sphider
所属分类:搜索引擎
开发工具:PHP
文件大小:40KB
下载次数:48
上传日期:2006-03-24 12:57:49
上 传 者:
pm1784
说明: 搜索软件ROBOT
搜索引擎中最重要的一项
PHP编写的一个网络蜘蛛程序
(search software search engine one of the most important preparation of a PHP Web Spider procedures)
文件列表:
Sphider--开源的蜘蛛程序\include\common.txt (623, 2005-09-16)
Sphider--开源的蜘蛛程序\include\index.css (376, 2005-09-16)
Sphider--开源的蜘蛛程序\include\index_footer.inc (16, 2005-09-16)
Sphider--开源的蜘蛛程序\include\index_header.inc (206, 2005-09-16)
Sphider--开源的蜘蛛程序\include\commonfuncs.php (4201, 2005-09-16)
Sphider--开源的蜘蛛程序\include\conf.php (3273, 2005-10-21)
Sphider--开源的蜘蛛程序\include\connect.php (476, 2005-10-21)
Sphider--开源的蜘蛛程序\include (0, 2005-10-21)
Sphider--开源的蜘蛛程序\admin\ext.txt (170, 2005-09-16)
Sphider--开源的蜘蛛程序\admin\auth.php.bak (1253, 2005-10-21)
Sphider--开源的蜘蛛程序\admin\tmp (0, 2005-09-20)
Sphider--开源的蜘蛛程序\admin\admin.css (395, 2005-09-16)
Sphider--开源的蜘蛛程序\admin\admin.php (40695, 2005-10-21)
Sphider--开源的蜘蛛程序\admin\auth.php (1247, 2005-10-21)
Sphider--开源的蜘蛛程序\admin\auth_old.php (757, 2005-09-16)
Sphider--开源的蜘蛛程序\admin\install.php (3346, 2005-09-16)
Sphider--开源的蜘蛛程序\admin\messages.php (3406, 2005-09-16)
Sphider--开源的蜘蛛程序\admin\spider.php (15665, 2005-09-30)
Sphider--开源的蜘蛛程序\admin\spiderfuncs.php (21187, 2005-09-20)
Sphider--开源的蜘蛛程序\admin (0, 2005-10-21)
Sphider--开源的蜘蛛程序\install.txt (6036, 2005-09-20)
Sphider--开源的蜘蛛程序\upgrading.txt (335, 2005-09-16)
Sphider--开源的蜘蛛程序\安装说明.txt (549, 2005-10-21)
Sphider--开源的蜘蛛程序\languages\cn-language.php (867, 2005-10-21)
Sphider--开源的蜘蛛程序\languages\de-language.php (1018, 2005-09-16)
Sphider--开源的蜘蛛程序\languages\ee-language.php (932, 2005-09-16)
Sphider--开源的蜘蛛程序\languages\en-language.php (961, 2005-09-16)
Sphider--开源的蜘蛛程序\languages\es-language.php (998, 2005-09-16)
Sphider--开源的蜘蛛程序\languages\it-language.php (1024, 2005-09-16)
Sphider--开源的蜘蛛程序\languages\nl-language.php (1003, 2005-09-16)
Sphider--开源的蜘蛛程序\languages\pt-language.php (950, 2005-09-20)
Sphider--开源的蜘蛛程序\languages (0, 2005-10-21)
Sphider--开源的蜘蛛程序\sql\tables.sql (1369, 2005-09-16)
Sphider--开源的蜘蛛程序\sql\upgrade_to_1.2.5.sql (160, 2005-09-16)
Sphider--开源的蜘蛛程序\sql\upgrade_to_1.2.6.sql (221, 2005-09-16)
Sphider--开源的蜘蛛程序\sql\upgrade_to_1.2.sql (171, 2005-09-16)
Sphider--开源的蜘蛛程序\sql (0, 2005-10-21)
Sphider--开源的蜘蛛程序\search.css (1072, 2005-09-16)
Sphider--开源的蜘蛛程序\search.php (14144, 2005-10-21)
... ...
============================================
Sphider - a lightweight search engine in PHP
Version 1.2.x
By Ando Saabas ando(a t)cs.ioc.ee
============================================
Sphider is a lightweight web spider and search engine written in PHP, using MySQL as its back end database. It is
suitable for adding search functionality to small or medium sites (up to 10-20,000 pages).
--------
Features
--------
1. Spidering
- Can index both static and dynamic pages.
- Finds links in
, , and tags, and can also follow links given in
javascript as strings via window.location and window.open.
- Respects robots.txt protocol.
- Follows server side redirections.
- Allows spidering to be limited by depth (ie maximum number of clicks from the starting page), by (sub)domain or by
directory.
- Supports indexing of pdf and doc files (using external binaries for file conversion).
- Allows resuming paused spidering.
2. Indexing
- Full text indexing.
- Possbility to exclude common words from being indexed.
- Option to define your custom page ranking function, which can depend on the number of times a given word occurs in the
webpage, whether the word occurs in the domain name, path, or title of the document and also the relative "deepness" of
the url (so that the same page in www.domain.com/ is ranked higher than in www.domain.com/dir1/dir2/foo.html)
3. Searching
- Uses AND operator by default, if more than one query word is used, it finds pages that include all the query words.
- Supports phrase searching.
- Supports excluding words (by putting a '-' in front of a word, any page including the word will be omitted from the
results).
- Option to add and group sites into categories
- Possibility to limit searching to a given category and its subcategories.
4. Size and speed
- Sphider uses reguler expressions to extract links from webpages, so indexing is not particularly fast. Searching is
quite fast, if the database size is reasonable.
-Sphider is very small, its source code being under 70kb in size, probably making it the smallest search engine with
such functionality out there (a pretty good indication of PHP as a rapid prototyping tool).
5. Compatibility
It is a typical LAMP application (but of course it can also be run under Windows). Sphider was designed to be compatible
with older versions of PHP and MySQL, it should work with at least PHP 3 and MySQL 3.23.
6. Licence
Sphider is licenced under GNU General Public Licence.
--------------------------------------------------------------------------------
Ando Saabas 2004
Contact: ando (at) cs.ioc.ee
近期下载者:
相关文件:
收藏者: