swish-efiles

所属分类:搜索引擎
开发工具:Visual C++
文件大小:472KB
下载次数:67
上传日期:2011-02-24 20:57:03
上 传 者john11
说明:  详细说明: 用C语言写的搜索引擎,包含多种建立索引的方式
(C serach engine, contains many methods for index establishing)

文件列表:
src (0, 1999-03-19)
src\Makefile (1216, 1999-01-07)
src\check.c (3140, 1998-12-02)
src\check.h (1081, 1998-12-01)
src\config.h (9319, 1998-12-12)
src\error.c (1048, 1998-12-01)
src\error.h (1024, 1998-12-01)
src\file.c (12070, 1998-12-12)
src\file.h (1849, 1998-12-02)
src\fs.c (10286, 1999-01-07)
src\hash.c (5202, 1998-12-01)
src\hash.h (1423, 1998-12-01)
src\http.c (12463, 1999-01-07)
src\http.h (722, 1998-12-01)
src\httpserver.c (10043, 1998-12-01)
src\httpserver.h (635, 1998-12-01)
src\index.c (29670, 1998-12-15)
src\index.h (2475, 1998-12-01)
src\list.c (1297, 1998-12-01)
src\list.h (1053, 1998-12-01)
src\mem.c (1440, 1998-12-01)
src\mem.h (1091, 1998-12-01)
src\merge.c (23634, 1998-12-01)
src\merge.h (4077, 1998-12-01)
src\search.c (28461, 1998-12-01)
src\search.h (2613, 1998-12-01)
src\swish.c (16397, 1999-01-07)
src\swish.h (14413, 1999-01-07)
src\string.c (14146, 1998-12-12)
src\string.h (1928, 1998-12-12)
src\swishspider (1994, 1999-03-19)
src\test.html (358, 1998-10-22)
src\user.config (7400, 1999-11-18)
src\win32 (0, 1998-10-22)
src\win32\dirent.c (5242, 1998-10-22)
src\win32\dirent.h (1540, 1998-10-22)
src\win32\regex.c (161825, 1998-10-22)
src\win32\regex.h (18702, 1998-10-22)
src\docprop.c (7911, 1998-12-02)
... ...

THE SWISH-E README SYSTEM REQUIREMENTS SWISH-E is written in C, and, up to this time, it has been tested on Solaris 2.5.1 and Dec Alpha 3.2 in house and by users on BSDI 2.0. To compile, a C compiler is needed. Pretty much any standard compiler should do, the ones we used are: gcc version 2.7.0 and ucbb SC3.0.1. The HTTP file access method uses a Perl helper script that requires the LWP,HTTP,HTML modules(Note: depending on your perl installation, you might need to install additional modules required by LWP; for requirements and downloads check http://www.cpan.org or http://theory.uwinnipeg.ca/search/cpan-search.html). It was tested with Perl 5.005 although should probably work with any version 5 release. A tip from Wil Vree about compiling with a Sun C compiler: In the Makefile the following two opions need to be modified CC = /opt/SUNWspro/bin/cc CFLAGS= -xO2 INSTALLATION Downloading the C files and recompiling the executable ------------------------------------------------------ 1) Create a directory where to keep the source code files, for example /opt/swish-e.x %mkdir /opt/swish-e.x 2) Download the swish-efiles.x.x.tar.gz into the directory, uncompress and untar it % gunzip swish-efiles.x.x.tar.gz %tar -xvf swish-efiles.x.x.tar The command should create the following directory and files: src/ Makefile file.h list.c string.h README-SWISH-E fs.c list.h swish.c user.config hash.c mem.c swish.h check.c hash.h mem.h swishspider check.h http.c merge.c swishspider.pl config.h http.h merge.h test.html docprop.c httpserver.c search.c /win32 docprop.h httpserver.h search.h error.c index.c stemmer.c error.h index.h stemmer.h file.c index.swish string.c 3) In the Makefile change the path for the C compiler Change CC = /usr/local/bin/gcc to CC = {PATH}/gcc as stated in the SYSTEM REQUIREMENTS section, you do not need to necessarily use gcc to compile. 4) Make any needed changes in the config.h to customize swish-e to your site. 5) Compile the swish-e executable from the src/ directory %cd src %make 6) Run a quick test to make sure that the executable is working correctly. %make test If the test is successful, you should see the following: sunsite% make test ./swish-e -i test.html -v -f index.swish Checking file "test.html"... test.html (41 words) Removing very common words... no words removed. Writing main index... 25 unique words indexed. Writing file index... 1 file indexed. Running time: Less than a second. Indexing done! ./swish-e -f index.swish -w test # SWISH format 1.x # Search words: test # Name: (no name) # Saved as: index.swish # Counts: 25 words, 1 files # Indexed on: 13/08/97 10:40:54 PDT # Description: (no description) # Pointer: (no pointer) # Maintained by: (no maintainer) 1000 test.html "If you are seeing this, the test was successful!" 358 . 7) Move the swish-e executable to /usr/local/bin 8) For instructions about how to index and search with swish-e check the Quick Start (http://sunsite.berkeley.edu/SWISH-E/Manual/quickstart.html) 9) The user.config file provided in the distribution is set for use with the FILESYSTEM. If you are using HTTP, comment out the directory of FILESYSTEM ONLY and set the ones for HTTP ONLY. QUESTIONS AND TROUBLESHOOTING If you run in any problem during installation, or if you have any question about swish-e use and features, let us know through our discussion group(http://sunsite.berkeley.edu/SWISH-E/Discussion Giulia Hill Library System Office UC Berkeley Libraries ghill@library.berkeley.edu 8/13/97 Last updated 12/10/***

近期下载者

相关文件


收藏者