swish-efiles
所属分类:搜索引擎
开发工具:Visual C++
文件大小:472KB
下载次数:67
上传日期:2011-02-24 20:57:03
上 传 者:
john11
说明: 详细说明:
用C语言写的搜索引擎,包含多种建立索引的方式
(C serach engine, contains many methods for index establishing)
文件列表:
src (0, 1999-03-19)
src\Makefile (1216, 1999-01-07)
src\check.c (3140, 1998-12-02)
src\check.h (1081, 1998-12-01)
src\config.h (9319, 1998-12-12)
src\error.c (1048, 1998-12-01)
src\error.h (1024, 1998-12-01)
src\file.c (12070, 1998-12-12)
src\file.h (1849, 1998-12-02)
src\fs.c (10286, 1999-01-07)
src\hash.c (5202, 1998-12-01)
src\hash.h (1423, 1998-12-01)
src\http.c (12463, 1999-01-07)
src\http.h (722, 1998-12-01)
src\httpserver.c (10043, 1998-12-01)
src\httpserver.h (635, 1998-12-01)
src\index.c (29670, 1998-12-15)
src\index.h (2475, 1998-12-01)
src\list.c (1297, 1998-12-01)
src\list.h (1053, 1998-12-01)
src\mem.c (1440, 1998-12-01)
src\mem.h (1091, 1998-12-01)
src\merge.c (23634, 1998-12-01)
src\merge.h (4077, 1998-12-01)
src\search.c (28461, 1998-12-01)
src\search.h (2613, 1998-12-01)
src\swish.c (16397, 1999-01-07)
src\swish.h (14413, 1999-01-07)
src\string.c (14146, 1998-12-12)
src\string.h (1928, 1998-12-12)
src\swishspider (1994, 1999-03-19)
src\test.html (358, 1998-10-22)
src\user.config (7400, 1999-11-18)
src\win32 (0, 1998-10-22)
src\win32\dirent.c (5242, 1998-10-22)
src\win32\dirent.h (1540, 1998-10-22)
src\win32\regex.c (161825, 1998-10-22)
src\win32\regex.h (18702, 1998-10-22)
src\docprop.c (7911, 1998-12-02)
... ...
THE SWISH-E README
SYSTEM REQUIREMENTS
SWISH-E is written in C, and, up to this time, it has been tested on
Solaris 2.5.1 and Dec Alpha 3.2 in house and by users on BSDI 2.0. To
compile, a C compiler is needed. Pretty much any standard compiler
should do, the ones we used are: gcc version 2.7.0 and ucbb SC3.0.1.
The HTTP file access method uses a Perl helper script that requires
the LWP,HTTP,HTML modules(Note: depending on your perl installation, you might need to
install additional modules required by LWP; for requirements and
downloads check http://www.cpan.org or http://theory.uwinnipeg.ca/search/cpan-search.html).
It was tested with Perl 5.005 although should probably work with any version 5 release.
A tip from Wil Vree about compiling with a Sun C compiler:
In the Makefile the following two opions need to be modified
CC = /opt/SUNWspro/bin/cc
CFLAGS= -xO2
INSTALLATION
Downloading the C files and recompiling the executable
------------------------------------------------------
1) Create a directory where to keep the source code files, for
example /opt/swish-e.x
%mkdir /opt/swish-e.x
2) Download the swish-efiles.x.x.tar.gz into the directory, uncompress and
untar it
% gunzip swish-efiles.x.x.tar.gz
%tar -xvf swish-efiles.x.x.tar
The command should create the following directory and files:
src/
Makefile file.h list.c string.h
README-SWISH-E fs.c list.h swish.c
user.config hash.c mem.c swish.h
check.c hash.h mem.h swishspider
check.h http.c merge.c swishspider.pl
config.h http.h merge.h test.html
docprop.c httpserver.c search.c /win32
docprop.h httpserver.h search.h
error.c index.c stemmer.c
error.h index.h stemmer.h
file.c index.swish string.c
3) In the Makefile change the path for the C compiler
Change
CC = /usr/local/bin/gcc
to
CC = {PATH}/gcc
as stated in the SYSTEM REQUIREMENTS section, you do not need to
necessarily use gcc to compile.
4) Make any needed changes in the config.h to customize swish-e to
your site.
5) Compile the swish-e executable from the src/ directory
%cd src
%make
6) Run a quick test to make sure that the executable is working
correctly.
%make test
If the test is successful, you should see the
following:
sunsite% make test
./swish-e -i test.html -v -f index.swish
Checking file "test.html"...
test.html (41 words)
Removing very common words... no words removed.
Writing main index... 25 unique words indexed.
Writing file index... 1 file indexed.
Running time: Less than a second.
Indexing done!
./swish-e -f index.swish -w test
# SWISH format 1.x
# Search words: test
# Name: (no name)
# Saved as: index.swish
# Counts: 25 words, 1 files
# Indexed on: 13/08/97 10:40:54 PDT
# Description: (no description)
# Pointer: (no pointer)
# Maintained by: (no maintainer)
1000 test.html "If you are seeing this, the test was successful!" 358
.
7) Move the swish-e executable to /usr/local/bin
8) For instructions about how to index and search with swish-e check
the Quick Start (http://sunsite.berkeley.edu/SWISH-E/Manual/quickstart.html)
9) The user.config file provided in the distribution is set for use with the FILESYSTEM.
If you are using HTTP, comment out the directory of FILESYSTEM ONLY and set the ones for
HTTP ONLY.
QUESTIONS AND TROUBLESHOOTING
If you run in any problem during installation, or if you have any
question about swish-e use and features, let us know through our
discussion group(http://sunsite.berkeley.edu/SWISH-E/Discussion
Giulia Hill
Library System Office
UC Berkeley Libraries
ghill@library.berkeley.edu
8/13/97
Last updated 12/10/***
近期下载者:
相关文件:
收藏者: