clucene-0.9.10
所属分类:搜索引擎
开发工具:Visual C++
文件大小:1790KB
下载次数:23
上传日期:2007-01-15 03:38:19
上 传 者:
nmlgb11
说明: 一个搜索引擎的基本代码,其算法有一定的参考价值,与大家共享 - A search engine basic code, its algorithm has the certain reference value, shares with everybody ,Visual C++,搜索引擎/Search Engine
(a search engine's basic code, the algorithm has some reference value. and share- A search engine basic code, its algorithm has the certain reference value, shares with everybody, Visual C, Search engines/Search Engine)
文件列表:
clucene-0.9.10\aclocal.m4 (242376, 2005-11-09)
clucene-0.9.10\APACHE.license (615, 2005-11-02)
clucene-0.9.10\AUTHORS (1139, 2005-11-08)
clucene-0.9.10\autogen.sh (1661, 2005-11-08)
clucene-0.9.10\ChangeLog (0, 2005-07-21)
clucene-0.9.10\config\config.guess (43728, 2005-11-09)
clucene-0.9.10\config\config.sub (31043, 2005-11-09)
clucene-0.9.10\config\depcomp (14841, 2005-11-09)
clucene-0.9.10\config\install-sh (9208, 2005-11-09)
clucene-0.9.10\config\ltmain.sh (182847, 2005-11-09)
clucene-0.9.10\config\missing (10678, 2005-11-09)
clucene-0.9.10\config.h.tmp.in (7422, 2005-11-09)
clucene-0.9.10\configure (838454, 2005-11-09)
clucene-0.9.10\configure.ac (7113, 2005-11-09)
clucene-0.9.10\COPYING (25030, 2005-07-24)
clucene-0.9.10\doc\benchmarks.txt (3296, 2005-11-02)
clucene-0.9.10\doc\clucene.jpg (7432, 2005-07-27)
clucene-0.9.10\doc\helpfooter.htm (101, 2005-11-02)
clucene-0.9.10\doc\helpheader.htm (451, 2005-11-09)
clucene-0.9.10\doc\helpheader.htm.in (460, 2005-11-02)
clucene-0.9.10\doc\Makefile.am (162, 2005-07-27)
clucene-0.9.10\doc\Makefile.in (9157, 2005-11-09)
clucene-0.9.10\doc\tests.txt (677, 2005-11-08)
clucene-0.9.10\doc\users.txt (631, 2005-10-23)
clucene-0.9.10\Doxyfile.in (39277, 2005-08-05)
clucene-0.9.10\HACKING (2581, 2005-11-02)
clucene-0.9.10\INSTALL (2199, 2005-10-11)
clucene-0.9.10\Jamfile (129, 2005-07-21)
clucene-0.9.10\Jamfile.v2 (348, 2005-07-21)
clucene-0.9.10\m4\ac_cxx_have_std.m4 (557, 2005-11-02)
clucene-0.9.10\m4\ac_cxx_have_stl.m4 (617, 2005-11-02)
clucene-0.9.10\m4\ac_cxx_have_wctype_h.m4 (534, 2005-11-02)
clucene-0.9.10\m4\ac_cxx_namespaces.m4 (494, 2005-11-02)
clucene-0.9.10\m4\ac_define_integer_bits.m4 (956, 2005-11-02)
clucene-0.9.10\m4\ax_config_feature.m4 (1399, 2005-11-02)
clucene-0.9.10\m4\ax_prefix_config_h.m4 (7014, 2005-10-04)
clucene-0.9.10\m4\dps_float_byte.m4 (2697, 2005-11-02)
clucene-0.9.10\m4\dps_snprintf_oflow.m4 (1141, 2005-11-02)
clucene-0.9.10\m4\dps_swprintf_works.m4 (848, 2005-11-02)
clucene-0.9.10\m4\mdl_cxx_function_try_blocks.m4 (588, 2005-11-02)
... ...
CLucene README
==============
------------------------------------------------------
CLucene is a C++ port of Lucene.
It is a high-performance, full-featured text search
engine written in C++. CLucene is faster than lucene
as it is written in C++.
------------------------------------------------------
CLucene has contributions from many, see AUTHORS
CLucene is distributed under the GNU Lesser General Public License (LGPL)
*or*
the Apache License, Version 2.0
See the LGPL and APACHE for the respective license information.
Installation
------------
* For Linux, MacOSX, cygwin and MinGW build information, read INSTALL.
* Boost.Jam files are provided in the root directory and subdirectories.
* Microsoft Visual Studio (6&7) are provided in the win32 folder.
Mailing List
------------
Questions and discussion should be directed to the CLucene mailing list
at clucene-developers@lists.sourceforge.net
Find subscription instructions at
http://lists.sourceforge.net/lists/listinfo/clucene-developers
Suggestions and bug reports can be made on our bug tracking database
(http://sourceforge.net/tracker/?group_id=80013&atid=558446)
The latest version
------------------
Details of the latest version can be found on the CLucene sourceforge project
web site: http://www.sourceforge.net/projects/clucene
Documentation
-------------
Documentation is provided at http://clucene.sourceforge.net/doc/doxygen/html/
You can also build your own documentation by running doxygen from the root directory
of clucene.
CLucene is a very close port of Java Lucene, so you can also try looking at the
Java Docs on http://lucene.apache.org/java/
Performance
-----------
Very little benchmarking has been done on clucene. Andi Vajda posted some
limited statistics on the clucene list a while ago with the following results.
There are 250 HTML files under $JAVA_HOME/docs/api/java/util for about
6108kb of HTML text.
org.apache.lucene.demo.IndexFiles with java and gcj:
on mac os x 10.3.1 (panther) powerbook g4 1ghz 1gb:
. running with java 1.4.1_01-99 : 20379 ms
. running with gcj 3.3.2 -O2 : 17842 ms
. running clucene 0.8.9's demo : 9930 ms
I recently did some more tests and came up with these rough tests:
663mb (797 files) of Guttenberg texts
on a Pentium 4 running Windows XP with 1 GB of RAM. Indexing max 100,000 fields
Jlucene: ******53ms. peak mem usage ~72mb, avg ~14mb ram
Clucene: 232141. peak mem usage ~60, avg ~4mb ram
Searching indexing using 10,000 single word queries
Jlucene: ~60078ms and used ~13mb ram
Clucene: ~48359ms and used ~4.2mb ram
近期下载者:
相关文件:
收藏者: