k50RVZDojs03

所属分类:搜索引擎
开发工具:Java
文件大小:771KB
下载次数:41
上传日期:2008-03-16 13:26:44
上 传 者晓风之鸣
说明:  基于IKAnalyzer分词算法的准商业化Lucene中文分词器
(IKAnalyzer segmentation algorithm based on quasi-commercial Lucene Chinese Word Breaker)

文件列表:
dict\connector.dic (23, 2007-08-02)
dict\count.dic (1284, 2007-08-02)
dict\c_number.dic (78, 2007-08-02)
dict\noisechar.dic (230, 2007-08-06)
dict\number_sign.dic (12, 2007-08-02)
dict\other_digit.dic (135, 2007-08-02)
dict\local\local.dic (20013, 2007-08-02)
dict\stopword\stopword.dic (702, 2007-08-07)
dict\suffix\suffix.dic (144, 2007-08-14)
dict\word\wordbase.dic (1769270, 2007-10-01)
org\mira\lucene\analysis\IKTokenizer.java (23513, 2007-12-14)
org\mira\lucene\analysis\IK_CAnalyzer.java (2340, 2007-12-14)
org\mira\lucene\analysis\MIK_CAnalyzer.java (1938, 2007-12-14)
org\mira\lucene\analysis\MTokenDelegate.java (838, 2007-12-14)
org\mira\lucene\analysis\TokenDelegate.java (1813, 2007-12-14)
org\mira\lucene\analysis\dict\Dictionary.java (16924, 2007-10-01)
org\mira\lucene\analysis\dict\DictSegment.java (2858, 2007-12-14)
org\mira\lucene\analysis\dict\Hit.java (1531, 2007-12-14)
org\mira\lucene\analysis\dict\WordType.java (1296, 2007-12-14)
org\mira\lucene\analysis\dict\Dictionary.class (12204, 2007-12-14)
org\mira\lucene\analysis\dict\DictSegment.class (2604, 2007-12-14)
org\mira\lucene\analysis\dict\Hit.class (1467, 2007-12-14)
org\mira\lucene\analysis\dict\WordType.class (1179, 2007-12-14)
org\mira\lucene\analysis\IKTokenizer.class (10518, 2007-12-14)
org\mira\lucene\analysis\IK_CAnalyzer.class (2530, 2007-12-14)
org\mira\lucene\analysis\MIK_CAnalyzer.class (2342, 2007-12-14)
org\mira\lucene\analysis\MTokenDelegate.class (669, 2007-12-14)
org\mira\lucene\analysis\TokenDelegate.class (1650, 2007-12-14)
org\mira\lucene\analysis\dict (0, 2008-01-31)
org\mira\lucene\analysis (0, 2008-01-31)
org\mira\lucene (0, 2008-01-31)
dict\local (0, 2008-01-31)
dict\stopword (0, 2008-01-31)
dict\suffix (0, 2008-01-31)
dict\word (0, 2008-01-31)
org\mira (0, 2008-01-31)
dict (0, 2008-01-31)
org (0, 2008-01-31)

近期下载者

相关文件


收藏者