Nlpir

所属分类:Java编程
开发工具:Java
文件大小:4745KB
下载次数:40
上传日期:2014-01-22 16:06:18
上 传 者reacherxu
说明:  前NLPIR汉语分词系统(又名ICTCLAS2013),主要功能包括中文分词;词性标注;命名实体识别;用户词典功能;支持GBK编码、UTF8编码、BIG5编码。新增微博分词、新词发现与关键词提取;张华平博士先后倾力打造十余年,内核升级10次。国内国际排名均为第一。 项目已经配置好环境,导入Eclipse即可使用,文件内src下的TestUTF8.java可以直接运行,提供了分词接口
(Before NLPIR Chinese word segmentation system (aka ICTCLAS2013), main features include Chinese word speech tagging named entity recognition User dictionary function support GBK encoding, UTF8 encoding, BIG5 encoding. New microblogging word, keyword extraction and discovery of new words Dr. Zhang Huaping has more than ten years effort to build a kernel upgrade 10 times. Both domestic and international ranked first. Project has been configured environment, Eclipse can be used to import, TestUTF8.java the file can be run directly under src, the interface provides word)

文件列表:
Nlpir\.classpath (302, 2014-01-14)
Nlpir\.project (381, 2014-01-14)
Nlpir\.settings\org.eclipse.core.resources.prefs (86, 2014-01-14)
Nlpir\.settings\org.eclipse.jdt.core.prefs (629, 2014-01-14)
Nlpir\bin\kevin\zhang\NLPIR.class (1164, 2014-01-14)
Nlpir\bin\TestUTF8.class (1788, 2014-01-15)
Nlpir\bin\WordSeperation.class (1849, 2014-01-15)
Nlpir\file\Data\BIG2GBK.map (286196, 2014-01-14)
Nlpir\file\Data\BIG5.pdat (468456, 2014-01-14)
Nlpir\file\Data\BIG5.wordlist (158695, 2014-01-14)
Nlpir\file\Data\BiWord.big (3520144, 2014-01-14)
Nlpir\file\Data\charset.type (65540, 2014-01-14)
Nlpir\file\Data\Configure.xml (856, 2014-01-14)
Nlpir\file\Data\CoreDict.pdat (1696620, 2014-01-14)
Nlpir\file\Data\CoreDict.pos (1786424, 2014-01-14)
Nlpir\file\Data\CoreDict.unig (478168, 2014-01-14)
Nlpir\file\Data\FieldDict.pdat (262236, 2014-01-14)
Nlpir\file\Data\FieldDict.pos (72, 2014-01-14)
Nlpir\file\Data\GBK.pdat (549204, 2014-01-14)
Nlpir\file\Data\GBK.wordlist (166985, 2014-01-14)
Nlpir\file\Data\GBK2BIG.map (286196, 2014-01-14)
Nlpir\file\Data\GBK2GBKC.map (286196, 2014-01-14)
Nlpir\file\Data\GBK2UTF.map (286196, 2014-01-14)
Nlpir\file\Data\GBKA.pdat (550848, 2014-01-14)
Nlpir\file\Data\GBKA.wordlist (166985, 2014-01-14)
Nlpir\file\Data\GBKA2UTF.map (286196, 2014-01-14)
Nlpir\file\Data\GBKC.pdat (550848, 2014-01-14)
Nlpir\file\Data\GBKC.wordlist (166985, 2014-01-14)
Nlpir\file\Data\GBKC2GBK.map (286196, 2014-01-14)
Nlpir\file\Data\GranDict.pdat (1978128, 2014-01-14)
Nlpir\file\Data\GranDict.pos (1778776, 2014-01-14)
Nlpir\file\Data\ICTPOS.map (406, 2014-01-14)
Nlpir\file\Data\NewWord.lst (126, 2014-01-14)
Nlpir\file\Data\NLPIR.ctx (37253, 2014-01-14)
Nlpir\file\Data\NLPIR.user (3356, 2014-01-14)
Nlpir\file\Data\NLPIR_First.map (288, 2014-01-14)
Nlpir\file\Data\nr.ctx (2213, 2014-01-14)
Nlpir\file\Data\nr.fsa (3008, 2014-01-14)
Nlpir\file\Data\nr.role (1757200, 2014-01-14)
Nlpir\file\Data\PKU.map (307, 2014-01-14)
... ...

近期下载者

相关文件


收藏者