Chinese--NER

所属分类:语音合成
开发工具:Java
文件大小:26KB
下载次数:189
上传日期:2011-02-28 20:32:12
上 传 者wlzmwq
说明:  基于CRF的中文机构名识别系统。使用北京大学1998年的人民日报语料库作为训练语料。除常用的特征模板,已经词性特征外,使用词语的最后一个字作为特征,提高了机构名识别的准确率, 调用了CRF++程序包训练模型。
(CRF-based name recognition system of Chinese institutions. People' s Daily, Peking University in 1998 with corpus as training data. In addition to the characteristics of commonly used templates, has been part of speech features, the last word to use words as features and improve the organization name recognition accuracy, call the CRF++ package training model.)

文件列表:
OrgName\.classpath (232, 2011-02-28)
OrgName\.project (383, 2011-02-28)
OrgName\bin\CRFWord.class (1942, 2011-02-28)
OrgName\bin\OrgName.class (2425, 2011-02-28)
OrgName\bin\Sentence.class (2172, 2011-02-28)
OrgName\bin\corpus.txt (24849, 2011-02-28)
OrgName\bin\evaluation.class (3414, 2011-02-28)
OrgName\bin\template.txt (92, 2011-02-28)
OrgName\src\CRFWord.java (1173, 2011-02-28)
OrgName\src\OrgName.java (1773, 2011-02-28)
OrgName\src\Sentence.java (2012, 2011-02-28)
OrgName\src\corpus.txt (24849, 2011-02-28)
OrgName\src\evaluation.java (3354, 2011-02-28)
OrgName\src\template.txt (92, 2011-02-28)

近期下载者

相关文件


收藏者