5953281

所属分类:图形图象
开发工具:C/C++
文件大小:66KB
下载次数:3
上传日期:2018-02-05 19:06:30
上 传 者wiypmf
说明:  最大概率分词法,这种分词算法能够较好的解决汉语分词中的歧义问题,但分词效率比最大匹配分词算法要低

文件列表:
test\test1.TXT (32151, 1997-12-17)
test\test2.txt (1795, 2002-11-22)
test\test3.txt (4868, 2002-10-30)
test\test4.txt (120, 2002-11-22)
Config.INI (223, 2002-11-22)
KChildFrm.h (1397, 2002-10-24)
MainFrm.h (1463, 2002-10-30)
MPWordSeg.h (352, 2002-10-30)
MyDictionary.h (3071, 2002-10-30)
MyFileApp.h (244, 2002-10-24)
Resource.h (782, 2002-10-30)
StdAfx.h (1054, 2002-10-24)
WordSeg.h (1367, 2002-10-24)
WordSegDoc.h (1486, 2002-10-24)
WordSegView.h (1911, 2002-10-24)
g0ChildFrm.cpp (1534, 2002-10-24)
MainFrm.cpp (3061, 2002-11-01)
MPWordSeg.cpp (6941, 2002-11-22)
MyDictionary.cpp (9373, 2002-11-15)
MyFileApp.cpp (1782, 2002-11-08)
StdAfx.cpp (209, 2002-10-24)
WordSeg.cpp (5485, 2002-11-13)
WordSegDoc.cpp (1777, 2002-10-24)
WordSegView.cpp (2817, 2002-10-24)
res\Toolbar.bmp (1078, 2002-10-24)
WordSeg.aps (45652, 2002-10-30)
WordSeg.clw (2882, 2002-10-30)
WordSeg.dsp (4997, 2002-10-30)
WordSeg.dsw (537, 2002-10-24)
res\WordSeg.ico (1078, 2002-10-24)
res\WordSegDoc.ico (1078, 2002-10-24)
WordSeg.ncb (107520, 2002-11-22)
WordSeg.opt (48640, 2002-11-22)
WordSeg.plg (2435, 2002-11-22)
WordSeg.rc (13698, 2002-10-30)
res\WordSeg.rc2 (399, 2002-10-24)
res (0, 2017-12-01)
test (0, 2017-12-01)

1 本程序说明了用最大概率法进行分词处理的一般过程 2 用户可以修改config.ini文件中的值 3 用于测试的三个文件中: test1是小学语文课本语料 test2是按句分行的语料 test3是包含歧义串的语料

近期下载者

相关文件


收藏者