V3.0
所属分类:其他
开发工具:Visual C++
文件大小:3376KB
下载次数:52
上传日期:2013-02-27 14:15:35
上 传 者:
zws520521
说明: 文本分类,包括:文本预处理,去除停用词,学习和训练,最后实现分类。
(Text classification, including: text pre-processing, removal of stop words, learning and training, and finally achieve the classification.)
文件列表:
文本分类程序(利用libSVM)V3.0\Configure.xml (716, 2009-02-03)
文本分类程序(利用libSVM)V3.0\Data\BiWord.big (3520144, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\charset.type (65540, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\CoreDict.pdat (1696620, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\CoreDict.pos (1786424, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\CoreDict.unig (478168, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\FieldDict.pdat (262236, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\FieldDict.pos (72, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\GranDict.pdat (1978128, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\GranDict.pos (1778776, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\ICTCLAS30.ctx (37253, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\ICTCLAS_First.map (288, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\ICTPOS.map (406, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\nr.ctx (2213, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\nr.fsa (3008, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\nr.role (1757200, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\PKU.map (307, 2009-01-16)
文本分类程序(利用libSVM)V3.0\Data\PKU_First.map (288, 2009-01-16)
文本分类程序(利用libSVM)V3.0\dict.txt (459496, 2010-03-05)
文本分类程序(利用libSVM)V3.0\feature\3.txt (2695, 2010-06-18)
文本分类程序(利用libSVM)V3.0\feature\4.txt (865, 2010-06-18)
文本分类程序(利用libSVM)V3.0\feature\5.txt (3780, 2010-06-18)
文本分类程序(利用libSVM)V3.0\feature\6.txt (484, 2010-06-18)
文本分类程序(利用libSVM)V3.0\featureselection.exe (192598, 2007-08-18)
文本分类程序(利用libSVM)V3.0\getFeature.exe (61440, 2010-03-05)
文本分类程序(利用libSVM)V3.0\getRandFile.exe (249856, 2010-03-06)
文本分类程序(利用libSVM)V3.0\getSVMfeture(df).exe (163840, 2010-06-18)
文本分类程序(利用libSVM)V3.0\getSVMTtrain.exe (86016, 2010-03-06)
文本分类程序(利用libSVM)V3.0\ICTCLAS30.dll (262144, 2009-12-30)
文本分类程序(利用libSVM)V3.0\ICTCLAS30.log (128, 2010-06-18)
文本分类程序(利用libSVM)V3.0\mergeFile.bat (157, 2010-03-06)
文本分类程序(利用libSVM)V3.0\seg\3.txt (3884, 2010-06-18)
文本分类程序(利用libSVM)V3.0\seg\4.txt (706, 2010-06-18)
文本分类程序(利用libSVM)V3.0\seg\5.txt (4085, 2010-06-18)
文本分类程序(利用libSVM)V3.0\seg\6.txt (380, 2010-06-18)
文本分类程序(利用libSVM)V3.0\seg\7.txt (2713, 2010-06-18)
文本分类程序(利用libSVM)V3.0\seg\8.txt (2665, 2010-06-18)
文本分类程序(利用libSVM)V3.0\seg\9.txt (738, 2010-06-18)
... ...
select feature for SVM tool Version 2.0
@author pingpeace
@e-mail jpshen2008@gmail.com
@date 2010-5-2
note:
1.if you want to use the tool, please see "readme文本分类的主要流程V2.0" first.
2.you'd better to install vs2005 or .net framework
3.can't exist the Chinese word in the path.
4.if you use the tool in experiment and publish paper. i hope you can list "Jianping Shen, XuanWang, Wenxiao Zhang, Jiajia Zhang , Machining Learning Based Sentiment Analysis for Finance News Mining,IIH-MSP-2010
, 2010" in your REFERENCE.
Dear All:
Thank you for your intetest and use the tool. When I put the tool in my blog to be download. I receive many user's e-mail. Thank you for them to inform the problem when they use and get me many advise. Your support is the power for me to make it well.
And in version 2.0. Many problem have been solved. Welcome to download and if you have any problem or advise, please feel free to send e-mail to me or leave your comments in the blog.
Because i am so busy that i can't debug the tool perfect and to reduce the runtime for getSVMtrain. Many user ask me why i divide the tool into many part? Because i think it can make you to use each part separately more easy.
It you want to use this tool, i think you'd better to learn some knowledge such as what is classification? what is feature? and if you want to the sourcecode,please send e-mail to me. Meantime you can find the webCrawler or other shared tools developed by in sourceforce.
Thank you for many user to call me prof. or Dr. And the company to invite me. But I am a master now. So don't call me prof. or Dr. And now i am looking for the PHD position in HK. So if you want to study in HK, welcome to contact me. If you want to learn more about me, please to see my homepage(http://cs.hitsz.edu.cn/xiaoyou/shenjianping.htm)
I am research on NLP, the application of NLP,information retrieval. If you are interested in it, welcome to discussion with me.
近期下载者:
相关文件:
收藏者: