suder

所属分类:自然语言处理
开发工具:Others
文件大小:590800KB
下载次数:0
上传日期:2018-03-25 15:36:10
上 传 者sh-1993
说明:  suder,土耳其文本分类新闻集
(suder,Turkish News Collections for Text Categorization)

文件列表:
LICENSE (1064, 2018-03-25)
suder-v1.tar.gz.partaa (104857600, 2018-03-25)
suder-v1.tar.gz.partab (104857600, 2018-03-25)
suder-v1.tar.gz.partac (104857600, 2018-03-25)
suder-v1.tar.gz.partad (104857600, 2018-03-25)
suder-v1.tar.gz.partae (104857600, 2018-03-25)
suder-v1.tar.gz.partaf (80688886, 2018-03-25)

# SuDer Turkish News Collections for Text Categorization ## Unarchiving Join the parts: ``` cat suder-v1.tar.gz.parta* > suder-v1.tar.gz ``` Decompress: ``` tar zxfv suder-v1.tar.gz ``` ## Corpus info There are two different collections obtained from www.cumhuriyet.com.tr annd www.sabah.com.tr, categories are different for these two different collections csv files contain the meta information including the titles, json files contain texts without the titles. TextId column in csv files map to the corresponding key in the json files. Data that do not contain category information are removed from the meta data but kept in the json files.

近期下载者

相关文件


收藏者