News-Group-Unsupervised-Learning

所属分类:聚类算法
开发工具:Jupyter Notebook
文件大小:5KB
下载次数:0
上传日期:2020-12-27 20:10:17
上 传 者sh-1993
说明:  20个新闻组数据集是大约20000个新闻组文档的集合,它们(几乎)均匀地划分为一个...
(The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20)

文件列表:
UnsupervisedLearning_NewsGroups.ipynb (17946, 2020-12-28)

# News-Group-Unsupervised-Learning The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

近期下载者

相关文件


收藏者