kaggle_bbc_news

所属分类:数学计算
开发工具:Jupyter Notebook
文件大小:0KB
下载次数:0
上传日期:2023-10-07 21:51:31
上 传 者sh-1993
说明:  新闻文章分类。非负矩阵分解与监督学习。,
(News articles classification. Non-negative Matrix Factorization vs Supervised Learning.,)

文件列表:
bbc_news_classification_nmf_vs_sl.ipynb (137990, 2023-10-10)

# BBC News articles classification: Non-negative Matrix Factorization vs Supervised Learning ## Abstract This study presents a fraction of an analysis of a BBC News dataset, encompassing Exploratory Data Analysis (EDA) and preprocessing stages, followed by a performance comparison of Non-Negative Matrix Factorization (NMF) against various supervised learning (SL) algorithms. The dataset comprises articles' texts and their categories: business, sport, tech, politics and entartainment. The results of this study showed that SL algorithms such as SVM and Random Forest (RF) scored better than NMF in terms of accuracy, but were completely outperformed by NMF in terms of computational speed. Additionally the NMF provided surprising results, when on every sample size, 50%, 20% and 10% of the train dataset, it got test scores on par with the train scores. SVM and RF models while resulting in higher accuracy than NMF across sample sizes, got more prominent overfitting problem with the data size reduction.

近期下载者

相关文件


收藏者