ch-Engine-Study-Case-News-Question-About-COVID-19
所属分类:搜索引擎
开发工具:Jupyter Notebook
文件大小:156KB
下载次数:0
上传日期:2021-06-05 18:05:59
上 传 者:
sh-1993
说明: 迷你搜索引擎(研究案例:关于新冠肺炎的新闻问题)
(Mini Search Engine (Study Case : News Question About COVID-19))
文件列表:
Mini Search Engine (Study Case : News Question About COVID-19).ipynb (59814, 2021-06-06)
news.csv (749622, 2021-06-06)
# Mini-Search-Engine-Study-Case-News-Question-About-COVID-19
Mini Search Engine (Study Case : News Question About COVID-19)
Steps :
1. Download [Dataset](https://www.kaggle.com/xhlulu/covidqa?select=news.csv) (Crawling)
2. Preprocessing Dataset
- Case Folding
- Tokenizing
- Stemming
- Stop Words
- Re Join Words
3. Indexing / Pembobotan TF-IDF
- TF - IDF (Sklearn)
4. Retrieval / Cosine Similarity
- Cosine Similarity (Sklearn)
5. Perankingan / TOP 10
- Sorting from the value closest to the similarity value
近期下载者:
相关文件:
收藏者: