naver-finance-news-crawler

所属分类:金融证券系统
开发工具:Python
文件大小:32KB
下载次数:0
上传日期:2022-03-26 06:43:32
上 传 者sh-1993
说明:  导航财经新闻爬虫
(naver-finance-news-crawler,Naver Finance News Crawler)

文件列表:
crawler (0, 2022-03-26)
crawler\__init__.py (85, 2022-03-26)
crawler\crawler.py (14960, 2022-03-26)
crawler\utils.py (1515, 2022-03-26)
examples (0, 2022-03-26)
examples\01-main_news_crawling.ipynb (10201, 2022-03-26)
examples\02-process_after_crawling.ipynb (10220, 2022-03-26)
examples\03-query_news_crawling_v01.ipynb (14683, 2022-03-26)
examples\03-query_news_crawling_v02.ipynb (20787, 2022-03-26)
examples\03-query_news_crawling_v03.ipynb (8594, 2022-03-26)
examples\04-insert_db_after_crawling_v01.ipynb (7110, 2022-03-26)
main.py (846, 2022-03-26)
requirements.txt (44, 2022-03-26)

# Naver-Finance-News-Crawler - Code for Naver Finance (Main) News Crawling([link](https://finance.naver.com/news/mainnews.nhn)) ## Usage ``` python main.py --start-date "2021-06-30" --end-date "2021-07-01" --save-dir "data" ``` ## Directories ``` crawler ├── __init__.py ├── crawler.py # MainNewsClawer class └── utils.py # util for crawling examples └── 01-main_news_crawling.ipynb # tutorial notebook main.py # main python file requirements.txt README.md ``` ## Result Example - Result is list of dictionary ``` [ {'press': , 'date': , 'time': , 'title': , 'link': , 'text': (), 'cleaned_text': } ] ``` ``` {'press': '', 'date': '2021-06-30', 'time': '02:00:12', 'title': '[] ', 'link': 'https://finance.naver.com/news/news_read.nhn?article_id=0012492807&office_id=001&mode=mainnews&type=&date=2021-06-30&page=1', 'text': '\n (=) = 29() . CAC40 0.14% 6,567.43, FTSE 100 0.21% 7,087.55 . Stoxx 50 0.43% 4,107.51, DAX30 0.88% 15,690.59 . 6 21 . ING " 2000 5 " . 6 . merciel@yna.co.kr\n', 'cleaned_text': [' 29 .', ' 40 0.14% 6567.43 100 0.21% 7087.55 .', ' 50 0.43% 4107.51 30 0.88% 15690.59 .', ' 6 21 .', ' 2000 5 .', ' 6 .']} ```

近期下载者

相关文件


收藏者