naver-finance-news-crawler
所属分类:金融证券系统
开发工具:Python
文件大小:32KB
下载次数:0
上传日期:2022-03-26 06:43:32
上 传 者:
sh-1993
说明: 导航财经新闻爬虫
(naver-finance-news-crawler,Naver Finance News Crawler)
文件列表:
crawler (0, 2022-03-26)
crawler\__init__.py (85, 2022-03-26)
crawler\crawler.py (14960, 2022-03-26)
crawler\utils.py (1515, 2022-03-26)
examples (0, 2022-03-26)
examples\01-main_news_crawling.ipynb (10201, 2022-03-26)
examples\02-process_after_crawling.ipynb (10220, 2022-03-26)
examples\03-query_news_crawling_v01.ipynb (14683, 2022-03-26)
examples\03-query_news_crawling_v02.ipynb (20787, 2022-03-26)
examples\03-query_news_crawling_v03.ipynb (8594, 2022-03-26)
examples\04-insert_db_after_crawling_v01.ipynb (7110, 2022-03-26)
main.py (846, 2022-03-26)
requirements.txt (44, 2022-03-26)
# Naver-Finance-News-Crawler
- Code for Naver Finance (Main) News Crawling([link](https://finance.naver.com/news/mainnews.nhn))
## Usage
```
python main.py --start-date "2021-06-30" --end-date "2021-07-01" --save-dir "data"
```
## Directories
```
crawler
├── __init__.py
├── crawler.py # MainNewsClawer class
└── utils.py # util for crawling
examples
└── 01-main_news_crawling.ipynb # tutorial notebook
main.py # main python file
requirements.txt
README.md
```
## Result Example
- Result is list of dictionary
```
[
{'press': ,
'date': ,
'time': ,
'title': ,
'link': ,
'text': (),
'cleaned_text': }
]
```
```
{'press': '',
'date': '2021-06-30',
'time': '02:00:12',
'title': '[] ',
'link': 'https://finance.naver.com/news/news_read.nhn?article_id=0012492807&office_id=001&mode=mainnews&type=&date=2021-06-30&page=1',
'text': '\n (=) = 29() . CAC40 0.14% 6,567.43, FTSE 100 0.21% 7,087.55 . Stoxx 50 0.43% 4,107.51, DAX30 0.88% 15,690.59 . 6 21 . ING " 2000 5 " . 6 . merciel@yna.co.kr\n',
'cleaned_text': [' 29 .',
' 40 0.14% 6567.43 100 0.21% 7087.55 .',
' 50 0.43% 4107.51 30 0.88% 15690.59 .',
' 6 21 .',
' 2000 5 .',
' 6 .']}
```
近期下载者:
相关文件:
收藏者: