nihongo-scraper
所属分类:教育系统应用
开发工具:Python
文件大小:0KB
下载次数:0
上传日期:2017-07-18 12:52:28
上 传 者:
sh-1993
说明: Scraper下载日语新闻、测验和其他离线使用的资源。数据仅用于个人学习,并应用NLP...
(Scraper to download Japanese news, quizzes, and other resources for use offline. Data is used for personal study only, and NLP is applied to isolate Kanji for reading cards, for example.)
文件列表:
LICENSE.txt (1061, 2017-07-18)
nihongo-spider.py (736, 2017-07-18)
requirements.txt (21, 2017-07-18)
# Nihongo Scraper
Scraper to download Japanese news, quizzes, and other resources for use offline.
Data is used for personal study only, and NLP is applied to isolate Kanji for
reading cards, for example.
* nihongo-spider simply scrapes a known site with quizzes and saves the response as JSON/CSV
URL's used are hidden, to prevent a mass of requests to all the sites, or bots following
links from GitHub.
## Build
```
git clone https://github.com/kinow/nihongo-scraper.git
cd nihongo-scraper
pip install -r requirements
```
## Execute nihongo-spider
```
cat > .env </context/path/
EOF
scrapy runspider nihongo-spider.py -o questions.json
```
近期下载者:
相关文件:
收藏者: