news-scrapers-puppeteer
所属分类:DevOps
开发工具:TypeScript
文件大小:0KB
下载次数:0
上传日期:2023-08-27 11:00:43
上 传 者:
sh-1993
说明: 刮新闻木偶,,
(news-scrapers-puppeteer,,)
文件列表:
.env.EXAMPLE (32, 2023-10-06)
.mocharc.json (35, 2023-10-06)
Dockerfile (723, 2023-10-06)
error_extract_new.png (791361, 2023-10-06)
logs.txt (1157667, 2023-10-06)
package-lock.json (260170, 2023-10-06)
package.json (1862, 2023-10-06)
src/ (0, 2023-10-06)
src/.env.EXAMPLE (32, 2023-10-06)
src/ApiManager.ts (1629, 2023-10-06)
src/JobRunner.ts (1133, 2023-10-06)
src/PersistenceManager.ts (5721, 2023-10-06)
src/ScraperApp.ts (13469, 2023-10-06)
src/config/ (0, 2023-10-06)
src/config/scrapingConfigFull.json (61158, 2023-10-06)
src/config/scrapingConfigFullOld.json (13680, 2023-10-06)
src/jobRunnerScript.ts (126, 2023-10-06)
src/mainScript.ts (119, 2023-10-06)
src/managers/ (0, 2023-10-06)
src/managers/ApiManager.ts (1265, 2023-10-06)
src/managers/PersistenceManager.ts (5734, 2023-10-06)
src/managers/UtilsManager.ts (269, 2023-10-06)
src/models/ (0, 2023-10-06)
src/models/GlobalConfig.ts (193, 2023-10-06)
src/models/GlobalConfigSql.ts (793, 2023-10-06)
src/models/NewScraped.ts (420, 2023-10-06)
src/models/NewScrapedSql.ts (3765, 2023-10-06)
src/models/ReviewScraped.ts (242, 2023-10-06)
src/models/ScrapingConfig.ts (359, 2023-10-06)
src/models/ScrapingIndex.ts (481, 2023-10-06)
src/models/ScrapingIndexSql.ts (4382, 2023-10-06)
src/models/ScrapingUrlSql.ts (537, 2023-10-06)
src/models/sequelizeConfig.ts (1493, 2023-10-06)
src/scrapers/ (0, 2023-10-06)
src/scrapers/AbcContentScraper.ts (8957, 2023-10-06)
src/scrapers/AbcIndexScraper.ts (3153, 2023-10-06)
src/scrapers/ContentScraper.ts (384, 2023-10-06)
src/scrapers/ElDiarioesContentScraper.ts (8436, 2023-10-06)
... ...
# News scraper
#
docker build . -t news-scraper
docker stop -t news-scraper
docker run -d --name news-scraper --restart always --network=host news-scraper
docker logs --follow news-scraper
docker stop $(docker ps -a -q)
docker rm $(docker ps -a -q)
run single test using it(..) handle:
npm test -- --grep "NewYorkTimesContentScraper"
npm test -- --grep "GuardianNewContentScraper"
npm test -- --grep "ElDiarioesContentScraper"
npm test -- --grep "PublicoContentScraper"
npm test -- --grep "ElMundoContentScraper"
npm test -- --grep "ElHeraldoSoriaContentScraper"
npm test -- --grep "ScienceNewsContentScraper"
npm test -- --grep "AbcContentScraper"
npm test -- --grep "ElDiarioesIndexScraper"
npm test -- --grep "ElPaisIndexScraper"
npm test -- --grep "PublicoIndexScraper"
npm test -- --grep "GuardianIndexScraper"
npm test -- --grep "NewYorkTimesIndexScraper"
npm test -- --grep "ElHeraldoSoriaIndexScraper"
npm test -- --grep "ScienceNewsIndexScraper"
npm test -- --grep "AbcIndexScraper"
近期下载者:
相关文件:
收藏者: