scraper

所属分类:数据采集/爬虫
开发工具:JavaScript
文件大小:0KB
下载次数:0
上传日期:2023-01-04 07:16:13
上 传 者sh-1993
说明:  我的学士论文(在布拉格FITCTU),关于非编程用户的网络抓取应用程序。,
(My bachelor s thesis (at FIT CTU Prague) on web scraping application for non- programming users.,)

文件列表:
LICENSE (1070, 2019-08-15)
presentation/ (0, 2019-08-15)
presentation/assets/ (0, 2019-08-15)
presentation/assets/Octoparse.png (114421, 2019-08-15)
presentation/assets/Scraper_control_panel.png (616110, 2019-08-15)
presentation/assets/ukazka2.mp4 (19721363, 2019-08-15)
presentation/assets/video_snapshot.png (469104, 2019-08-15)
presentation/prezentace.pdf (21301685, 2019-08-15)
presentation/prezentace.tex (5825, 2019-08-15)
src/ (0, 2019-08-15)
src/impl/ (0, 2019-08-15)
src/impl/babel.config.js (305, 2019-08-15)
src/impl/lib/ (0, 2019-08-15)
src/impl/lib/css/ (0, 2019-08-15)
src/impl/lib/css/bootstrap.min.css (155758, 2019-08-15)
src/impl/lib/js/ (0, 2019-08-15)
src/impl/lib/js/bootstrap.min.js (58072, 2019-08-15)
src/impl/lib/js/jquery-3.3.1.min.js (86927, 2019-08-15)
src/impl/lib/js/popper.min.js (21004, 2019-08-15)
src/impl/manifest.json (1067, 2019-08-15)
src/impl/package-lock.json (213456, 2019-08-15)
src/impl/package.json (462, 2019-08-15)
src/impl/scraper.png (1181, 2019-08-15)
src/impl/src/ (0, 2019-08-15)
src/impl/src/backgroundScripts/ (0, 2019-08-15)
src/impl/src/backgroundScripts/background.html (138, 2019-08-15)
src/impl/src/backgroundScripts/background.js (3093, 2019-08-15)
src/impl/src/constants.js (1276, 2019-08-15)
src/impl/src/contentScripts/ (0, 2019-08-15)
src/impl/src/contentScripts/communication.js (4797, 2019-08-15)
src/impl/src/contentScripts/contentScript.css (2687, 2019-08-15)
src/impl/src/contentScripts/contentScript.js (1056, 2019-08-15)
src/impl/src/contentScripts/controller.js (3339, 2019-08-15)
src/impl/src/contentScripts/dataEngine/ (0, 2019-08-15)
src/impl/src/contentScripts/dataEngine/dataEngine.js (2411, 2019-08-15)
src/impl/src/contentScripts/dataEngine/exporter.js (1692, 2019-08-15)
src/impl/src/contentScripts/mainPanelController.js (2710, 2019-08-15)
src/impl/src/contentScripts/previewTable/ (0, 2019-08-15)
... ...

# Web application for online web scraping #### Author: Jakub Draho ## Project structure All the source files are located inside the `src/` folder - `src/impl/` contains the code of the applicatin, whereas in `src/thesis/` you will find source files of the thesis in the LaTeX format. You can also find compiled text (PDF and PS formats) inside the `text/` folder. ## Instalation of the app 1. Open your internet browser Google Chrome 2. Go to the `chrome://extensions/` or alternatively click on the controls menu (in the top right corner) -> "More tools" -> "Extensions" 3. Turn on the Developer mode (switch is located in the top right corner) 4. Click on the `Load unpacked` button in the top left corner 5. Select the whole `src/impl/` folder in the opened file dialog 6. The extension should be successfully installed by now and ready to use ## Running the tests 1. Make sure you have installed the npm package manager: `npm --version` 2. Go to the folder with source files of the app: `cd src/impl/` 3. Install all the required packages: `npm install` 4. Run the tests `npm test` ## Compilation of the text 1. Go to the folder with source files of the thesis: `cd src/thesis/` 2. Run the command `pdflatex thesis.tex` to compile into PDF format 3. For PS format you can use `pdf2ps thesis.pdf` (if you've already got the PDF file)

近期下载者

相关文件


收藏者