scraper
所属分类:数据采集/爬虫
开发工具:JavaScript
文件大小:0KB
下载次数:0
上传日期:2023-01-04 07:16:13
上 传 者:
sh-1993
说明: 我的学士论文(在布拉格FITCTU),关于非编程用户的网络抓取应用程序。,
(My bachelor s thesis (at FIT CTU Prague) on web scraping application for non- programming users.,)
文件列表:
LICENSE (1070, 2019-08-15)
presentation/ (0, 2019-08-15)
presentation/assets/ (0, 2019-08-15)
presentation/assets/Octoparse.png (114421, 2019-08-15)
presentation/assets/Scraper_control_panel.png (616110, 2019-08-15)
presentation/assets/ukazka2.mp4 (19721363, 2019-08-15)
presentation/assets/video_snapshot.png (469104, 2019-08-15)
presentation/prezentace.pdf (21301685, 2019-08-15)
presentation/prezentace.tex (5825, 2019-08-15)
src/ (0, 2019-08-15)
src/impl/ (0, 2019-08-15)
src/impl/babel.config.js (305, 2019-08-15)
src/impl/lib/ (0, 2019-08-15)
src/impl/lib/css/ (0, 2019-08-15)
src/impl/lib/css/bootstrap.min.css (155758, 2019-08-15)
src/impl/lib/js/ (0, 2019-08-15)
src/impl/lib/js/bootstrap.min.js (58072, 2019-08-15)
src/impl/lib/js/jquery-3.3.1.min.js (86927, 2019-08-15)
src/impl/lib/js/popper.min.js (21004, 2019-08-15)
src/impl/manifest.json (1067, 2019-08-15)
src/impl/package-lock.json (213456, 2019-08-15)
src/impl/package.json (462, 2019-08-15)
src/impl/scraper.png (1181, 2019-08-15)
src/impl/src/ (0, 2019-08-15)
src/impl/src/backgroundScripts/ (0, 2019-08-15)
src/impl/src/backgroundScripts/background.html (138, 2019-08-15)
src/impl/src/backgroundScripts/background.js (3093, 2019-08-15)
src/impl/src/constants.js (1276, 2019-08-15)
src/impl/src/contentScripts/ (0, 2019-08-15)
src/impl/src/contentScripts/communication.js (4797, 2019-08-15)
src/impl/src/contentScripts/contentScript.css (2687, 2019-08-15)
src/impl/src/contentScripts/contentScript.js (1056, 2019-08-15)
src/impl/src/contentScripts/controller.js (3339, 2019-08-15)
src/impl/src/contentScripts/dataEngine/ (0, 2019-08-15)
src/impl/src/contentScripts/dataEngine/dataEngine.js (2411, 2019-08-15)
src/impl/src/contentScripts/dataEngine/exporter.js (1692, 2019-08-15)
src/impl/src/contentScripts/mainPanelController.js (2710, 2019-08-15)
src/impl/src/contentScripts/previewTable/ (0, 2019-08-15)
... ...
# Web application for online web scraping
#### Author: Jakub Draho
## Project structure
All the source files are located inside the `src/` folder - `src/impl/` contains
the code of the applicatin, whereas in `src/thesis/` you will find source files
of the thesis in the LaTeX format.
You can also find compiled text (PDF and PS formats) inside the `text/` folder.
## Instalation of the app
1. Open your internet browser Google Chrome
2. Go to the `chrome://extensions/` or alternatively click on the controls menu
(in the top right corner) -> "More tools" -> "Extensions"
3. Turn on the Developer mode (switch is located in the top right corner)
4. Click on the `Load unpacked` button in the top left corner
5. Select the whole `src/impl/` folder in the opened file dialog
6. The extension should be successfully installed by now and ready to use
## Running the tests
1. Make sure you have installed the npm package manager: `npm --version`
2. Go to the folder with source files of the app: `cd src/impl/`
3. Install all the required packages: `npm install`
4. Run the tests `npm test`
## Compilation of the text
1. Go to the folder with source files of the thesis: `cd src/thesis/`
2. Run the command `pdflatex thesis.tex` to compile into PDF format
3. For PS format you can use `pdf2ps thesis.pdf` (if you've already got the PDF
file)
近期下载者:
相关文件:
收藏者: