NLP-newspapersAnalysis
所属分类:大数据
开发工具:Jupyter Notebook
文件大小:0KB
下载次数:0
上传日期:2023-09-21 03:27:39
上 传 者:
sh-1993
说明: 分析佩鲁主要报纸的新闻标题,
(Analysis of news headlines for the main newspapers of Perú,)
文件列表:
.flake8 (264, 2023-12-04)
LICENSE (1491, 2023-12-04)
conf/ (0, 2023-12-04)
conf/base/ (0, 2023-12-04)
conf/base/catalog.yml (2190, 2023-12-04)
conf/base/logging.yml (822, 2023-12-04)
conf/base/parameters.yml (0, 2023-12-04)
conf/base/parameters_cleaning_and_preprocessing.yml (257, 2023-12-04)
conf/base/parameters_eda.yml (234, 2023-12-04)
conf/base/parameters_feature_engineering.yml (250, 2023-12-04)
conf/base/parameters_sentiment_emotion_analysis.yml (257, 2023-12-04)
conf/base/parameters_topic_modeling.yml (245, 2023-12-04)
docs/ (0, 2023-12-04)
docs/Makefile (5616, 2023-12-04)
docs/_static/ (0, 2023-12-04)
docs/_static/nlp-newspaper-pipeline.svg (172985, 2023-12-04)
docs/api.rst (245, 2023-12-04)
docs/conf.py (3207, 2023-12-04)
docs/data.rst (7264, 2023-12-04)
docs/index.rst (889, 2023-12-04)
docs/make.bat (5122, 2023-12-04)
docs/pipelines.rst (4160, 2023-12-04)
notebooks/ (0, 2023-12-04)
notebooks/0.0-aerv-data-retrieval.ipynb (12026, 2023-12-04)
notebooks/0.1-aerv-dataCleaning.ipynb (33912, 2023-12-04)
notebooks/0.2-aerv-exploratoty-data-analysis.ipynb (30177, 2023-12-04)
notebooks/0.3-aerv-sentiment-analysis.ipynb (668568, 2023-12-04)
notebooks/0.4-aerv-topic-modeling.ipynb (53968, 2023-12-04)
notebooks/0.5-aerv-pipelines.ipynb (278246, 2023-12-04)
notebooks/0.5-aerv-pipelines.py (1446, 2023-12-04)
notebooks/0.6-aerv-specific-word.ipynb (164380, 2023-12-04)
poetry.lock (446307, 2023-12-04)
pyproject.toml (1045, 2023-12-04)
references/ (0, 2023-12-04)
... ...
# NLP - Peruvian Newspapers Analysis
![Python Version](https://img.shields.io/badge/python-%3E=3.9-blue?style=for-the-badge&logo=python&logoColor=white)
[![Powered by Kedro](https://img.shields.io/badge/powered_by-kedro-ffc900?logo=kedro&style=for-the-badge)](https://kedro.org)
![Code style badge](https://img.shields.io/badge/style-black-black?style=for-the-badge)
## Objective
To show how the narrative changes over a period of time in local media, from both
independent and main news outlets in Perú.
### Specific Objectives
- Show asociations between different words over a period of time to see how the
narrative changes around ceirtain topics
- Show how the media can control the narrative, by looking into the different
reactions people have to the tweets
## Project overview
The project consists of 6 pipelines that go from retrieving the data to the
data structures that can be used for Data Analysis and Visualisation. Below is
a picture of the pipelines flow.
![pipeline flow for the project](docs/_static/nlp-newspaper-pipeline.svg)
> You can also check the sister project [nlp-newspapersDashboard](https://github.com/drearondov/nlp-newspapersDashboard)
> where I buil a dashboard with the data comming from this project.
### Pipelines
- Data Retrieval
- Cleaning and Preprocessing
- Feature Engineering
- EDA
- Sentiment & Emotion Analysis
- Topic Modeling
Full documentation about the project and the data warehouse can be found in
the documentation. And if you want to learn more abouth the building process
you can read the accompaning blog post series on [Nou de Data](noudedata.com).
## Code and Resources used
- **Python version:** `3.11.5`
- **Packages:** Kedro, Pandas, Numpy, Plotly, Requests, Gensim,
Textblob, PySentimiento
近期下载者:
相关文件:
收藏者: