NLP-newspapersAnalysis

所属分类:大数据
开发工具:Jupyter Notebook
文件大小:0KB
下载次数:0
上传日期:2023-09-21 03:27:39
上 传 者sh-1993
说明:  分析佩鲁主要报纸的新闻标题,
(Analysis of news headlines for the main newspapers of Perú,)

文件列表:
.flake8 (264, 2023-12-04)
LICENSE (1491, 2023-12-04)
conf/ (0, 2023-12-04)
conf/base/ (0, 2023-12-04)
conf/base/catalog.yml (2190, 2023-12-04)
conf/base/logging.yml (822, 2023-12-04)
conf/base/parameters.yml (0, 2023-12-04)
conf/base/parameters_cleaning_and_preprocessing.yml (257, 2023-12-04)
conf/base/parameters_eda.yml (234, 2023-12-04)
conf/base/parameters_feature_engineering.yml (250, 2023-12-04)
conf/base/parameters_sentiment_emotion_analysis.yml (257, 2023-12-04)
conf/base/parameters_topic_modeling.yml (245, 2023-12-04)
docs/ (0, 2023-12-04)
docs/Makefile (5616, 2023-12-04)
docs/_static/ (0, 2023-12-04)
docs/_static/nlp-newspaper-pipeline.svg (172985, 2023-12-04)
docs/api.rst (245, 2023-12-04)
docs/conf.py (3207, 2023-12-04)
docs/data.rst (7264, 2023-12-04)
docs/index.rst (889, 2023-12-04)
docs/make.bat (5122, 2023-12-04)
docs/pipelines.rst (4160, 2023-12-04)
notebooks/ (0, 2023-12-04)
notebooks/0.0-aerv-data-retrieval.ipynb (12026, 2023-12-04)
notebooks/0.1-aerv-dataCleaning.ipynb (33912, 2023-12-04)
notebooks/0.2-aerv-exploratoty-data-analysis.ipynb (30177, 2023-12-04)
notebooks/0.3-aerv-sentiment-analysis.ipynb (668568, 2023-12-04)
notebooks/0.4-aerv-topic-modeling.ipynb (53968, 2023-12-04)
notebooks/0.5-aerv-pipelines.ipynb (278246, 2023-12-04)
notebooks/0.5-aerv-pipelines.py (1446, 2023-12-04)
notebooks/0.6-aerv-specific-word.ipynb (164380, 2023-12-04)
poetry.lock (446307, 2023-12-04)
pyproject.toml (1045, 2023-12-04)
references/ (0, 2023-12-04)
... ...

# NLP - Peruvian Newspapers Analysis ![Python Version](https://img.shields.io/badge/python-%3E=3.9-blue?style=for-the-badge&logo=python&logoColor=white) [![Powered by Kedro](https://img.shields.io/badge/powered_by-kedro-ffc900?logo=kedro&style=for-the-badge)](https://kedro.org) ![Code style badge](https://img.shields.io/badge/style-black-black?style=for-the-badge) ## Objective To show how the narrative changes over a period of time in local media, from both independent and main news outlets in Perú. ### Specific Objectives - Show asociations between different words over a period of time to see how the narrative changes around ceirtain topics - Show how the media can control the narrative, by looking into the different reactions people have to the tweets ## Project overview The project consists of 6 pipelines that go from retrieving the data to the data structures that can be used for Data Analysis and Visualisation. Below is a picture of the pipelines flow. ![pipeline flow for the project](docs/_static/nlp-newspaper-pipeline.svg) > You can also check the sister project [nlp-newspapersDashboard](https://github.com/drearondov/nlp-newspapersDashboard) > where I buil a dashboard with the data comming from this project. ### Pipelines - Data Retrieval - Cleaning and Preprocessing - Feature Engineering - EDA - Sentiment & Emotion Analysis - Topic Modeling Full documentation about the project and the data warehouse can be found in the documentation. And if you want to learn more abouth the building process you can read the accompaning blog post series on [Nou de Data](noudedata.com). ## Code and Resources used - **Python version:** `3.11.5` - **Packages:** Kedro, Pandas, Numpy, Plotly, Requests, Gensim, Textblob, PySentimiento

近期下载者

相关文件


收藏者