robot_reporter

所属分类:机器人/智能制造
开发工具:Jupyter Notebook
文件大小:0KB
下载次数:0
上传日期:2021-03-02 23:08:59
上 传 者sh-1993
说明:  机器学习工具,用于阅读关于新闻主题的推文,并检测关键主题和趋势,
(Machine learning tools to read Tweets on news topics and detect key topics and trends,)

文件列表:
.DS_Store (10244, 2021-03-02)
images/ (0, 2021-03-02)
images/.DS_Store (6148, 2021-03-02)
images/robot2.png (1830211, 2021-03-02)
keywords/ (0, 2021-03-02)
keywords/.DS_Store (18436, 2021-03-02)
keywords/keyword_ashli/ (0, 2021-03-02)
keywords/keyword_ashli/.DS_Store (8196, 2021-03-02)
keywords/keyword_ashli/.ipynb_checkpoints/ (0, 2021-03-02)
keywords/keyword_ashli/.ipynb_checkpoints/code_ashli-checkpoint.ipynb (423922, 2021-03-02)
keywords/keyword_ashli/code_ashli.ipynb (172489, 2021-03-02)
keywords/keyword_ashli/data/ (0, 2021-03-02)
keywords/keyword_ashli/data/.DS_Store (6148, 2021-03-02)
keywords/keyword_huckabee/ (0, 2021-03-02)
keywords/keyword_huckabee/.DS_Store (8196, 2021-03-02)
keywords/keyword_huckabee/.ipynb_checkpoints/ (0, 2021-03-02)
keywords/keyword_huckabee/.ipynb_checkpoints/code_huckabee-checkpoint.ipynb (72, 2021-03-02)
keywords/keyword_huckabee/code_huckabee.ipynb (118873, 2021-03-02)
keywords/keyword_jack/ (0, 2021-03-02)
keywords/keyword_jack/.DS_Store (6148, 2021-03-02)
keywords/keyword_jack/.ipynb_checkpoints/ (0, 2021-03-02)
keywords/keyword_jack/.ipynb_checkpoints/code_jackdorsey-checkpoint.ipynb (202751, 2021-03-02)
keywords/keyword_jack/.ipynb_checkpoints/code_jackdorsey_noretweets-checkpoint.ipynb (153945, 2021-03-02)
keywords/keyword_jack/code_jackdorsey.ipynb (197643, 2021-03-02)
keywords/keyword_jack/code_jackdorsey_noretweets.ipynb (153945, 2021-03-02)
keywords/keyword_jack/data/ (0, 2021-03-02)
keywords/keyword_jack/data/.DS_Store (6148, 2021-03-02)
keywords/keyword_myanmar/ (0, 2021-03-02)
keywords/keyword_myanmar/.DS_Store (8196, 2021-03-02)
keywords/keyword_myanmar/.ipynb_checkpoints/ (0, 2021-03-02)
keywords/keyword_myanmar/.ipynb_checkpoints/code_myanmar_noretweets-checkpoint.ipynb (243839, 2021-03-02)
keywords/keyword_myanmar/code_myanmar_noretweets.ipynb (251530, 2021-03-02)
keywords/keyword_navalny/ (0, 2021-03-02)
keywords/keyword_navalny/.DS_Store (6148, 2021-03-02)
keywords/keyword_navalny/.ipynb_checkpoints/ (0, 2021-03-02)
keywords/keyword_navalny/.ipynb_checkpoints/Untitled1-checkpoint.ipynb (72, 2021-03-02)
keywords/keyword_navalny/.ipynb_checkpoints/code_navalny-checkpoint.ipynb (72, 2021-03-02)
keywords/keyword_navalny/.ipynb_checkpoints/code_navalny_noretweets-checkpoint.ipynb (72, 2021-03-02)
... ...

# robot.reporting.info ## Blog that uses NLP-based topic modeling to analyze Tweets ### Goal of this project: I’m a data scientist who builds programs that can read huge numbers of documents within minutes. These programs can do more than just read documents - they can also detect the underlying topics in these documents, and assign them to categories according to the content in each document. This type of program can be helpful to any professional who needs to read massive amounts of text, allowing them to get a sense of what the text or group of texts is about without having to actually read them. Since I worked as a journalist for many years, I was curious if I could build a program like this that would be useful for understanding the news. That led me to design the program that powers the content on this blog. My "robot" carries out searches of news-related keywords on Twitter, reading through thousands of Tweets within minutes. The robot quickly figures out the main topics expressed in the tens of thousands or even hundreds of thousands of Tweets. That's where the robot's work ends, and where the human work takes over. I put my journalist's cap on, and analyze those lists of topics, which provide insights about the group of Tweets as an aggregate - insights which I could not possibly have reached without the robot having "read" them for me. ### The blog: [Robot Reporting](https://www.robotreporting.info/) Each post follows the following format: NEWS KEYWORD SEARCH: I tell you what term or terms I searched for. MY EXPECTATION: What I expect the Tweets to be about, based on my knowledge as a journalist. ROBOT'S RESULTS: I tell you what main topics the Tweets contain, according to the robot's analysis. WHAT I LEARNED FROM THE ROBOT: Here I highlight the most important or enlightening elements of the robot's results. TECHNICAL NOTES: Details on the statistical and machine learning methods and tools used. I invite you to read my results, and I'd love to hear what you think. Please feel free to e-mail me your reactions and suggestions. Enjoy! Mary MacCarthy maccarthy.mary@gmail.com robotreporting.info

近期下载者

相关文件


收藏者