fakenews

所属分类:其他
开发工具:Python
文件大小:0KB
下载次数:0
上传日期:2017-06-25 17:56:50
上 传 者sh-1993
说明:  假新闻,,
(fakenews,,)

文件列表:
ML/ (0, 2017-06-25)
ML/dat/ (0, 2017-06-25)
ML/dat/clarifai/ (0, 2017-06-25)
ML/dat/clarifai/clarifai_api.py (1921, 2017-06-25)
ML/dat/clarifai/fake_photos.txt (658687, 2017-06-25)
ML/dat/clarifai/fake_urls.py (443, 2017-06-25)
ML/dat/clarifai/not_fake_photos.txt (19980, 2017-06-25)
ML/dat/clarifai/not_fake_urls.py (684, 2017-06-25)
ML/dat/fake/ (0, 2017-06-25)
ML/dat/fake/step_1_count_words.py (2147, 2017-06-25)
ML/dat/fake/step_2_split_data.py (878, 2017-06-25)
ML/dat/fake/step_3_create_data_stats.py (626, 2017-06-25)
ML/dat/fake/step_4_label.py (837, 2017-06-25)
ML/dat/guardian/ (0, 2017-06-25)
ML/dat/guardian/crawl_sites.py (1505, 2017-06-25)
ML/dat/guardian/data.json (106973, 2017-06-25)
ML/dat/guardian/data2.json (11033554, 2017-06-25)
ML/dat/guardian/guardian_api.py (2727, 2017-06-25)
ML/dat/guardian/step_1_count_words.py (1254, 2017-06-25)
ML/dat/guardian/step_2_split_data.py (793, 2017-06-25)
ML/dat/guardian/test.json (1433, 2017-06-25)
ML/dat/nytimes/ (0, 2017-06-25)
ML/dat/nytimes/data.json (1325850, 2017-06-25)
ML/dat/nytimes/nytimes_api.py (2234, 2017-06-25)
ML/dat/site_counts.csv (5263, 2017-06-25)
ML/dat/unigram.txt (254053, 2017-06-25)
ML/src/ (0, 2017-06-25)
ML/src/__init__.py (0, 2017-06-25)
ML/src/args.py (1834, 2017-06-25)
ML/src/data.py (5922, 2017-06-25)
ML/src/inference.py (1064, 2017-06-25)
ML/src/main.py (506, 2017-06-25)
ML/src/models.py (20498, 2017-06-25)
ML/src/regression/ (0, 2017-06-25)
ML/src/regression/main.py (3937, 2017-06-25)
ML/src/regression/utils.py (876, 2017-06-25)
ML/src/utils.py (876, 2017-06-25)
extension/ (0, 2017-06-25)
... ...

# POP! --- Burst your Bubble This past election has made it abundantly clear that we live in very isolated bubbles, reinforced by social media like Facebook and Twitter. At this point, it seems like there is little base of shared knowledge on which to have political discussion. Since the election, there has been a lot of talk of not only blatantly fake news, but the much more pervasive and insidious *biased news*. Visualizations like the Wall Street Journal’s [Red vs. Blue Facebook feed](http://graphics.wsj.com/blue-feed-red-feed/) highlight how different both sides see the world. Until election day, many didn’t even realize how isolated we are. Well, we think it’s time to *pop* our bubbles. ### We introduce the POP chrome extension The POP chrome extension can track what news articles you’ve been reading and automatically detect if they are biased left or right. Over time, POP can help you keep tabs on how one-sided the news you’ve been reading is. ### Powered by PopAI, the artificial intelligence that detects bias in your news reading behavior In the past 48 hours, we had our PopAI read thousands of news articles from different news sources. By looking at millions of words and analyzing the context in which they’re used, PopAI now understands the meaning of words. Given a news article, it has learned to extract its meaning and predict bias. If the news article contains images, PopAI’s bias prediction is augmented by a Clarifai model trained to detect political bias in images. PopAI combines the power of deep learning for text analysis with Clarifai's cutting-edge computer vision technology. ## Start POPing Your Bubble! You can install the POP chrome extension from the [Chrome Web Store](https://chrome.google.com/webstore/category/extensions). Once installed, you can start saving the articles you read by clicking on the extension button in your browser. The extension will also open a side panel showing stats on your reading pattern. ## Technical Details ### Data We trained PopAI with multiple data sources. News articles from the Guardian, we scraped using the Guardian API and news articles from biased news sources we obtained from the [fake news Kaggle challenge](https://www.kaggle.com/mrisdal/fake-news) ### Machine Learning Pipeline We used [Exponential Family Embeddings](https://github.com/mariru/exponential_family_embeddings) to extract semantic features of the vocabulary words. Embeddings are a powerful unsupervised text analysis technique that helps cope with the curse of dimensionality. Discrete representations of text are mapped into a continuous embedding space, which captures the semantics of the text. For each news article we then extract a feature vector, by averaging the word embeddings of the words that appear in the article. The article features are then fed into a neural network which predicts whether an article is biased towards the left towards the right or unbiased. The PopAI training pipeline is implemented in [tensorflow](http://tensorflow.org/), and scales to big data. ### Backend - Django Web Framework ### Frontent - Chrome Extension

近期下载者

相关文件


收藏者