mediawatch

所属分类:中间件编程
开发工具:GO
文件大小:0KB
下载次数:0
上传日期:2023-09-02 06:59:47
上 传 者sh-1993
说明:  赋予新闻机构打击虚假信息的权力,
(Empowering news organizations to fight disinformation,)

文件列表:
.dockerignore (36, 2023-11-06)
.realize.yaml.template (2659, 2023-11-06)
Makefile (3729, 2023-11-06)
assets/ (0, 2023-11-06)
assets/.$MediaWatch Core.drawio.png.bkp (61233, 2023-11-06)
assets/MediaWatch Core.drawio.png (62809, 2023-11-06)
buf.gen-tags.yaml (307, 2023-11-06)
buf.gen.py.yaml (186, 2023-11-06)
buf.gen.yaml (495, 2023-11-06)
buf.lock (587, 2023-11-06)
buf.work.yaml (82, 2023-11-06)
buf.yaml (370, 2023-11-06)
cmd/ (0, 2023-11-06)
cmd/api/ (0, 2023-11-06)
cmd/api/Dockerfile.api (1262, 2023-11-06)
cmd/api/Makefile (1288, 2023-11-06)
cmd/api/main.go (1414, 2023-11-06)
cmd/compare/ (0, 2023-11-06)
cmd/compare/Dockerfile.compare (1290, 2023-11-06)
cmd/compare/Makefile (1292, 2023-11-06)
cmd/compare/main.go (12121, 2023-11-06)
cmd/enrich/ (0, 2023-11-06)
cmd/enrich/.dockerignore (1231, 2023-11-06)
cmd/enrich/.editorconfig (258, 2023-11-06)
cmd/enrich/.pylintrc (18296, 2023-11-06)
cmd/enrich/Dockerfile.enrich (592, 2023-11-06)
cmd/enrich/Makefile (1613, 2023-11-06)
cmd/enrich/VERSION (6, 2023-11-06)
cmd/enrich/enrich/ (0, 2023-11-06)
cmd/enrich/enrich/ai/ (0, 2023-11-06)
cmd/enrich/enrich/ai/model.py (3214, 2023-11-06)
cmd/enrich/enrich/app.py (1544, 2023-11-06)
cmd/enrich/enrich/config/ (0, 2023-11-06)
cmd/enrich/enrich/config/config.py (2398, 2023-11-06)
cmd/enrich/enrich/mediawatch/ (0, 2023-11-06)
cmd/enrich/enrich/mediawatch/enrich/ (0, 2023-11-06)
cmd/enrich/enrich/mediawatch/enrich/v2/ (0, 2023-11-06)
... ...

# MediaWatch ### Empowering news organizations to fight disinformation Despite concentrated efforts to combat disinformation and fake news, many countries still exhibit depressingly low trust in the media. Worse, throughout these countries the role of journalism is often brought into question. The entrenched political affiliation of legacy media casts deep doubts on independent and fair fact-checking. On other hand, digital media operate in an unmapped environment, further complicating the issue of misinformation. MediaWatch aspires to run a pilot project in Greece (108th place in RSF’s 2022 World Press Freedom Index 2022, 32% in media trust in Reuters Institute Digital News Report, 2021) with the further goal to develop a tool that can be used across different countries and media systems. ### Fake News, Mis/Dis–information, Propaganda, all have; *Networks in Common* MediaWatch is a real-time network analysis platform which continuously monitors online media outlets and identifies flows of information - potentially detecting bad actors and networks of propaganda, with the use of advanced AI algorithms for online content analysis and classification. Therefore, it makes it possible to group articles in clusters by similarity, claims, quotes, entities, topics or categories (and any other combination of custom features) helping journalists, researchers and fact checkers to drill-down information by similar allegations, and rapidly respond to arising issues, reducing the time devoted on non-journalistic tasks. To corelate passages or claims within articles we use [go-plagiarism](https://github.com/cvcio/go-plagiarism) as our principal algorithm. Though, **we are not interested in plagiarism itself**, we have found, in our long-term feasibility study, that journalists tend to reproduce passages, claims or articles in full (aka copy-paste), as a process in which **an existing narrative is transformed into multiple similar ones, to extend attention to the agenda and frame**, we call this process **"The Chain of Misinformation"**. ## How it works MediaWatch comprises of multiple micro-servives. In genereal: we collect data for rss feeds using the [feeds](cmd/feeds) micro-service, from twitter using [listen](cmd/listen) and [twitter](cmd/twitter) (which will deprecate*), scrape the data using the [scraper](cmd/scraper) micro-service, enrich the data using [enrich](cmd/enrich) micro-service and using various [AI models](https://huggingface.co/cvcio), and finally create the relationships usgin [compare](cmd/compare) micro-service. All internall communication between micro-services is handled by [kafka](https://kafka.apache.org/) and the [worker](microservice). To store raw articles we use [elasticsearch](https://www.elastic.co/) -different [indices](pkg/es/indeces/) for each language, for the relationships we use [neo4j](https://neo4j.com/), and for any other data we use [mongodb](https://www.mongodb.com/). ![MediaWatch CORE](./assets/MediaWatch%20Core.drawio.png) *micro-services architecture* ## Roadmap In the -not so- near future we plan to introduce multiple new features and micro-services, starting from a unified subscription model for organisations and users to support a cross-organisation fact-checking scheme, where multiple users can share insights, in conjunction with smart-annotations and reports micro-services. Notably: - Users and Organizations - Smart Annotations - Reports - Hidden Votes - Important Features Highlighter (claims, quotes, etc.) - Fully Integrated Lucene Search - Data Exports - Open Source Application - Public API (gRPC, HTTP) - Documentation and Manual *Please if you have any suggestions or feature requests reach us at info@cvcio.org, or via github issues.* ## Contributing If you're new to contributing to Open Source on Github, [this guide](https://opensource.guide/how-to-contribute/) can help you get started. Please check out the contribution guide for more details on how issues and pull requests work. Before contributing be sure to review the [code of conduct](/CODE_OF_CONDUCT.md). ## License This library is distributed under the MIT license found in the [LICENSE](/LICENSE) file.

近期下载者

相关文件


收藏者