datacardsplaybook

所属分类:DevOps
开发工具:TypeScript
文件大小:0KB
下载次数:0
上传日期:2023-02-22 16:32:00
上 传 者sh-1993
说明:  数据卡术语集帮助数据集生产者和发布者采用以人为中心的方法来提高数据集文档的透明度。,
(The Data Cards Playbook helps dataset producers and publishers adopt a people- centered approach to transparency in dataset documentation.,)

文件列表:
CONTRIBUTING.md (1103, 2023-02-22)
GSoC/ (0, 2023-02-22)
LICENSE (11358, 2023-02-22)
benchmarkdetectives/ (0, 2023-02-22)
benchmarkdetectives/starter-task/ (0, 2023-02-22)
benchmarkdetectives/starter-task/Aryan_Data_Card.pdf (549491, 2023-02-22)
benchmarkdetectives/starter-task/Avyay_Starter_Tasks.md (314, 2023-02-22)
benchmarkdetectives/starter-task/Dhivya_Data_Card.pdf (361076, 2023-02-22)
benchmarkdetectives/starter-task/Dhivya_Model_Card.pdf (381922, 2023-02-22)
benchmarkdetectives/starter-task/Review Siddharth's Model Card (148, 2023-02-22)
benchmarkdetectives/starter-task/Siddharth_Model_Card.pdf (1330625, 2023-02-22)
labs/ (0, 2023-02-22)
labs/card-builder/ (0, 2023-02-22)
labs/card-builder/build.config.js (1057, 2023-02-22)
labs/card-builder/package.json (658, 2023-02-22)
labs/card-builder/src/ (0, 2023-02-22)
labs/card-builder/src/CardNode.ts (3723, 2023-02-22)
labs/card-builder/src/cardToHTML.ts (4986, 2023-02-22)
labs/card-builder/src/constants.ts (1291, 2023-02-22)
labs/card-builder/src/index.ts (1003, 2023-02-22)
labs/card-builder/src/interactions.js (970, 2023-02-22)
labs/card-builder/src/styles/ (0, 2023-02-22)
labs/card-builder/src/styles/default.scss (7859, 2023-02-22)
labs/card-builder/src/utils/ (0, 2023-02-22)
labs/card-builder/src/utils/htmlUtils.ts (3461, 2023-02-22)
labs/card-builder/src/utils/markdownUtils.ts (1062, 2023-02-22)
labs/interactive-calculators/ (0, 2023-02-22)
playbook/ (0, 2023-02-22)
playbook/Module-Answer/ (0, 2023-02-22)
playbook/Module-Answer/Activities/ (0, 2023-02-22)
... ...

# Data Cards Playbook The Data Cards Playbook helps dataset producers and publishers adopt a people-centered approach to transparency in dataset documentation. Using the Playbook activities and resources on [our website](https://sites.research.google/datacardsplaybook), you can create transparency-focused metadata schema for datasets across domains, organizational structures, and audience groups In this repository, you can: - Explore templates of Transparency Artifacts (Data Cards, Model Cards, Healthsheets) - See and contribute examples of Data Cards in this repository ## Data Cards Data Cards are structured summaries of essential facts about various aspects of ML datasets needed by stakeholders across a dataset's lifecycle for responsible AI development. These summaries provide explanations of processes and rationales that shape the data and consequently the models, such as upstream sources, data collection and annotation methods; training and evaluation methods, intended use; or decisions affecting model performance. > Read our paper on [Data Cards](https://arxiv.org/abs/2204.01075) > > Watch the paper [video from FAccT 2022](https://www.youtube.com/watch?v=jcQ4A2EbFW8) ## Hands-on Data Card creation Our Data Card template is available in [.docx format](https://github.com/PAIR-code/datacardsplaybook/blob/main/templates/DataCardsExtendedTemplate.docx). It contains numerous sections, questions and guidelines for responses that are designed to comprehensively document any possible dataset. Along with Data Cards, we've also made [Healthsheets](https://github.com/PAIR-code/datacardsplaybook/blob/main/templates/Healthsheet%20Template.docx)([Research Paper](https://arxiv.org/abs/2202.13028)) and [Model Card](https://github.com/PAIR-code/datacardsplaybook/blob/main/templates/Model%20Card%20%E2%80%93%20Template.docx) ([Research Paper](https://arxiv.org/abs/1810.03993)) templates available to document healthcare-specific datasets and general purpose models, respectively. ## Examples of Data Cards - [GEM Benchmark Data Cards](https://gem-benchmark.com/data_cards) - [FIT400m Data Card](https://github.com/google-research/parti/blob/main/data_cards/fit400m_data_card.pdf) - [WikiDialog-OQ](https://github.com/google-research/dialog-inpainting/blob/main/WikiDialog-OQ_Data_Card.pdf) - [Open Images Extended - Crowdsourced](https://research.google/tools/datasets/open-images-extended-crowdsourced/) - [Relative Movie Attributes](https://github.com/google-research-datasets/soft-attributes/blob/main/Data-Description.pdf) - [More Inclusive Annotated People](https://storage.googleapis.com/openimages/open_images_extended_miap/Open%20Images%20Extended%20-%20MIAP%20-%20Data%20Card.pdf) - [Translated Wikipedia Biographies](https://research.google/tools/datasets/translated-wikipedia-biographies/#:~:text=The%20Translated%20Wikipedia%20Biographies%20dataset,drop%2C%20possessives%20and%20gender%20agreement.) - Crowdsourced high-quality multi-speaker speech datasets - [Argentinian Spanish](https://research.google/tools/datasets/argentinian-spanish-tts/) - [Chilean Spanish](https://research.google/tools/datasets/chilean-spanish-tts/) - [Colombian Spanish](https://research.google/tools/datasets/colombian-spanish-tts/) - [Peruvian Spanish](https://research.google/tools/datasets/peruvian-spanish-tts/) - [Venezulean Spanish](https://research.google/tools/datasets/venezuelan-spanish-tts/) - [Ivy Lee's collection of ML model cards and datasheets](https://github.com/ivylee/model-cards-and-datasheets) Want to add your Data Card to this list? [Open an issue!](https://github.com/PAIR-code/datacardsplaybook/issues/new) ## Frequently Asked Questions (FAQs) Coming Soon ## Note The Data Cards Playbook is being actively developed and documentation is likely to change as we improve our methodologies. We want to hear from you! Leave notes, feedback, or suggestions on our GitHub. Use #datacardsplaybook. ## Citation M. Pushkarna, A. Zaldivar, D. Nanas, et al. Data Cards Playbook. Published March 5, 2021. ## License The Data Cards Playbook is licensed under a [Creative Commons Attribution-Share Alike 4.0 International License](https://creativecommons.org/licenses/by-sa/4.0/). ## Credits ### Core Team This work was co-created by Mahima Pushkarna and Andrew Zaldivar and done in collaboration with Reena Jana, Vivian Tsai, and Oddur Kjartansson. We want to thank Donald Gonzalez, Dan Nanas, Parker Barnes, Laura Rosenstein, Diana Akrong, Monica Caraway, Ding Wang, Danielle Smalls, Aybuke Turker, Emily Brouillet, Andrew Fuchs, Sebastian Gehrmann, Cassie Kozyrkov, Alex Siegman, and Anthony Keene for their immense contributions; and Meg Mitchell and Timnit Gebru for championing this work. We also want to thank Adam Boulanger, Lauren Wilcox, Parker Barnes, Roxanne Pinto and Aya akmakli for their feedback; Tulsee Doshi, Dan Liebling, Meredith Morris, Lucas Dixon, Fernanda Viegas, Jen Gennai, and Marian Croak for their support. This work would not have been possible without our workshop and study participants, and numerous partners, whose insights and experiences have shaped this Playbook. ### Special Thanks This work would not have been possible without our workshop participants, supporters and champions, whose insights and experiences have shaped this Playbook: Lucas Ackerknecht, Hartwig Adam, Seiji Armstrong, Lora Aroyo, Sebastian Assaf, Anurag Batra, Samy Bengio, Louisa Bostrom, Thomas Cadwalader, Michelle Carney, Will Carter, Amanda Casari, Di Dang, Alex David Norton, Tiffany Deng, Emily Denton, Tulsee Doshi, Madeleine Elish, Patrick Gage Kelley, Timnit Gebru, Sara Goetz, Robbie Gonzalez, Alex Hanna, Jing Hua, Ben Hutchinson, Nathan Ie, Robyn Im, Orion Jankowski, Ellen Jiang, Shivani Kapania, David Karam, Daniel Kim, Leslie Lai, Eryka Lehr, Elijah Logan, Daphne Luong, Nicole Maffeo, Meg Mitchell, Maysam Moussalem, Unni Nair, Ricardo Olenewa, Kristen Olson, Praveen Paritosh, Adam Pearce, Angie Peng, Ludovic Peran, Roxanne Pinto, Vinodkumar Prabhakaran, Rida Qadri, Ravi Rajakumar, Hima Rajana, Susanna Ricco, Kevin Robinson, Taylor Roper, Negar Rostamzadeh, Mo Shomrat, Andrew Smart, Jamila Smith-Loud, Nithum Thain, Janel Thamkul, Aybuke Turker, Joseph Thomas, Bobby Tran, James Wang, Martin Wattenberg, James Wexler, Catherine Williams, Catherina Xu, Tabitha Yong, and Ben Zevenbergen.

近期下载者

相关文件


收藏者