Presenter-Centric Image Collection and Annotation: Enhancing Accessibility for the Visually Impaired
This repository contains the proposed dataset of the paper **[Presenter-Centric Image Collection and Annotation: Enhancing Accessibility for the Visually Impaired](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/)** published at *36th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2023*.
We propose an approach to collect data automatically and a protocol to annotate this data specifically for this audience, aiming to support the development of Assistive Technology systems. We provide access to our three datasets: complete dataset, single person dataset, and annotated single person dataset. The complete dataset contains 10.939 images, the single person dataset contains 5689 images and the annotated single person dataset contains 967 images, each accompanied by three descriptive annotations. The images were collected from [youtube.com](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://www.youtube.com/).
If you find this code useful for your research, please cite the paper:
```
@INPROCEEDINGS{Ferreira_2023_Sibgrapi,
author={Luísa Ferreira and Daniel Fernandes and Fabio Cerqueira and Marcos Ribeiro and Michel Silva},
booktitle={36th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)},
title={Presenter-Centric Image Collection and Annotation: Enhancing Accessibility for the Visually Impaired},
month = {TO APPEAR},
year={},
volume={},
number={},
pages={},
doi={}
}
```
---
Access Datasets:
===
We're giving you a script along with text files that list the filenames for each image in the dataset. These filenames include important info like YouTube ID and frame ID, which the script uses to fetch the images.
- ### 1 . Install requirements
You can find the necessary libraries to run the Python script in the `dataset/requirements.txt` file.
```bash
pip install -r requirements.txt
```
- ### 2 . Running the script
To get the images from the dataset, use the `dataset/collect_images.py` script. You'll need to give the file path as an argument. For instance, if you want images from the complete dataset, provide the path to the complete_dataset.txt file. The annotations for the annotated single-person dataset are available in `dataset/annotations.csv`. As mentioned in the paper, there are 684 images with three descriptors, 190 with two descriptors, and 93 with a single descriptor.
```bash
python collect_images.py path/to/txt/file
```
---
Following the steps provided, you'll acquire the images from the datasets.
Access annotations for Annotated Single Person Dataset:
===
The annotations are available in `dataset/annotations.csv`. As mentioned in the paper, there are 684 images with three descriptors, 190 with two descriptors, and 93 with a single descriptor. The information about the annotated single-person dataset can be found in the file named annotations.csv within the 'dataset' folder. Each line in the CSV file consists of four columns: image ID, original description, first adjusted description, and second adjusted description. If all columns have descriptions, then the image has three annotations. If the second adjustment column is empty, it means the image has only two descriptors. Lastly, if both the first and second adjustment columns are empty, the image has only one description.
Contact
===
Authors
---
* [Luísa Ferreira](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://github.com/ferreiraluisa) - BsC student - UFV - luisa.ferreira@ufv.br.br (Scholarship from PIBIC-UFV/CNPq 2022-2023)
* [Daniel Fernandes](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://github.com/daniellf) - PhD Candidate - UFV - daniel.louzada@ufv.br
* Fábio Cerqueira - Professor at Universidade Federal Fluminense (UFF) - UFF - frcerqueira@id.uff.br
* Marcos Henrique - Professor at Universidade Federal de Viosa (UFV) - marcosh.ribeiro@ufv.br
* [Michel Silva](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://michelmelosilva.github.io/) - Assistant Professor at Universidade Federal de Viosa (UFV) - michel.m.silva@ufv.br
Institution
---
Universidade Federal de Viosa (UFV)
Departamento de Ciência da Computao
Viosa- Minas Gerais -Brazil
Laboratory
---
![MaVILab](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://mavilab-ufv.github.io/images/mavilab-logo.png) | ![UFV](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://cdn.discordapp.com/attachments/729689711416967239/844210892916523018/Ygemzly2XsP3gzFbXjFyExvD00B3rBvPbDEOoNOB-4uL4NLF1YKM6kiypik1H4koNc5_sNVAAAy_PDq_kmh_CRmn1dvC1uyeckCs.png)
--- | ---
**MaVILab:** Machine Vision and Intelligence Laboratory
https://mavilab-ufv.github.io/
---
Acknowledgements
===
The authors would like to thanks CAPES, FAPEMIG, PIBIC-UFV/CNPq, CNPq, and BICJr-UFV/FAPEMIG for funding different parts of this work; [Ricson Luiz Oliveira Vilaa](https://github.com/MaVILab-UFV/presenter-centric-dataset-SIBGRAPI-2023/blob/master/https://github.com/ricsonl) for fine-tunning the scene classification CNN; and our tireless annotators Allan Lopes, Júlia Vieira, Júlia Lopes, and Sophia Jorge.
### Enjoy it! :smiley: