Journal
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY
Volume 20, Issue 4, Pages 677-690Publisher
SPRINGER HEIDELBERG
DOI: 10.1007/s10209-020-00740-9
Keywords
Media; Accessibility; Quality; Captions; Authoring tools; Automatic speech recognition (ASR)
Categories
Funding
- Fundacion ONCE (ONCE Foundation)
- Real Patronato sobre Discapacidad (Royal Board on Disability) of the Spanish Ministry of Health, Consumer Affairs and Social Welfare
- UNED Abierta
- FIAPAS
- Spanish Ministry of Education [FPU14/02577]
Ask authors/readers for more resources
Students using YouTube's automatic speech recognition feature to produce captions made errors mainly in the number of characters per line, caption speed, failure to use a new line per participant, and not including sound effects. Recommendations for improvement include providing more guidance in the course, teaching how to edit captions and add sound effects subtitles.
Some people have problems accessing multimedia services available on the web. Contributions to accessibility made by end users who lack institutional support in the design, production or deployment of media accommodations should be considered. The aim of our paper is to assess the quality of the captions produced using YouTube by non-professional subtitlers who only received basic training. We also identify potential improvements either in the subtitling tool or in the training resources, which could enhance the quality of the captions. We conducted a study in which 53 participants of a MOOC on digital accessibility used the automatic speech recognition (ASR) feature of YouTube to produce the captions for a video provided by the teaching staff. We assessed the quality of the captions produced by the students and then compared it with the quality of the captions produced by: (a) a human expert and (b) the ASR-based subtitling by YouTube. Students' errors occurred mainly in the number of characters per line, the speed of the captions, failing to use a new line per participant, and not including sound effects. The course should warn students to use a new line per participant, teach them how to subtitle sound effects, specify the maximum number of characters per line of text, and inform that in some countries such as Spain, captions can be edited but in other countries this may not be possible. Our recommendations for the YouTube editor include improving both the user interface and the ASR, with a view to enhancing ongoing and future research.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available