LESSONS LEARNED AFTER DEVELOPMENT AND USE OF A DATA COLLECTION APP FOR LANGUAGE DOCUMENTATION (LIG-AIKUMA)

Abstract : Lig-Aikuma is a free Android app running on various mobile phones and tablets. It proposes a range of different speech collection modes (recording, respeaking, translation and elicitation) and offers the possibility to share recordings between users. More than 250 hours of speech in 6 different languages from sub-Saharan Africa (including 3 oral languages in the process of being documented) have already been collected with Lig-Aikuma. This paper presents the lessons learned after 3 years of development and use of Lig-Aikuma. While significant data collections were conducted, this has not been done without difficulties. Some mixed results lead us to stress the importance of design choices, data sharing architecture and user manual. We also discuss other potential uses of the app, discovered during its deployment: data collection for language revitalisation, data collection for speech technology development (ASR) and enrichment of existing corpora through the addition of spoken comments.
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02264418
Contributor : Laurent Besacier <>
Submitted on : Wednesday, August 7, 2019 - 7:14:45 AM
Last modification on : Friday, August 9, 2019 - 1:02:13 AM

File

ligaikumaFINAL.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02264418, version 1

Collections

Citation

Laurent Besacier, Elodie Gauthier, Sylvie Voisin. LESSONS LEARNED AFTER DEVELOPMENT AND USE OF A DATA COLLECTION APP FOR LANGUAGE DOCUMENTATION (LIG-AIKUMA). International Congress of Phonetic Sciences ICPhS 2019, Aug 2019, Melbourne, Australia. ⟨hal-02264418⟩

Share

Metrics

Record views

33

Files downloads

10