Niko Partanen, and Mandana Seyfeddinipur for useful discussions, to the three reviewers for detailed comments, and to Jesse Gates for proofreading. Errors and shortcomings are the authors' responsibility. Financial support from grants ANR-10-LABX-0083 (Laboratoire d'excellence "Empirical Foundations of Linguistics, their living voices to visiting linguists, and the Pangloss Collection depositors, who generously chose to share the fruit of their hard work, pp.2019-2024 ,
, Bibliographical References
A corpus-driven approach to language contact: Endangered languages in a comparative perspective, 2016. ,
URL : https://hal.archives-ouvertes.fr/halshs-01287037
Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018. ,
Automatic speech recognition for underresourced languages: A survey, Speech Communication, vol.56, pp.85-100, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00953644
Big data, little data, no data: scholarship in the networked world, 2015. ,
Enquête et description des languesà tradition orale. Volume I : l'enquête de terrain et l'analyse grammaticale. Société d'études linguistiques et anthropologiques de France, vol.3, 1971. ,
Annotating multimedia/multi-modal resources with ELAN, Proceedings of LREC, 2004. ,
Field linguistics: A minor manual, vol.60, pp.12-31, 2007. ,
The two-level tonal system of Lataddi Narua. Linguistics of the Tibeto-Burman Area, vol.39, pp.67-104, 2016. ,
Yongning Narua orthography: users' guide and developers' notes, 2018. ,
URL : https://hal.archives-ouvertes.fr/halshs-01956606
Building speech recognition systems for language documentation: the CoEDL Endangered Language Pipeline and Inference System, 2018. ,
, Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU), pp.200-204, 2018.
Elpis, an accessible speech-to-text tool, Proceedings of Interspeech 2019, pp.306-310, 2019. ,
La sémantique du prédicat en mwotlap, vol.84, 2003. ,
Instant annotations-Applying NLP methods to the annotation of spoken language documentation corpora, Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages, pp.25-36, 2017. ,
Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks, Proceedings of the 23rd International Conference on Machine Learning, ICML '06, pp.369-376, 2006. ,
The origin of the peculiarities of the Vietnamese alphabet, Mon-Khmer Studies, vol.39, pp.89-104, 2010. ,
URL : https://hal.archives-ouvertes.fr/halshs-00918824
Adam: A method for stochastic optimization, 3rd International Conference on Learning Representations, 2015. ,
Indigenous language technologies in Canada: Assessment, challenges, and successes, Proceedings of the 27th International Conference on Computational Linguistics, pp.2620-2632, 2018. ,
Using the TEI as pivot format for oral and multimodal language corpora, Journal of the Text Encoding Initiative, p.10, 2016. ,
Development of Linguistic Linked Open Data resources for collaborative data-intensive research in the language sciences: An introduction, Development of Linguistic Linked Open Data Resources for Collaborative Data-Intensive Research in the Language Sciences, 2019. ,
Quel statut pour les données de la recherche après la loi numérique ?, 2016. ,
Documenting and researching endangered languages: The Pangloss Collection. Language Documentation and Conservation, vol.8, pp.119-135, 2014. ,
URL : https://hal.archives-ouvertes.fr/halshs-01003734
Integrating automatic transcription into the language documentation workflow: experiments with Na data and the Persephone toolkit, Language Documentation and Conservation, vol.12, pp.393-429, 2018. ,
URL : https://hal.archives-ouvertes.fr/halshs-01841979
Phonetic lessons from automatic phonemic transcription: preliminary reflections on Na (Sino-Tibetan) and Tsuut'ina (Dene) data, Proceedings of ICPhS XIX (19th International Congress of Phonetic Sciences), 2019. ,
URL : https://hal.archives-ouvertes.fr/halshs-02059313
Tone in Yongning Na: lexical tones and morphotonology, 2017. ,
URL : https://hal.archives-ouvertes.fr/halshs-01094049
Na (Mosuo)-English-Chinese dictionary, 2018. ,
URL : https://hal.archives-ouvertes.fr/halshs-01744420
The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. Translation and Multilingual Natural Language Processing, Towards a general-purpose linguistic annotation backend, 2018. ,
Linguistic fieldwork, 2001. ,
Speech data acquisition: The underestimated challenge. KALIPHO -Kieler Arbeiten zur Linguistik und Phonetik, vol.3, pp.1-42, 2015. ,
URL : https://hal.archives-ouvertes.fr/halshs-01026295
A TEI-based approach to standardising spoken language transcription, Journal of the Text Encoding Initiative, issue.1, 2011. ,
Language documentation twenty-five years on, Language, vol.94, issue.4, pp.324-345, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01968838
Ethnologue: languages of the world, SIL International, 2017. ,
Efficient speech transcription through respeaking, Proceedings of Interspeech 2013, pp.1087-1091, 2013. ,
Untrained forced alignment of transcriptions and audio for language documentation corpora using WebMAUS, Proceedings of the Ninth International Conference on Language Resources and Evaluation, pp.3940-3947, 2014. ,
LD&C possibilities for the next decade. Language Documentation and Conservation, vol.11, pp.1-4, 2017. ,
Future directions in technological support for language documentation, Proceedings of the Workshop on Computational Methods for Endangered Languages, vol.1, 2019. ,
Underresourced languages: Phonetic results from language archives, The Routledge Handbook of Phonetics, 2019. ,
Language documentation and description, vol.1, pp.35-51, 2003. ,