Skip to Main content Skip to Navigation
Conference papers

A data repository for the management of dynamic linguistic datasets

Abstract : This paper addresses the issue of using Nakala, a dynamic database technology, for the management of language corpora. We present our ongoing attempt at storing and classifying multimedia documents of a corpus of language learner oral and written productions with universal resource identifiers. The architecture supports query APIs compatible with R packages and other tools which will facilitate the generation of linguistically enriched datasets for a more effective corpus-based study of language acquisition.
Complete list of metadata
Contributor : Leonardo Contreras Roa Connect in order to contact the contributor
Submitted on : Monday, September 13, 2021 - 5:28:38 PM
Last modification on : Friday, October 8, 2021 - 4:28:11 PM


  • HAL Id : hal-03343010, version 1


Thomas Gaillat, Leonardo Contreras Roa, Juvénal Attoumbre. A data repository for the management of dynamic linguistic datasets. CLARIN Annual Conference 2021, Sep 2021, Madrid (online), Spain. ⟨hal-03343010⟩



Record views


Files downloads