A viewing and processing tool for the analysis of a comparable corpus of Kiranti mythology

Abstract : This presentation describes a trilingual corpus of three endangered languages of the Kiranti group (Tibeto-Burman family) from Eastern Nepal. The languages, which are exclusively oral, share a rich mythology, and it is thus possible to build a corpus of the same native narrative material in the three languages. The segments of similar semantic content are tagged with a "similarity" label to identify correspondences among the three language versions of the story. An interface has been developed to allow these similarities to be viewed together, in order to allow make possible comparison of the different lexical and morphosyntactic features of each language. A concordancer makes it possible to see the various occurrences of words or glosses, and to further compare and contrast the languages.
Document type :
Conference papers
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

Contributor : Aimée Lahaussois <>
Submitted on : Thursday, June 2, 2016 - 10:42:57 AM
Last modification on : Tuesday, May 28, 2019 - 1:39:56 AM
Long-term archiving on : Saturday, September 3, 2016 - 10:12:17 AM


Explicit agreement for this submission


Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License


  • HAL Id : halshs-01229423, version 1


Aimée Lahaussois, Séverine Guillaume. A viewing and processing tool for the analysis of a comparable corpus of Kiranti mythology. Building and Using Comparable Corpora, May 2012, Istambul, Turkey. pp.33-41. ⟨halshs-01229423⟩



Record views


Files downloads