The benefits of increasing the digital availability of Alsatian theater - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

The benefits of increasing the digital availability of Alsatian theater

Pablo Ruiz Fabo
Carole Werner
Delphine Bernhard

Résumé

We present a digital corpus of plays in Alsatian language varieties with rich metadata. We transcribed bibliographical metadata for 359 plays, and annotated social attributes like professional group, social class and gender for 2,253 characters. Regarding full-text availability, TEI encoding is ongoing and already carried out for 25 plays (300,000 tokens). At the moment, corrected OCR output is available for 28 further plays, which await TEI encoding. This material helps computational literary research on Alsatian theater, impossible before given lack of a corpus, and has shown potential to challenge existing knowledge on the tradition. The texts also have relevance for NLP research on low-resource languages. To promote interest in language diversity and in the corpus' language varieties, we also created a corpus exploration interface, which allows navigating the corpus based on plays’ metadata and on character attributes. This interface can be useful for research and, particularly, opens up the resources to a wider non-specialist community.
Fichier principal
Vignette du fichier
ruiz_et_al_dh2022_pub.pdf (4.12 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03660481 , version 1 (22-08-2022)

Identifiants

Citer

Pablo Ruiz Fabo, Carole Werner, Delphine Bernhard. The benefits of increasing the digital availability of Alsatian theater. Digital Humanities 2022, Jul 2022, Tokyo (Online), Japan. pp.567-570, ⟨10.5281/zenodo.7014965⟩. ⟨hal-03660481⟩

Collections

SITE-ALSACE
75 Consultations
14 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More