Towards Sentence-level Text Readability Assessment for French - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

Towards Sentence-level Text Readability Assessment for French

Résumé

In this paper, we report on some experiments aimed at exploring the relation between document-level and sentence-level readability assessment for French. These were run on an open-source tailored corpus, which was automatically created by aggregating various sources from children's literature. On top of providing the research community with a freely available corpus, we report on sentence readability scores obtained when applying both classical approaches (aka readability formulas) and state-of-the-art deep learning techniques (e.g. fine-tuning of large language models). Results show a relatively strong correlation between document-level and sentence-level readability, suggesting ways to reduce the cost of building annotated sentence-level readability datasets.
Fichier principal
Vignette du fichier
RANLP_2023_Submission14.pdf (211.18 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Licence : CC BY - Paternité

Dates et versions

hal-04192063 , version 1 (31-08-2023)

Licence

Paternité

Identifiants

  • HAL Id : hal-04192063 , version 1

Citer

Duy Van Ngo, Yannick Parmentier. Towards Sentence-level Text Readability Assessment for French. Second Workshop on Text Simplification, Accessibility and Readability (TSAR@RANLP2023), Sep 2023, Varna, Bulgaria. ⟨hal-04192063⟩
71 Consultations
99 Téléchargements

Partager

Gmail Facebook X LinkedIn More