Building a syntactic-semantic interface for a semi-automatically generated TAG for Arabic

Abstract : Syntactic and semantic resources play an important role for various Natural Language Processing (NLP) tasks by providing information about the correct structural representations of the sentences and their meaning. To date, there is not a wide-coverage electronic grammar for the Arabic language. In this context, we present a new approach for building a tree adjoining grammar to represent the syntax and the semantic of modern standard Arabic. This grammar is produced semi-automatically with the XMG (eXtensible MetaGrammar) description language. First the syntax of Arabic is described using the defined Arab-XMG meta-grammar. Then semantic information is added by introducing semantic frame-based dimension into the meta-grammar. This is achieved by exploiting lexical resources such as Arabic VerbNet. Finally, the link between semantic and syntax is established using a syntax-semantic interface that allows the construction of sentence meaning through semantic role labeling. Experiments were performed to check grammar coverage as well as the syntactic-semantic analysis. The results showed that the generated grammar can cover the basic syntactic structures of Arabic sentences and the different phrasal structures with a precision rate of about 92%. Moreover, it confirms the effectiveness of the proposed approach as we were able to parse semantically a set of sentences and build their semantic representations with a precision rate of about 72%.
Type de document :
Article dans une revue
The International Arab Journal of Information Technology, In press, 〈http://www.iajit.org/〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01809771
Contributeur : Yannick Parmentier <>
Soumis le : jeudi 7 juin 2018 - 09:58:03
Dernière modification le : vendredi 8 juin 2018 - 01:17:21

Identifiants

  • HAL Id : hal-01809771, version 1

Citation

Chérifa Ben Khelil, Chiraz Zribi, Denys Duchier, Yannick Parmentier. Building a syntactic-semantic interface for a semi-automatically generated TAG for Arabic. The International Arab Journal of Information Technology, In press, 〈http://www.iajit.org/〉. 〈hal-01809771〉

Partager

Métriques

Consultations de la notice

61