Lexical and Morpho-syntactic Features in Word Embeddings - A Case Study of Nouns in Swedish - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Lexical and Morpho-syntactic Features in Word Embeddings - A Case Study of Nouns in Swedish

Ali Basirat
  • Fonction : Auteur
Marc Tang

Résumé

We apply real-valued word vectors combined with two different types of classifiers (linear discriminant analysis and feed-forward neural network) to scrutinize whether basic nominal categories can be captured by simple word embedding models. We also provide a linguistic analysis of the errors generated by the classifiers. The targeted language is Swedish, in which we investigate three nominal aspects: uter/neuter, common/proper, and count/mass. They represent respectively grammatical, semantic, and mixed types of nominal classification within languages. Our results show that word embeddings can capture typical grammatical and semantic features such as uter/neuter and common/proper nouns. Nevertheless, the model encounters difficulties to identify classes such as count/mass which not only combine both grammatical and semantic properties, but are also subject to conversion and shift. Hence, we answer the call of the Special Session on Natural Language Processing in Artificial Intelligence by approaching the topic of interfaces between morphology, lexicon, semantics, and syntax via interdisciplinary methods combining machine learning of language and general linguistics.
Fichier principal
Vignette du fichier
NLPinAI_2018_6_CR.pdf (333.05 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02529163 , version 1 (11-12-2021)

Identifiants

Citer

Ali Basirat, Marc Tang. Lexical and Morpho-syntactic Features in Word Embeddings - A Case Study of Nouns in Swedish. Special Session on Natural Language Processing in Artificial Intelligence, Jan 2018, Funchal, France. pp.663-674, ⟨10.5220/0006729606630674⟩. ⟨hal-02529163⟩
83 Consultations
118 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More