UDLex: Towards Cross-language Subcategorization Lexicons
Résumé
This paper introduces UDLex, a computational framework for the automatic extraction of argument structures for several languages. By exploiting the versatility of the Universal Dependency annotation scheme, our system acquires subcat-egorization frames directly from a dependency parsed corpus, regardless of the input language. It thus uses a universal set of language-independent rules to detect verb dependencies in a sentence. In this paper we describe how the system has been developed by adapting the LexIt (Lenci et al., 2012) framework, originally designed to describe argument structures of Ital-ian predicates. Practical issues that arose when building argument structure representations for typologically different languages will also be discussed.
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...