Recherche et production de corpus de messages pour la multilinguisation de sites de e-commerce en SMS, initialement en arabe
Résumé
In this paper, we present our research in the framework of the CATS project (Classified Ads through SMS) [3]. CATS is a system of management of small Arabic advertisements posted in SMS of buying and selling ( cars, real estate...), currently deployed in Jordan by the FastLink operator. In order to adapt this system to other languages (French, English) and in length by applying it to other sectors (employment, marriage, domestic machines, trade of mobile phone, pages yellow...), we are in the difficulty in finding or building SMS corpora functionally equivalent to a real, Arab and natural corpus. A simple translation of this starting corpus gives it the same type of corpus (real, natural)? We present, in this paper, an answer to this question and a solution for a case of multilinguisation of sites of e-commerce in SMS, initially in Arabic.
Origine : Fichiers produits par l'(les) auteur(s)
Loading...