Towards Higher Quality Internal and Outside Multilingualization of Web Sites

Abstract : The multilingualization of Web sites with high quality is increasingly important, but is unsolvable in most situations where internal quality certification is needed, and not solved in the majority of other situations. We demonstrate it by analyzing a variety of techniques to make the underlying software easily localizable and to manage the translation of textual content in the classical internal mode, that is by modifying the language-dependent resources. A new idea is that volunteer final users should be able to contribute to the improvement oreven production of translated resources and content. For this, we have developed a PHP piece of code which naive webmasters (not computer scientists nor professional translators) can add to a Web site to enable internal multilingualization by users with enough access rights: in management mode, these users can edit the texts of titles, button labels, messages, etc. in text areas appearing in context in the Web page. If Web site developers follow some recommendations, all textual interface elements should be localizable in this way. Another angle of attack, applicable in all cases where navigating a site though a gateway is possible, consists in replacing the problem of diffusion by the problem of access in multiple lang uages. We introduce the concept of iMAG (interactive Multilingual Access Gateway, dedicated to a Web site or domain) to solve the problem of higher quality multilingual access. First, by using available MT systems or by default morphological processors and bilingual dictionaries, any page of an elected website is made instantly accessible in many languages, with a generally low quality profile, as through usual translation gateways. Over time, the quality profile of textual GUI elements, Web pages and even documents (if accessible in html) will improve thanks to outside contributors, who will post-edit or produce the translations from the reading context. This is only possible because the iMAG associated to the website stores the translations in its translation memory (TM) and the contributed dictionary items it its dictionary. The TM has quality levels, according to the users' profiles, and scores within levels. An API will be proposed so that the developers of the elected website can connect their to its iMAG, retrieve the best level translations, certify them if necessary, and put them in their localized resources. At that point, external localization meets internal localization.
Keywords : TM iMAG
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00968752
Contributor : Mathieu Mangeot <>
Submitted on : Tuesday, April 1, 2014 - 2:49:44 PM
Last modification on : Tuesday, March 5, 2019 - 1:38:31 AM
Long-term archiving on : Tuesday, July 1, 2014 - 11:56:14 AM

File

ONII08_CB-VB-MM-CR.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00968752, version 1

Collections

Citation

Christian Boitet, Valérie Bellynck, Mathieu Mangeot, Carlos Ramisch. Towards Higher Quality Internal and Outside Multilingualization of Web Sites. ONII-08 (Summer Workshop on Ontology, NLP, Personalization and IE/IR), Jul 2008, Mumbai, India. pp.8. ⟨hal-00968752⟩

Share

Metrics

Record views

311

Files downloads

308