Skip to Main content Skip to Navigation
Conference papers

Building Bilingual Terminologies from Comparable Corpora: The TTC TermSuite

Béatrice Daille 1
1 TALN
LINA - Laboratoire d'Informatique de Nantes Atlantique
Abstract : In this paper, we exploit domain-specific comparable corpora to build bilingual terminologies. We present the monolingual term extraction and the bilingual alignment that will allow us to identify and translate high specialised terminology. We stress the huge importance of taking into account both simple and complex terms in a multilingual environment. Such linguistic diversity implies to combine several methods to perfect accurately both monolingual and bilingual terminology extraction tasks. The methods are implemented in TTC TermSuite based on a UIMA framework.
Document type :
Conference papers
Complete list of metadatas

Cited literature [14 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00819594
Contributor : Béatrice Daille <>
Submitted on : Wednesday, May 1, 2013 - 11:14:58 PM
Last modification on : Thursday, April 5, 2018 - 10:37:00 AM
Long-term archiving on: : Friday, August 2, 2013 - 4:06:26 AM

File

TTC_BUCC2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00819594, version 1

Collections

Citation

Béatrice Daille. Building Bilingual Terminologies from Comparable Corpora: The TTC TermSuite. 5th Workshop on Building and Using Comparable Corpora with special topic "Language Resources for Machine Translation in Less-Resourced Languages and Domains", co-located with LREC 2012, May 2012, Istambul, Turkey. ⟨hal-00819594⟩

Share

Metrics

Record views

277

Files downloads

521