Amharic Speech Recognition for Speech Translation

Abstract : The state-of-the-art speech translation can be seen as a cascade of Automatic Speech Recognition, Statistical Machine Translation and Text-To-Speech synthesis. In this study an attempt is made to experiment on Amharic speech recognition for Amharic-English speech translation in tourism domain. Since there is no Amharic speech corpus, we developed a read-speech corpus of 7.43hr in tourism domain. The Amharic speech corpus has been recorded after translating standard Basic Traveler Expression Corpus (BTEC) under a normal working environment. In our ASR experiments phoneme and syllable units are used for acoustic models, while morpheme and word are used for language models. Encouraging ASR results are achieved using morpheme-based language models and phoneme-based acoustic models with a recognition accuracy result of 89.1%, 80.9%, 80.6%, and 49.3% at character, morph, word and sentence level respectively. We are now working towards designing Amharic-English speech translation through cascading components under different error correction algorithms.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [14 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01350050
Contributor : Laurent Besacier <>
Submitted on : Friday, July 29, 2016 - 3:12:01 PM
Last modification on : Tuesday, February 12, 2019 - 1:31:17 AM
Document(s) archivé(s) le : Sunday, October 30, 2016 - 11:30:28 AM

File

WOLDEYOHANNIS_ET_AL - Amharic ...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01350050, version 1

Collections

Citation

Michael Melese, Laurent Besacier, Million Meshesha. Amharic Speech Recognition for Speech Translation. Atelier Traitement Automatique des Langues Africaines (TALAF). JEP-TALN 2016, Jul 2016, Paris, France. ⟨hal-01350050⟩

Share

Metrics

Record views

282

Files downloads

1137