Breaking the Unwritten Language Barrier: The BULB Project, Proceedings of SLTU (Spoken Language Technologies for Under- Resourced Languages), 2016. ,
DOI : 10.1016/j.procs.2016.04.023
URL : https://hal.archives-ouvertes.fr/halshs-01428027
A case study on using speech-to-translation alignments for language documentation. arXiv preprint, 2017. ,
Dbpedia: A nucleus for a web of open data. The semantic web, pp.722-735, 2007. ,
Neural machine translation by jointly learning to align and translate . arXiv preprint, 2014. ,
Listen and translate: A proof of concept for endto-end speech-to-text translation, NIPS workshop on End-to-end Learning for Speech and Audio Processing, 2016. ,
End-to-end automatic speech translation of audiobooks, Accepted to Acoustics, Speech and Signal Processing (ICASSP) IEEE International Conference on Acoustics, Speech and Signal Processing, 2018. ,
Nltk: the natural language toolkit, Proceedings of the COLING/ACL on Interactive presentation sessions, pp.69-72, 2006. ,
Parallel Speech Collection for Under-resourced Language Studies Using the Lig-Aikuma Mobile Device App, Proceedings of SLTU (Spoken Language Technologies for Under- Resourced Languages), 2016. ,
DOI : 10.1016/j.procs.2016.04.030
URL : https://hal.archives-ouvertes.fr/hal-01350065
Microsoft speech language translation (mslt) corpus: The iwslt 2016 release for english, french and german, 2016. ,
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01303135
google-diff-match-patch-diff, match and patch libraries for plain text, 2012. ,
Evaluating machine translation output with automatic sentence segmentation, International Workshop on Spoken Language Translation (IWSLT) 2005, 2005. ,
Librispeech: An ASR corpus based on public domain audio books, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5206-5210, 2015. ,
DOI : 10.1109/ICASSP.2015.7178964
URL : http://www.clsp.jhu.edu/%7Eguoguo/papers/icassp2015_librispeech.pdf
Improved speechto-text translation with the fisher and callhome spanishenglish speech translation corpus, 2013. ,
The kaldi speech recognition toolkit, IEEE 2011 workshop on automatic speech recognition and understanding, p.192584, 2011. ,
Parallel corpora for medium density languages. Amsterdam Studies in the Theory and History of Linguistic Science Series 4, p.247, 2007. ,
DOI : 10.1075/cilt.292.32var
URL : http://eprints.sztaki.hu/7902/1/Kornai_1762382_ny.pdf
Sequence-to-sequence models can directly transcribe foreign speech. arXiv preprint, 2017. ,
DOI : 10.21437/interspeech.2017-503
URL : http://arxiv.org/pdf/1703.08581