Curriculum-Based Transfer Learning for an Effective End-to-End Spoken Language Understanding and Domain Portability

Abstract : We present an end-to-end approach to extract semantic concepts directly from the speech audio signal. To overcome the lack of data available for this spoken language understanding approach, we investigate the use of a transfer learning strategy based on the principles of curriculum learning. This approach allows us to exploit out-of-domain data that can help to prepare a fully neural architecture. Experiments are carried out on the French MEDIA and PORTMEDIA corpora and show that this end-to-end SLU approach reaches the best results ever published on this task. We compare our approach to a classical pipeline approach that uses ASR, POS tagging, lemmatizer, chunker... and other NLP tools that aim to enrich ASR outputs that feed an SLU text to concepts system. Last, we explore the promising capacity of our end-to-end SLU approach to address the problem of domain portability.
Complete list of metadatas

Cited literature [35 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02307813
Contributor : Yannick Estève <>
Submitted on : Thursday, October 10, 2019 - 12:08:16 AM
Last modification on : Tuesday, October 15, 2019 - 11:11:47 AM

File

1832.pdf
Publisher files allowed on an open archive

Identifiers

Citation

Antoine Caubrière, Natalia Tomashenko, Antoine Laurent, Emmanuel Morin, Nathalie Camelin, et al.. Curriculum-Based Transfer Learning for an Effective End-to-End Spoken Language Understanding and Domain Portability. Interspeech 2019, Sep 2019, Graz, Austria. pp.1198-1202, ⟨10.21437/Interspeech.2019-1832⟩. ⟨hal-02307813⟩

Share

Metrics

Record views

8

Files downloads

8