Quaternion Denoising Encoder-Decoder for Theme Identification of Telephone Conversations

Abstract : In the last decades, encoder-decoders or autoencoders (AE) have received a great interest from researchers due to their capability to construct robust representations of documents in a low dimensional subspace. Nonetheless, autoencoders reveal little in way of spoken document internal structure by only considering words or topics contained in the document as an isolate basic element, and tend to overfit with small corpus of documents. Therefore, Quaternion Multi-layer Perceptrons (QMLP) have been introduced to capture such internal latent dependencies , whereas denoising autoencoders (DAE) are composed with different stochastic noises to better process small set of documents. This paper presents a novel autoencoder based on both hitherto-proposed DAE (to manage small corpus) and the QMLP (to consider internal latent structures) called "Quater-nion denoising encoder-decoder" (QDAE). Moreover, the paper defines an original angular Gaussian noise adapted to the speci-ficity of hyper-complex algebra. The experiments, conduced on a theme identification task of spoken dialogues from the DE-CODA framework, show that the QDAE obtains the promising gains of 3% and 1.5% compared to the standard real valued de-noising autoencoder and the QMLP respectively.
Document type :
Conference papers
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02107632
Contributor : Titouan Parcollet <>
Submitted on : Tuesday, April 23, 2019 - 5:22:30 PM
Last modification on : Friday, May 10, 2019 - 1:25:57 AM

File

0a47312b0c016633a9865255d8b2e6...
Files produced by the author(s)

Identifiers

Collections

Citation

Titouan Parcollet, Mohamed Morchid, Georges Linarès. Quaternion Denoising Encoder-Decoder for Theme Identification of Telephone Conversations. Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.3325-3328, ⟨10.21437/Interspeech.2017-1029⟩. ⟨hal-02107632⟩

Share

Metrics

Record views

3

Files downloads

21