R. Sarikaya, G. E. Hinton, and A. Deoras, Application of deep belief networks for natural language understanding, IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), vol.22, issue.4, pp.778-784, 2014.

I. V. Serban, A. Sordoni, Y. Bengio, A. Courville, and J. Pineau, Building end-to-end dialogue systems using generative hierarchical neural network models, Thirtieth AAAI Conference on Artificial Intelligence, 2016.

Y. Chen, D. Hakkani-tür, G. Tür, J. Gao, and L. Deng, End-toend memory networks with knowledge carryover for multi-turn spoken language understanding, Interspeech, pp.3245-3249, 2016.

P. Haghani, A. Narayanan, M. Bacchiani, G. Chuang, N. Gaur et al., From audio to semantics: Approaches to end-to-end spoken language understanding, 2018 IEEE Spoken Language Technology Workshop (SLT), pp.720-726, 2018.

D. Serdyuk, Y. Wang, C. Fuegen, A. Kumar, B. Liu et al., Towards end-to-end spoken language understanding, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5754-5758, 2018.

Y. Liu and S. Li, Recognizing implicit discourse relations via repeated reading: Neural networks with multi-level attention, 2016.

P. Li, W. Lam, L. Bing, W. Guo, and H. Li, Cascaded attention based unsupervised information distillation for compressive summarization, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp.2081-2090, 2017.

J. Zhang, H. Luan, M. Sun, F. Zhai, J. Xu et al., Improving the transformer translation model with document-level context, 2018.

G. Tur and R. Mori, Spoken language understanding: Systems for extracting semantic information from speech, 2011.

M. Morchid, R. Dufour, G. Linares, and Y. Hamadi, Latent topic model based representations for a robust theme identification of highly imperfect automatic transcriptions, Computational Linguistics and Intelligent Text Processing, pp.596-605, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01293908

K. Janod, M. Morchid, R. Dufour, G. Linares, and R. Mori, Deep stacked autoencoders for spoken language understanding, ISCA INTERSPEECH, vol.1, issue.2, 2016.
URL : https://hal.archives-ouvertes.fr/hal-02356395

C. Xiong, V. Zhong, and R. Socher, Dynamic coattention networks for question answering, 2016.

N. Ryant, E. Bergelson, K. Church, A. Cristià, J. Du et al., Enhancement and analysis of conversational speech, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5154-5158, 2017.

E. M. Hoey and K. H. Kendrick, Research methods in psycholinguistics: A practical guide, pp.151-173, 2017.

R. Pappagari, J. Villalba, and N. Dehak, Joint verification-identification in end-to-end multi-scale cnn framework for topic identification, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.6199-6203, 2018.

Y. Esteve, M. Bouallegue, C. Lailler, M. Morchid, R. Dufour et al., Integration of word and semantic features for theme identification in telephone conversations, Natural Language Dialog Systems and Intelligent Assistants, pp.223-231, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01433213

K. Janod, M. Morchid, R. Dufour, G. Linares, and R. Mori, Denoised bottleneck features from deep autoencoders for telephone conversation analysis, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.9, pp.1809-1820, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02356138

J. Sun, W. Guo, Z. Chen, and Y. Song, Topic detection in conversational telephone speech using cnn with multi-stream inputs, ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.7285-7289, 2019.

S. Mobin and B. Olshausen, Auditory separation of a conversation from background via attentional gating, 2019.

T. Parcollet, M. Morchid, P. Bousquet, R. Dufour, G. Linarès et al., Quaternion neural networks for spoken language understanding, Spoken Language Technology Workshop (SLT), pp.362-368, 2016.
URL : https://hal.archives-ouvertes.fr/hal-02107532

W. R. Hamilton, Ii. on quaternions; or on a new system of imaginaries in algebra, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, vol.25, issue.163, pp.10-13, 1844.
URL : https://hal.archives-ouvertes.fr/in2p3-00008076

S. J. Sangwine, Fourier transforms of colour images using quaternion or hypercomplex, numbers, Electronics letters, vol.32, issue.21, pp.1979-1980, 1996.

S. Pei and C. Cheng, Color image processing by using binary quaternion-moment-preserving thresholding technique, IEEE Transactions on Image Processing, vol.8, issue.5, pp.614-628, 1999.

N. A. Aspragathos and J. K. Dimitros, A comparative study of three methods for robot kinematics, Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on, vol.28, pp.135-145, 1998.

P. Arena, L. Fortuna, L. Occhipinti, and M. G. Xibilia, Neural networks for quaternion-valued function approximation, Circuits and Systems, 1994. ISCAS'94, vol.6, pp.307-310, 1994.

P. Arena, L. Fortuna, G. Muscato, and M. G. Xibilia, Multilayer perceptrons to approximate quaternion valued functions, Neural Networks, vol.10, issue.2, pp.335-342, 1997.

A. Hirose and S. Yoshida, Generalization characteristics of complexvalued feedforward neural networks in relation to signal coherence, IEEE Transactions on Neural Networks and learning systems, vol.23, issue.4, pp.541-551, 2012.

M. Tygert, J. Bruna, S. Chintala, Y. Lecun, S. Piantino et al., A mathematical motivation for complex-valued convolutional networks, Neural computation, vol.28, issue.5, pp.815-825, 2016.

I. Danihelka, G. Wayne, B. Uria, N. Kalchbrenner, and A. Graves, Associative long short-term memory, 2016.

S. Wisdom, T. Powers, J. Hershey, J. L. Roux, and L. Atlas, Fullcapacity unitary recurrent neural networks, Advances in Neural Information Processing Systems, pp.4880-4888, 2016.

T. Parcollet, M. Morchid, and G. Linares, Quaternion denoising encoder-decoder for theme identification of telephone conversations, Proc. Interspeech, pp.3325-3328, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02107632

, Deep quaternion neural networks for spoken language understanding, Automatic Speech Recognition and Understanding Workshop, pp.504-511, 2017.

C. J. Gaudet and A. S. Maida, Deep quaternion networks, 2018 International Joint Conference on Neural Networks (IJCNN), pp.1-8, 2018.

T. Parcollet, Y. Zhang, M. Morchid, C. Trabelsi, G. Linarès et al., Quaternion convolutional neural networks for end-toend automatic speech recognition, 19th Annual Conference of the International Speech Communication Association, pp.22-26, 2018.

,

T. Parcollet, M. Ravanelli, M. Morchid, G. Linarès, C. Trabelsi et al., Quaternion recurrent neural networks, International Conference on Learning Representations, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02107628

T. Parcollet, M. Morchid, G. Linarès, and R. Mori, Bidirectional quaternion long short-term memory recurrent neural networks for speech recognition, ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.8519-8523, 2019.

T. Isokawa, N. Matsui, and H. Nishimura, Quaternionic neural networks: Fundamental properties and applications, Complex-Valued Neural Networks: Utilizing High-Dimensional Parameters, pp.411-439, 2009.

N. Matsui, T. Isokawa, H. Kusamichi, F. Peper, and H. Nishimura, Quaternion neural network with geometrical operators, Journal of Intelligent & Fuzzy Systems, vol.15, issue.3, pp.149-164, 2004.

J. Diebel, Representing attitude: Euler angles, unit quaternions, and rotation vectors, Matrix, vol.58, pp.1-35, 2006.

X. Glorot and Y. Bengio, Understanding the difficulty of training deep feedforward neural networks, Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp.249-256, 2010.

V. Nair and G. E. Hinton, Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning (ICML-10), pp.807-814, 2010.

B. Karlik and A. V. Olgac, Performance analysis of various activation functions in generalized mlp architectures of neural networks, International Journal of Artificial Intelligence and Expert Systems, vol.1, issue.4, pp.111-122, 2011.

L. Trottier, P. Gigu, and B. Chaib-draa, Parametric exponential linear unit for deep convolutional neural networks, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)

, IEEE, pp.207-214, 2017.

S. , D. Leo, and P. Rotelli, Local hypercomplex analyticity, 1997.

B. C. Ujang, C. Jahanchahi, C. C. Took, and D. Mandic, Quaternion valued neural networks and nonlinear adaptive filters, 2010.

B. C. Ujang, C. C. Took, and D. P. Mandic, Quaternion-valued nonlinear adaptive filtering, IEEE Transactions on Neural Networks, vol.22, issue.8, pp.1193-1206, 2011.

D. P. Mandic, C. Jahanchahi, and C. C. Took, A quaternion gradient operator and its applications, IEEE Signal Processing Letters, vol.18, issue.1, pp.47-50, 2011.

C. J. Willmott and K. Matsuura, Advantages of the mean absolute error (mae) over the root mean square error (rmse) in assessing average model performance, Climate research, vol.30, issue.1, pp.79-82, 2005.

T. Nitta, A quaternary version of the back-propagation algorithm, Neural Networks, 1995. Proceedings., IEEE International Conference on, vol.5, pp.2753-2756, 1995.

T. Isokawa, T. Kusakabe, N. Matsui, and F. Peper, Quaternion neural network and its application, International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, pp.318-324, 2003.

C. Trabelsi, O. Bilaniuk, Y. Zhang, D. Serdyuk, S. Subramanian et al., Deep complex networks, 2017.

D. M. Blei, A. Y. Ng, and M. I. Jordan, Latent dirichlet allocation, Journal of machine Learning research, vol.3, pp.993-1022, 2003.

F. Bechet, B. Maza, N. Bigouroux, T. Bazillon, M. El-beze et al., Decoda: a call-centre human-human spoken conversation corpus, LREC, pp.1343-1347, 2012.

C. Lailler, A. Landeau, F. Béchet, Y. Estève, and P. Deléglise, Enhancing the ratp-decoda corpus with linguistic annotations for performing a large range of nlp tasks, LREC, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01433189

G. Linares, P. Nocéra, D. Massonie, and D. Matrouf, The lia speech recognition system: from 10xrt to 1xrt, Text, Speech and Dialogue, pp.302-308, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01318280

V. Van-asch, Macro-and micro-averaged evaluation measures, Belgium: CLiPS, 2013.

S. Robertson, Understanding inverse document frequency: on theoretical arguments for idf, Journal of documentation, vol.60, issue.5, pp.503-520, 2004.

R. Krestel, P. Fankhauser, and W. Nejdl, Latent dirichlet allocation for tag recommendation, Proceedings of the third ACM conference on Recommender systems, pp.61-68, 2009.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, Distributed representations of words and phrases and their compositionality, Advances in neural information processing systems, pp.3111-3119, 2013.

A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov, Bag of tricks for efficient text classification, 2016.