, Acknowledgements COMPRISE has received funding from the European Union's Horizon 2020 Research and Innovation Programme under Grant Agreement No. 825081. 6. Bibliographical References

E. Byambakhishig, K. Tanaka, R. Aihara, T. Nakashika, T. Takiguchi et al., Error correction of automatic speech recognition based on normalized web distance, Fifteenth Annual Conference of the International Speech Communication Association (INTERSPEECH), pp.2852-2856, 2014.

J. C. Chou, C. C. Yeh, and H. Y. Lee, One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization, Proc. INTERSPEECH, pp.664-668, 2019.

. H2020-comprise-project, Deliverable D2.1: Baseline speech and text transformation and model learning library, 2019.

S. Kang, J. H. Kim, and J. Seo, Post-error correction in automatic speech recognition using discourse information, Advances in Electrical and Computer Engineering, vol.14, issue.2, pp.53-57, 2014.

Y. Oualil, M. Schulder, H. Helmke, A. Schmidt, and D. Klakow, Real-time integration of dynamic context information for improving automatic speech recognition, Sixteenth Annual Conference of the International Speech Communication Association (INTERSPEECH), pp.2107-21111, 2015.

J. Qian, H. Du, J. Hou, L. Chen, T. Jung et al., Hidebehind: Enjoy Voice Input with Voiceprint Unclonability and Anonymity, Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems, pp.82-94, 2018.

B. M. Srivastava, N. Vauquier, M. Sahidullah, A. Bellet, M. Tommasi et al., Evaluating voice conversion-based privacy protection against informed attackers, Proc. ICASSP, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02355115

D. Sundermann and H. Ney, VTLNbased voice conversion, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology, pp.556-559, 2003.

Y. C. Tam, Y. Lei, J. Zheng, and W. Wang, ASR error detection using recurrent neural network language model and complementary ASR, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2312-2316, 2014.

Z. Zhou, X. Song, R. Botros, and L. Zhao, A Neural Network Based Ranking Framework to Improve ASR with NLU Related Knowledge Deployed, ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.6450-6454, 2019.