Enhanced Search and Navigation on Conversational Speech
Résumé
Huge amounts of conversational speech continually flow through call centers world wide but remain inaccessible. In the context of a French research project, we adapted our industrial search and navigation engine as to be able to process conversational speech. Our full text search engine indexes the transcripts of an automatic speech recognition system. We adapted our processing at two crucial levels: text analysis and user interface. To tackle the problem of disfluencies, a special language model has been developed for the integrated part-of-speech tagger. This text based approach enables the use of current named entity recognition and data mining methods. The user interface takes into account the nature of conversational speech documents without leaving the user behind. We will demonstrate the operational search and navigation engine with a special accent on the user interface. The index will contain a 150h corpus of automatic transcripts and some of the corresponding anonymized audio files.