Semantic and Phonetic Automatic Reconstruction of Medical Dictations
Résumé
Automatic speech recognition (ASR) has become a valuable tool in large document production environments like medical dictation. While manual post-processing is still needed for correcting speech-recognition errors and for creating documents which adhere to various stylistic and formatting conventions, a large part of the document production process is carried out by the ASR system. For improving the quality of the system output, knowledge about the multi-layered relationship between the dictated texts and the final documents is required. Thus, typical speechrecognition errors can be avoided, and proper style and formatting can be anticipated in the ASR part of the document-production process. Yet - while vast amounts of recognition results and manually edited final reports are constantly being produced - the error-free literal transcripts of the actually dictated texts are a scarce and costly resource because they have to be created by manually transcribing the audio files.
Domaines
Linguistique
Origine : Fichiers produits par l'(les) auteur(s)
Loading...