Audiovisual diarization of people in video content - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Multimedia Tools and Applications Année : 2012

Audiovisual diarization of people in video content

Résumé

Audio-Visual People Diarization (AVPD) is an original framework that simultaneously improves audio, video, and audiovisual diarization results. Following a literature review of people diarization for both audio and video content and their limitations, which includes our own contributions, we describe a proposed method for associating both audio and video information by using co-occurrence matrices and present experiments which were conducted on a corpus containing TV news, TV debates, and movies. Results show the effectiveness of the overall diarization system and confirm the gains audio information can bring to video indexing and vice versa.

Dates et versions

hal-03220748 , version 1 (07-05-2021)

Identifiants

Citer

Elie El Khoury, Christine Sénac, Philippe Joly. Audiovisual diarization of people in video content. Multimedia Tools and Applications, 2012, 68, pp.747--775. ⟨10.1007/s11042-012-1080-6⟩. ⟨hal-03220748⟩
45 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More