Multiview approaches to event detection and scene analysis

Abstract : This chapter addresses sound scene and event classification in multiview settings, that is, settings where the observations are obtained from multiple sensors, each sensor contributing a particular view of the data (e.g., audio microphones, video cameras, etc.). We briefly introduce some of the techniques that can be exploited to effectively combine the data conveyed by the different views under analysis for a better interpretation. We first provide a high-level presentation of generic methods that are particularly relevant in the context of multiview and multimodal sound scene analysis. Then, we more specifically present a selection of techniques used for audiovisual event detection and microphone array-based scene analysis.
Complete list of metadatas

Cited literature [176 references]  Display  Hide  Download
Contributor : Romain Serizel <>
Submitted on : Thursday, November 16, 2017 - 5:31:21 PM
Last modification on : Thursday, January 9, 2020 - 12:08:07 PM



Slim Essid, Sanjeel Parekh, Ngoc Duong, Romain Serizel, Alexey Ozerov, et al.. Multiview approaches to event detection and scene analysis. Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩. ⟨hal-01620341⟩



Record views


Files downloads