Recherche - Laboratoire Jean Kuntzmann Accéder directement au contenu

Filtrer vos résultats

12 résultats
Image document

audio-visual multiple-speaker tracking for robot perception

Yutong Ban
Artificial Intelligence [cs.AI]. Université Grenoble Alpes, 2019. English. ⟨NNT : 2019GREAM017⟩
Thèse tel-02163418v4
Image document

Tracking Multiple Persons Based on a Variational Bayesian Model

Yutong Ban , Sileye Ba , Xavier Alameda-Pineda , Radu Horaud
Computer Vision – ECCV 2016 Workshops, Oct 2016, Amsterdam, Netherlands. pp.52-67, ⟨10.1007/978-3-319-48881-3_5⟩
Communication dans un congrès hal-01359559v2
Image document

Exploiting the Complementarity of Audio and Visual Data in Multi-Speaker Tracking

Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud
ICCVW 2017 - IEEE International Conference on Computer Vision Workshops, Oct 2017, Venise, Italy. pp.446-454, ⟨10.1109/ICCVW.2017.60⟩
Communication dans un congrès hal-01577965v1
Image document

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

Yutong Ban , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (5), pp.1761-1776. ⟨10.1109/TPAMI.2019.2953020⟩
Article dans une revue hal-01950866v2
Image document

ODANet: Online Deep Appearance Network for Identity-Consistent Multi-Person Tracking

Guillaume Delorme , Yutong Ban , Guillaume Sarrazin , Xavier Alameda-Pineda
ICPR 2021 - 25th International Conference on Pattern Recognition / Workshops, Jan 2021, Milano / Virtual, Italy. pp.803-818, ⟨10.1007/978-3-030-68780-9_60⟩
Communication dans un congrès hal-03188744v2
Image document

Tracking Multiple Audio Sources with the Von Mises Distribution and Variational EM

Yutong Ban , Xavier Alameda-Pineda , Christine Evers , Radu Horaud
IEEE Signal Processing Letters, 2019, 26 (6), pp.798 - 802. ⟨10.1109/LSP.2019.2908376⟩
Article dans une revue hal-01969050v1
Image document

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking

Yutong Ban , Xiaofei Li , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.6553-6557, ⟨10.1109/ICASSP.2018.8462100⟩
Communication dans un congrès hal-01718114v1
Image document

A Cascaded Multiple-Speaker Localization and Tracking System

Xiaofei Li , Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud
IWAENC - LOCATA Challenge Workshop - a satellite event of IWAENC 2018, Sep 2018, Tokyo, Japan. pp.1-5
Communication dans un congrès hal-01957137v1
Image document

Tracking a Varying Number of People with a Visually-Controlled Robotic Head

Yutong Ban , Xavier Alameda-Pineda , Fabien Badeig , Sileye Ba , Radu Horaud
IEEE/RSJ International Conference on Intelligent Robots and Systems, Sep 2017, Vancouver, Canada. pp.4144-4151, ⟨10.1109/IROS.2017.8206274⟩
Communication dans un congrès hal-01542987v2
Image document

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments

Xiaofei Li , Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud
IEEE Journal of Selected Topics in Signal Processing, 2019, 13 (1), pp.88-103. ⟨10.1109/JSTSP.2019.2903472⟩
Article dans une revue hal-01851985v2
Image document

How To Train Your Deep Multi-Object Tracker

Yihong Xu , Aljosa Osep , Yutong Ban , Radu Horaud , Laura Leal-Taixé , et al.
IEEE Conference on Computer Vision and Pattern Recognition, Jun 2020, Seattle WA, United States. pp.6786-6795, ⟨10.1109/CVPR42600.2020.00682⟩
Communication dans un congrès hal-02534894v1
Image document

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots

Xavier Alameda-Pineda , Soraya Arias , Yutong Ban , Guillaume Delorme , Laurent Girin , et al.
ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨10.1145/3343031.3350590⟩
Communication dans un congrès hal-02354514v1