Filtrer vos résultats
- 62
- 20
- 48
- 26
- 3
- 2
- 1
- 1
- 1
- 2
- 79
- 18
- 5
- 2
- 1
- 1
- 8
- 7
- 12
- 8
- 4
- 7
- 10
- 6
- 3
- 3
- 6
- 5
- 2
- 82
- 82
- 65
- 27
- 9
- 4
- 4
- 3
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 82
- 40
- 27
- 14
- 11
- 11
- 10
- 8
- 6
- 6
- 6
- 5
- 5
- 4
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
82 résultats
|
Towards Probabilistic Generative Models for Socially Intelligent RobotsComputer Vision and Pattern Recognition [cs.CV]. Université Grenoble - Alpes, 2020
HDR
tel-03192456v1
|
||
|
Egocentric Audio-Visual Scene Analysis : a machine learning and signal processing approachGeneral Mathematics [math.GM]. Université de Grenoble, 2013. English. ⟨NNT : 2013GRENM024⟩
Thèse
tel-00880117v2
|
||
|
Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational AutoencodersIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2020, Barcelona, Spain. pp.7534-7538, ⟨10.1109/ICASSP40776.2020.9053730⟩
Communication dans un congrès
hal-02534911v1
|
||
|
Geometrically-constrained Robust Time Delay Estimation Using Non-coplanar Microphone ArraysEUSIPCO 2012 - 20th European Signal Processing Conference, Aug 2012, Bucharest, Romania. pp.1309-1313
Communication dans un congrès
hal-00768763v1
|
||
|
Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech EnhancementICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. pp.1-5, ⟨10.1109/ICASSP39728.2021.9414097⟩
Communication dans un congrès
hal-03155445v1
|
||
|
Geometrically-constrained time delay estimation-based sound source localisation (gTDESSL)[Research Report] RR-7988, INRIA. 2012, pp.28
Rapport
hal-00704986v2
|
||
|
A Geometric Approach to Sound Source Localization from Time-Delay Estimates2013
Rapport
hal-00910081v2
|
||
|
Vision-Guided Robot HearingThe International Journal of Robotics Research, 2015, 34 (4-5), pp.437-456. ⟨10.1177/0278364914548050⟩
Article dans une revue
hal-00990766v1
|
||
|
A Geometric Approach to Sound Source Localization from Time-Delay EstimatesIEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (6), pp.1082-1095. ⟨10.1109/TASLP.2014.2317989⟩
Article dans une revue
hal-00910081v3
|
||
|
Successor Feature Neural Episodic ControlNeurIPS 2021 - 35th International Conference on Neural Information Processing Systems, Dec 2021, Virtual, Canada. pp.1-12
Communication dans un congrès
hal-03426874v1
|
||
|
Mixture of Inference Networks for VAE-based Audio-visual Speech EnhancementIEEE Transactions on Signal Processing, 2021, 69, pp.1899-1909. ⟨10.1109/TSP.2021.3066038⟩
Article dans une revue
hal-02926172v2
|
||
|
Successor Feature RepresentationsTransactions on Machine Learning Research Journal, 2023, pp.1-35
Article dans une revue
hal-03426870v1
|
||
Multimodal behavior analysis in the wildAcademic Press (Elsevier), 2018
Ouvrages
hal-01858395v1
|
|||
|
Variational Inference and Learning of Piecewise-linear Dynamical SystemsIEEE Transactions on Neural Networks and Learning Systems, 2022, 33 (8), pp.3753 - 3764. ⟨10.1109/TNNLS.2021.3054407⟩
Article dans une revue
hal-02745527v3
|
||
|
The Geometry of Sound-Source Localization using Non-Coplanar microphone ArraysWASPAA 2013 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2013, New Paltz, United States. pp.1-4, ⟨10.1109/WASPAA.2013.6701896⟩
Communication dans un congrès
hal-00848876v1
|
||
|
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and SeparationTransactions on Machine Learning Research Journal, 2024, pp.1-19
Article dans une revue
hal-03584014v1
|
||
|
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory MappingIEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩
Article dans une revue
hal-01485540v1
|
||
|
Benchmarking Methods for Audio-Visual Recognition Using Tiny Training SetsICASSP 2013 - IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2013, Vancouver, Canada. pp.3662-3666, ⟨10.1109/ICASSP.2013.6638341⟩
Communication dans un congrès
hal-00861645v1
|
||
|
PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose EstimationWACV 2021 - IEEE Winter Conference on Applications of Computer vision, Jan 2021, Waikoloa, United States. pp.1-11, ⟨10.1109/WACV48630.2021.00284⟩
Communication dans un congrès
hal-02971754v1
|
||
|
Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR FrameworkLVA/ICA 2017 - 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.459-468, ⟨10.1007/978-3-319-53547-0_43⟩
Communication dans un congrès
hal-01646098v1
|
||
|
A Comprehensive Analysis of Deep RegressionIEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42 (9), pp.2065-2081. ⟨10.1109/TPAMI.2019.2910523⟩
Article dans une revue
hal-01754839v1
|
||
|
Expression-preserving face frontalization improves visually assisted speech processingInternational Journal of Computer Vision, 2023, 131 (5), pp.1122-1140. ⟨10.1007/s11263-022-01742-1⟩
Article dans une revue
hal-03902610v2
|
||
Speaker-Adaptive Acoustic-Articulatory Inversion using Cascaded Gaussian Mixture RegressionIEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23 (12), pp.2246-2259. ⟨10.1109/TASLP.2015.2464702⟩
Article dans une revue
hal-01231197v1
|
|||
|
A Recurrent Variational Autoencoder for Speech EnhancementICASSP 2020 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, May 2020, Barcelone (virtual), Spain. pp.371-375, ⟨10.1109/ICASSP40776.2020.9053164⟩
Communication dans un congrès
hal-02329000v2
|
||
|
DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture ModelECCV 2018 - European Conference on Computer Vision, Sep 2018, Munich, Germany. pp.205-221, ⟨10.1007/978-3-030-01228-1_13⟩
Communication dans un congrès
hal-01851511v1
|
||
|
Tracking Multiple Persons Based on a Variational Bayesian ModelComputer Vision – ECCV 2016 Workshops, Oct 2016, Amsterdam, Netherlands. pp.52-67, ⟨10.1007/978-3-319-48881-3_5⟩
Communication dans un congrès
hal-01359559v2
|
||
|
Sound Representation and Classification Benchmark for Domestic RobotsICRA 2014 - IEEE International Conference on Robotics and Automation, May 2014, Hong Kong, China. pp.6285-6292, ⟨10.1109/ICRA.2014.6907786⟩
Communication dans un congrès
hal-00952092v1
|
||
|
Sound-Event Recognition with a Companion HumanoidHumanoids 2012 - IEEE International Conference on Humanoid Robotics, Nov 2012, Osaka, Japan. pp.104-111, ⟨10.1109/HUMANOIDS.2012.6651506⟩
Communication dans un congrès
hal-00768767v1
|
||
|
A Proposal-based Paradigm for Self-supervised Sound Source Localization in VideosCVPR 2022 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States. pp.1-10, ⟨10.1109/CVPR52688.2022.00110⟩
Communication dans un congrès
hal-03626420v1
|
||
|
CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentificationICPR 2020 - 25th International Conference on Pattern Recognition, Jan 2021, Milano, Italy. pp.4428-4435, ⟨10.1109/ICPR48806.2021.9412431⟩
Communication dans un congrès
hal-02882285v1
|