TS-NET: COMBINING MODALITY SPECIFIC AND COMMON FEATURES FOR MULTIMODAL PATCH MATCHING

Sovann En 1 Alexis Lechervy 1 Frédéric Jurie 1
1 Equipe Image - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen
Abstract : Multimodal patch matching addresses the problem of finding the correspondences between image patches from two different modalities, e.g. RGB vs sketch or RGB vs near-infrared. The comparison of patches of different modalities can be done by discovering the information common to both modalities (Siamese like approaches) or the modality-specific information (Pseudo-Siamese like approaches). We observed that none of these two scenarios is optimal. This motivates us to propose a three-stream architecture, dubbed as TS-Net, combining the benefits of the two. In addition, we show that adding extra constraints in the intermediate layers of such networks further boosts the performance. Experimentations on three multimodal datasets show significant performance gains in comparison with Siamese and Pseudo-Siamese networks.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [20 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01804551
Contributor : Frederic Jurie <>
Submitted on : Thursday, May 31, 2018 - 5:14:17 PM
Last modification on : Thursday, February 7, 2019 - 2:46:44 PM
Document(s) archivé(s) le : Saturday, September 1, 2018 - 2:56:04 PM

Files

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01804551, version 1
  • ARXIV : 1806.01550

Citation

Sovann En, Alexis Lechervy, Frédéric Jurie. TS-NET: COMBINING MODALITY SPECIFIC AND COMMON FEATURES FOR MULTIMODAL PATCH MATCHING. ICIP, IEEE International Conference on Image Processing, Oct 2018, Athens, Greece. ⟨hal-01804551⟩

Share

Metrics

Record views

102

Files downloads

85