Bicameral Structuring and Synthetic Imagery for Jointly Predicting Instance Boundaries and Nearby Occlusions from a Single Image

Matthieu Grard 1, 2 Emmanuel Dellandréa 2 Liming Chen 2
2 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Oriented boundary detection is a challenging task aimed at both delineating category-agnostic object instances and inferring their spatial layout from a single RGB image. State-of-the-art deep convolutional networks for this task rely on two independent streams that predict boundaries and occlusions respectively, although both require similar local and global cues, and occlusions cause boundaries. We therefore propose a fully convolutional bicameral structuring, composed of two cascaded decoders sharing one deep encoder, linked altogether by skip connections to combine local and global features, for jointly predicting instance boundaries and their unoccluded side. Furthermore, state-of-the-art datasets contain real images with few instances and occlusions mostly due to objects occluding the background, thereby missing meaningful occlusions between instances. For evaluating the missing scenario of dense piles of objects as well, we introduce synthetic data (Mikado), which extensibly contains more instances and inter-instance occlusions per image than the PASCAL Instance Occlusion Dataset (PIOD), the COCO Amodal dataset (COCOA), and the Densely Segmented Supermarket Amodal dataset (D2SA). We show that the proposed network design outperforms the two-stream baseline and alternative archiectures for oriented boundary detection on both PIOD and Mikado, and the amodal segmentation approach on COCOA as well. Our experiments on D2SA also show that Mikado is plausible in the sense that it enables the learning of performance-enhancing representations transferable to real data, while drastically reducing the need of hand-made annotations for finetuning.
Complete list of metadatas

Cited literature [75 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02151828
Contributor : Matthieu Grard <>
Submitted on : Thursday, June 13, 2019 - 10:51:26 AM
Last modification on : Wednesday, November 20, 2019 - 3:21:33 AM

Files

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02151828, version 1
  • ARXIV : 1906.07480

Citation

Matthieu Grard, Emmanuel Dellandréa, Liming Chen. Bicameral Structuring and Synthetic Imagery for Jointly Predicting Instance Boundaries and Nearby Occlusions from a Single Image. 2019. ⟨hal-02151828⟩

Share

Metrics

Record views

32

Files downloads

59