Skip to Main content Skip to Navigation
Conference papers

Pedestrian attribute recognition with part-based CNN and combined feature representations

Abstract : In video surveillance, pedestrian attributes such as gender, clothing or hair types are useful cues to identify people. The main challenge in pedestrian attribute recognition is the large variation of visual appearance and location of attributes due to different poses and camera views. In this paper, we propose a neural network combining high-level learnt Convolutional Neural Network (CNN) features and low-level handcrafted features to address the problem of highly varying viewpoints. We first extract low-level robust Local Maximal Occurrence (LOMO) features and learn a body part-specific CNN to model attribute patterns related to different body parts. For small datasets which have few data, we propose a new learning strategy, where the CNN is pre-trained in a triplet structure on a person re-identification task and then fine-tuned on attribute recognition. Finally, we fuse the two feature representations to recognise pedestrian attributes. Our approach achieves state-of-the-art results on three public pedestrian attribute datasets.
Document type :
Conference papers
Complete list of metadatas

Cited literature [27 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01625470
Contributor : Yiqiang Chen <>
Submitted on : Thursday, June 21, 2018 - 11:55:46 AM
Last modification on : Wednesday, July 8, 2020 - 12:43:46 PM
Document(s) archivé(s) le : Tuesday, September 25, 2018 - 12:14:40 AM

File

chen_visapp2018.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01625470, version 1

Citation

Yiqiang Chen, Stefan Duffner, Andrei Stoian, Jean-Yves Dufour, Atilla Baskurt. Pedestrian attribute recognition with part-based CNN and combined feature representations. VISAPP2018, Jan 2018, Funchal, Portugal. ⟨hal-01625470⟩

Share

Metrics

Record views

221

Files downloads

350