Recognition and Localization of Food in Cooking Videos

Nachwa Aboubakr 1 Rémi Ronfard 2 James L. Crowley 1
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
2 IMAGINE - Intuitive Modeling and Animation for Interactive Graphics & Narrative Environments
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : In this paper, we describe experiments with techniques for locating foods and recognizing food states in cooking videos. We describe production of a new data set that provides annotated images for food types and food states. We compare results with two techniques for detecting food types and food states, and then show that recognizing type and state with separate classifiers improves recognition results. We then use this to provide detection of composite activation maps for food types. The results provide a promising first step towards construction of narratives for cooking actions.
Document type :
Conference papers
Complete list of metadatas
Contributor : Nachwa Aboubakr <>
Submitted on : Friday, July 6, 2018 - 7:54:43 PM
Last modification on : Thursday, May 2, 2019 - 3:30:30 PM



Nachwa Aboubakr, Rémi Ronfard, James L. Crowley. Recognition and Localization of Food in Cooking Videos. Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management (CEA-MADiMa 2018), Jul 2018, Stockholm, Sweden. pp.21-24, ⟨10.1145/3230519.3230590⟩. ⟨hal-01815512v3⟩



Record views


Files downloads