, Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in Conjunction with The 2017 International Joint Conference on Artificial Intelligence

G. Amato and P. Bolettieri, Vinicius Monteiro de Lira, Cristina Ioana Muntean, Raffaele Perego, and Chiara Renso, SIGIR, pp.1333-1336, 2017.

G. Andrew, R. Arora, J. Bilmes, and K. Livescu, Deep canonical correlation analysis, ICML, pp.1247-1255, 2013.

R. Francis and M. Bach, Kernel independent component analysis, Journal of machine learning research, vol.3, pp.1-48, 2002.

O. Beijbom, N. Joshi, D. Morris, S. Saponas, and S. Khullar, Menu-Match: Restaurant-Specific Food Logging from Images, 2015 IEEE Winter Conference on Applications of Computer Vision, pp.844-851, 2015.

L. Bossard, M. Guillaumin, and L. Van-gool, Food-101 -Mining Discriminative Components with Random Forests, ECCV, 2014.

J. Chen and C. Ngo, Deep-based ingredient recognition for cooking recipe retrieval, Proceedings of the 2016 ACM on Multimedia Conference, pp.32-41, 2016.

J. Chen and C. Ngo, Deep-based Ingredient Recognition for Cooking Recipe Retrieval, MultiMedia Modeling, pp.32-41, 2016.

J. Chen, L. Pang, and C. Ngo, Cross-Modal Recipe Retrieval: How to Cook this Dish, MultiMedia Modeling, pp.588-600, 2017.

M. Chen, K. Dhingra, W. Wu, L. Yang, R. Sukthankar et al., PFID: Pittsburgh fast-food image dataset, pp.289-292, 2009.

D. Elsweiler, C. Trattner, and M. Harvey, Exploiting Food Choice Biases for Healthier Recipe Recommendation, SIGIR, pp.575-584, 2017.

G. M. Farinella, D. Allegra, and F. Stanco, A Benchmark Dataset to Study the Representation of Food Images, pp.584-599, 2015.

R. Hadsell, S. Chopra, and Y. Lecun, Dimensionality Reduction by Learning an Invariant Mapping, CVP, pp.1735-1742, 2006.

J. Harashima, Y. Someya, and Y. Kikuta, Cookpad Image Dataset: An Image Collection As Infrastructure for Food Research, SIGIR, pp.1229-1232, 2017.

Z. Harris, Distributional structure, vol.10, pp.146-162, 1954.

K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition, 2015.

S. Hochreiter and J. Schmidhuber, Long Short-Term Memory, Neural Comput, vol.9, pp.1735-1780, 1997.

H. Hotelling, Relations between two sets of variates, Biometrika, vol.28, issue.4, pp.321-377, 1936.

J. Hu, J. Lu, and Y. P. Tan, Discriminative Deep Metric Learning for Face Verification in the Wild, CVPR, pp.1875-1882, 2014.

J. Jeon, V. Lavrenko, and R. Manmatha, Automatic Image Annotation and Retrieval Using Cross-media Relevance Models, SIGIR, pp.119-126, 2003.

A. Karpathy and L. Fei-fei, Deep visual-semantic alignments for generating image descriptions, CVPR, pp.3128-3137, 2015.

Y. Kawano and K. Yanai, Food image recognition with deep convolutional features, UbiComp '14, pp.589-593, 2014.

Y. Kawano and K. Yanai, FoodCam: A Real-Time Mobile Food Recognition System Employing Fisher Vector, MMM, pp.369-373, 2014.

D. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.

R. Kiros, R. Salakhutdinov, and R. S. Zemel, Unifying visualsemantic embeddings with multimodal neural language models, TACL, 2015.

R. Kiros, Y. Zhu, R. Ruslan, R. Salakhutdinov, R. Zemel et al., Skip-Thought Vectors. In NIPS, pp.3294-3302, 2015.

T. Kusmierczyk and K. Nørvåg, Online Food Recipe Title Semantics: Combining Nutrient Facts and Topics, CIKM, pp.2013-2016, 2016.

T. Kusmierczyk, C. Trattner, and K. Nørvåg, Understanding and predicting online food recipe production patterns, HT, pp.243-248, 2016.

P. L. , L. , and C. Fyfe, Kernel and nonlinear canonical correlation analysis, International Journal of Neural Systems, vol.10, pp.365-377, 2000.

T. Marc, N. Law, M. Thome, and . Cord, Quadruplet-wise image similarity learning, ICCV, pp.249-256, 2013.

A. Lazaridou, . Nghia-the, M. Pham, and . Baroni, Combining Language and Vision with a Multimodal Skip-gram Model, NAACL HLT, pp.153-163, 2015.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, Distributed representations of words and phrases and their compositionality, NIPS, pp.3111-3119, 2013.

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh et al., ImageNet Large Scale Visual Recognition Challenge, IJCV, vol.115, pp.211-252, 2015.

A. Salvador, N. Hynes, Y. Aytar, J. Marin, F. Ofli et al., Learning Cross-modal Embeddings for Cooking Recipes and Food Images, CVPR, 2017.

S. Sanjo and M. Katsurai, Recipe Popularity Prediction with Deep Visual-Semantic Fusion, CIKM, pp.2279-2282, 2017.

F. Schroff, D. Kalenichenko, and J. Philbin, Facenet: A unified embedding for face recognition and clustering, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.815-823, 2015.

A. Sun, S. Sourav, K. Bhowmick, G. Tran-nam-nguyen, and . Bai, Tag-based Social Image Retrieval: An Empirical Evaluation, J. Am. Soc. Inf. Sci. Technol, vol.62, pp.2364-2381, 2011.

C. Trattner and D. Elsweiler, Investigating the Healthiness of Internet-Sourced Recipes: Implications for Meal Planning and Recommender Systems, pp.489-498, 2017.

X. Wang, D. Kumar, N. Thome, M. Cord, and F. Precioso, Recipe recognition with large multimodal food dataset, pp.1-6, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01196959

Q. Kilian, L. K. Weinberger, and . Saul, Distance Metric Learning for Large Margin Nearest Neighbor Classification, J. Mach. Learn. Res, vol.10, pp.207-244, 2009.

J. Wu, Z. Lin, and H. Zha, Joint Latent Subspace Learning and Regression for Cross-Modal Retrieval, SIGIR, pp.917-920, 2017.

E. P. Xing, M. I. Jordan, J. Stuart, A. Y. Russell, and . Ng, Distance Metric Learning with Application to Clustering with Side-Information, NIPS, pp.521-528, 2003.

F. Yan and K. Mikolajczyk, Deep correlation for matching images and text, CVPR, pp.3441-3450, 2015.