D. Aldavert, M. Rusiñol, R. Toledo, and J. Lladós, A study of Bag-of-Visual-Words representations for handwritten keyword spotting, International Journal on Document Analysis and Recognition (IJDAR), vol.30, issue.11, pp.223-234, 2015.
DOI : 10.1109/ICDAR.2013.169

U. L. Altintakan and A. Yazici, Towards Effective Image Classification Using Class-Specific Codebooks and Distinctive Local Features, IEEE Transactions on Multimedia, vol.17, issue.3, pp.323-332, 2015.
DOI : 10.1109/TMM.2014.2388312

D. Arthur and S. Vassilvitskii, k-means++: The advantages of careful seeding, Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, pp.1027-1035, 2007.

D. A. Chanti and A. Caplier, Spontaneous Facial Expression Recognition using Sparse Representation, Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, pp.64-74, 2017.
DOI : 10.5220/0006118000640074

URL : https://hal.archives-ouvertes.fr/hal-01485279

T. Furuya and R. Ohbuchi, Dense sampling and fast encoding for 3D model retrieval using bag-of-visual features, Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR '09, p.26, 2009.
DOI : 10.1145/1646396.1646430

K. Grauman and T. Darrell, The pyramid match kernel: Efficient learning with sets of features, Journal of Machine Learning Research, vol.8, pp.725-760, 2007.

W. Hariri, H. Tabia, N. Farah, D. Declercq, and A. Benouareth, Geometrical and Visual Feature Quantization for 3D Face Recognition, Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2017.
DOI : 10.5220/0006101701870193

R. T. Ionescu, M. Popescu, and C. Grozea, Local learning to improve bag of visual words model for facial expression recognition, Workshop on challenges in representation learning, ICML, 2013.

T. Kanade, J. F. Cohn, and Y. Tian, Comprehensive database for facial expression analysis, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), pp.46-53, 2000.
DOI : 10.1109/AFGR.2000.840611

URL : http://www.pitt.edu/%7Ejeffcohn/biblio/Cohn-Kanade_Database.pdf

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

J. Leskovec, A. Rajaraman, and J. D. Ullman, Mining of massive datasets, 2014.
DOI : 10.1017/CBO9781139924801

M. Lyons, S. Akamatsu, M. Kamachi, and J. Gyoba, Coding facial expressions with Gabor wavelets, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, pp.200-205, 1998.
DOI : 10.1109/AFGR.1998.670949

URL : http://www.mis.atr.co.jp/~mlyons/pub_pdf/fg98-1.pdf

M. Pantic, M. Valstar, R. Rademaker, and L. Maat, Web-Based Database for Facial Expression Analysis, 2005 IEEE International Conference on Multimedia and Expo, p.5, 2005.
DOI : 10.1109/ICME.2005.1521424

URL : http://pubs.doc.ic.ac.uk/Pantic-ICME05-2/Pantic-ICME05-2.pdf

X. Peng, L. Wang, X. Wang, and Y. Qiao, Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice, Computer Vision and Image Understanding, vol.150, pp.109-125, 2016.
DOI : 10.1016/j.cviu.2016.03.013

P. Scovanner, S. Ali, and M. Shah, A 3-dimensional sift descriptor and its application to action recognition, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, pp.357-360, 2007.
DOI : 10.1145/1291233.1291311

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, p.1470, 2003.
DOI : 10.1109/ICCV.2003.1238663

A. Tcherkassof, D. Dupré, B. Meillon, N. Mandran, M. Dubois et al., Dynemo : A Video Database of Natural Facial Expressions of Emotions, The International journal of Multimedia & Its Applications, vol.5, issue.5, pp.61-80, 2013.
DOI : 10.5121/ijma.2013.5505

URL : https://hal.archives-ouvertes.fr/hal-00939067

J. C. Van-gemert, C. J. Veenman, A. W. Smeulders, and J. Geusebroek, Visual Word Ambiguity, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.7, pp.321271-1283, 2010.
DOI : 10.1109/TPAMI.2009.132

Y. Xie, S. Jiang, and Q. Huang, Weighted visual vocabulary to balance the descriptive ability on general dataset, Neurocomputing, vol.119, pp.478-488, 2013.
DOI : 10.1016/j.neucom.2013.03.004

S. Zhang, Q. Tian, G. Hua, Q. Huang, and W. Gao, Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications, IEEE Transactions on Image Processing, vol.20, issue.9, pp.2664-2677, 2011.
DOI : 10.1109/TIP.2011.2128333

Q. Zhu, Y. Zhong, B. Zhao, G. Xia, and L. Zhang, Bag-of-Visual-Words Scene Classifier With Local and Global Features for High Spatial Resolution Remote Sensing Imagery, IEEE Geoscience and Remote Sensing Letters, vol.13, issue.6, pp.747-751, 2016.
DOI : 10.1109/LGRS.2015.2513443