Normalized Kernels as Similarity Indices - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Normalized Kernels as Similarity Indices

Julien Ah-Pine

Résumé

Measuring similarity between objects is a fundamental issue for numerous applications in data-mining and machine learning domains. In this paper, we are interested in kernels. We particularly focus on kernel normalization methods that aim at designing proximity measures that better fit the definition and the intuition of a similarity index. To this end, we introduce a new family of normalization techniques which extends the cosine normalization. Our approach aims at refining the cosine measure between vectors in the feature space by considering another geometrical based score which is the mapped vectors' norm ratio. We show that the designed normalized kernels satisfy the basic axioms of a similarity index unlike most unnormalized kernels. Furthermore, we prove that the proposed normalized kernels are also kernels. Finally, we assess these different similarity measures in the context of clustering tasks by using a kernel PCA based clustering approach. Our experiments employing several real-world datasets show the potential benefits of normalized kernels over the cosine normalization and the Gaussian RBF kernel.
Fichier principal
Vignette du fichier
kernels_normalization_final.pdf (191.8 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01504523 , version 1 (10-04-2017)

Identifiants

Citer

Julien Ah-Pine. Normalized Kernels as Similarity Indices. 14th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2010), Jun 2010, Hyderabad, India. pp.362 - 373, ⟨10.1007/978-3-642-13672-6_36⟩. ⟨hal-01504523⟩
67 Consultations
2415 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More