Skip to Main content Skip to Navigation
Conference papers

DEvIANT : Discovering significant exceptional (dis)agreement within groups

Adnene Belfodil 1 Wouter Duivesteijn 2 Marc Plantevit 3 Sylvie Cazalens 1 Philippe Lamarre 1
1 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
3 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : We consider any type of data featuring individuals (e.g., parliamentarians , customers) performing observable actions (e.g., votes, ratings) on entities (e.g., legislative procedures, movies). In such data, we aim to find contexts (i.e. subgroup of entities) for which an exceptional (dis)agreement is observed among a group of individuals. To this end, we introduce the novel problem of discovering statistically significant exceptional contextual intra-group agreement patterns. To handle the data sparsity, we use the Krippendorff's Alpha measure to assess the agreement among individuals. We devise a branch-and-bound algorithm, named DEvIANT, to discover such patterns. DEvIANT exploits both closure operators and tight optimistic estimates. We derive analytic approximations for the confidence intervals (CIs) associated to patterns for a computationally efficient significance assessment. We prove that these approximate CIs are nested along specialization of patterns. This makes it possible to incorporate pruning properties in the algorithm to early discard non-significant patterns. Empirical study on several datasets demonstrates the efficiency and the usefulness of DEvIANT.
Complete list of metadata

Cited literature [44 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02961093
Contributor : Marc Plantevit <>
Submitted on : Thursday, October 8, 2020 - 10:44:51 AM
Last modification on : Tuesday, June 1, 2021 - 2:08:09 PM
Long-term archiving on: : Saturday, January 9, 2021 - 6:18:49 PM

File

main.pdf
Files produced by the author(s)

Identifiers

Citation

Adnene Belfodil, Wouter Duivesteijn, Marc Plantevit, Sylvie Cazalens, Philippe Lamarre. DEvIANT : Discovering significant exceptional (dis)agreement within groups. Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2019), Sep 2019, Würzburg, Germany. pp.3-20, ⟨10.1007/978-3-030-46150-8_1⟩. ⟨hal-02961093⟩

Share

Metrics

Record views

29

Files downloads

45