Exceptional contextual subgraph mining

Mehdi Kaytoue 1 Marc Plantevit 1 Albrecht Zimmermann 1 Anes Bendimerad 1 Céline Robardet 1
1 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Many relational data result from the aggregation of several individual behaviors described by some characteristics. For instance, a bike-sharing system may be modeled as a graph where vertices stand for bike-share stations and connections represent bike trips made by users from one station to another. Stations and trips are described by additional information such as the description of the geographical environment of the stations (business vs. residential area, closeness to POI, elevation, urbanization density, etc.), or of the bike trips (timestamp, user profile, weather, events and other special conditions about the trip). Identifying highly connected components (such as communities or quasi-cliques) in this graph provides interesting insights into global usages but does not capture mobility profiles that characterize a subpopulation. To tackle this problem we propose an approach rooted in exceptional model mining to find exceptional contextual subgraphs, i.e., subgraphs generated from a context or a description of the individual behaviors that is exceptional (behaves in a different way) compared to the whole augmented graph. The dependency between a context and an edge is assessed by a χ 2 test and the weighted relative accuracy measure is used to only retain contexts that strongly characterize connected subgraphs. We present an original algorithm that uses sophisticated pruning techniques to restrict the search space of vertices, context refinements, and edges to be considered. An experimental evaluation on synthetic data and two real-life datasets demonstrates the effectiveness of the proposed pruning mechanisms, as well as the relevance of the discovered patterns.
Complete list of metadatas

Cited literature [53 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01488732
Contributor : Mehdi Kaytoue <>
Submitted on : Wednesday, April 19, 2017 - 4:57:45 PM
Last modification on : Thursday, November 21, 2019 - 2:32:23 AM

File

main_hal.pdf
Files produced by the author(s)

Identifiers

Citation

Mehdi Kaytoue, Marc Plantevit, Albrecht Zimmermann, Anes Bendimerad, Céline Robardet. Exceptional contextual subgraph mining. Machine Learning, Springer Verlag, 2017, 106 (08), pp.1171--1211. ⟨10.1007/s10994-016-5598-0⟩. ⟨hal-01488732v2⟩

Share

Metrics

Record views

327

Files downloads

826