Exceptional Model Mining for Behavioral Data Analysis

Adnene Belfodil 1, 2, 3
2 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
3 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : With the rapid proliferation of data platforms collecting and curating data related to various domains such as governments data, education data, environment data or product ratings, more and more data are available online. This offers an unparalleled opportunity to study the behavior of individuals and the interactions between them. In the political sphere, being able to query datasets of voting records provides interesting insights for data journalists and political analysts. In particular, such data can be leveraged for the investigation of exceptionally consensual/controversial topics. Consider data describing the voting behavior in the European Parliament (EP). Such a dataset records the votes of each member (MEP) in voting sessions held in the parliament, as well as information on the parliamentarians (e.g., gender, national party, European party alliance) and the sessions (e.g., topic, date). This dataset offers opportunities to study the agreement or disagreement of coherent subgroups, especially to highlight unexpected behavior. It is to be expected that on the majority of voting sessions, MEPs will vote along the lines of their European party alliance. However, when matters are of interest to a specific nation within Europe, alignments may change and agreements can be formed or dissolved. For instance, when a legislative procedure on fishing rights is put before the MEPs, the island nation of the UK can be expected to agree on a specific course of action regardless of their party alliance, fostering an exceptional agreement where strong polarization exists otherwise. In this thesis, we aim to discover such exceptional (dis)agreement patterns not only in voting data but also in more generic data, called behavioral data, which involves individuals performing observable actions on entities. We devise two novel methods which offer complementary angles of exceptional (dis)agreement in behavioral data: within and between groups. These two approaches called Debunk and Deviant, ideally, enables the implementation of a sufficiently comprehensive tool to highlight, summarize and analyze exceptional comportments in behavioral data. We thoroughly investigate the qualitative and quantitative performances of the devised methods. Furthermore, we motivate their usage in the context of computational journalism.
Complete list of metadatas

Cited literature [310 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/tel-02335097
Contributor : Adnene Belfodil <>
Submitted on : Monday, November 4, 2019 - 9:59:27 AM
Last modification on : Tuesday, November 19, 2019 - 2:42:22 AM

Identifiers

  • HAL Id : tel-02335097, version 1

Citation

Adnene Belfodil. Exceptional Model Mining for Behavioral Data Analysis. Databases [cs.DB]. Univ Lyon, CNRS, ENS de Lyon, Université Claude-Bernard Lyon 1, LIP, F-69342, Lyon Cedex 07, France; INSA LYON, 2019. English. ⟨NNT : 2019LYSEI086⟩. ⟨tel-02335097⟩

Share

Metrics

Record views

70

Files downloads

121