Skip to Main content Skip to Navigation
New interface

Rule Discovery in Labeled Sequential Data: Application to Game Analytics

Romain Mathonat 1, 2 
2 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : It is extremely useful to exploit labeled datasets not only to learn models and perform predictive analytics but also to improve our understanding of a domain and its available targeted classes. The subgroup discovery task has been considered for more than two decades. It concerns the discovery of rules covering sets of objects having interesting properties, e.g., they characterize a given target class. Though many subgroup discovery algorithms have been proposed for both transactional and numerical data, discovering rules within labeled sequential data has been much less studied. In that context, exhaustive exploration strategies can not be used for real-life applications and we have to look for heuristic approaches. In this thesis, we propose to apply bandit models and Monte Carlo Tree Search to explore the search space of possible rules using an exploration-exploitation trade-off, on different data types such as sequences of itemset or time series. For a given budget, they find a collection of top-k best rules in the search space w.r.t chosen quality measure. They require a light configuration and are independent from the quality measure used for pattern scoring. To the best of our knowledge, this is the first time that the Monte Carlo Tree Search framework has been exploited in a sequential data mining setting. We have conducted thorough and comprehensive evaluations of our algorithms on several datasets to illustrate their added-value, and we discuss their qualitative and quantitative results. To assess the added-value of one or our algorithms, we propose a use case of game analytics, more precisely Rocket League match analysis. Discovering interesting rules in sequences of actions performed by players and using them in a supervised classification model shows the efficiency and the relevance of our approach in the difficult and realistic context of high dimensional data. It supports the automatic discovery of skills and it can be used to create new game modes, to improve the ranking system, to help e-sport commentators, or to better analyse opponent teams, for example.
Document type :
Complete list of metadata

Cited literature [145 references]  Display  Hide  Download
Contributor : Romain MATHONAT Connect in order to contact the contributor
Submitted on : Tuesday, October 27, 2020 - 12:31:18 PM
Last modification on : Friday, September 30, 2022 - 11:34:16 AM


Files produced by the author(s)


  • HAL Id : tel-02970006, version 2


Romain Mathonat. Rule Discovery in Labeled Sequential Data: Application to Game Analytics. Computer Science [cs]. Université de Lyon, 2020. English. ⟨NNT : ⟩. ⟨tel-02970006v2⟩



Record views


Files downloads