Statistics and Data Quality Towards more collaboration between these communities

Soumaya Ben Hassine-Guetari; O. Coppet; B. Laboisse

Autre Publication Scientifique Année : 2010

Statistics and Data Quality Towards more collaboration between these communities

(1) , ,

Soumaya Ben Hassine-Guetari

Fonction : Auteur correspondant
PersonId : 879459

Connectez-vous pour contacter l'auteur

Equipe de Recherche en Ingénierie des Connaissances

O. Coppet

Fonction : Auteur

B. Laboisse

Fonction : Auteur

Résumé

dSummer 1980, during a conference given in the Institute of Statistics of Paris, a very impressive presentation on the FCA analysis that came along with multiple investigation tracks was turned out to be false as it was based on inaccurate data. Thirty years later, data quality is an autonomous discipline with dedicated academic mastering courses (Talburt et al. (2006)), publications (Redman (2001), Wand and Wang (1996)) and software (Gouasdoue et al. (2007)). In fact, a plethora of dimensions, metrics, models and database design techniques (Wang et al. (2001)) are now defined to handle data and their quality in the same ow, helping, then, the statisticians qualify and evaluate their results (Berti-Equille (2007)). In the other hand, statistical models were proposed to define the dimensions' metrics, detect outliers and anomalous data, analyze data heterogeneity, etc. (Batini and Scannapieco (2006)) Let's, then, build a bridge between the two communities and have a track data quality at CompStat 2011!

Domaines

Applications [stat.AP]

Fabien Rico : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00519979

Soumis le : mardi 21 septembre 2010-21:16:53

Dernière modification le : vendredi 24 février 2023-12:08:53

Dates et versions

hal-00519979 , version 1 (21-09-2010)

Identifiants

HAL Id : hal-00519979 , version 1

Citer

Soumaya Ben Hassine-Guetari, O. Coppet, B. Laboisse. Statistics and Data Quality Towards more collaboration between these communities. 2010. ⟨hal-00519979⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LYON2 ERIC UDL

72 Consultations

0 Téléchargements

Statistics and Data Quality Towards more collaboration between these communities

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager