Base de caractérisation des valeurs manquantes

Leila Ben Othman; François Rioult; Sadok Ben Yahia; Bruno Crémilleux

Article Dans Une Revue Revue des Sciences et Technologies de l'Information - Série TSI : Technique et Science Informatiques Année : 2011

Base de caractérisation des valeurs manquantes

(1) , (2) , , (2)

1
2

Leila Ben Othman

Fonction : Auteur

Groupe de Recherche en Informatique, Image et Instrumentation de Caen

François Rioult

Fonction : Auteur
PersonId : 8867
IdHAL : francois-rioult
ORCID : 0000-0001-8162-0997
IdRef : 099427494

Equipe CODAG - Laboratoire GREYC - UMR6072

Sadok Ben Yahia

Fonction : Auteur
PersonId : 1031974
ORCID : 0000-0001-8939-8948

Bruno Crémilleux

Fonction : Auteur
PersonId : 15791
IdHAL : bruno-cremilleux
ORCID : 0000-0001-8294-9049
IdRef : 083548335

Equipe CODAG - Laboratoire GREYC - UMR6072

Résumé

When tackling real-life datasets, it is common to face the existence of missing values within data. Explaining the origin of the missing values appearance allows to better control the quality of the data, as well as proposing suitable handling methods, e.g., their completion. The abundant literature heavily relies on the missing value appearance models proposed by Little and Rubin. However, a careful scrutiny of these statistic-based models highlights that they constitute an actual hamper towards their use by data mining techniques. The main thrust of this paper is the proposition of a new model for missing values appearance. Such introduced models rely on the use of the proper implication basis.

Mots clés

Intégrité donnée Analyse statistique Modélisation Complétude Qualité information Association statistique Fouille donnée Analyse donnée Contrôle qualité Information incomplète Donnée manquante Data integrity Statistical analysis Modeling Completeness Information quality Statistical association Data mining Data analysis Quality control Incomplete information Missing data

Greyc Référent : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01017369

Soumis le : mercredi 2 juillet 2014-12:21:42

Dernière modification le : mercredi 27 mars 2024-11:24:19

Dates et versions

hal-01017369 , version 1 (02-07-2014)

Identifiants

HAL Id : hal-01017369 , version 1

Citer

Leila Ben Othman, François Rioult, Sadok Ben Yahia, Bruno Crémilleux. Base de caractérisation des valeurs manquantes. Revue des Sciences et Technologies de l'Information - Série TSI : Technique et Science Informatiques, 2011, 30 (10), 24 p. ⟨hal-01017369⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS GREYC GREYC-CODAG COMUE-NORMANDIE ENSICAEN UNICAEN

64 Consultations

0 Téléchargements

Base de caractérisation des valeurs manquantes

Résumé

Mots clés

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager