Unary and n-ary inclusion dependency discovery in relational databases

Fabien de Marchi; Stéphane Lopes; Jean-Marc Petit

doi:10.1007/s10844-007-0048-x

Journal Articles Journal of Intelligent Information Systems Year : 2008

Unary and n-ary inclusion dependency discovery in relational databases

(1) , (2) , (1)

1
2

Fabien de Marchi

Function : Author
PersonId : 6947
IdHAL : fabien-de-marchi
IdRef : 078523125

Base de Données

Stéphane Lopes

Function : Author
PersonId : 13737
IdHAL : stephane-lopes

Parallélisme, Réseaux, Systèmes, Modélisation

Jean-Marc Petit

Function : Author
PersonId : 4224
IdHAL : jean-marc-petit
ORCID : 0000-0002-0015-745X

Base de Données

Abstract

Foreign keys form one of the most fundamental constraints for relational databases. Since they are not always defined in existing databases, the discovery of foreign keys turns out to be an important and challenging task. The underlying problem is known to be the inclusion dependency (IND) inference problem. In this paper, data-mining algorithms are devised for IND inference in a given database. We propose a two-step approach. In the first step, unary INDs are discovered thanks to a new preprocessing stage which leads to a new algorithm and to an efficient implementation. In the second step, n-ary IND inference is achieved. This step fits in the framework of levelwise algorithms used in many data-mining algorithms. Since real-world databases can suffer from some data inconsistencies, approximate INDs, i.e. INDs which almost hold, are considered. We show how they can be safely integrated into our unary and n-ary discovery algorithms. An implementation of these algorithms has been achieved and tested against both synthetic and real-life databases. Up to our knowledge, no other algorithm does exist to solve this data-mining problem.

Keywords

Inclusion dependency discovery Relational databases

Domains

Databases [cs.DB]

Stéphane Lopes : Connect in order to contact the contributor

https://hal.science/hal-00287078

Submitted on : Tuesday, June 10, 2008-5:00:59 PM

Last modification on : Wednesday, July 5, 2023-3:28:04 PM

Dates and versions

hal-00287078 , version 1 (10-06-2008)

Identifiers

HAL Id : hal-00287078 , version 1
DOI : 10.1007/s10844-007-0048-x

Cite

Fabien de Marchi, Stéphane Lopes, Jean-Marc Petit. Unary and n-ary inclusion dependency discovery in relational databases. Journal of Intelligent Information Systems, 2008, ⟨10.1007/s10844-007-0048-x⟩. ⟨hal-00287078⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS UVSQ LABEXIMU INSA-GROUPE UDL

118 View

0 Download

Unary and n-ary inclusion dependency discovery in relational databases

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share