Relational Knowledge Discovery in a Chinese Characters Database

Abstract : This article describes a novel application ofInductive Logic Programming (ILP) to the problem of data mining relational databases. The task addressed here consists in mining a relational database ofmore than 200,000 ground facts describing 6768 Chinese characters. Mining this relational database may be recast in an ILP setting, where the form ofthe association rules searched are represented as nondeterminate Horn clauses, a type of clause known for being computationally hard to learn. We have introduced a new kind of language bias, S-structural indeterminate clauses, which takes into account the meaning of part-of predicates that play a key role in the complexity oflearning in structural domains. The ILP algorithm REPART has been specifically developed to learn S-structural indeterminate clauses. Its efficiency lies in a particular change ofrepresentation, so as to enable one to use propositional learners. This article presents original results discovered by REPART that exemplify how ILP algorithms may not only scale up efficiently to large relational databases but also discover useful and computationally hard to learn patterns.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01184490
Contributor : Lip6 Publications <>
Submitted on : Friday, August 14, 2015 - 4:07:54 PM
Last modification on : Tuesday, May 14, 2019 - 10:31:45 AM

Identifiers

Citation

Jean-Daniel Zucker, Jean-Gabriel Ganascia, Isabelle Bournaud. Relational Knowledge Discovery in a Chinese Characters Database. Applied Artificial Intelligence, Taylor & Francis, 1998, 12 (5), pp.455-488. ⟨10.1080/088395198117712⟩. ⟨hal-01184490⟩

Share

Metrics

Record views

131