Federating Heterogeneous Data Sources - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

Federating Heterogeneous Data Sources

Résumé

To cope with the difficulties of Web information search, lots of technologies related to Web search engines have been proposed and have also seen very successful applications. Rather than yet another Web search engines with general purpose, this paper couples text mining and XML view caching techniques within Web mediation architecture and presents a prototype framework for topic-centric Web information search. Given a topic domain, domain-specific information is extracted from the Web documents belonging to the domain, then text-mining technologies are applied to discover the semantics contained in the Web information. Next we integrate the extracted information into a domain-specific common concept model defined using semantic Web languages. Finally an XML-based mediator allows the users to query the integrated Web information using XQuery. Once Web information is represented in the concept model with explicit semantic hierarchy understandable to the programs, user's queries against special fragments of Web documents can be carried out. One important part of our works aims at integrating XML view and cache techniques to manage Web information. Checksum technology is used to monitor the updates of Web page. One prototype is under construction centered on popular French sites of the finance domain.
Fichier principal
Vignette du fichier
DangNgoc-Gardarin2003.pdf (117.51 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00733477 , version 1 (28-12-2012)

Identifiants

  • HAL Id : hal-00733477 , version 1

Citer

Tuyet-Tram Dang-Ngoc, Georges Gardarin. Federating Heterogeneous Data Sources. IASTED International Conference on Information and Knowledge Sharing (IKS 2003), Nov 2003, Scottsdale, United States. p. 193-198, ISBN 0-88986-396-2 (407). ⟨hal-00733477⟩

Collections

CNRS UVSQ
134 Consultations
40 Téléchargements

Partager

Gmail Facebook X LinkedIn More