Annotation of Scientific Summaries for Information Retrieval.

Fidelia Ibekwe-Sanjuan; Fernandez Silvia; Eric Sanjuan; Charton Eric

Communication Dans Un Congrès Année : 2008

Annotation of Scientific Summaries for Information Retrieval.

(1) , (2) , (2) , (2)

1
2

Fidelia Ibekwe-Sanjuan

Fonction : Auteur correspondant
PersonId : 180321
IdHAL : fidelia-ibekwe
ORCID : 0000-0001-8862-7729
IdRef : 11366396X

Connectez-vous pour contacter l'auteur

Equipe de recherche de Lyon en sciences de l'information et de la communication

Fernandez Silvia

Fonction : Auteur

Laboratoire Informatique d'Avignon

Eric Sanjuan

Fonction : Auteur
PersonId : 912763
IdHAL : eric-sanjuan
ORCID : 0000-0002-4057-6691

Laboratoire Informatique d'Avignon

Charton Eric

Fonction : Auteur

Laboratoire Informatique d'Avignon

Résumé

We present a methodology combining surface NLP and Machine Learning techniques for ranking asbtracts and generating summaries based on annotated corpora. The corpora were annotated with meta-semantic tags indicating the category of information a sentence is bearing (objective, findings, newthing, hypothesis, conclusion, future work, related work). The annotated corpus is fed into an automatic summarizer for query-oriented abstract ranking and multi- abstract summarization. To adapt the summarizer to these two tasks, two novel weighting functions were devised in order to take into account the distribution of the tags in the corpus. Results, although still preliminary, are encouraging us to pursue this line of work and find better ways of building IR systems that can take into account semantic annotations in a corpus.

Mots clés

term weighting Corpus annotation discourse structure analysis automatic summarization document ranking term weighting.

Domaines

Sciences de l'information et de la communication Recherche d'information [cs.IR]

Fichier principal

ibekwe-ESAIR-08-final.pdf (344.4 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Fidelia Ibekwe : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00635699

Soumis le : mardi 25 octobre 2011-17:14:40

Dernière modification le : mardi 3 octobre 2023-14:14:03

Archivage à long terme le : jeudi 15 novembre 2012-10:32:14

Dates et versions

hal-00635699 , version 1 (25-10-2011)

Identifiants

HAL Id : hal-00635699 , version 1
ARXIV : 1110.5722

Citer

Fidelia Ibekwe-Sanjuan, Fernandez Silvia, Eric Sanjuan, Charton Eric. Annotation of Scientific Summaries for Information Retrieval.. ECIR'08 Workshop on: Exploiting Semantic Annotations for Information Retrieval, Mar 2008, Glasgow, United Kingdom. pp.70-83. ⟨hal-00635699⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LYON3 UNIV-AVIGNON UNIV-LYON1 UNIV-LYON2 ELICO LIA UDL

203 Consultations

416 Téléchargements

Annotation of Scientific Summaries for Information Retrieval.

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager