Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation

Claude Pasquier

Résumé

This paper describes the design of a system for extracting keyphrases from a single document The principle of the algorithm is to cluster sentences of the documents in order to highlight parts of text that are semantically related. The clusters of sentences, that reflect the themes of the document, are then analyzed to find the main topics of the text. Finally, the most important words, or groups of words, from these topics are proposed as keyphrases.
Fichier principal
Vignette du fichier
article.pdf (46.78 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01151516 , version 1 (28-09-2021)

Identifiants

  • HAL Id : hal-01151516 , version 1

Citer

Claude Pasquier. Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation. 5th International Workshop on Semantic Evaluation (SemEval '10), Jul 2010, Uppsala, Sweden. pp.154-157. ⟨hal-01151516⟩
89 Consultations
35 Téléchargements

Partager

Gmail Facebook X LinkedIn More