A Corpus Processing and Analysis Pipeline for Quickref - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

A Corpus Processing and Analysis Pipeline for Quickref

Antoine Hacquard
  • Fonction : Auteur
  • PersonId : 1097779

Résumé

Quicklisp is a library manager working with your existing Common Lisp implementation to download and install around 2000 libraries, from a central archive. Quickref, an application itself written in Common Lisp, generates, automatically and by introspection, a technical documentation for every library in Quicklisp, and produces a website for this documentation. In this paper, we present a corpus processing and analysis pipeline for Quickref. This pipeline consists of a set of natural language processing blocks allowing us to analyze Quicklisp libraries, based on natural language contents sources such as README files, docstrings, or symbol names. The ultimate purpose of this pipeline is the generation of a keyword index for Quickref, although other applications such as word clouds or topic analysis are also envisioned. CCS CONCEPTS • Information systems → Information extraction; Retrieval effectiveness; Presentation of retrieval results; • Software and its engineering → Software libraries and repositories.
Fichier principal
Vignette du fichier
hacquard.21.els.pdf (571.03 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03216684 , version 1 (04-05-2021)

Identifiants

Citer

Antoine Hacquard, Didier E Verna. A Corpus Processing and Analysis Pipeline for Quickref. 14th European Lisp Symposium, May 2021, Online, France. ⟨10.5281/zenodo.4714443⟩. ⟨hal-03216684⟩
36 Consultations
61 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More