Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers

Abstract : In this paper, we present the automatic annotation of bibliographical references' zone in papers and articles of XML/TEI format. Our work is applied through two phases: first, we use machine learning technology to classify bibliographical and non-bibliographical paragraphs in papers, by means of a model that was initially created to differentiate between the footnotes containing or not containing bibliographical references. The previous description is one of BILBO's features, which is an open source software for automatic annotation of bibliographic reference. Also, we suggest some methods to minimize the margin of error. Second, we propose an algorithm to find the largest list of bibliographical references in the article. The improvement applied on our model results an increase in the model's efficiency with an Accuracy equal to 85.89. And by testing our work, we are able to achieve 72.23% as an average for the percentage of success in detecting bibliographical references' zone.
Document type :
Conference papers
Complete list of metadatas

Cited literature [4 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01771689
Contributor : Amal Htait <>
Submitted on : Thursday, April 19, 2018 - 4:51:08 PM
Last modification on : Friday, March 22, 2019 - 11:34:05 AM
Long-term archiving on : Tuesday, September 18, 2018 - 4:17:58 PM

File

AmalHtait_LREC2016.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01771689, version 1

Collections

Citation

Amal Htait, Sébastien Fournier, Patrice Bellot. Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers. LREC, May 2016, Portoroz, Slovenia. ⟨hal-01771689⟩

Share

Metrics

Record views

97

Files downloads

34