Development of text and image processing for digital libraries: the Bibliothèques Virtuelles Humanistes project and the digitization of Renaissance documents

Abstract : The BVH project have been rewarded with the Succeed award 2014. This prize recognizes the successful implementation of a digitisation programme, especially those exploiting the latest technology and the output of research for the digitisation of historical text. The BVH (Bibliothèques Virtuelles Humanistes: Virtual Humanistic Libraries) is a research program devoted to the digitization and electronic publication of original source documents from the Renaissance period. Since 2003, its website has published digital facsimiles, selected Early Modern imprints (1450-1650) mainly from regional collections, and transcriptions of French texts of the same period, encoded according to the XML-TEI standard. Particular attention is paid to achieving great accuracy in the bibliographical description as regards the true states of originals and the closest correspondence between two distinct corpora, facsimile and text, linked by several levels of metadata in the main catalogue. The BVH team works in close collaboration with researchers from the Computer Science Laboratory of Tours (LI-Tours) to develop new technologies in the fields of image processing and pattern recognition. Open source software for layout analysis and text transcriptions, AGORA and RETRO, enables us to perform automatic extraction of graphic components from digitized books, and thus to build up specialized databases of iconographic and typographical material. As a member of the TEI consortium, we actively contribute to the development of a specialised schema for the transcription of Renaissance documents. Each step of processing and every component developed at the BVH is also intended for use by the whole digital community, creating a model for the digital library of the future.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01458415
Contributor : Sandrine Breuil <>
Submitted on : Thursday, February 16, 2017 - 2:17:00 PM
Last modification on : Wednesday, November 6, 2019 - 1:48:05 PM
Long-term archiving on: Wednesday, May 17, 2017 - 12:21:14 PM

File

SuceedAwards_BVH_nomination_20...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution - NonCommercial - ShareAlike 4.0 International License

Identifiers

  • HAL Id : hal-01458415, version 1

Collections

Citation

Toshinori Uetani, Rémi Jimenes, Sandrine Breuil, Jorge Fins, Marie-Luce Demonet, et al.. Development of text and image processing for digital libraries: the Bibliothèques Virtuelles Humanistes project and the digitization of Renaissance documents. 2014. ⟨hal-01458415⟩

Share

Metrics

Record views

227

Files downloads

177