DEBORA: Digital AccEss to BOoks of the RenAissance

Frank Le Bourgeois 1 Hubert Emptoz 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : EBORA (Digital AccEss to BOoks of the RenAissance) is a multidisciplinary European project aiming at digitizing and thus making rare sixteenth century books more accessible. End-users, librarians, historians, researchers in book history and computer scientists participated in the development of remote and collaborative access to digitized Renaissance books, necessary because of the reduced accessibility to digital libraries in image mode through the Internet. The size of files for the storage of images, the lack of a standard file format exchange suitable for progressive transmission, and limited querying possibilities currently limit remote access to digital libraries. To improve accessibility, historical documents must be digitized and retro-converted to extract a detailed description of the image contents suited to users’ needs. Specialists of the Renaissance have described the metadata generally required by end-users and the ideal functionalities of the digital library. The retro-conversion of historical documents is a complex process that includes image capture, metadata extraction, image storage and indexing, automatic conversion in a reusable electronic form, publication on the Internet, and data compression for faster remote access. The steps of this process cannot be developed independently. DEBORA proposes a global approach to retro-conversion from the digitization to the final functionalities of the digital library centered on users’ needs. The retro-conversion process is mainly based on a document image analysis system that simultaneously extracts the metadata and compresses the images. We also propose a file format to describe compressed books as heterogeneous data (images/text/links/ annotation/physical layout and logical structure) suitable for progressive transmission, editing, and annotation. DEBORA is an exploratory project that aims at demonstrating the feasibility of the concepts by developing prototypes tested by end-users.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01593285
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Tuesday, September 26, 2017 - 9:39:18 AM
Last modification on : Thursday, November 1, 2018 - 1:19:22 AM

Links full text

Identifiers

Citation

Frank Le Bourgeois, Hubert Emptoz. DEBORA: Digital AccEss to BOoks of the RenAissance. International Journal on Document Analysis and Recognition, Springer Verlag, 2007, 2-4, 9, pp.193-221. ⟨10.1007/s10032-006-0030-0⟩. ⟨hal-01593285⟩

Share

Metrics

Record views

120