Legal deposit of the French Web: harvesting strategies for a national domain - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Legal deposit of the French Web: harvesting strategies for a national domain

Le dépôt légal du web français : élaborer des stratégies de collecte pour un domaine national

Résumé

According to French Copyright Law voted on August 1 st , 2006, the Bibliothèque nationale de France ("BnF", or "the Library") is in charge of collecting and preserving the French Internet. The Library has established a "mixed model" of Web archiving, which combines broad crawls of the .fr domain, focused crawls and e-deposits. Thanks to its research partnership with the Internet Archive, BnF has performed four annual broad crawls since 2004. The last one has been made with noticeably different features: one of the most important was the use of the all-comprehensive list of the .fr domain names, given to BnF by the AFNIC (“Association française pour le nommage Internet en cooperation”, the registry for the .fr) after an agreement was signed between both institutions in September 2007. The technical choices made before and during a crawl have a decisive impact on the future shape of the collection. These decisions must therefore be taken according to the legal and intellectual frame within which the crawl is performed: for BnF, it is the five-centuries-old tradition of the legal deposit. To assess the consequences and the outcomes of the different technical solutions available, we propose to analyze the results of the BnF’s last crawl and to compare them to those of previous harvests. These studies also prove to be useful in our attempt to characterize the 2007 French Web.
Fichier principal
Vignette du fichier
LasfarguesOuryWendland-IWAW-2008-en.pdf (312.2 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01098538 , version 1 (26-12-2014)

Licence

Paternité

Identifiants

  • HAL Id : hal-01098538 , version 1

Citer

France Lasfargues, Clément Oury, Bert Wendland. Legal deposit of the French Web: harvesting strategies for a national domain. International Web Archiving Workshop, Sep 2008, Aarhus, Denmark. ⟨hal-01098538⟩

Collections

BNF BNF_DDL
679 Consultations
1402 Téléchargements

Partager

Gmail Facebook X LinkedIn More