Improved deduplication through parallel binning - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Improved deduplication through parallel binning

Résumé

Many modern storage systems use deduplication in order to compress data by avoiding storing the same data twice. Deduplication needs to use data stored in the past, but accessing information about all data stored can cause a severe bottleneck. Similarity based deduplication only accesses information on past data that is likely to be similar and thus more likely to yield good deduplication. We present an adaptive deduplication strategy that extends Extreme Binning and investigate theoretically and experimentally the effects of the additional bin accesses.
Fichier non déposé

Dates et versions

hal-01495371 , version 1 (24-03-2017)

Identifiants

Citer

Zhike Zhang, Deepavali Bhagwat, Witold Litwin, Darrell D.E. Long, Thomas Schwarz. Improved deduplication through parallel binning. 2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC), Dec 2012, Austin, United States. pp.130-141, ⟨10.1109/PCCC.2012.6407746⟩. ⟨hal-01495371⟩
33 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More