Database Replication in Large Scale Systems: Optimizing the Number of Replicas

Modou Gueye Idrissa Sarr 1 Samba Ndiaye
1 BD - Bases de Données
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : In distributed systems, replication is used for ensuring availability and increasing performances. However, the heavy workload of distributed systems such as web2.0 applications or Global Distribution Systems, limits the benefit of replication if its degree (i.e., the number of replicas) is not controlled. Since every replica must perform all updates eventually, there is a point beyond which adding more replicas does not increase the throughput, because every replica is saturated by applying updates. Moreover, if the replication degree exceeds the optimal threshold, the useless replica would generate an overhead due to extra communication messages. In this paper, we propose a suitable replication management solution in order to reduce useless replicas. To this end, we define two mathematical models which approximate the appropriate number of replicas to achieve a given level of performance. Moreover, we demonstrate the feasibility of our replication management model through simulation. The results expose the effectiveness of our models and their accuracy.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01297999
Contributor : Lip6 Publications <>
Submitted on : Tuesday, April 5, 2016 - 11:26:56 AM
Last modification on : Thursday, March 21, 2019 - 1:11:46 PM

Identifiers

Citation

Modou Gueye, Idrissa Sarr, Samba Ndiaye. Database Replication in Large Scale Systems: Optimizing the Number of Replicas. EDBT09 International Workshop on Data Management in Peer-to-peer systems (DAMAP), Mar 2009, Saint Petersburg, Russia. pp.3-9, ⟨10.1145/1698790.1698794⟩. ⟨hal-01297999⟩

Share

Metrics

Record views

78