Posts

Showing posts from April, 2021

Data Center Failures and CRDB Replication

Image
Data Center Failures and CRDB Replication David Lukens, 2021-04-03 Executive Summary The purpose of this document is to explain how replication of data within CockroachDB can be used to tolerate various failure scenarios and the impacts to query execution times. This simulation will explore the effect of different replication factors on a database cluster that spans multiple Availability Zones in multiple Regions. Overall, more replicas allows for more nodes to be down at the same time, while both R=5 and R=7 allow for the loss of an entire region. But, counterintuitively, R=7 only allows for a single Availability Zone to be lost, while R=5 allows for two. Description In a discussion, the question came up, “What happens to geographically diverse replicas when something fails?” The result was a simulation of geo-partitioned replicas and creation of a visual representation to understand this. This was for a 15 node CockroachDB cluster, with five Availability Zones (AZs or Data Cent...