Skip to content
Define The Cloud

The Intersection of Technology and Reality

Define The Cloud

The Intersection of Technology and Reality

Redundancy in Data Storage: Part 2: Geographical Replication

Joe Onisick (@JoeOnisick), February 21, 2011

This is a followup to my previous article, Redundancy in Data Storage: Part 1: RAID Levels, where I discussed various site-local data redundancy technologies. Here, I will attempt to detail many of the choices available to provide redundancy beyond the data center that organizations use to solve disaster recovery, business continuity, and continuity of operations (COOP).

It’s obvious that site-local redundancy isn’t enough for critical applications. The threat of natural disasters is always looming, regional power outages occur, building electrical and mechanical systems fail, and backhoes seem to hate fiber optic cable. Enterprises therefore attempt to use geographic redundancy to ensure that even when these things happen critical applications and data remain available. At the heart of making an application geographically redundant is making sure the application’s data resides in more than one geographical location. There are a number of technology and architectural choices that can be used to achieve this geographical replication of data. Often these solutions will be evaluated in terms of cost, RTO (recovery time objective), and RPO (recovery point objective), as I outlined in Disaster Recovery and the Cloud.

One obvious place to build redundancy is at the storage area network level. There are a variety of technologies available to replicate SAN volumes between geographic locations. Synchronous replication tightly couples the primary and backup sites and does not return success to the storage controller until a write completes in both locations, providing a zero RPO. However, synchronous replication requires very fast network connections and requires that the backup site be located very close to the primary location because otherwise latency will severely reduce storage performance. To allow the sites to be further apart, asynchronous replication can be used where the changes are streamed to the backup site but completion of the I/O is signalled before receiving an acknowledgement. Finally, point-in-time replication generates many snapshots of the storage and sends the delta between each snapshot.

All of these SAN replication approaches are bandwidth intensive. Applications make many changes to the disk as part of their ordinary functioning and these changes are almost certainly not encoded in a dense fashion that allows them to efficiently cross networks. An application might make small updates to the same disk block many times in short order and all of these changes would have to be sent across the network in asynchronous or synchronous replication. Point in time replication lowers this overhead a small amount (because redundant changes between snapshots are not sent) at the cost of worse RPO.

Redundancy can also be implemented through database replication. Just as in SAN replication, synchronous, asynchronous, and snapshot-based techniques are available. Many of the same tradeoffs apply, although generally database changes can be sent more efficiently across a WAN. Unfortunately, effectively using database replication to provide geographic redundancy is difficult. For one, database replication can only stand on its own if all of the critical application data resides within the database. This is often not the case. Moreover, sophisticated database deployments involving data partitioning, federation, and integration often greatly complicate replication to the point that effective configuration becomes prohibitive.

Finally, the application itself can handle data redundancy. Often the highest end applications (for instance financial, logistics, and reservation systems) require the federation of data at the application level. This allows extreme top-end performance to be reached and also allows compliance with various types of data jurisdiction requirements (for instance, national directives requiring customer identifiable information to remain in the country of origin.) Unfortunately, this is very difficult and error prone.

Data redundancy is only one piece of the business continuity problem. Applications require other infrastructure to run, such as the network and application servers. Some organizations are using virtualized approaches here with some success to build geographically redundant architectures. Others rely on configuration management technologies to ensure that the disaster recovery sites remain synchronized and ready to handle workload. Another important point to consider is how to handle moving the active instance of the application to the backup site, and also how to re-establish redundancy after a failure and move applications back to the primary. Any approach to provide geographic redundancy must be designed carefully and continually tested well, because today’s complicated application architectures provide too many opportunities for mistakes to be made in provisioning redundancy.

These replication techniques still require the site-local mechanisms like RAID discussed in part 1, because otherwise the facilities involved would be far too unreliable, and also require significant investments in network links, replication technologies, and personnel effort. Also, for the most part, these technologies require the duplication of infrastructure for disaster recovery purposes. In my forthcoming part 3, I will discuss emerging approaches in cloud architectures that unify redundancy mechanisms and significantly simplify the effort involved in implementing resilient business systems.

About the Author

Michael Lyle (@MPLyle) is CTO and co-founder of Translattice, and is responsible for the company’s strategic technical direction.  He is a recognized leader in developing distributed systems technologies and has extensive experience in datacenter and information technology operations.

Share this:

  • Facebook
  • X

Related posts:

  1. Redundancy in Data Storage: Part 1: RAID Levels
  2. Have We Taken Data Redundancy too Far?
  3. Digging Into the Software Defined Data Center
  4. Storage Protocols
  5. Intel’s Betting the Storage I/O Farm on the CPU
Technical Deep Dive

Post navigation

Previous post
Next post

Related Posts

Data Center Bridging

March 12, 2010May 26, 2010

Data Center Bridging (DCB) is a group of IEEE standard protocols designed to support I/O consolidation.  DCB enables multiple protocols with very different requirements to run over the same Layer 2 10 Gigabit Ethernet infrastructure.  Because DCB is currently discussed along with Fibre Channel over Ethernet (FCoE) it’s not uncommon…

Share this:

  • Facebook
  • X
Read More

FCoE multi-hop; Do you Care?

May 30, 2010May 30, 2010

There is a lot of discussion in the industry around FCoE’s current capabilities, and specifically around the ability to perform multi-hop transmission of FCoE frames and the standards required to do so.  A recent discussion between Brad Hedlund at Cisco and Ken Henault at HP (http://bit.ly/9Kj7zP) prompted me to write…

Share this:

  • Facebook
  • X
Read More

SDN – Centralized Network Command and Control

June 22, 2012May 18, 2020

Software Defined Networking (SDN) is a hot topic in the data center and cloud community.  The geniuses <sarcasm> over at IDC predict a $2 billion market by 2016 (expect this number to change often between now and then, and look closely at what they count in the cost.) The concept…

Share this:

  • Facebook
  • X
Read More

Comments (7)

  1. Pingback: Tweets that mention Redundancy in Data Storage: Part 2: Geographical Replication — Define The Cloud -- Topsy.com
  2. organic colon cleanse recipe says:
    July 8, 2014 at 2:11 am

    Here is on the best way to lose 20 lbs in 14 days a weird method.
    Having a cleanse you’ll be able become wounded or to recover faster
    once you get cuts.

    Reply
  3. contest to win a free iphone 5 says:
    August 25, 2014 at 4:51 pm

    We’ll talk about the many options from going to the Apple store to
    actually purchasing an iphone glass repair kit.
    It releases October 14 to retail, pre-orders begin October 7.
    For all the relaxation of other people who sort the bulk of the individuals, a likelihood to win a i – Pad
    is like a the moment in a life span give.

    Reply
  4. collar blue rhinestone says:
    November 9, 2014 at 2:22 am

    Nice post. I was checking continuously this weblog and I am inspired!
    Extremely useful information specially the closing section :
    ) I care for such information a lot. I used to be looking
    for this particular information for a long
    time. Thanks and good luck.

    Reply
  5. Alpha X Boost And Muscle X Boost says:
    February 23, 2017 at 8:35 pm

    Thanks a lot for being our instructor on this subject.
    My spouse and i enjoyed your current article quite definitely and most of all
    cherished how you really handled the issues I regarded as being controversial.
    You happen to be always incredibly kind towards readers much like me
    and assist me in my existence. Thank you.

    Reply
  6. Vibrant C says:
    March 18, 2017 at 12:46 am

    We’re a gaggle of volunteers and opening a brand new scheme in our community.
    Your website provided us with valuable info to work on. You
    have performed an impressive job and our whole neighborhood
    can be thankful to you.

    Reply
  7. Libidogene says:
    April 25, 2017 at 1:11 am

    Hello colleagues, nice piece of writing and good
    arguments commented at this place, I am really enjoying by these.

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Creative Commons License
This work by Joe Onisick and Define the Cloud, LLC is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License

Disclaimer

All brand and company names are used for identification purposes only. These pages are not sponsored or sanctioned by any of the companies mentioned; they are the sole work and property of the authors. While the author(s) may have professional connections to some of the companies mentioned, all opinions are that of the individuals and may differ from official positions of those companies. This is a personal blog of the author, and does not necessarily represent the opinions and positions of his employer or their partners.
©2025 Define The Cloud | WordPress Theme by SuperbThemes