Efficient approaches to location fault tolerance (Part 3)
We continue discussing trends in the advancement of storage virtualization. Two posts ago, we covered advanced data virtualization, and the last post we covered self organizing systems. We are now onto the future for location fault tolerance.
Current storage products are focused mainly on setting up storage tiers within a single data center, and using replication to gain offsite data protection. Yes, these products do address multiple data centers being managed centrally, but typical architectures do not have data being managed across data centers besides the replicated copies.
Why the single data center consolidation focus for tiers? Most likely, it is because the network is a big unknown for storage vendors who are trying to create sellable products. Taking the network out of the picture ensures QoS and performance. Further, clustered systems are difficult to run over WANs.
Using replication does solve site failure scenarios, but since it is making full copies of data, it does not provide ideal storage efficiency. Coupling the RAID parity (suppose 120% raw capacity required) with replication, perhaps to one or two sites, it is easy to calculate 240% to 360% raw storage is required. Ramp up this system to petabytes, and it gets expensive very fast.
Considering Dispersal, devices can be placed in multiple locations, and the M of N configuration can allow for location fault tolerance. For example, if the M of N configuration is 12 of 16, and the 16 data packets are stored across 16 devices in 4 locations, a location failure still leaves 12 available data packets to reconstruct that data.
Access with object storage is through a non-clustered approach since software clients can access the dsNet system, (as well as through accesser appliances). dsNet systems can efficiently run over WANs and store data packets in multiple data centers. This architecture will prove to be important for scale out systems in the future.
Coupling advanced data virtualization via Dispersal with features to store data packets at multiple sites solves the storage efficiency problems with replication – today’s expensive and predominant multi-site data protection feature. Cleversafe is the leader in this approach, and it will prove a requirement as storage virtualization progresses and networks improve.
We’ll discuss a shift in management systems in more detail in the next post.
