Wednesday, August 12, 2009

VTL Vendor comparison for Data DeDuplication

In continuation to my original post "DeDuplicating Data Backups" Here are the details for the follow-up

As I was saying in my requirement list, one of the main objectives is to have integration with NetBackup OST. Now, only few vendors have that - DataDomain, Sepaton, FalconStor, Quantum, Copan (OEM of FalconStor), Diligent/IBM

I had taken each of these individually and compared their features as to what they claim they can do and can they meet my requirements. Based on that FalconStor came out on top of the list.

Feature

Sepaton

Diligent/IBM

Data Domain

FalconStor

Quantum

NetApp

Appliance or Software based

Appliance

Appliance

Appliance

Both

Appliance

Appliance

De-Duplication?

Y

Y

Y

Y

Y

y

Source/Target based de-dup

Target

Target

Target

Target

Target

Target

De-Duplication type (Inline/Post-Processing)

Post

Inline

Inline

Post

Post/policy-based

Post

If post processing, need to wait until all data is received?

Y

NA

NA

Configurable

Configurable

Y

De-duplication level

File

File

sub-file

sub-file

sub-file

block/sub-file

Fixed/Variable Length segment size (and size) - Granuality

Differencing


Variable

Fixed

Variable

Variable

De-duplication technology

Delta Diff.

Delta Diff.

Hash

Hash

Hash

Delta Diff.

Global (if multiple devices) de-duplication?

Y

Y

N

Y

N

N

Periodic/Scheduled scrubbing to remove unclaimed blocks?



Y

N


Y

Max Devices in global de-dup

5

2

1

8

1

1

Max throughput per device (Ingest rate)

600MB/s

900MB/s

750MB/s

1.5GB/s

880MB/s

600MB/s

Max throughput in max config (Ingest rate)

3GB/s (11TB/hr)

NA

750MB/s

12GB/s (43TB/hr)

NA

NA

De-duplication speeds (per device/globally)

1500MB/s (5.5TB/hr)

NA

750MB/s

4GB/s

500MB/s

Not Published

Restore speeds same as backup? (Impact of rehydration)

N

N

N

Y

N

N

Encryption

N



Y


N

Compression

Y

Y

Y

Y

Y

Y

Integrated Replication

Y

Y

Y

Y

Y

Y

Replication technology (FC/IP)

IP

IP

IP

Both

IP

FC (DWDM)

Bi-directional replication on same set of devices?



Y

Y

Y


Network Optimized replication? (dedup/compress)

Y

Y

Y

Y

Y

N

Physical tape integration?

N

N

N

Y

Y

Y

Integration with Symantec OST?

N

N

Y

Y

Y

N

Shared storage (leverage existing storage infrastructure & no vendor tie-in for backend storage)

Y

Y

N

Y

N

N

Dynamic Addition of capacity to VTL

N

Y

N

Y

N

N

HA configuration (in case if one of the VTL appliance fails)

Y

Y

N

Y

N

N

If Appliance, RAID level used

5

Vendor Qualification Matrix

5

Vendor Qualification Matrix

5

5

Scalability (with Max node config - HA only)

1.2PB

4

768TB

2.4PB (SIR) / 32PB (VTL)

220TB

128TB

1 comment:

  1. Thanks so much for this great post! I have been trying to learn all about data deduplication so I can understand it better. It is so awesome how this chart works and can provide you with that information! I know it can be a confusing subject for most. Can you tell me where I can find more information on this?

    ReplyDelete