cancel
Showing results for 
Search instead for 
Did you mean: 

Data Deduplication!!

Shyam_Prasad
Level 4
Certified
Hi,

What is meant by De-duplication?
Is De-duplication enabled by default in NetBackup?
If  not how to goahead for De-duplication?
Pls let me know..

--Jaya
1 ACCEPTED SOLUTION

Accepted Solutions

Seth_Bokelman
Level 5
Certified
De-duplication is the elimination of redundant data from backup media.  It applies mostly to backups performed to disk systems, not tape systems, and it's not a feature of NetBackup, per se, though I believe you can do it with Symantec's PureDisk product.  Typically, people who are using VTL devices, such as Data Domain or NetApp, will use DeDupe so that they can store the equivalent of 200 TB of backup data on 20 TB of disk.  So, you could attach one of these devices to your NetBackup installation, and then you would have deduplication with NetBackup.

Basically, it works by breaking all of your data into chunks, performing a hash of each of those chunks and looking for hashes that are identical.  The systems then compare the data, and if it's truely identical, they'll only keep one copy and throw away they others.  As this is all happening on the storage shelf level, it's transparent to your backup software, and the space savings are massive, as backup data often doesn't change much between fulls...

That's a quick 'n dirty explanation, I'd also suggest the Wikipedia article: en.wikipedia.org/wiki/Data_deduplication

View solution in original post

2 REPLIES 2

Seth_Bokelman
Level 5
Certified
De-duplication is the elimination of redundant data from backup media.  It applies mostly to backups performed to disk systems, not tape systems, and it's not a feature of NetBackup, per se, though I believe you can do it with Symantec's PureDisk product.  Typically, people who are using VTL devices, such as Data Domain or NetApp, will use DeDupe so that they can store the equivalent of 200 TB of backup data on 20 TB of disk.  So, you could attach one of these devices to your NetBackup installation, and then you would have deduplication with NetBackup.

Basically, it works by breaking all of your data into chunks, performing a hash of each of those chunks and looking for hashes that are identical.  The systems then compare the data, and if it's truely identical, they'll only keep one copy and throw away they others.  As this is all happening on the storage shelf level, it's transparent to your backup software, and the space savings are massive, as backup data often doesn't change much between fulls...

That's a quick 'n dirty explanation, I'd also suggest the Wikipedia article: en.wikipedia.org/wiki/Data_deduplication

Shyam_Prasad
Level 4
Certified
Thanks Seth for ur Sharing... :)