cancel
Showing results for 
Search instead for 
Did you mean: 

What does the de-duplication verify step actually do?

robnicholson
Level 6

A policy based upon a deduplication storage folder with client side de-duplication splits the task into two halves for performance reasons. Whilst backing up a 3TB server yesterday, the server went offline due to an intermittent fault on one of the hard disks (which is why we're backing it up :-)) but the verify task started and is still running.

Do I infer correctly from this that the verify stage is checking the validity of the data in the de-duplication storage folder itself and isn't doing a byte level compare against the source?

Allied to this, if one does a full backup of a large server with de-duplication that has just been backed up, the backup is far quicker than a normal backup-to-disk folder as the majority of the data already exists in the de-duplication folder and doesn't get sent across the network and written to backup store.

But what about the verify step of a full backup? Does it always read and verify EVERY file in the backup set whether or not it was actually written during that backup or does it only verify the files that had to be refreshed in the backup due to a change?

Final question for now. With a full backup with client side deduplication, is EVERY file read from disk by the client, split into 64k blocks and checked if they already exist in the deduplication storage folder? Or is there some clever kind of check that the client can carry out without reading the entire file to see if it's changed?

Cheers, Rob.

3 REPLIES 3

itsmeaffinity
Level 6

Hi

 

Verify basically calcualte the checksum of the data being backed up usig algorithm's to see data being backed up is vaild or not

Please also check this for more details on it

http://www.symantec.com/docs/TECH18698

Thank You

robnicholson
Level 6

Thanks for that. Makes sense and explains why it's better to run the verify stage on the media server not on the client as the checksum calculation runs much faster locally on the media server than on the client.

I'll do a full backup again ASAP which should only end up refreshing a relatively small number of blocks in the dedup folder. If the verify is very quick, it would show that it's only verifying the files refreshed, not all the files.

Cheers, Rob.

itsmeaffinity
Level 6

Hi

 

Thanks for your response & if this helps you please mark this solved

 

Thank You