Hi,
After several months working alongside Dell to resolve the CRC issues, and changing every piece of hardware, I'm now thinking maybe the problem is software related!?!?
Backup Exec 9.1 - All updates (inc. SP3)
Windows 2003 Server (Was a Dell 2550 - now a HP DL380)
2 x SCSI Cards - 39160 and 29160
1 x Dell PV-132T SDLT Library (2 SDLT drives)
1 x Overland NEO SDLT Library (2 SDLT drives)
OK...
General CRC errors started around 4 months ago, where backups would fail and mark the device as offline.
I called Dell (as recommended in Veritas support documents) and they agreed to replace the drive.
We have tried changing media, replacing media and disposing problem tapes (losing our critical data).
To cut a very long story short, 4 months later and the server has been replaced, the SCSI controller card replaced, several tape drives and the SCSI cable have all been replaced. Firmware updates done by Dell on the library, drives and SCSI card (apparently Dell 39160 card requires a specific firmware version..). BIOS and even the system backplane and SCSI controllers have all been updated.
About 4 weeks ago Dell agreed to completely replace our WHOLE unit... Library chassis, drives, cable, controller card etc etc - brand new from factory - as long as we agreed to discard ALL our existing tapes and start over again (incase a damaged tape had damaged the head of the drive which in turn damaged another tape creating a fault cycle..) - Which we did.
A week later backups are back failing with CRC errors - and are worse now then before! :(
Backups are successful maybe twice a week!!!
The only common defactor here being the software.
The event ID's are: (in order)
33152 - Write Failure - Error=Error_CRC
57665 - Dell 3 Reported an error to write data to media - Data Error (crc check)
33152 - Rewind Failure (after device is offline)
58053 - Robotic Library Error
The succession between these error is about 3 minutes from the write failure to the device failing.
At first I thought medai - but after purchasing tapes from Fuji, Quantum and Dell - all giving same errors it cant be.
Also, and heres the twist; JOB1 starts at 8pm - writes 500mb to TAPE4 in DRIVE 2 and fails with CRC error.
I then choose to re-run job, which it does using the same drive, same tape and same backup and runs fine!!
Also tapes I thought were faulty have been used again without a problem in the drive that indicated an error.
Also, the NEO library uses a completley seperate SCSI card and media tapes (not used between devices) - and that gets CRC errors too, exactly the same symptoms.
Any help \ ideas - or a solution!!?