07-04-2012 08:27 AM
Netbackup 7.1.0.2 / Windows 2008 server x64 / Dell ML6020 with 6 LTO3 drives.
For the last couple of weeks Netbackup has been down'ing one of my tape drives pretty much every night. But it's not happening every single night. When the error occurs I see a message like the following in the Problems report and the media server's application event log:
TapeAlert Code: 0x1f, Type: Critical, Flag: HARDWARE B, from drive (index -1)
I can reset and then up the drive via the admin console, and run manual backups and then scheduled backups will run without issues. But on other occassions overnight backups run and the drive is taken down by NBU.
The Dell management console and the hardware itself does not show any errors. I did open a case with Symantec, but they dismissed it and said it's a hardware fault and to contact the vendor. I spoke to Dell who have been very helpful, but it's definetely looking like a NBU issue.
Any ideas on what else to look at?
Solved! Go to Solution.
07-04-2012 11:19 AM
You have a hardware issue with the drive.
0x1F: 'Hardware B: Tape drive has a problem not read/write related',
This message has actually been generated by te tapedrive, as the firmware has detected that there is some problem.
No matter what the hardware vendor may say, this issue cannot be caused by NBU, it is impossible.
NBU is only reporting the error.
The TAPEALERT is actually geberated and held on the tapedrive (firmware/ memory), the codes are onlt read and displayed by NBU.
From http://www.symantec.com/docs/TECH169477
TapeAlert / Tape Alert
Oct 11 08:59:31 media bptm[3771]: [ID 228150 daemon.warning] TapeAlert Code: 0x03, Type: Warning, Flag: HARD ERROR, from drive TLD0_LTO4_DRIVE1 (index 4), Media Id R0TP01
07-04-2012 08:42 AM
It seems H/W issue but, below things need to check ...
Is that drive asking for cleaning ??
Is tape drive drivers update as recent version ?
What the OS logs says at the time of drive down issue ?
07-04-2012 08:54 AM
07-04-2012 10:47 AM
Hi,
I have encountered the above issue in our environment, we have HP LTO4 ( EML e series Library) Drive. I logged a case with HP and vendor suggested to upgrade the drive to latest firmware version. After the upgrdation the drive worked fine.
07-04-2012 11:19 AM
You have a hardware issue with the drive.
0x1F: 'Hardware B: Tape drive has a problem not read/write related',
This message has actually been generated by te tapedrive, as the firmware has detected that there is some problem.
No matter what the hardware vendor may say, this issue cannot be caused by NBU, it is impossible.
NBU is only reporting the error.
The TAPEALERT is actually geberated and held on the tapedrive (firmware/ memory), the codes are onlt read and displayed by NBU.
From http://www.symantec.com/docs/TECH169477
TapeAlert / Tape Alert
Oct 11 08:59:31 media bptm[3771]: [ID 228150 daemon.warning] TapeAlert Code: 0x03, Type: Warning, Flag: HARD ERROR, from drive TLD0_LTO4_DRIVE1 (index 4), Media Id R0TP01
07-04-2012 11:53 AM
07-04-2012 01:29 PM
It is very very simple:
Jul 4 04:58:10 weabsunprd08 bptm[623]: [ID 995414 daemon.warning] TapeAlert Code: 0x03, Type: Warning, Flag: HARD ERROR, from drive LTO4_NAC_I2K_D07 (index 31), Media Id W45684
You have a hardware issue.
In NetBackup, there are not too many times that we can say we are 100% certain, but this is one of them.
You do not have a NetBackup issue, you have a hardware (or firmware) issue.
The issue is given to you in the tapealert message, which, as I explained above - is generated by the tapedrive firmware when it detects some issue.
This TN http://www.symantec.com/docs/TECH169477
... explains tapealert and meny other drive/ library issues.
It was created with the purpose of deflecting calls to the right people, that is the hardware vendors. Yes, there will be a few exceptions, the common exceptions are listed in the TN, but even then, the troubleshooting should start at the hardware /os level.
Martin
07-04-2012 01:57 PM
Have a look at what tapealert.org says about the errors coming from your tape drive:
http://www.tapealert.org/archives/184
Please give these TapeAlerts to your hardware vendor and ask them to explain how this is a software issue.
NBU is DOWN'ing the drive BECAUSE of these errors. The order of events says it all....
07-04-2012 03:37 PM
From tapealert.org
"A “tape alert” message is a critical, warning, or informational alert that occurs due to a tape drive or robotic library hardware event. These “tape alert” messages are stored on the tape drive or robotic library. Applications like Backup Exec™ orSANtools® SMARTMon-UX query the tape device or robotic library for these “tape alert” messages and display the “tape alerts” to the user."
This detail, from the tapealert.org site, explains exactly what I explained.
Key points:
"occurs due to a tape drive or robotic library hardware event"
"Applications like Backup Exec™ orSANtools® SMARTMon-UX query the tape device"
So, this shows with no doubt, that backup applications do not cause tapealerts.
As marianne says:
"Please give these TapeAlerts to your hardware vendor and ask them to explain how this is a software issue."
Martin