05-12-2011 12:40 PM
I believe there may be something wrong with one of two drives, but would like some confirmation...
I am getting failures on “IBM.ULTRIUM-HH4.000”, and the drive is going down afterwards. I would ‘Up’ the drive, run another job, and it would fail and go back ‘Down’ again. However, the jobs running on drive “IBM.ULTRIUM-HH4.001”, every job so far seems to be finishing successfully.
I am running NetBackup 7.0 (build 20100104) on Windows2008 R2 Datacenter.
Hardware is SpectraLogic T50e (BlueScale v.11.3.1) with IBM LTO4 Half Height Fibre (fw: A23E) FWA23fddd
Here is a sample of some status logs...
Samples of Error Jobs:
5/12/2011 10:08:54 AM - started process bptm (5432)
5/12/2011 10:08:54 AM - requesting resource 21908L
5/12/2011 10:08:54 AM - granted resource 21908L
5/12/2011 10:08:54 AM - granted resource IBM.ULTRIUM-HH4.000
5/12/2011 10:08:56 AM - started process bptm (5432)
5/12/2011 10:08:56 AM - mounting 21908L
5/12/2011 10:10:04 AM - mounted; mount time: 00:01:08
5/12/2011 10:22:26 AM - Error bptm(pid=5432) io_ioctl_ndmp (MTFSF) failed on media id 21908L, drive index 0, return code 18 (NDMP_XDR_DECODE_ERR) (tmisc.c.1441)
5/12/2011 10:22:26 AM - Error bptm(pid=5432) NDMP SDK: stub called for missing shared library entry "ndmp_get_error_name"
5/12/2011 10:22:26 AM - Error bptm(pid=5432) NDMP SDK: continuing without looking up error name; returning "?"
5/12/2011 10:22:26 AM - Error bpimport(pid=7052) Status = media position error.
5/12/2011 10:22:26 AM - end Import; elapsed time: 00:13:32
media position error(86)
5/12/2011 9:45:12 AM - started process bptm (5068)
5/12/2011 9:45:12 AM - requesting resource 21908L
5/12/2011 9:45:12 AM - granted resource 21908L
5/12/2011 9:45:12 AM - granted resource IBM.ULTRIUM-HH4.000
5/12/2011 9:45:14 AM - started process bptm (5068)
5/12/2011 9:45:14 AM - mounting 21908L
5/12/2011 9:46:20 AM - mounted; mount time: 00:01:06
5/12/2011 9:56:49 AM - Error bptm(pid=5068) io_ioctl_ndmp (MTFSF) failed on media id 21908L, drive index 0, return code 18 (NDMP_XDR_DECODE_ERR) (tmisc.c.1441)
5/12/2011 9:56:49 AM - Error bptm(pid=5068) NDMP SDK: stub called for missing shared library entry "ndmp_get_error_name"
5/12/2011 9:56:49 AM - Error bptm(pid=5068) NDMP SDK: continuing without looking up error name; returning "?"
5/12/2011 9:56:49 AM - Error bpimport(pid=6224) Status = media position error.
5/12/2011 9:56:49 AM - end Import; elapsed time: 00:11:37
media position error(86)
(this is one that stated on “000”, and passed the same tape to “001”, then finished successfully)
5/12/2011 8:11:46 AM - started process bptm (6200)
5/12/2011 8:11:46 AM - requesting resource 21907L
5/12/2011 8:11:47 AM - granted resource 21907L
5/12/2011 8:11:47 AM - granted resource IBM.ULTRIUM-HH4.000
5/12/2011 8:11:48 AM - started process bptm (6200)
5/12/2011 8:11:48 AM - mounting 21907L
5/12/2011 8:12:02 AM - Error bptm(pid=6200) error requesting media, TpErrno = Robot operation failed
5/12/2011 8:12:02 AM - current media 21907L complete, requesting next resource IBM.ULTRIUM-HH4.000:Import:21907L
5/12/2011 8:14:22 AM - granted resource 21907L
5/12/2011 8:14:22 AM - granted resource IBM.ULTRIUM-HH4.001
5/12/2011 8:14:23 AM - started process bptm (6200)
5/12/2011 8:14:23 AM - mounting 21907L
5/12/2011 8:15:32 AM - mounted; mount time: 00:01:09
5/12/2011 8:15:32 AM - positioning 21907L to file 557
5/12/2011 8:17:48 AM - positioned 21907L; position time: 00:02:16
5/12/2011 8:17:49 AM - begin reading
5/12/2011 8:19:31 AM - end reading; read time: 00:01:42
the requested operation was successfully completed(0)
Sample of Successful jobs:
5/12/2011 9:45:41 AM - begin Import
5/12/2011 9:45:41 AM - started process bptm (676)
5/12/2011 9:45:41 AM - requesting resource 21909L
5/12/2011 9:45:42 AM - granted resource 21909L
5/12/2011 9:45:42 AM - granted resource IBM.ULTRIUM-HH4.001
5/12/2011 9:45:43 AM - started process bptm (676)
5/12/2011 9:45:43 AM - mounting 21909L
5/12/2011 9:47:34 AM - mounted; mount time: 00:01:51
5/12/2011 10:04:02 AM - end Import; elapsed time: 00:18:21
the requested operation was successfully completed(0)
Solved! Go to Solution.
05-12-2011 02:32 PM
Based on the information given, I agree ...
io_ioctl_ndmp (MTFSF) failed on media id 21908L
.. is a positioning error. The actual positioning is carried out by the o/s / tape drivers - nothing to do with NBU.
Martin
05-12-2011 01:07 PM
Looks like a bad drive if same media works OK in another drive
05-12-2011 01:12 PM
This is what I was thinking, but wanted another set of eyes on the error messages if anything sticks out at you. :)
05-12-2011 02:32 PM
Based on the information given, I agree ...
io_ioctl_ndmp (MTFSF) failed on media id 21908L
.. is a positioning error. The actual positioning is carried out by the o/s / tape drivers - nothing to do with NBU.
Martin