07-28-2014 11:34 AM
Hi All
Configured SL8500 - ACS - 5 drives in Solaries media server. One drive is Down state. tryied manuvally but it's again down.
Verified messages logs shows below Fatel error..
Jul 29 04:09:49 ssbkupmm016v su: [ID 366847 auth.notice] 'su root' succeeded for d705297 on /dev/pts/3
Jul 29 04:11:56 ssbkupmm016v ltid[21107]: [ID 402974 daemon.notice] Operator/EMM server has UP'ed drive 01000110_lto3 (device 4)
Jul 29 04:12:06 ssbkupmm016v avrd[21473]: [ID 124861 daemon.error] Fatal open error on 01000110_lto3 (device 4, /dev/rmt/4cbn), errno = 2 (No such file or directory), DOWN'ing it
Kindly suggest on this
Solved! Go to Solution.
07-28-2014 11:47 AM
it looks like Drive path has been changed or Drive is not visiable in OS level
what is the output of below commands,
mt -f /dev/rmt/4cbn status
tpautoconf -t
tpconfig -d
you need to compare the outputs and make sure that all paths are visiable in tpconfig -d is also visible in tpautoconf -t
if they are not you need to update the configurations..
07-28-2014 11:47 AM
it looks like Drive path has been changed or Drive is not visiable in OS level
what is the output of below commands,
mt -f /dev/rmt/4cbn status
tpautoconf -t
tpconfig -d
you need to compare the outputs and make sure that all paths are visiable in tpconfig -d is also visible in tpautoconf -t
if they are not you need to update the configurations..
07-28-2014 11:59 AM
I agree with Nagalla.
Looks like OS has lost connectivity to the tape drive.
Troubleshooting attempts should be at OS level.
Important commands now are:
mt -f /dev/rmt/4cbn status
and
/usr/openv/volmgr/bin/scan
This will show us what the OS sees.
07-28-2014 12:44 PM
Hi
# date
Tue Jul 29 05:38:33 EST 2014
# uname -a
SunOS Meidaserver 5.10 Generic_147440-19 sun4u sparc SUNW,Sun-Fire-V490
# mt -f /dev/rmt/4cbn status
/dev/rmt/4cbn: No such file or directory
# tpautoconf -t
TPAC60 HP Ultrium 3-SCSI L6HS MXP1004KAN -1 -1 -1 -1 /dev/rmt/3cbn - -
TPAC60 HP Ultrium 3-SCSI L6HS HU10715JNR -1 -1 -1 -1 /dev/rmt/1cbn - -
TPAC60 HP Ultrium 3-SCSI L6HS HU10708N21 -1 -1 -1 -1 /dev/rmt/2cbn - -
TPAC60 HP Ultrium 3-SCSI L6HS MXP0901BAF -1 -1 -1 -1 /dev/rmt/0cbn - -
# tpconfig -d
Id DriveName Type Residence
Drive Path Status
****************************************************************************
0 01020105_lto3 hcart3 ACS(2) ACS=1, LSM=2, PANEL=1, DRIVE=5
/dev/rmt/0cbn UP
1 01000105_lto3 hcart3 ACS(2) ACS=1, LSM=0, PANEL=1, DRIVE=5
/dev/rmt/1cbn UP
2 01010110_lto3 hcart3 ACS(2) ACS=1, LSM=1, PANEL=1, DRIVE=10
/dev/rmt/2cbn UP
3 01010105_lto3 hcart3 ACS(2) ACS=1, LSM=1, PANEL=1, DRIVE=5
/dev/rmt/3cbn UP
4 01000110_lto3 hcart3 ACS(2) ACS=1, LSM=0, PANEL=1, DRIVE=10
/dev/rmt/4cbn DOWN
Currently defined robotics are:
ACS(2) ACSLS server = xxxxx
EMM Server = xxxx
FYI
Other outputs provided below..
bash-3.00# tpconfig -l
Device Robot Drive Robot Drive Device Second
Type Num Index Type DrNum Status Comment Name Path Device Path
robot 2 - ACS - - - - xxxxxxxx
drive - 0 hcart3 - UP - 01020105_lto3 /dev/rmt/0cbn ACS=1, LSM=2, PANEL=1, DRIVE=5
drive - 1 hcart3 - UP - 01000105_lto3 /dev/rmt/1cbn ACS=1, LSM=0, PANEL=1, DRIVE=5
drive - 2 hcart3 - UP - 01010110_lto3 /dev/rmt/2cbn ACS=1, LSM=1, PANEL=1, DRIVE=10
drive - 3 hcart3 - UP - 01010105_lto3 /dev/rmt/3cbn ACS=1, LSM=1, PANEL=1, DRIVE=5
drive - 4 hcart3 - DOWN - 01000110_lto3 /dev/rmt/4cbn ACS=1, LSM=0, PANEL=1, DRIVE=10 =======================> Issue drive
bash-3.00#
--------------------------------------------------------------------------------------------------------------------------------
bash-3.00# ls -ltr /dev/rmt/*cbn
lrwxrwxrwx 1 root root 73 Jul 16 2007 /dev/rmt/0cbn -> ../../devices/pci@8,600000/SUNW,emlxs@1/fp@0,0/st@w500104f000904e13,0:cbn
lrwxrwxrwx 1 root root 73 Jul 16 2007 /dev/rmt/1cbn -> ../../devices/pci@8,600000/SUNW,emlxs@1/fp@0,0/st@w500104f000904e73,0:cbn
lrwxrwxrwx 1 root root 73 Jul 16 2007 /dev/rmt/2cbn -> ../../devices/pci@8,600000/SUNW,emlxs@1/fp@0,0/st@w500104f000904e34,0:cbn
lrwxrwxrwx 1 root root 73 Jul 16 2007 /dev/rmt/3cbn -> ../../devices/pci@8,700000/SUNW,emlxs@3/fp@0,0/st@w500104f000904e43,0:cbn
lrwxrwxrwx 1 root root 73 Jul 16 2007 /dev/rmt/4cbn -> ../../devices/pci@8,700000/SUNW,emlxs@3/fp@0,0/st@w500104f000904e64,0:cbn =======> Issue drive
--------------------------------------------------------------------------------------------------------------------------------
Could not find the one of the tape controller from below output. (500104f000904e64) , Based on the below output we see only 4 controler for tape hence 4 drives are working.
bash-3.00# cfgadm -al
Ap_Id Type Receptacle Occupant Condition
c0 scsi-bus connected configured unknown
c0::dsk/c0t0d0 CD-ROM connected configured unknown
c1 fc-private connected configured unknown
c1::500000e011f259c1 disk connected configured unknown
c1::500000e011f25b31 disk connected configured unknown
c2 fc-fabric connected configured unknown
c2::500104f000904e43 tape connected configured unknown
c3 fc connected unconfigured unknown
c4 fc connected unconfigured unknown
c5 fc-fabric connected configured unknown
c5::500104f000904e13 tape connected configured unknown
c5::500104f000904e34 tape connected configured unknown
c5::500104f000904e73 tape connected configured unknown
usb0/1 unknown empty unconfigured ok
usb0/2 unknown empty unconfigured ok
usb0/3 unknown empty unconfigured ok
usb0/4 unknown empty unconfigured ok
bash-3.00#
--------------------------------------------------------------------------------------------------------------------------------
bash-3.00# luxadm -e port
/devices/pci@9,600000/SUNW,qlc@2/fp@0,0:devctl CONNECTED
/devices/pci@8,700000/SUNW,emlxs@3/fp@0,0:devctl CONNECTED ===========> Drive 3 and 4 are using this port but Drive 4 is only NOT working.
/devices/pci@8,700000/SUNW,emlxs@4/fp@0,0:devctl NOT CONNECTED
/devices/pci@8,700000/SUNW,emlxs@5/fp@0,0:devctl NOT CONNECTED
/devices/pci@8,600000/SUNW,emlxs@1/fp@0,0:devctl CONNECTED
bash-3.00#
07-28-2014 12:46 PM
FYI..
# /usr/openv/volmgr/bin/scan
************************************************************
*********************** SDT_TAPE ************************
*********************** SDT_CHANGER ************************
************************************************************
------------------------------------------------------------
Device Name : "/dev/rmt/3cbn"
Passthru Name: "/dev/sg/c0tw500104f000904e43l0"
Volume Header: ""
Port: -1; Bus: -1; Target: -1; LUN: -1
Inquiry : "HP Ultrium 3-SCSI L6HS"
Vendor ID : "HP "
Product ID : "Ultrium 3-SCSI "
Product Rev: "L6HS"
Serial Number: "MXP1004KAN"
WWN : ""
WWN Id Type : 0
Device Identifier: ""
Device Type : SDT_TAPE
NetBackup Drive Type: 16
Removable : Yes
Device Supports: SCSI-3
Flags : 0x0
Reason: 0x0
------------------------------------------------------------
Device Name : "/dev/rmt/1cbn"
Passthru Name: "/dev/sg/c0tw500104f000904e73l0"
Volume Header: ""
Port: -1; Bus: -1; Target: -1; LUN: -1
Inquiry : "HP Ultrium 3-SCSI L6HS"
Vendor ID : "HP "
Product ID : "Ultrium 3-SCSI "
Product Rev: "L6HS"
Serial Number: "HU10715JNR"
WWN : ""
WWN Id Type : 0
Device Identifier: ""
Device Type : SDT_TAPE
NetBackup Drive Type: 16
Removable : Yes
Device Supports: SCSI-3
Flags : 0x0
Reason: 0x0
------------------------------------------------------------
Device Name : "/dev/rmt/2cbn"
Passthru Name: "/dev/sg/c0tw500104f000904e34l0"
Volume Header: ""
Port: -1; Bus: -1; Target: -1; LUN: -1
Inquiry : "HP Ultrium 3-SCSI L6HS"
Vendor ID : "HP "
Product ID : "Ultrium 3-SCSI "
Product Rev: "L6HS"
Serial Number: "HU10708N21"
WWN : ""
WWN Id Type : 0
Device Identifier: ""
Device Type : SDT_TAPE
NetBackup Drive Type: 16
Removable : Yes
Device Supports: SCSI-3
Flags : 0x0
Reason: 0x0
------------------------------------------------------------
Device Name : "/dev/rmt/0cbn"
Passthru Name: "/dev/sg/c0tw500104f000904e13l0"
Volume Header: ""
Port: -1; Bus: -1; Target: -1; LUN: -1
Inquiry : "HP Ultrium 3-SCSI L6HS"
Vendor ID : "HP "
Product ID : "Ultrium 3-SCSI "
Product Rev: "L6HS"
Serial Number: "MXP0901BAF"
WWN : ""
WWN Id Type : 0
Device Identifier: ""
Device Type : SDT_TAPE
NetBackup Drive Type: 16
Removable : Yes
Device Supports: SCSI-3
Flags : 0x0
Reason: 0x0
#
07-28-2014 03:06 PM
OS Level not avialable. Thats why i'm requesting you to please do the cable trace by coordinating with hardware team so that we can get the switch port info where drive10_lto3 is connected to. any other why to collect switch port info..
what is next step and suggest me...
07-28-2014 03:33 PM
07-28-2014 10:40 PM
I do not quite understand what you are asking here from a backup software community:
... i'm requesting you to please do the cable trace by coordinating with hardware team so that we can get the switch port info where drive10_lto3 is connected to. any other why to collect switch port info..
It seems you are aware of the fact that there is a hardware issue and that nothing from NetBackup side can be done to fix it.
Coordinating with hardware/SAN management team and tracing cables is something only you can do.
Good luck!
07-29-2014 09:54 AM
Hey Marianne
Thanks
I am working with SAN Team on this. After investigating we fond it's cabling issue. .
And library end vendor conformed every thing fine. Now I am working with SAN Team for tracing cables...