cancel
Showing results for 
Search instead for 
Did you mean: 

WARNING: VxVM vxio V-5-0-2 Subdisk disk04-28 block 6654400: Uncorrectable read error

Jay008
Level 3
Certified

Hi ,

 

i am new to the veritas volume manager.i saw some message in /var/ad/messages like " WARNING: VxVM vxio V-5-0-2 Subdisk disk04-28 block 6654400: Uncorrectable read error".the disk04 is c2t4d0s2. i have issued some commands iostat,vxdisk list,vxprint -ht .In vxprint -ht i got a message like c2t4d0s2 is failling.but all the subdisk are in active state only.please fiind and help me to solve the issue. i pasted the output below.

 

 

bash-2.05# vxdisk list
DEVICE       TYPE            DISK         GROUP        STATUS
c2t0d0s2     auto:sliced     rootdisk     rootdg       online
c2t1d0s2     auto:sliced     disk01       rootdg       online
c2t2d0s2     auto:sliced     disk02       p2mbadg      online
c2t3d0s2     auto:sliced     disk03       p2mbadg      online
c2t4d0s2     auto:sliced     disk04       p2mbadg      online
c2t5d0s2     auto:sliced     disk05       p2mbadg      online
c4t121d0s2   auto:none       -            -            online invalid


bash-2.05#  vxprint -ht
Disk group: rootdg

DG NAME         NCONFIG      NLOG     MINORS   GROUP-ID
ST NAME         STATE        DM_CNT   SPARE_CNT         APPVOL_CNT
DM NAME         DEVICE       TYPE     PRIVLEN  PUBLEN   STATE
RV NAME         RLINK_CNT    KSTATE   STATE    PRIMARY  DATAVOLS  SRL
RL NAME         RVG          KSTATE   STATE    REM_HOST REM_DG    REM_RLNK
CO NAME         CACHEVOL     KSTATE   STATE
VT NAME         NVOLUME      KSTATE   STATE
V  NAME         RVG/VSET/CO  KSTATE   STATE    LENGTH   READPOL   PREFPLEX UTYPE
PL NAME         VOLUME       KSTATE   STATE    LENGTH   LAYOUT    NCOL/WID MODE
SD NAME         PLEX         DISK     DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
SV NAME         PLEX         VOLNAME  NVOLLAYR LENGTH   [COL/]OFF AM/NM    MODE
SC NAME         PLEX         CACHE    DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
DC NAME         PARENTVOL    LOGVOL
SP NAME         SNAPVOL      DCO

dg rootdg       default      default  122000   1119494390.10.p.com


Disk group: p2mbadg

DG NAME         NCONFIG      NLOG     MINORS   GROUP-ID
ST NAME         STATE        DM_CNT   SPARE_CNT         APPVOL_CNT
DM NAME         DEVICE       TYPE     PRIVLEN  PUBLEN   STATE
RV NAME         RLINK_CNT    KSTATE   STATE    PRIMARY  DATAVOLS  SRL
RL NAME         RVG          KSTATE   STATE    REM_HOST REM_DG    REM_RLNK
CO NAME         CACHEVOL     KSTATE   STATE
VT NAME         NVOLUME      KSTATE   STATE
V  NAME         RVG/VSET/CO  KSTATE   STATE    LENGTH   READPOL   PREFPLEX UTYPE
PL NAME         VOLUME       KSTATE   STATE    LENGTH   LAYOUT    NCOL/WID MODE
SD NAME         PLEX         DISK     DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
SV NAME         PLEX         VOLNAME  NVOLLAYR LENGTH   [COL/]OFF AM/NM    MODE
SC NAME         PLEX         CACHE    DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
DC NAME         PARENTVOL    LOGVOL
SP NAME         SNAPVOL      DCO

dg p2mbadg      default      default  15000    1029939189.1953..com

dm disk02       c2t2d0s2     auto     10175    143328960 -
dm disk03       c2t3d0s2     auto     10175    143328960 -
dm disk04       c2t4d0s2     auto     10175    143328960 FAILING
dm disk05       c2t5d0s2     auto     10175    143328960 -

 

v  system_temp03 -           ENABLED  ACTIVE   31453184 SELECT    -        fsgen
pl system_temp03-01 system_temp03 ENABLED ACTIVE 31454016 CONCAT  -        RW
sd disk04-28    system_temp03-01 disk04 80420928 31454016 0       c2t4d0   ENA

v  system_temp04 -           ENABLED  ACTIVE   31453184 SELECT    -        fsgen
pl system_temp04-01 system_temp04 ENABLED ACTIVE 31454016 CONCAT  -        RW
sd disk05-28    system_temp04-01 disk05 80420928 31454016 0       c2t5d0   ENA

v  system_temp05 -           ENABLED  ACTIVE   31453184 SELECT    -        fsgen
pl system_temp05-01 system_temp05 ENABLED ACTIVE 31454016 CONCAT  -        RW
sd disk02-29    system_temp05-01 disk02 111874944 31454016 0      c2t2d0   ENA

v  system_temp06 -           ENABLED  ACTIVE   31453184 SELECT    -        fsgen
pl system_temp06-01 system_temp06 ENABLED ACTIVE 31454016 CONCAT  -        RW
sd disk03-29    system_temp06-01 disk03 111874944 31454016 0      c2t3d0   ENA

v  system_temp07 -           DISABLED CLEAN    31453184 SELECT    -        fsgen

v  system_temp08 -           ENABLED  ACTIVE   31453184 SELECT    -        fsgen
pl system_temp07-01 system_temp08 ENABLED ACTIVE 31454016 CONCAT  -        RW
sd disk04-29    system_temp07-01 disk04 111874944 31454016 0      c2t4d0   ENA
pl system_temp08-01 system_temp08 ENABLED ACTIVE 31454016 CONCAT  -        RW
sd disk05-29    system_temp08-01 disk05 111874944 31454016 0      c2t5d0   ENA

v  tempdb       -            ENABLED  ACTIVE   2396160  SELECT    -        fsgen
pl tempdb-01    tempdb       ENABLED  ACTIVE   2401536  STRIPE    2/128    RW
sd disk02-18    tempdb-01    disk02   76055424 1200768  0/0       c2t2d0   ENA
sd disk03-18    tempdb-01    disk03   76055424 1200768  1/0       c2t3d0   ENA
pl tempdb-02    tempdb       ENABLED  ACTIVE   2401536  STRIPE    2/128    RW
sd disk04-10    tempdb-02    disk04   15559104 1200768  0/0       c2t4d0   ENA
sd disk05-10    tempdb-02    disk05   15559104 1200768  1/0       c2t5d0   ENA

 

bash-2.05# tail -f /var/adm/messages
Dec 29 18:46:36 vxio: [ID 771159 kern.warning] WARNING: VxVM vxio V-5-0-2 Subdisk disk04-28 block 6654400: Uncorrectable read error
Dec 29 18:47:27  scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b7335,0 (ssd8):
Dec 29 18:47:27   SCSI transport failed: reason 'timeout': giving up
Dec 29 18:47:27  scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b7335,0 (ssd8):
Dec 29 18:47:27   Error for Command: read(10)                Error Level: Retryable
Dec 29 18:47:27  scsi: [ID 107833 kern.notice]    Requested Block: 87105408                  Error Block: 87105408
Dec 29 18:47:27  scsi: [ID 107833 kern.notice]    Vendor: SEAGATE                            Serial Number: 0221K1PV72
Dec 29 18:47:27  scsi: [ID 107833 kern.notice]    Sense Key: Unit Attention
Dec 29 18:47:27  scsi: [ID 107833 kern.notice]    ASC: 0x29 (<vendor unique code 0x29>), ASCQ: 0x3, FRU: 0x4
Dec 29 18:55:47  sshd[12440]: [ID 530472 auth.error] Kerberos mechanism library initialization error: No profile file open.

 

bash-2.05# echo |format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c1t0d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b6dca,0
       1. c1t1d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b74be,0
       2. c1t2d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b72e8,0
       3. c1t3d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b72ac,0
       4. c1t4d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b7335,0
       5. c1t5d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@8,600000/SUNW,qlc@2/fp@0,0/ssd@w21000004cf9b72ab,0
       6. c2t0d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@9,700000/pci@2/SUNW,qlc@4/fp@0,0/ssd@w22000004cf9b6dca,0
       7. c2t1d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@9,700000/pci@2/SUNW,qlc@4/fp@0,0/ssd@w22000004cf9b74be,0
       8. c2t2d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@9,700000/pci@2/SUNW,qlc@4/fp@0,0/ssd@w22000004cf9b72e8,0
       9. c2t3d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@9,700000/pci@2/SUNW,qlc@4/fp@0,0/ssd@w22000004cf9b72ac,0
      10. c2t4d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@9,700000/pci@2/SUNW,qlc@4/fp@0,0/ssd@w22000004cf9b7335,0

 

1 ACCEPTED SOLUTION

Accepted Solutions

sunshine_2
Level 4

all these error messages are indicative that one of the disk ( c2t4d0s2  ) is failing or it has BAD BLOCKS.

 

iostat -En|grep -i errors..

 

if you see a lot of HARD and SOFT and TRANSPORT errors  , plan for a disk replacement.

View solution in original post

3 REPLIES 3

sunshine_2
Level 4

all these error messages are indicative that one of the disk ( c2t4d0s2  ) is failing or it has BAD BLOCKS.

 

iostat -En|grep -i errors..

 

if you see a lot of HARD and SOFT and TRANSPORT errors  , plan for a disk replacement.

Jay008
Level 3
Certified

Hi Sunshine,

Happy new year...am very happy to see you after long time.thanks for ur reply.i just want your mail id .because i ve prepared some docs for disk replacement.i can't paste here.so please let me know. i need your help always....  thanking you....

 

-JAY

Gaurav_S
Moderator
Moderator
   VIP    Certified

Hello Jay,

 

Sunshine is correct, check the iostat for hard/transport errors, if you find them growing, call for replacement of disk drive.

 

Failing flag appears on a disk when a disk had write error (which you can see in messages), Its an indicator flag. You can manually run "vxedit -g <dg> set failing=off <disk>" & can remove failing flag for timebeing, but in case flag reappears then u should go for disk replacement.

 

Thanks

Gaurav