I/O fencing disk test
Hello, While testing the disk to be used for fencing prior to configuration, I am being prompted for password for both nodes over and over again. Kindly advise. vxfentsthdw -r -w /dev/hdiskpower2 -n use /usr/bin/rsh VERITAS vxfentsthdw version 5.1.1.0 AIX The utility vxfentsthdw works on the two nodes of the cluster. The utility verifies that the shared storage one intends to use is configured to support I/O fencing. It issues a series of vxfenadm commands to setup SCSI-3 registrations on the disk, verifies the registrations on the disk, and removes the registrations from the disk. The logfile generated for vxfentsthdw is /var/VRTSvcs/log/vxfen/vxfentsthdw.log.1 Enter the first node of the cluster: ibwebprod root@ibwebprod's [1] password: Enter the second node of the cluster: ibwebstdby root@ibwebstdby's [2] password: root@ibwebprod's [3] password:Solved625Views6likes2CommentsVCS 5.1 - DG import fails to the new node
Hi, I have recently added a new node to a cluster and it joined the cluster with no problem... but whe I tried to import a DG to the node it fails... Note that this DG is available and imported in a different node. Here si the log when it WORKED in a existing node: 2014/01/16 00:49:42 VCS NOTICE V-16-1-10301 Initiating Online of Resource md_DG (Owner: unknown, Group: mdm) on System D-node6 2014/01/16 00:49:44 VCS WARNING V-16-10001-1014 (D-node6) DiskGroup:md_DG:online:Diskgroups will be imported without reservations 2014/01/16 00:49:46 VCS NOTICE V-16-10001-1009 (D-node6) DiskGroup:md_DG:online:vxdg import succeeded on Disk Group md_DG 2014/01/16 00:49:47 VCS NOTICE V-16-10001-1010 (D-node6) DiskGroup:md_DG:online:Volumes in Disk Group md_DG are started. Any mirrors are updated in background And here is when it FAILED in the new node: 2014/01/16 00:45:11 VCS NOTICE V-16-1-10301 Initiating Online of Resource md_DG (Owner: unknown, Group: mdm) on System d-node9 2014/01/16 00:45:12 VCS WARNING V-16-10001-1014 (d-node9) DiskGroup:md_DG:online:Diskgroups will be imported without reservations 2014/01/16 00:45:12 VCS WARNING V-16-10001-1016 (d-node9) DiskGroup:md_DG:online:vxdg import (clear flag) failed. 2014/01/16 00:45:12 VCS WARNING V-16-10001-1017 (d-node9) DiskGroup:md_DG:online:Trying force import for the diskgroup. 2014/01/16 00:45:12 VCS ERROR V-16-10001-1003 (d-node9) DiskGroup:md_DG:online:** ERROR: vxdg import (force) failed on Disk Group md_DG 2014/01/16 00:45:12 VCS ERROR V-16-10001-1004 (d-node9) DiskGroup:md_DG:online:** ERROR: vxdg import failed on Disk Group md_DG after vxdctl enable 2014/01/16 00:45:13 VCS INFO V-16-2-13716 (d-node9) Resource(mdm_DG): Output of the completed operation (online) ============================================== VxVM vxdg ERROR V-5-1-10978 Disk group md_DG: import failed: No valid disk found containing disk group VxVM vxdg ERROR V-5-1-10978 Disk group md_DG: import failed: No valid disk found containing disk group VxVM vxdg ERROR V-5-1-10978 Disk group md_DG: import failed: No valid disk found containing disk group ============================================== Looking into the old dicussions I have tried this but still didnt work...: changed before importing # vxdmpadm settune dmp_cache_open=off Tunable value will be changed immediately # vxdmpadm gettune all | grep cache dmp_cache_open off on But I got the same error. Same thing o var/adm/mesages: Jan 16 00:45:12 d-node9 Had[17566]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1003 (dp-node9) DiskGroup:mdm_DG:online:** ERROR: vxdg import (force) failed on Disk Group mdm_DG Jan 16 00:45:12 d-node9 vxdmp: [ID 631182 kern.notice] NOTICE: VxVM vxdmp V-5-0-0 removed disk array DISKS, datype = Disk Jan 16 00:45:12 d-node9 vxdmp: [ID 803759 kern.notice] NOTICE: VxVM vxdmp V-5-0-34 added disk array DISKS, datype = Disk Jan 16 00:45:12 d-node9 Had[17566]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1004 (dp-node9) DiskGroup:mdm_DG:online:** ERROR: vxdg import failed on Disk Group mdm_DG after vxdctl enable Any suggestion? Tks, JoaoSolved2.2KViews3likes9CommentsVCS Cluster not starting.
Hello All, I am having difficulties trying to get VCS started on this system. I have attached what I have got so far. I apperciate any comments or suggestions as to go from here. Thank you The hostnames in the main.cf corrosponds to that of the servers. hastatus -sum VCS ERROR V-16-1-10600 Cannot connect to VCS engine VCS WARNING V-16-1-11046 Local system not available hasys -state VCS ERROR V-16-1-10600 Cannot connect to VCS engine hastop -all -force VCS ERROR V-16-1-10600 Cannot connect to VCS engine hastart / hastart -onenode dmesg: Exiting: Another copy of VCS may be running engine_A.log 2013/10/22 15:16:43 VCS NOTICE V-16-1-11051 VCS engine join version=4.1000 2013/10/22 15:16:43 VCS NOTICE V-16-1-11052 VCS engine pstamp=4.1 03/03/05-14:58:00 2013/10/22 15:16:43 VCS NOTICE V-16-1-10114 Opening GAB library 2013/10/22 15:16:43 VCS NOTICE V-16-1-10619 'HAD' starting on: db1 2013/10/22 15:16:45 VCS INFO V-16-1-10125 GAB timeout set to 15000 ms 2013/10/22 15:17:00 VCS CRITICAL V-16-1-11306 Did not receive cluster membership, manual intervention may be needed for seeding #gabconfig -a GAB Port Memberships =============================================================== #lltstat -nvv LLT node information: Node State Link Status Address * 0 db1 OPEN bge1 UP 00:03:BA:15 bge2 UP 00:03:BA:15 1 db2 CONNWAIT bge1 DOWN bge2 DOWN bash-2.05$ lltconfig LLT is running ps -ef | grep had root 826 1 0 15:16:43 ? 0:00 /opt/VRTSvcs/bin/had root 836 1 0 15:16:45 ? 0:00 /opt/VRTSvcs/bin/hashadowSolved18KViews3likes4CommentsCan't locate object method "QEMU" Error During Upgrade to 6.0.x
Hi all, I have a two node VCS cluster running on two KVM Virtual Machines RHEL 5.5. VCS version is currently 5.1 SP1 RP2 and I'm trying to upgrade it to version 6.0.1. Right after I run the installer script I get the following error: [root@Hostname rhel5_x86_64]# pwd /shared_data/VCS/dvd1-redhatlinux/rhel5_x86_64 [root@Hostname rhel5_x86_64]# ./installer Can't locate object method "QEMU" via package "Padv::RHEL5x8664" at /shared_data/VCS/dvd1-redhatlinux/rhel5_x86_64/scripts/EDR/Padv/Linux.pm line 1061. [root@Hostname rhel5_x86_64]# Has anyone encountered such an error before? I've contacted support and so far they have only instructed me to approach RedHat and open a ticket with them. Any help will be much appreciated. Thanks, YairSolved628Views3likes4Commentsneed urgent help
While booting below errors dmesg Sun Oct 14 16:25:41 UTC 2012 Oct 14 14:22:18 rmgdbp09 vxdmp: [ID 917986 kern.notice] NOTICE: VxVM vxdmp V-5-0-112 disabled path 118/0x710 belonging to the dmpnode 286/0xc8 Oct 14 14:23:52 rmgdbp09 vxdmp: [ID 736771 kern.notice] NOTICE: VxVM vxdmp V-5-0-148 enabled path 118/0x710 belonging to the dmpnode 286/0xc8 Oct 14 14:29:16 rmgdbp09 vxdmp: [ID 917986 kern.notice] NOTICE: VxVM vxdmp V-5-0-112 disabled path 118/0x700 belonging to the dmpnode 286/0x50 Oct 14 14:33:18 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,32 (ssd223): Oct 14 14:33:18 rmgdbp09 offline Oct 14 14:33:52 rmgdbp09 vxdmp: [ID 736771 kern.notice] NOTICE: VxVM vxdmp V-5-0-148 enabled path 118/0x700 belonging to the dmpnode 286/0x50 Oct 14 14:38:18 rmgdbp09 vxdmp: [ID 917986 kern.notice] NOTICE: VxVM vxdmp V-5-0-112 disabled path 118/0x6f8 belonging to the dmpnode 286/0x80 Oct 14 14:38:52 rmgdbp09 vxdmp: [ID 736771 kern.notice] NOTICE: VxVM vxdmp V-5-0-148 enabled path 118/0x6f8 belonging to the dmpnode 286/0x80 Oct 14 14:39:45 rmgdbp09 vxesd[141]: [ID 360244 daemon.notice] Event Source daemon started Oct 14 14:39:45 rmgdbp09 syseventd[75]: [ID 617319 daemon.error] SIGHUP caught - reloading modules Oct 14 14:39:46 rmgdbp09 syseventd[75]: [ID 661968 daemon.error] Daemon restarted Oct 14 14:41:14 rmgdbp09 pseudo: [ID 129642 kern.info] pseudo-device: laner0 Oct 14 14:41:14 rmgdbp09 genunix: [ID 936769 kern.info] laner0 is /pseudo/laner@0 Oct 14 14:46:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 14:46:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 14:49:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 14:49:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 14:51:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 14:51:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 14:53:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 14:53:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 14:56:18 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 14:56:18 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 14:58:21 rmgdbp09 llt: [ID 144861 kern.notice] LLT INFO V-14-1-10009 LLT Protocol available Oct 14 14:58:32 rmgdbp09 lmx: [ID 504452 kern.notice] LMX Multiplexor available Oct 14 14:58:34 rmgdbp09 root: [ID 702911 daemon.error] CAPTURE_UPTIME ERROR: /var/opt/SUNWsrsrp missing Oct 14 14:58:34 rmgdbp09 savecore: [ID 686316 auth.error] no dump device configured Oct 14 14:58:34 rmgdbp09 last message repeated 1 time Oct 14 14:58:34 rmgdbp09 genunix: [ID 454863 kern.info] dump on /dev/dsk/c1t0d0s1 size 32793 MB Oct 14 14:58:34 rmgdbp09 ntpdate[1207]: [ID 558275 daemon.notice] adjust time server 193.88.157.124 offset -0.367765 sec Oct 14 14:58:35 rmgdbp09 pseudo: [ID 129642 kern.info] pseudo-device: tod0 Oct 14 14:58:35 rmgdbp09 genunix: [ID 936769 kern.info] tod0 is /pseudo/tod@0 Oct 14 14:58:35 rmgdbp09 pseudo: [ID 129642 kern.info] pseudo-device: pm0 Oct 14 14:58:35 rmgdbp09 genunix: [ID 936769 kern.info] pm0 is /pseudo/pm@0 Oct 14 14:58:35 rmgdbp09 gab: [ID 872190 kern.notice] GAB INFO V-15-1-20021 GAB available Oct 14 14:58:35 rmgdbp09 gab: [ID 222459 kern.notice] GAB INFO V-15-1-20026 Port a registration waiting for seed port membership Oct 14 14:58:36 rmgdbp09 gab: [ID 222459 kern.notice] GAB INFO V-15-1-20026 Port d registration waiting for seed port membership Oct 14 14:58:36 rmgdbp09 llt: [ID 860062 kern.notice] LLT INFO V-14-1-10024 link 1 (ce5) node 0 active Oct 14 14:58:36 rmgdbp09 llt: [ID 860062 kern.notice] LLT INFO V-14-1-10024 link 0 (ce1) node 0 active Oct 14 14:58:36 rmgdbp09 xntpd[1314]: [ID 702911 daemon.notice] xntpd 3-5.93e Mon Sep 20 15:47:11 PDT 1999 (1) Oct 14 14:58:36 rmgdbp09 xntpd[1314]: [ID 301315 daemon.notice] tickadj = 5, tick = 10000, tvu_maxslew = 495, est. hz = 100 Oct 14 14:58:37 rmgdbp09 xntpd[1314]: [ID 798731 daemon.notice] using kernel phase-lock loop 0041 Oct 14 14:58:37 rmgdbp09 last message repeated 1 time Oct 14 14:58:40 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port d gen 2d7c05 membership 01 Oct 14 14:58:40 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port a gen 2d7c06 membership 01 Oct 14 15:00:01 rmgdbp09 vcsmm: [ID 743917 kern.notice] VCS RAC ERROR V-10-1-15013 vcsmm_ioctl: driver not configured Oct 14 15:07:12 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 15:07:12 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 15:15:00 rmgdbp09 vcsmm: [ID 743917 kern.notice] VCS RAC ERROR V-10-1-15013 vcsmm_ioctl: driver not configured Oct 14 15:22:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 15:22:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 15:30:00 rmgdbp09 vcsmm: [ID 743917 kern.notice] VCS RAC ERROR V-10-1-15013 vcsmm_ioctl: driver not configured Oct 14 15:32:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 15:32:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 15:38:17 rmgdbp09 scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/SUNW,qlc@1/fp@0,0/ssd@w50060e8006cfa802,0 (ssd226): Oct 14 15:38:17 rmgdbp09 SCSI transport failed: reason 'timeout': giving up Oct 14 15:45:00 rmgdbp09 vcsmm: [ID 743917 kern.notice] VCS RAC ERROR V-10-1-15013 vcsmm_ioctl: driver not configured Oct 14 16:00:00 rmgdbp09 vcsmm: [ID 743917 kern.notice] VCS RAC ERROR V-10-1-15013 vcsmm_ioctl: driver not configured Oct 14 16:08:22 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port b gen 2d7c0a membership 01 Oct 14 16:08:22 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port o gen 2d7c09 membership 01 Oct 14 16:08:22 rmgdbp09 vcsmm: [ID 357760 kern.notice] VCS RAC INFO V-10-1-15047 mmpl_reconfig_ioctl: dev_ioctl failed, vxfen may not be configured Oct 14 16:08:22 rmgdbp09 vxfen: [ID 725375 kern.notice] NOTICE: VCS FEN INFO V-11-1-35 Fencing driver going into RUNNING state Oct 14 16:08:42 rmgdbp09 snmpXdmid: [ID 723131 daemon.error] Error in Adding Row for Subscription Table Entry Oct 14 16:08:42 rmgdbp09 snmpXdmid: [ID 132663 daemon.error] Failed to add filter to SP for Event delivery Oct 14 16:09:49 rmgdbp09 syslog[4934]: [ID 702911 daemon.notice] VCS INFO V-16-1-11240 Command Server: running with security OFF Oct 14 16:09:49 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-10619 'HAD' starting on: rmgdbp09 Oct 14 16:09:50 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-10620 Waiting for local cluster configuration status Oct 14 16:09:51 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-10624 Local cluster configuration stale Oct 14 16:09:51 rmgdbp09 last message repeated 1 time Oct 14 16:09:51 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-11034 Registering for cluster membership Oct 14 16:09:51 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-11035 Waiting for cluster membership Oct 14 16:09:56 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port h gen 2d7c19 membership 01 Oct 14 16:09:56 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS INFO V-16-1-10077 Received new cluster membership Oct 14 16:09:56 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-10086 System (Node '0') is in Regular Membership - Membership: 0x3 Oct 14 16:09:56 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-10086 System rmgdbp09 (Node '1') is in Regular Membership - Membership: 0x3 Oct 14 16:09:56 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-10075 Building from remote system Oct 14 16:09:58 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:09:58 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-10066 Entering RUNNING state Oct 14 16:09:58 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS NOTICE V-16-1-50311 VCS Engine: running with security OFF Oct 14 16:10:00 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:10:04 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:10:09 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port u gen 2d7c1b membership ;1 Oct 14 16:10:09 rmgdbp09 gab: [ID 674723 kern.notice] GAB INFO V-15-1-20038 Port u gen 2d7c1b k_jeopardy 0 Oct 14 16:10:09 rmgdbp09 gab: [ID 513393 kern.notice] GAB INFO V-15-1-20040 Port u gen 2d7c1b visible 0 Oct 14 16:10:14 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port v gen 2d7c1a membership 01 Oct 14 16:10:15 rmgdbp09 pseudo: [ID 129642 kern.info] pseudo-device: devinfo0 Oct 14 16:10:15 rmgdbp09 genunix: [ID 936769 kern.info] devinfo0 is /pseudo/devinfo@0 Oct 14 16:10:20 rmgdbp09 vxvm:vxconfigd: [ID 456610 daemon.notice] V-5-1-7900 CVM_VOLD_CONFIG command received Oct 14 16:10:20 rmgdbp09 vxvm:vxconfigd: [ID 699813 daemon.notice] V-5-1-7899 CVM_VOLD_CHANGE command received Oct 14 16:10:20 rmgdbp09 vxvm:vxconfigd: [ID 322665 daemon.notice] V-5-1-7961 establishing cluster Oct 14 16:10:25 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port w gen 2d7c20 membership 01 Oct 14 16:10:31 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:11:16 rmgdbp09 last message repeated 40 times Oct 14 16:11:25 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(srvm_vol_dg) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(RMAN_VOL01) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPUB_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPTL_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPTL_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PAVS_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPUB_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:11:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PAVS_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:13:35 rmgdbp09 vxio: [ID 535583 kern.warning] WARNING: VxVM vxio V-5-0-164 Failed to join cluster rmgprod, aborting Oct 14 16:13:35 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port u closed Oct 14 16:13:35 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port v closed Oct 14 16:13:35 rmgdbp09 vxvm:vxconfigd: [ID 955251 daemon.error] V-5-1-8765 cluster join for node 1 failed: Connection timed out Oct 14 16:13:39 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port w closed Oct 14 16:13:39 rmgdbp09 vxvm:vxconfigd: [ID 741513 daemon.notice] V-5-1-7901 CVM_VOLD_STOP command received Oct 14 16:13:39 rmgdbp09 vxvm:vxconfigd: [ID 778436 daemon.error] V-5-1-4109 -1 returned from volcvm_establish Oct 14 16:13:39 rmgdbp09 vxvm:vxconfigd: [ID 886039 daemon.warning] V-5-1-4852 cluster_establish: timed out Oct 14 16:13:39 rmgdbp09 vxvm:vxconfigd: [ID 453237 daemon.error] V-5-1-11178 kernel_fail_join() : join timed out during reconfiguration (12, -1) Oct 14 16:13:39 rmgdbp09 vxvm:vxconfigd: [ID 565473 daemon.notice] V-5-1-9543 Timeout is not reset: another reconfig in progress Oct 14 16:14:10 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:14:10 rmgdbp09 reason: request to join failed Oct 14 16:15:00 rmgdbp09 vcsmm: [ID 832650 kern.notice] VCSMM: mm_deblog_sz = 1048576 Oct 14 16:15:00 rmgdbp09 vcsmm: [ID 706457 kern.notice] VCSMM: mm_msglog_sz = 128 Oct 14 16:15:00 rmgdbp09 vcsmm: [ID 838746 kern.notice] VCSMM: mm_slave_max = 1024 Oct 14 16:15:10 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:15:10 rmgdbp09 reason: request to join failed Oct 14 16:15:12 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13066 (rmgdbp09) Agent is calling clean for resource(cvm_clus) because the resource is not up even after online completed. Oct 14 16:15:13 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:15:17 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port u gen 2d7c22 membership ;1 Oct 14 16:15:17 rmgdbp09 gab: [ID 674723 kern.notice] GAB INFO V-15-1-20038 Port u gen 2d7c22 k_jeopardy 0 Oct 14 16:15:17 rmgdbp09 gab: [ID 513393 kern.notice] GAB INFO V-15-1-20040 Port u gen 2d7c22 visible 0 Oct 14 16:15:22 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port v gen 2d7c1c membership 01 Oct 14 16:15:28 rmgdbp09 vxvm:vxconfigd: [ID 456610 daemon.notice] V-5-1-7900 CVM_VOLD_CONFIG command received Oct 14 16:15:28 rmgdbp09 vxvm:vxconfigd: [ID 699813 daemon.notice] V-5-1-7899 CVM_VOLD_CHANGE command received Oct 14 16:15:28 rmgdbp09 vxvm:vxconfigd: [ID 322665 daemon.notice] V-5-1-7961 establishing cluster Oct 14 16:15:33 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port w gen 2d7c22 membership 01 Oct 14 16:15:39 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:16:44 rmgdbp09 last message repeated 23 times Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PAVS_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPTL_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPUB_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPTL_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(RMAN_VOL01) - monitor procedure did not complete within the expected time. Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPUB_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:16:45 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PAVS_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:16:52 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:17:02 rmgdbp09 last message repeated 16 times Oct 14 16:17:25 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(srvm_vol_dg) - monitor procedure did not complete within the expected time. Oct 14 16:18:43 rmgdbp09 vxio: [ID 535583 kern.warning] WARNING: VxVM vxio V-5-0-164 Failed to join cluster rmgprod, aborting Oct 14 16:18:43 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port u closed Oct 14 16:18:43 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port v closed Oct 14 16:18:43 rmgdbp09 vxvm:vxconfigd: [ID 955251 daemon.error] V-5-1-8765 cluster join for node 1 failed: Connection timed out Oct 14 16:18:47 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port w closed Oct 14 16:18:47 rmgdbp09 vxvm:vxconfigd: [ID 741513 daemon.notice] V-5-1-7901 CVM_VOLD_STOP command received Oct 14 16:18:47 rmgdbp09 vxvm:vxconfigd: [ID 778436 daemon.error] V-5-1-4109 -1 returned from volcvm_establish Oct 14 16:18:47 rmgdbp09 vxvm:vxconfigd: [ID 886039 daemon.warning] V-5-1-4852 cluster_establish: timed out Oct 14 16:18:47 rmgdbp09 vxvm:vxconfigd: [ID 453237 daemon.error] V-5-1-11178 kernel_fail_join() : join timed out during reconfiguration (12, -1) Oct 14 16:18:47 rmgdbp09 vxvm:vxconfigd: [ID 565473 daemon.notice] V-5-1-9543 Timeout is not reset: another reconfig in progress Oct 14 16:19:18 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:19:18 rmgdbp09 reason: request to join failed Oct 14 16:20:18 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:20:18 rmgdbp09 reason: request to join failed Oct 14 16:20:19 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13066 (rmgdbp09) Agent is calling clean for resource(cvm_clus) because the resource is not up even after online completed. Oct 14 16:20:20 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:20:25 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port u gen 2d7c24 membership ;1 Oct 14 16:20:25 rmgdbp09 gab: [ID 674723 kern.notice] GAB INFO V-15-1-20038 Port u gen 2d7c24 k_jeopardy 0 Oct 14 16:20:25 rmgdbp09 gab: [ID 513393 kern.notice] GAB INFO V-15-1-20040 Port u gen 2d7c24 visible 0 Oct 14 16:20:30 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port v gen 2d7c1e membership 01 Oct 14 16:20:36 rmgdbp09 vxvm:vxconfigd: [ID 456610 daemon.notice] V-5-1-7900 CVM_VOLD_CONFIG command received Oct 14 16:20:36 rmgdbp09 vxvm:vxconfigd: [ID 699813 daemon.notice] V-5-1-7899 CVM_VOLD_CHANGE command received Oct 14 16:20:36 rmgdbp09 vxvm:vxconfigd: [ID 322665 daemon.notice] V-5-1-7961 establishing cluster Oct 14 16:20:41 rmgdbp09 gab: [ID 316943 kern.notice] GAB INFO V-15-1-20036 Port w gen 2d7c24 membership 01 Oct 14 16:20:47 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:21:44 rmgdbp09 last message repeated 20 times Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPTL_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPUB_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPUB_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PAVS_VOL_oracle) - monitor procedure did not complete within the expected time. Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PPTL_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(PAVS_VOL_arch) - monitor procedure did not complete within the expected time. Oct 14 16:21:46 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(RMAN_VOL01) - monitor procedure did not complete within the expected time. Oct 14 16:21:50 rmgdbp09 vxfen: [ID 214757 kern.notice] NOTICE: VCS FEN INFO V-11-1-34 The ioctl VXFEN_IOC_CLUSTSTAT returned 0 Oct 14 16:22:10 rmgdbp09 last message repeated 19 times Oct 14 16:22:24 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13027 (rmgdbp08) Resource(srvm_vol_dg) - monitor procedure did not complete within the expected time. Oct 14 16:23:20 rmgdbp09 sshd[8133]: [ID 800047 auth.error] error: PAM: System error for illegal user sumitsg from 193.88.157.75 Oct 14 16:23:25 rmgdbp09 last message repeated 1 time Oct 14 16:23:29 rmgdbp09 sshd[8180]: [ID 800047 auth.error] error: ssh_msg_send: write Oct 14 16:23:51 rmgdbp09 vxio: [ID 535583 kern.warning] WARNING: VxVM vxio V-5-0-164 Failed to join cluster rmgprod, aborting Oct 14 16:23:51 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port u closed Oct 14 16:23:51 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port v closed Oct 14 16:23:51 rmgdbp09 vxvm:vxconfigd: [ID 955251 daemon.error] V-5-1-8765 cluster join for node 1 failed: Connection timed out Oct 14 16:23:55 rmgdbp09 gab: [ID 397130 kern.notice] GAB INFO V-15-1-20032 Port w closed Oct 14 16:23:55 rmgdbp09 vxvm:vxconfigd: [ID 741513 daemon.notice] V-5-1-7901 CVM_VOLD_STOP command received Oct 14 16:23:55 rmgdbp09 vxvm:vxconfigd: [ID 778436 daemon.error] V-5-1-4109 -1 returned from volcvm_establish Oct 14 16:23:55 rmgdbp09 vxvm:vxconfigd: [ID 886039 daemon.warning] V-5-1-4852 cluster_establish: timed out Oct 14 16:23:55 rmgdbp09 vxvm:vxconfigd: [ID 453237 daemon.error] V-5-1-11178 kernel_fail_join() : join timed out during reconfiguration (12, -1) Oct 14 16:23:55 rmgdbp09 vxvm:vxconfigd: [ID 565473 daemon.notice] V-5-1-9543 Timeout is not reset: another reconfig in progress Oct 14 16:24:26 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:24:26 rmgdbp09 reason: request to join failed Oct 14 16:25:26 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:25:26 rmgdbp09 reason: request to join failed Oct 14 16:25:27 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-13066 (rmgdbp09) Agent is calling clean for resource(cvm_clus) because the resource is not up even after online completed. Oct 14 16:25:28 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-1005 (rmgdbp09) CVMCluster:???:monitor:node - state: out of cluster Oct 14 16:25:28 rmgdbp09 reason: request to join failed Oct 14 16:25:29 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-10303 Resource cvm_clus (Owner: unknown, Group: cvm) is FAULTED (timed out) on sys rmgdbp09 Oct 14 16:25:30 rmgdbp09 arp: [ID 994748 kern.notice] ar_query_xmit: Could not find the ace Oct 14 16:25:30 rmgdbp09 last message repeated 1 time Oct 14 16:25:31 rmgdbp09 Had[4933]: [ID 702911 daemon.notice] VCS ERROR V-16-1-10205 Group cvm is faulted on system rmgdbp09 root@rmgdbp09 # server acme up , however no service group is online and its stuck at cvm service group root@rmgdbp09 # /etc/vx/bin/vxclustadm nodestate state: out of cluster reason: request to join failed root@rmgdbp09 # root@rmgdbp09 root@rmgdbp08 # gabconfig -a GAB Port Memberships =============================================================== Port a gen 2d7c06 membership 01 Port b gen 2d7c0a membership 01 Port d gen 2d7c05 membership 01 Port f gen 2d7c18 membership 0 Port f gen 2d7c18 visible ;1 Port h gen 2d7c19 membership 01 Port o gen 2d7c09 membership 01 Port q gen 2d7c16 membership 0 Port q gen 2d7c16 visible ;1 Port v gen 2d7c1f membership 0 Port v gen 2d7c1f visible ;1 Port w gen 2d7c25 membership 0 Port w gen 2d7c25 visible ;1 root@rmgdbp08 # Any clue from anybody ?Solved1.1KViews3likes2CommentsISCSI reboot problem with Veritas Cluster Server
Hi I've a problem with iSCSI and VCS. Basically the problem is for RHEL. As you know when you reboot a system that uses iSCSI usually the name of the devices is always changing, (/dev/sdf to /dev/sdm) so I found a way to solve this issue with UUID (not cluster UUID). That's the ID you get with blkid command. I've configured in VCS in BlockDevice section (/dev/sdX) but I've been looking for documentation about If I can configure UUID instead /dev/sdX but there aren't any references about it, so please tell me how should I configure UUID in VCS. Thanks for your help!Solved1.1KViews2likes4CommentsLLT WARNING V-14-1-10498
Hi, I have seen this error at Solaris log and I want to discover why this is happening: Jun 30 19:07:07 dp-node9 llt: [ID 216203 kern.notice] LLT WARNING V-14-1-10498 recvarpack cross links? links 0 and 1 saw the same peer link number 1 for node 1 Jun 30 19:12:07 dp-node9 last message repeated 1 time Here are some considerations: 1- each link from HB network is being located in different switches although they are at the same VLAN 2- the other nodes that we have has the exactly configuration and this error is not showing up. Thanks, PaulaSolved771Views2likes2Commentsneed a solution
we have 2 node cluster and with version 5.1 we experienced outage and I think it was due to below error messages can someone shed some light on these messages qlc: [ID 630585 kern.info] NOTICE: Qlogic qlc(1): Loop OFFLINE qlc: [ID 630585 kern.info] NOTICE: Qlogic qlc(1): Loop ONLINE fctl: [ID 999315 kern.warning] WARNING: fctl(4): AL_PA=0xe8 doesn't exist in LILP map scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@9/SUNW,qlc@0/fp@0,0/ssd@w203400a0b875f9d9,0 (ssd3): Command failed to complete...Device is gone scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@9/SUNW,qlc@0/fp@0,0/ssd@w203400a0b875f9d9,0 (ssd3): Command failed to complete...Device is gone scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@9/SUNW,qlc@0/fp@0,0/ssd@w203400a0b875f9d9,0 (ssd3): Command failed to complete...Device is gone scsi: [ID 243001 kern.info] /pci@0,600000/pci@0/pci@9/SUNW,qlc@0/fp@0,0 (fcp4): offlining lun=0 (trace=0), target=e8 (trace=2800004) vxdmp: [ID 631182 kern.notice] NOTICE: VxVM vxdmp V-5-0-0 removed disk array 600A0B800075F9D9000000004D2334F5, datype = ST2540- vxdmp: [ID 443116 kern.notice] NOTICE: VxVM vxdmp V-5-0-0 i/o error occured (errno=0x6) on dmpnode 334/0x2c last message repeated 59 times vxdmp: [ID 480808 kern.notice] NOTICE: VxVM vxdmp V-5-0-112 disabled path 118/0x18 belonging to the dmpnode 334/0x28 due to open failure vxdmp: [ID 824220 kern.notice] NOTICE: VxVM vxdmp V-5-0-111 disabled dmpnode 334/0x28 what is this dmpnode 334/0x28 signify, I forget how to map this to device as i only remember is tht its in hexadecimal. Also, what could be the cause of it ... is it due to HBA as issue starts with the message like below qlc: [ID 630585 kern.info] NOTICE: Qlogic qlc(1): Loop OFFLINE qlc: [ID 630585 kern.info] NOTICE: Qlogic qlc(1): Loop ONLINE fctl: [ID 999315 kern.warning] WARNING: fctl(4): AL_PA=0xe8 doesn't exist in LILP map1.8KViews2likes4CommentsOnline config LLT interface
Recently I realized that a llt interface in a cluster is not working because the nodes are in different vlans (the interface uses broadcast conf): Node 0: NODE0:~ # cat /etc/llttab set-node NODE0 set-cluster 1047 set-timer peerinact:3200 link eth2 eth-00:10:18:0b:7e:12 - ether - - link eth3 eth-00:10:18:0b:7e:13 - ether - - link-lowpri eth0 eth-00:14:5e:7b:08:1a - ether - - NODE0:~ # ifconfig eth0 eth0 Link encap:Ethernet HWaddr 00:14:5E:7B:08:1A inet addr:10.92.5.134 Bcast:10.92.5.255 Mask:255.255.254.0 Node 1: NODE1:~ # cat /etc/llttab set-node NODE1 set-cluster 1047 set-timer peerinact:3200 link eth2 eth-00:0e:0c:ba:3e:ae - ether - - link eth3 eth-00:0e:0c:ba:41:96 - ether - - link-lowpri eth0 eth-00:14:5e:7a:aa:3c - ether - - NODE1:~ # ifconfig eth0 eth0 Link encap:Ethernet HWaddr 00:14:5E:7A:AA:3C inet addr:10.93.146.123 Bcast:10.93.147.255 Mask:255.255.254.0 I think I should configure eth0 as an UDP interface, shouldn't I?? I tried somethink like that and It seemed to work: Node 0: //Specifying the other node IP as the broadcast address lltconfig -t eth0 -d eth0 -b udp -I 10.92.5.134 -B 10.93.146.124 Node 1: //Specifying the other node IP as the broadcast address lltconfig -t eth0 -d eth0 -b udp -I 10.93.146.123 -B 10.92.5.134 Is this correct?? What should I define in the llttab file?? Regards, joagmvSolved1.1KViews2likes4Comments