VCS cannot startup
Hi All , The enviornment is configured two node form the active / passive cluster, i have maintenance for active node , switch to passive node to online cluster, but check the status is in parital, I have no idea what happen on the issue. Can you please advice how to fix it ? root@devuaebms42 # gabconfig -a GAB Port Memberships =============================================================== Port a gen 63750d membership 01 Port h gen 63750b membership ;1 Port h gen 63750b visible 0 ^Croot@devuaebms42 # hastatus -sum -- SYSTEM STATE -- System State Frozen A devuaebms41 EXITED 0 A devuaebms42 RUNNING 0 -- GROUP STATE -- Group System Probed AutoDisabled State B cf_bms_sg_01 devuaebms41 Y Y OFFLINE B cf_bms_sg_01 devuaebms42 Y N PARTIAL Many thanks, HongSolved2.8KViews0likes9CommentsVCS Cluster not starting.
Hello All, I am having difficulties trying to get VCS started on this system. I have attached what I have got so far. I apperciate any comments or suggestions as to go from here. Thank you The hostnames in the main.cf corrosponds to that of the servers. hastatus -sum VCS ERROR V-16-1-10600 Cannot connect to VCS engine VCS WARNING V-16-1-11046 Local system not available hasys -state VCS ERROR V-16-1-10600 Cannot connect to VCS engine hastop -all -force VCS ERROR V-16-1-10600 Cannot connect to VCS engine hastart / hastart -onenode dmesg: Exiting: Another copy of VCS may be running engine_A.log 2013/10/22 15:16:43 VCS NOTICE V-16-1-11051 VCS engine join version=4.1000 2013/10/22 15:16:43 VCS NOTICE V-16-1-11052 VCS engine pstamp=4.1 03/03/05-14:58:00 2013/10/22 15:16:43 VCS NOTICE V-16-1-10114 Opening GAB library 2013/10/22 15:16:43 VCS NOTICE V-16-1-10619 'HAD' starting on: db1 2013/10/22 15:16:45 VCS INFO V-16-1-10125 GAB timeout set to 15000 ms 2013/10/22 15:17:00 VCS CRITICAL V-16-1-11306 Did not receive cluster membership, manual intervention may be needed for seeding #gabconfig -a GAB Port Memberships =============================================================== #lltstat -nvv LLT node information: Node State Link Status Address * 0 db1 OPEN bge1 UP 00:03:BA:15 bge2 UP 00:03:BA:15 1 db2 CONNWAIT bge1 DOWN bge2 DOWN bash-2.05$ lltconfig LLT is running ps -ef | grep had root 826 1 0 15:16:43 ? 0:00 /opt/VRTSvcs/bin/had root 836 1 0 15:16:45 ? 0:00 /opt/VRTSvcs/bin/hashadowSolved18KViews3likes4CommentsVCS Cluster not starting.
Hi I am facing problem while trying to start VCS . From LOG : ============================================================== tail /var/VRTSvcs/log/engine_A.log 2014/01/13 21:39:14 VCS NOTICE V-16-1-11050 VCS engine version=5.1 2014/01/13 21:39:14 VCS NOTICE V-16-1-11051 VCS engine join version=5.1.00.0 2014/01/13 21:39:14 VCS NOTICE V-16-1-11052 VCS engine pstamp=Veritas-5.1-10/06/09-14:37:00 2014/01/13 21:39:14 VCS INFO V-16-1-10196 Cluster logger started 2014/01/13 21:39:14 VCS NOTICE V-16-1-10114 Opening GAB library 2014/01/13 21:39:14 VCS NOTICE V-16-1-10619 ‘HAD’ starting on: nsscls01 2014/01/13 21:39:16 VCS INFO V-16-1-10125 GAB timeout set to 30000 ms 2014/01/13 21:39:16 VCS NOTICE V-16-1-11057 GAB registration monitoring timeout set to 200000 ms 2014/01/13 21:39:16 VCS NOTICE V-16-1-11059 GAB registration monitoring action set to log system message 2014/01/13 21:39:31 VCS CRITICAL V-16-1-11306 Did not receive cluster membership, manual intervention may be needed for seeding ============================================================================================= root@nsscls01# hastatus -sum VCS ERROR V-16-1-10600 Cannot connect to VCS engine VCS WARNING V-16-1-11046 Local system not available Please advice how can I start the VCS.Solved16KViews2likes11Commentsunsuccessful cluster failover occured because of nic faulted
Hello, Platform: solaris 11 Logs: Jun 23 16:36:49 nodeA Had[5211]: [ID 702911 daemon.notice] VCS ERROR V-16-1-54031 Resource csgnic (Owner: Unspecified, Group: ClusterService) is FAULTED on sys nodeA Jun 23 16:37:49 nodeA Had[5211]: [ID 702911 daemon.notice] VCS ERROR V-16-1-54031 Resource nic_proxy_aggr1 (Owner: Unspecified, Group: oracle) is FAULTED on sys nodeA Question 1) I need more detail about the problem. I tried to check /var/log/messages, /var/adm/messages, /var/fm/fmd/*files and I can’t see anything related with this error. Which logs should be checked on the solaris 11 system for this situation? Question 2) What kind of method do you advise to investigate the nic problem forgetting more information on platform? Question 3) What kind of configuration should I do for handling with nic failures?Solved3.3KViews1like7CommentsSolaris, live migragtion on failover instead of stop/start of LDOMs
Hi all, I have the following setup: Solaris 11.3 SPARC LDM 3.3 VCS 7.1 3x SPARC T5-2 Servers 2x I/O domains per system From Visualization Guide about Failover scenarios: Domain state: Control domain Alternate I/O VCS behavior Up Up No fail over Up Down No fail over Down Up Fail over* Down Down Fail over** * VCS behavior would be “No fail over” with service group in auto-disabled state if the LDom resource attribute DomainFailurePolicy for the control domain is set to “ignore” and the LDom service group attribute SysDownPolicy is set to “AutoDisableNoOffline”. ** VCS behavior would be “No fail over” with service group in auto-disabled state if the LDom resource attribute DomainFailurePolicy for the control and other I/O domain is set to “ignore” and the LDom service group attribute SysDownPolicy is set to “AutoDisableNoOffline” So far so good, but VCS doesn't triggers the Live Migration, instead Logical Domain stops on one system and starts on another. Is here the possibility to change the behavior from stop/start to migrate? AFAIK, the Oracle (Sun) Cluster can do that afterclrs set -p Migration_type=MIGRATE ldom1-rs Thank you!1.2KViews0likes0CommentsMULTINICB resource faulty and not getting cleared
Hi Team, I am seeing MultinicB resource fault as shown below D Ossfs Proxy ossfs_p1 et-coreg-admin2 D PubLan MultiNICB pub_mnic et-coreg-admin2 D Sybase1 Proxy syb1_p1 et-coreg-admin2 Pub_mnic is faulted and in turn proxy resources that mirror the status of MUltinICB resources. Below error seen on 3 rd June Jun 3 10:39:17 et-coreg-admin2 in.mpathd[6604]: [ID 168056 daemon.error] All Interfaces in group pub_mnic have failed Jun 3 10:39:18 et-coreg-admin2 Had[6102]: [ID 702911 daemon.notice] VCS ERROR V-16-1-10303 Resource pub_mnic (Owner: Unspecified, Group: PubLan) is FAULTED (timed out) on sys et-coreg-admin2 As of now interfaces seems ok and network is ok. I want to clear this resource but being a Persistent resource it should recover itself once network issue resolved. # ifconfig -a lo0: flags=1001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8131 index 1 inet 117.0.0.1 netmask ff000000 bnxe0: flags=19040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,FAILED> mtu 1500 index 1 inet 10.106.111.66 netmask ffffff80 broadcast 10.106.111.117 groupname pub_mnic ether 14:58:d0:54:18:18 bnxe0:1: flags=11000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4,FAILED> mtu 1500 index 1 inet 10.106.111.70 netmask ffffff80 broadcast 10.106.111.117 bnxe1: flags=19040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,FAILED> mtu 1500 index 3 inet 10.106.111.68 netmask ffffff80 broadcast 10.106.111.117 groupname pub_mnic ether 14:58:d0:54:18:1c hares -display pub_mnic #Resource Attribute System Value pub_mnic Group global PubLan pub_mnic Type global MultiNICB pub_mnic AutoStart global 1 pub_mnic Critical global 1 pub_mnic Enabled global 1 pub_mnic LastOnline global admin1 pub_mnic MonitorOnly global 0 pub_mnic ResourceOwner global pub_mnic TriggerEvent global 0 pub_mnic ArgListValues admin1 UseMpathd 1 1 MpathdCommand 1 /usr/lib/inet/in.mpathd ConfigCheck 1 1 MpathdRestart 1 1 Device 4 bnxe0 0 bnxe1 1 NetworkHosts 1 10.106.111.51 LinkTestRatio 1 1 IgnoreLinkStatus 1 1 NetworkTimeout 1 100 OnlineTestRepeatCount 1 3 OfflineTestRepeatCount 1 3 NoBroadcast 1 0 DefaultRouter 1 0.0.0.0 Failback 1 0 GroupName 1 "" Protocol 1 IPv4 pub_mnic ArgListValues admin1 UseMpathd 1 1 MpathdCommand 1 /usr/lib/inet/in.mpathd ConfigCheck 1 1 MpathdRestart 1 1 Device 4 bnxe0 0 bnxe1 1 NetworkHosts 1 10.106.111.51 LinkTestRatio 1 1 IgnoreLinkStatus 1 1 NetworkTimeout 1 100 OnlineTestRepeatCount 1 3 OfflineTestRepeatCount 1 3 NoBroadcast 1 0 DefaultRouter 1 0.0.0.0 Failback 1 0 GroupName 1 "" Protocol 1 IPv4 pub_mnic ConfidenceLevel admin1 0 pub_mnic ConfidenceLevel admin1 0 pub_mnic ConfidenceMsg admin1 pub_mnic ConfidenceMsg admin1 pub_mnic Flags admin1 pub_mnic Flags admin1 pub_mnic IState admin1 not waiting pub_mnic IState admin1 not waiting pub_mnic MonitorMethod admin1 Traditional pub_mnic MonitorMethod admin1 Traditional pub_mnic Probed admin1 1 pub_mnic Probed admin1 1 pub_mnic Start admin1 0 pub_mnic Start admin1 0 pub_mnic State admin1 ONLINE pub_mnic State admin1 FAULTED pub_mnic ComputeStats global 0 pub_mnic ConfigCheck global 1 pub_mnic DefaultRouter global 0.0.0.0 pub_mnic Failback global 0 pub_mnic GroupName global pub_mnic IgnoreLinkStatus global 1 pub_mnic LinkTestRatio global 1 pub_mnic MpathdCommand global /usr/lib/inet/in.mpathd pub_mnic MpathdRestart global 1 pub_mnic NetworkHosts global 10.106.111.51 pub_mnic NetworkTimeout global 100 pub_mnic NoBroadcast global 0 pub_mnic OfflineTestRepeatCount global 3 pub_mnic OnlineTestRepeatCount global 3 pub_mnic Protocol global IPv4 pub_mnic TriggerResStateChange global 0 pub_mnic UseMpathd global 1 pub_mnic ContainerInfo admin1 Type Name Enabled pub_mnic ContainerInfo admin1 Type Name Enabled pub_mnic Device admin1 bnxe0 0 bnxe1 1 pub_mnic Device admin1 bnxe0 0 bnxe1 1 pub_mnic MonitorTimeStats admin1 Avg 0 TS pub_mnic MonitorTimeStats admin1 Avg 0 TS pub_mnic ResourceInfo admin1 State Valid Msg TS pub_mnic ResourceInfo admin1 State Stale Msg TS Please help to solve this asapSolved1.7KViews0likes3Commentscfsmount1 &cfsmount2 resource could not offline
I met a problem about vcs. Environment: HW T5220 Server *2 + ax4-5; Problem description: When executing “init 6” or “hastop –all” command in cluster system, resource cfsmount1&cfsmount2 could not been offline normally; Checked with the HW state(EMC connective state ,disk, system, iostat –En),the output of “vxdisk , vxprint, vxdmpadm, fuser, mount –v etc.” I tried to umount /var/opt/mediation/MMStorage manually, it did not succeed and it look like the process has hung up; Please see check list in attach file check_point.log , engine_A.log and main.cf . Could you give me some advice about how to fix the problem?3.1KViews0likes9Commentscfsmount1 &cfsmount2 resource could not offline
I met a problem about vcs. Environment: HW T5220 Server *2 + ax4-5; SW EMM8 ICP1505 Problem description: When executing “init 6” or “hastop –all” command in cluster system, resource cfsmount1&cfsmount2 could not been offline normally; Checked with the HW state(EMC connective state ,disk, system, iostat –En),the output of “vxdisk , vxprint, vxdmpadm, fuser, mount –v etc.” I tried to umount /var/opt/mediation/MMStorage manually, it did not succeed and it look like the process has hung up; Please see check list in attach file check_point.log , engine_A.log and main.cf . Could you give me some advice about how to fix the problem?580Views1like1Commentmonitoring of 2 nic resources in a global cluster
hi all. I'm using an application in a global cluster environment formed by 2 mini-cluster with one node each one, configured by a supplier. this application is using 2 NICs: one for the LAN of the management of the servers and one for the LAN of themanagement of the network elements. The supplier only configured a resource associated to the NIC of the management so, in case of a fault of this NIC the application can switch, while in case of a fault of the NIC of the network element the application freezes. This behaviour is obviously unacceptable, but the answer of the supplier was: the application doesn't support this feature (2 NIC resources together, apart from the heartbeat NIC). Is this possible ? I know that you cannot know how the application works, but I think that the configuration of the NIC resources is totally indipendent of the application itself, in whatever way it works. Is this correct ? thanks in advance and BR Tiziano1.6KViews0likes5Comments