09-22-2013 12:20 PM
I have a VCS VVR iisue below and I have tried recovering the rvg and rattaching the rlink but not working :see below:
root@ojnbu2 # vradmin -g catalog_dg -l repstatus rvg_ojota
Replicated Data Set: rvg_ojota
Primary:
Host name: 10.1.230.172
RVG name: rvg_ojota
DG name: catalog_dg
RVG state: enabled for I/O (passthru)
Data volumes: 1
VSets: 0
SRL name: srlvol
SRL size: 150.00 G
Total secondaries: 1
Secondary:
Host name: 10.1.110.132
RVG name: rvg_ojota
DG name: catalog_dg
Rlink from Primary: rlk_10.1.110.132_rvg_ojota
Rlink to Primary: rlk_10.1.230.172_rvg_ojota
Configured mode: asynchronous
Latency protection: off
SRL protection: autodcm
Data status: consistent, stale
Replication status: not replicating (primary needs recovery)
Current mode: N/A
Logging to: N/A
Timestamp Information: N/A
Bandwidth Limit: N/A
Compression Mode: Off
Solved! Go to Solution.
09-24-2013 07:13 AM
Many thanks,
I have frozen the service group with the catalog_dg and imported it manually .so VVR has started as below and the cluster has not deported it for 5mins now,still monitoring:
09-22-2013 12:39 PM
Your output shows the RVG State is in PassThru mode so you need to disassociated and re-associated the SRL - see extract from VVR admin guide:
When a Primary SRL header error occurs, writes to the RVG continue; however,all RLINKs are put in the STALE state. The RVG is operating in PASSTHRU mode.To recover from an SRL header error
1 Stop the RVG.# vxrvg -g hrdg stop hr_rvg2 Dissociate the SRL from the RVG.# vxvol -g hrdg dis hr_srl3 Repair or restore the SRL. Even if the problem can be fixed by repairing theunderlying subdisks, the SRL must still be dissociated and reassociated toinitialize the SRL header.4 Make sure the SRL is started, and then reassociate the SRL:# vxvol -g hrdg start hr_srl# vxvol -g hrdg aslog hr_rvg hr_srl5 Start the RVG:# vxrvg -g hrdg start hr_rvg6 Restore the data volumes from backup if needed. Synchronize all the RLINKs.
09-22-2013 12:46 PM
Thanks Mike.
Can I do this onone dynamically
what is the impacts pls
09-22-2013 12:50 PM
But ther eis not device error on the rvg rvg and srl
see vxprint and vxinfi status below:
09-22-2013 12:52 PM
So how am I goint o repair if the status of the srl and rvg are ok as u said below:
3 Repair or restore the SRL. Even if the problem can be fixed by repairing the
underlying subdisks, the SRL must still be dissociated and reassociated to
initialize the SRL header.
09-22-2013 01:07 PM
The RVG state being in PASSTHRU means there has been a problem accessing the SRL at some point, but looks ok now, so you just need to follow steps to re-initialise the SRL header.
I THINK you can't stop the RVG online, which is a bit rubbish, if I'm right as you can normally delete the RVG and rlink online so you may end having to do this abd re-create RDS.
Mike
09-23-2013 03:16 AM
This implies they will not be down time.The configuration is a GCO with VVR of netbackup catalogue from prod (one syte) to DR (Romote site).
That is to say after recreating the rvg and srl ,replication has to be started.
Correct me if I am wrong.
I am gratefull
09-23-2013 05:57 AM
You should be able to do this without downtime. You can see if you can disasscoiate SRL without stopping RVG as I think you can do this if RVG is in PASSTHRU mode, but if you are having issues disasscoiating SRL while volumes are mounted you can delete RDS and re-create it online.
You should freeze all service groups in VCS while you are doing this work.
What ever method you use, you will need to restart replication.
Mike
09-24-2013 02:42 AM
Hello House.
I have execute the POA below but still the same issue:
1.vxvol -g catalog_dg -f dis srlvol
2.vxvol -g catalog_dg aslog rvg_ojota srlvol
3.vradmin -g catalog_dg -a startrep rvg_ojota
After these the VVR status still same as above.see below:
oot@ojnbu2 # vradmin -g catalog_dg -l repstatus rvg_ojota
Replicated Data Set: rvg_ojota
Primary:
Host name: 10.1.230.172
RVG name: rvg_ojota
DG name: catalog_dg
RVG state: enabled for I/O (passthru)
Data volumes: 1
VSets: 0
SRL name: srlvol
SRL size: 150.00 G
Total secondaries: 1
Secondary:
Host name: 10.1.110.132
RVG name: rvg_ojota
DG name: catalog_dg
Rlink from Primary: rlk_10.1.110.132_rvg_ojota
Rlink to Primary: rlk_10.1.230.172_rvg_ojota
Configured mode: asynchronous
Latency protection: off
SRL protection: autodcm
Data status: consistent, stale
Replication status: not replicating (primary needs recovery)
Current mode: N/A
Logging to: N/A
Timestamp Information: N/A
Bandwidth Limit: N/A
Compression Mode: Off
pls advice
09-24-2013 02:58 AM
When did the RVG state show passthru - was this after you associated SRL volume or did it change to passthru when you started replication.
Were the rlinks detached before you ran "vxvol -g catalog_dg aslog rvg_ojota srlvol" - i.e had you run "vradmin stoprep" or "vxrlink det". When you run "vxrlink startrep" this attaches rlinks, so I would think they need to be detached first.
Mike
09-24-2013 03:02 AM
Sorry it is after starting replication the passtjhru has stoppedas below .
I thing I need to recover now:
09-24-2013 05:04 AM
Yes hopefully a vxrvg or vxrlink recover will now fix issue
Mike
09-24-2013 05:13 AM
It fixed it but ther eis another issue resurfacing:
The catalog_dg in the secidary host gets deported and throws the error below:
I imported the diskgroup being replicated manually and the replicateion will start but after few mins it stops again because the diskgroup gets deported again.
See below:
09-24-2013 05:18 AM
How do we stop the cluster from deporting this catalog_dg after importing it manually may setting the attributes to maual outside of the cluster
09-24-2013 05:55 AM
To stop VCS taking action on resources freeze the service group.
Mike
09-24-2013 07:13 AM
Many thanks,
I have frozen the service group with the catalog_dg and imported it manually .so VVR has started as below and the cluster has not deported it for 5mins now,still monitoring: