Highlighted

VVR

Hi

I am new to VVR and need assistance please.

Replication works fine until I create the “replication service group” in VCS.

VCS initiates a disk group deport on secondary node 2 and then replication stops.

If you freeze the “replication service group” the disk group on the secondary node stays imported and replication works.

I obviously missing something. Shouldnt the disk groups be in an imported state on all node for VVR replication to work?

 

Thanks

Marius

1 Solution

Accepted Solutions
Accepted Solution!

It appears you are trying to

It appears you are trying to replicate between your 2 nodes in your cluster SRV003457 and SRV003458 - this is called a Replicated Data Cluster (RDC) which requires that you set up the replication service group as a hybrid service group.  If you are using SQL, there are instructions on how to do this in the SFW_HA_DR_SQL2008_Solutions_6.0.1.pdf and SFW_HA_DR_SQL2012_Solutions_6.0.1.pdf guides or equivalent for your versions of SFWHA and SQL.

Your  PRDdatdg-RVG should look like:

 

group PRDdatdg-RVG (
    SystemList = { SRV003457 = 0, SRV003458 = 1 }
    Parallel = 2
    AutoStartList = { SRV003457, SRV003458 }
    SystemZones = { SRV003457 = 0, SRV003458 = 1 }

    )

    IP PRDdatdg-RVG-IP (
        Address@SRV003457 = IP_for_SRV003457
        Address@SRV003458 = IP_for_SRV003458

        SubNetMask = "255.255.255.0"
        MACAddress @SRV003457 = 00-0A-F7-20-29-0A
        MACAddress @SRV003458 = 00-0A-F7-20-29-14
        )

    NIC PRDdatdg-RVG-NIC (
        MACAddress @SRV003457 = 00-0A-F7-20-29-0A
        MACAddress @SRV003458 = 00-0A-F7-20-29-14
        )

    VMDg CLS10-SQL-PROD-SG-VMDg (
        DiskGroupName @SRV003457 = PRDdatdg
        DiskGroupName @SRV003458 = PRDdatdg
        )

    VvrRvg PRDdatdg-RVG-VvrRvg (
        RVG = PRDdatdg-RDS
        VMDgResName = CLS10-SQL-PROD-SG-VMDg
        IPResName = PRDdatdg-RVG-IP
        )

    PRDdatdg-RVG-VvrRvg requires CLS10-SQL-PROD-SG-VMDg
    PRDdatdg-RVG-VvrRvg requires PRDdatdg-RVG-IP
    PRDdatdg-RVG-IP requires PRDdatdg-RVG-NIC

 

You do not have "Parallel = 2" which means your PRDdatdg-RVG is a failover group only allowed online on one node and therefore you get errors:
2014/04/28 08:09:26 VCS ERROR V-16-1-10214 Concurrency Violation:CurrentCount increased above 1 for failover group PRD_MSDTC-RVG
2014/04/28 08:09:26 VCS WARNING V-16-6-15034 (SRV003458) violationSmiley Surprisedfflining group PRD_MSDTC-RVG on system SRV003458

 

If you stop VCS ("hastop -all" from one node or "net stop had" on both nodes) and then change main.cf as above and then restart VCS ("hastart" or "net start had" from both nodes), then it should work, but if you are using SQL, then usually you should be using the SQL agents set up using the SQL wizards as in guides mentioned above which will also configure RDC for you.

Ideally you should be using virtual IPs for replication (these should be separate from application IPS) as this means you can add more nodes to your cluster in the furture (so for instance you could add node SRV003456 and then you could replicate from SRV003456 to SRV003458 or replicate from SRV003457 to SRV003458 so the virtual replication IP would fail between SRV003456 and SRV003457) , but if you only ever have 2 nodes, you don't technically need to use virtual IPs and if this is the case then you should remove the IP (PRDdatdg-RVG-IP) resource.

Mike

View solution in original post

3 Replies

Hello, Yes, diskgroups will

Hello,

Yes, diskgroups will be in imported state on the remote site for replication to work. VVR works at block level replication so volumes have to be visible on remote site for replication to work. Volumes will appear only if diskgroup is imported. This is also proved once you keep group frozen, replication works. The problem here is why VCS deports the diskgroup ?

I  am wondering if your systemlist is correct on remote site ? can you attach main.cf for review & also attach engine_A.log which carries entries of VCS taking diskgroup offline.

 

G

See attched Thanks for

See attched

Thanks for helping

 

Accepted Solution!

It appears you are trying to

It appears you are trying to replicate between your 2 nodes in your cluster SRV003457 and SRV003458 - this is called a Replicated Data Cluster (RDC) which requires that you set up the replication service group as a hybrid service group.  If you are using SQL, there are instructions on how to do this in the SFW_HA_DR_SQL2008_Solutions_6.0.1.pdf and SFW_HA_DR_SQL2012_Solutions_6.0.1.pdf guides or equivalent for your versions of SFWHA and SQL.

Your  PRDdatdg-RVG should look like:

 

group PRDdatdg-RVG (
    SystemList = { SRV003457 = 0, SRV003458 = 1 }
    Parallel = 2
    AutoStartList = { SRV003457, SRV003458 }
    SystemZones = { SRV003457 = 0, SRV003458 = 1 }

    )

    IP PRDdatdg-RVG-IP (
        Address@SRV003457 = IP_for_SRV003457
        Address@SRV003458 = IP_for_SRV003458

        SubNetMask = "255.255.255.0"
        MACAddress @SRV003457 = 00-0A-F7-20-29-0A
        MACAddress @SRV003458 = 00-0A-F7-20-29-14
        )

    NIC PRDdatdg-RVG-NIC (
        MACAddress @SRV003457 = 00-0A-F7-20-29-0A
        MACAddress @SRV003458 = 00-0A-F7-20-29-14
        )

    VMDg CLS10-SQL-PROD-SG-VMDg (
        DiskGroupName @SRV003457 = PRDdatdg
        DiskGroupName @SRV003458 = PRDdatdg
        )

    VvrRvg PRDdatdg-RVG-VvrRvg (
        RVG = PRDdatdg-RDS
        VMDgResName = CLS10-SQL-PROD-SG-VMDg
        IPResName = PRDdatdg-RVG-IP
        )

    PRDdatdg-RVG-VvrRvg requires CLS10-SQL-PROD-SG-VMDg
    PRDdatdg-RVG-VvrRvg requires PRDdatdg-RVG-IP
    PRDdatdg-RVG-IP requires PRDdatdg-RVG-NIC

 

You do not have "Parallel = 2" which means your PRDdatdg-RVG is a failover group only allowed online on one node and therefore you get errors:
2014/04/28 08:09:26 VCS ERROR V-16-1-10214 Concurrency Violation:CurrentCount increased above 1 for failover group PRD_MSDTC-RVG
2014/04/28 08:09:26 VCS WARNING V-16-6-15034 (SRV003458) violationSmiley Surprisedfflining group PRD_MSDTC-RVG on system SRV003458

 

If you stop VCS ("hastop -all" from one node or "net stop had" on both nodes) and then change main.cf as above and then restart VCS ("hastart" or "net start had" from both nodes), then it should work, but if you are using SQL, then usually you should be using the SQL agents set up using the SQL wizards as in guides mentioned above which will also configure RDC for you.

Ideally you should be using virtual IPs for replication (these should be separate from application IPS) as this means you can add more nodes to your cluster in the furture (so for instance you could add node SRV003456 and then you could replicate from SRV003456 to SRV003458 or replicate from SRV003457 to SRV003458 so the virtual replication IP would fail between SRV003456 and SRV003457) , but if you only ever have 2 nodes, you don't technically need to use virtual IPs and if this is the case then you should remove the IP (PRDdatdg-RVG-IP) resource.

Mike

View solution in original post