Forum Discussion

Steppan's avatar
Steppan
Level 4
9 years ago

New File Server - New Backup Solution - Need advise

Hello!

Have spent some time reading the cummunity forum and collecting information regarding file servers environments and netbackup but I would really like some more focused help on my particular environment here at the company. I will appreciate any help.

Today we have a VMware Cluster with Dell Compellent LUNs as datastores and some as RDM disks too.

Our file server is a single 2012 R2 VM and has a 12Tb RDM LUN mapped to it (physical mode), where the main folders are shared to users (we have many quotas and file screening policies configured too).

Today we have 11Tb deduped (by windows feature) into something near 7Tb of data in that disk. We have no redundancy (I mean failover), no storage replication, just HA from VMware cluster.

We recently bought a solution with Netbackup (30Tb dedup license), Dell DR4100 backup appliance (+30Tb of capacity), 2-drive LTO7 tapelibrary (24 slots) and licenses to backup VMware Environment, SQL (physical failover cluster), Exchange (physical DAG), Sharepoint and everything else.

The main goal is to take advantage from Netbackup features like Synthetic Full, Dedup, Granular Recovery, OST integration, and to make less use of tape than we have today - this is the reason we bought a backup appliance - and aside that, we still need to deploy a new file server infrastructure.

We have Microsoft EA and SA on our Windows Server Datacenter Licenses and we're planning to use the new Windows 2016 Server Stretched Cluster Feature.

This is a brief of the cluster:

https://technet.microsoft.com/en-us/windows-server-docs/storage/storage-replica/stretch-cluster-replication-using-shared-storage?f=255&MSPPError=-2147217396

We have 2 datacenters (LAN and SAN connected between them), one with HP Blade and HP 3PAR Storage and another one with Dell Blade and Dell Compellent Storage. Using Windows 2016 Stretched Cluster we plan to use these 2 blade/storage kits in separate datacenters, using VMware clusters on both, hosting two fileserver VM nodes on HP side, two fileserver VM nodes on Dell side.

My questions are regarding the Netbackup features and best practices and the alignment with a good performance and redundant capable fileserver.

1. Will Netbackup Full synthetic feature be compatible with that storage replication windows feature (the hardware is arriving and the software is still in order processing, so let's assume that the installed version will be Netbackup 8)?

2. What would be the best practice in LUN presentation to file server? Considering the size of data (lets assume 15Tb), is it better to present 4 or more LUNs with smaller sizes or one big 15Tb LUN? How will it affect the backup job performance?

3. What is the best practice for backing up the fileserver? Using disk agent into it's OS or using VMware integration from Netbackup and it's CBT functions? What about the restore?

4. We have today windows dedup activated, the Dell DR4100 backup appliance has it's dedup and the netbackup has it's dedup features too. Is it better to disable windows dedup to avoid dedup rehydration? What would be the best dedup feature choice? DR4100 or Netbackup? Source or destination?

Thank you

Steppan

  • If the Dell blades for the ESXi hosts are FC or CNA (i.e. leading to NPIV) connected and the Dell Compellent SAN LUNs really do appear (to native Windows OS) as FC HBA based multi-path LUNs, then you should be able to leverage SAN off-host backups via your bare metal (on a Dell blade) (i.e. non-virtualised) Windows based NetBackup Media Server - and your backups... should... be rapid!   (assuming your lowly storage array can keep up with NetBackup) - but it may transpire that the LAN ingest at the Dell DRxxxx is the bottleneck... but not if you can also leverage client-side dedupe on the bare metal NetBackup Media Server, and use the oodles of CPU that we tend to have on blades so that NetBackup Media Server (acting as client) can fingerprint all those blocks coming up so rapidly from the SAN.

    Sound like you have a really nice project on your hands.  Hope it goes well for you.

  • You will need to put a client on the VM, you cannot snapshot RDMs from hypervisors.

    Netbackup will probably not be aware of any storage replication, as it is happening at lower level.

    Normally my suggestion would be to use accelerator, but not sure it is supported with anything but Netbackup's own deduplication feature.

    Usually better to present more LUNs as every "physical disk" gets a queue in the OS, of course there can be other limits.

    You should probably consider backup for file/folder restores and backup for disaster recovery of the full volume(s)

    Being a RDM snapshot/clone might be worth to look into.

    • Steppan's avatar
      Steppan
      Level 4

      You're right, I forgot the fact that RDM won't be able to do snapshot.

      Great info on the number of LUNs. I've never worked with Netbackup. I came from the HP world, always worked with Data Protector, but had some contact with other platforms like backup exec and arcserve brightstor. Based on that, I don't know what to expect from Netbackup. The objective of this topic is to align my company's infrastructure with the new coming backup solution.

      Thanks!

  • See page 66 of the HCL for NetBackup v7.7.x here:

    https://www.veritas.com/support/en_US/article.000025228

    ...which looks like Dell DRxxxx firmware version 3.0 and above supports Accelerator into their dedupe stack.

    And (oddly enough) page 66 (also) of the SCL for NetBackup v7.7.x here:

    https://www.veritas.com/support/en_US/article.000025229

    ...indicates that backups of Windows 2012 R2 NTFS native dedupe volumes is supported, but you cannot perform an "NTFS dedupe aware" backup, only something like a normal backup, but you can use Accelerator (which is a different form of optimisation) and this will greatly reduce the folder walking of a full backup when that full backup is sent directly to the Dell DR4100.

    Page 21 of the NetBackup for Virtual environments compatility doc , here:

    https://www.veritas.com/support/en_US/article.000006177

    ...says that NetBackup does not support VM style backups of VMs containing NTFS native dedupe volumes, plus those volumes are on RDMs anyway and so couldn't be backed-up by VM style backups anyway.

    .

    So, for your big file server with NTFS native dedupe voluems on RDM LUNs... the backup style will be... plain client inside the VM, using an MS-Windows backup policy type, making sure that "Enable optimized backup of Windows deduplicated volumes" is *not* selected on the backup policy, configure an SLP with two stages, a backup stage with say 2/3/4 weeks retention and a duplication stage with say 5/6/7/8 or months or years retention, and finally use this SLP as the storage unit backup target in the policy header and/or schedule definition.

    .

    As ever, when you're deploying something new... who did the sizing?  RPO/RTO considered?  Did you realize that any restore of 7TB of NTFS dedupe data (i.e. 11TB of actual data) will require a restore target of 11TB... so you may need to two stage this particular restore in a DR scenario.

    HTH.

    • Steppan's avatar
      Steppan
      Level 4

      SDO,

      This fact about the restore really annoys me. There was a guy from a third part IT services provider playing a POC for backup exec here before we decided to go with Netbackup and he told me that even if I choose to make a granular restore, the entire content of data (lets say 11Tb not-deduped) needs to be "mounted" on another LUN just to collect a 1Mb granular file. During BE POC we backep up a clone VM from the file server containing the same data of it but using VMDK instead of RDM, using VMware integration. When I tried to do a restore, it says that I would have to choose a "temporary mount disk" or something like that as BE would need to extract the entire data, even if I need to restore one single granular file. Is it true for Netbackup too?

      We have many granular restores for this file server (I would say 50 per week) nowadays with data protector. The RTO is not a concern, we have 1 full backup every weekend and 5 enhanced incrementals during the week at 8pm. The concern is being able to restore a 1Mb excel sheet in a matter of minutes without waiting for a long extraction/mapping/mounting time for such big bunch of data when the only thing I want is a tiny file.

      Considering the fact that RDM won't be able to snapshot (so we can't use VMware integration) and Michael's approach on presenting more than one unique huge LUN, the best way would be converting the actual one RDM LUN model to a multi disk VMDK scenario, at different datastores (different storage boxes for sure), disable the windows dedup service and use the VMware integration for backing up this VM?

      The RPO and RTO will be considered of course. The strategy for file server is to backup it up to DR4100 with synthetic full during the year and 1 or 2 times per year we are going to copy the data from DR4100 to LTO7 tape for archiving. We need to safeguard these tapes for 5 years. We need faster restore and lower dependency on tape handle (our safe where the tapes are stored is located 20km from here in another plant).

      Today we have:

      Yearly jobs (Permanent)

      Monthly jobs (1 year protection)

      Weekly jobs (1 month protection)

      Daily jobs (7 days protection)

      The backup is made first to a Software D2D appliance for faster backup and restore (very tight backup window). Only Yearly and Monthly jobs are copied to tape and deleted from disk. 

      Thank you!

      • sdo's avatar
        sdo
        Moderator

        1) [Steppan] This fact about the restore really annoys me. There was a guy from a third part IT services provider playing a POC for backup exec here before we decided to go with Netbackup and he told me that even if I choose to make a granular restore, the entire content of data (lets say 11Tb not-deduped) needs to be "mounted" on another LUN just to collect a 1Mb granular file. During BE POC we backep up a clone VM from the file server containing the same data of it but using VMDK instead of RDM, using VMware integration. When I tried to do a restore, it says that I would have to choose a "temporary mount disk" or something like that as BE would need to extract the entire data, even if I need to restore one single granular file. Is it true for Netbackup too?
        [sdo] I think that BE might be considered as having the edge here because BE does appear to support "file" level recovery from a VM style backup of a VM containing an NTFS native dedupe volume. But we already know that NetBackup doesn't even support VM style backups of a VM containing an NTFS native dedupe volume, so to answer your question, then "no", NetBackup doesn't do that, because NetBackup can't do that. I think that the real issue here is any file copy out of an NTFS native dedupe volume requires random access and an understanding of the meta-data... so maybe one day... when MS and Veritas get it together on this... then we may see a feature where file level restore out of a backup image containing NTFS native dedupe is possible from a backup image residing on disk (e.g. BasicDisk, AdvancedDisk, MSDP, OST)... but there's several complex layers of abstraction to pick through. Personally I think we should consider ourselves lucky that we can even take a backup in the first place. ;)

        2) [Steppan] We have many granular restores for this file server (I would say 50 per week) nowadays with data protector. The RTO is not a concern, we have 1 full backup every weekend and 5 enhanced incrementals during the week at 8pm. The concern is being able to restore a 1Mb excel sheet in a matter of minutes without waiting for a long extraction/mapping/mounting time for such big bunch of data when the only thing I want is a tiny file.
        [sdo] Have you considered enabling VSS on a daily schedule, as this may offer you the ability to recover changed/deleted files without having to restore from a backup product. But this very much depends upon the rate of change... and so sometimes it is useful - especially with such large file systems, to provision an additional disk/vmdk/LUN to act as VSS delta space, so that the original base source volume cannot fill up even if the VSS delta space fills up.

        3) [Steppan] Considering the fact that RDM won't be able to snapshot (so we can't use VMware integration) and Michael's approach on presenting more than one unique huge LUN, the best way would be converting the actual one RDM LUN model to a multi disk VMDK scenario, at different datastores (different storage boxes for sure), disable the windows dedup service and use the VMware integration for backing up this VM?
        [sdo] That sounds like it would fly. And your Dell DRxxxx appliance appears to support the "Accel_VMware" feature, so you should be able to perform accelerated VMstyle BLIB/CBT backups if you used VMDKs instead of hard RDM LUNs inside your big file server.

        4) [Steppan] The RPO and RTO will be considered of course. The strategy for file server is to backup it up to DR4100 with synthetic full during the year and 1 or 2 times per year we are going to copy the data from DR4100 to LTO7 tape for archiving. We need to safeguard these tapes for 5 years. We need faster restore and lower dependency on tape handle (our safe where the tapes are stored is located 20km from here in another plant).
        [sdo] IMO, I wouldn't consider "NetBackup synthetic full" because this involves post processing to re-munge to re-create a new full backup. I think you would be better off with NetBackup Accelerator - either as plain client inside the VM, or using Accelerator on the VM style backups themselves. What is not clear to me is whether the Dell DRxxxx can leverage FC or iSCSI based shared LUN access to the VMFS datastores on your FC/iSCSI/SAN attached storage array - or whether all VMFS data-store block level backup traffic is going to have to be passed out through the NIC of the ESXi host, to the NetBackup Media Server (element of your master) and then in turn passed on again via LAN to your Dell DRxxxx.  You can think of Accelerator as a bit like a forever incremental.  Only the changed blocks are ever moved, even when doing a "full", and so the volume of LAN traffic from "VMDK/datastore -> ESXi -> NetBackup -> Backup Appliance" should only ever be as high as your actual rate of data change on disk.