08-13-2014 10:43 AM
Hello,
I'm having an issue where my VMware backups are not running. When I run a job it just stays qued up. In the Activity Monitor under the State Details it says Disk media is not active (Dedupe_media01_STU, media01).
What do I need to do in order to get the jobs to run?
Environment:
Server | OS | NetBackup Version |
Master server | Solaris 10 | 7.5.0.7 |
Media server 02 | Solaris 10 | 7.5.0.7 |
VM Media server 00 | Win2008 R2 | 7.5.0.7 |
VM Media server 01 | Win2008 R2 | 7.5.0.7 |
Media server bar-2 | SLES 11 | 7.1 |
Media server bar0 | SLES 11 | 7.5.0.5 |
Media server bar1 | SLES 11 | 7.5.0.5 |
Media server bar2 | SLES 11 | 7.5.0.5 |
Solved! Go to Solution.
08-13-2014 11:18 AM
nbemmcmd -updatehost -machinename string -machinetype media -machinestateop set_disk_active -masterserver string
If that doesn't work then you need to look at
nbdevquery -listdp -stype PureDisk -U
and check if the pool is actually up
nbdevquery -liststs -stype PureDisk -U
will show you if the storage server is up.
Also make sure spoold and spad are running on the media server.
08-13-2014 11:18 AM
nbemmcmd -updatehost -machinename string -machinetype media -machinestateop set_disk_active -masterserver string
If that doesn't work then you need to look at
nbdevquery -listdp -stype PureDisk -U
and check if the pool is actually up
nbdevquery -liststs -stype PureDisk -U
will show you if the storage server is up.
Also make sure spoold and spad are running on the media server.
08-13-2014 11:20 AM
Do I run this command on the media or master server? And what do I use for the string?
08-13-2014 11:40 AM
Are you saying the media server is not active because the disk is full?
What are retention levels on this dedupe disk?
Are backups getting duplicated anywhere else?
Have a look at this TN to understand expiration and data removal process from dedupe disk and how you can manually reclaim expired storage space.
http://www.symantec.com/docs/TECH124914
08-14-2014 08:07 AM
We opened a ticket with tech support and this is what I sent to them.
This is what the is in job details
------------------------------
info nbjm (pid=8220) starting backup job (jobid=58769) for client splunk-1, policy VMWARE_DEV_DEV, schedule Diff-inc
info nbjm (pid=8220) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=58769, request id:{3867C4B0-23B9-11E4-A63B-0010E020C290})
requesting resource masterserver-2.NBU_CLIENT.MAXJOBS. splunk-1
requesting resource masterserver-2.NBU_POLICY.MAXJOBS.VMWARE_ DEV_DEV
awaiting resource Dedupe_Winmedia01_STU.Waiting for resources.
Reason: Disk media server is not active, Media server: Winmedia01,
Robot Type(Number): NONE(N/A), Media ID: N/A, Drive Name: N/A,
Volume Pool: NetBackup, Storage Unit: Dedupe_Winmedia01_STU, Drive Scan Host: N/A,
Disk Pool: Dedupe_Winmedia01_DP, Disk Volume: PureDiskVolume
-------------------------------------------------------------------------------------
We found this, which you might find interesting.
Under Media and Device Management --> Devices --> Media Servers
Winmedia00 status is set to offline
Winmedia01 status is set to Active for Tape
All other media servers are set to Active for Tape and Disk.
We have a firewall called HBSS or ePolicy. They make changes from time to time that messes stuff up. So what ports do we need to have open in order to connect to the disk pools? The Disk Pools are on a NetApp device.
Are there commands can we run that check for connectivity between the Master server and the media servers or the Disk pools?
08-14-2014 08:10 AM
thanks I will read the technote. we are using only 22% of the disk pool space
08-14-2014 08:23 AM
we are using only 22% of the disk pool space
So, where is the 'error 129 Disk storage unit full' in your subject coming from?
08-14-2014 08:29 AM
The error is in the status, when we run the job. As you can see from the post above this one the The Windows media servers are showing that they can't reach the disk pool. My co-worker reran the GUI tool to change the status of the media server but the status didn't change.
08-14-2014 09:05 AM
You have not showed us any logs or Job details showing status 129.
Only 'Disk media server is not active' which is a comms problem.
NBU 7.5 and 7.6 needs port 1556 (PBX) only between master and media server.
See NetBackup Network Ports Reference Guide
Any chance the firewall is doing NAT or PAT?
This is not supported.
08-14-2014 10:03 AM
I ran the nbdevquery -listdp -stype PureDisk -U and the all the pools show up
I ran nbdevquery -liststs -stype PureDisk -U and the pools show up
This is a Windows media sever and I was not able to find spoold or spad in the services list.
08-14-2014 10:06 AM
I asked the firewall guys and they say the firewall is down and there are no errors or blocks reported in the logs.
08-14-2014 10:14 AM
What about nbemmcmd -updatehost -machinename string -machinetype media -machinestateop set_disk_active -masterserver string?
08-14-2014 10:26 AM
Hi,
1. If winmedia01 return status "Winmedia01 status is set to Active for Tape" it means that 1556 port is working.
You can verify this by bptestbpcd command.
2. If I understand you notice - "we are using only 22% of the disk pool" it means that any backup was runned to the disk pool in previous ? Or this 22% is free capacity for whole deduplication pool and 78% is used by another system ? Please, describe here detail.
3. So, as wrote Marianne and Riaan, please attach here output detail. I recommend you run nbsu command from /usr/openv/netbackup/bin/support/nbsu on master server a attach here file created by this. The output of this script describes the location of the file.
Petr
08-14-2014 10:46 AM
This is the output of the command.
NBEMMCMD, Version: 7.5.0.7
Command completed successfully
Thanks.
08-14-2014 10:58 AM
Riaan.Badenhorst,
Thanks your command fixed the issue.
08-14-2014 11:17 AM
Pleasure