05-14-2018 10:41 AM
Hi, I have 2 frozen tapes.
i checked the tape logs on Admin Console for the Media/tape and it shows "media id xxx load operation reported an error".
I checked the \veritas\netbackup\db\media\errors and found the following:
03/07/18 14:29:26 502322 10 POSITION_ERROR 0003
03/07/18 15:04:04 502322 10 POSITION_ERROR 0003
03/08/18 10:16:14 504921 10 WRITE_ERROR 0003
03/23/18 10:54:23 502513 10 POSITION_ERROR 0003
04/23/18 15:01:36 500780 1 WRITE_ERROR 0014
04/23/18 15:01:39 500780 1 TAPE_ALERT 0014 0x34001000 0x00000000
04/30/18 06:44:28 502412 -1 RESERVE_ERROR 0004 0 1 0 0
05/07/18 06:00:51 500529 -1 RESERVE_ERROR 0006 0 1 0 0
05/07/18 06:54:11 505158 -1 RESERVE_ERROR 0002 0 1 0 0
Does this mean something to anyone?Should i be looking somewhere else?
Is there a way to find out what drive was used for this particulat tape?
I have used ROBTEST to move tapes from slot to drives in the library and everything seems to work there. Drives dont need cleaning either as per the SL Console. Automatic cleaning is enabled.
05-14-2018 12:27 PM
I would check the tape itself
05-14-2018 01:07 PM
check the tape for physical damage? or something else?
This is what i got: bpmedialist -m 502427
Server Host = brm-up-nbu-6
id rl images allocated last updated density kbytes restores
vimages expiration last read <------- STATUS ------->
On Hold
--------------------------------------------------------------------------------
502427 5 3 05/08/2018 00:10 05/11/2018 07:16 hcart2 452767786 0
3 08/12/2018 06:00 N/A FROZEN
0
05-14-2018 07:55 PM
05-15-2018 12:32 AM
Are you looking at 'errors' file on media server brm-up-nbu-6?
What is the timestamp for the tape logs error?
Do you have bptm log folder on brm-up-nbu-6?
This log is the best place to look (at least level 3 logging) along with OS System log (/var/log/messages on Linux).
05-15-2018 09:45 AM - edited 05-15-2018 01:19 PM
HI, @Marianne @Amol_Nair
on the media server i tried to run the following but it didnt return any output.
egrep "501636|502542" /usr/openv/netbackup/db/media/errors
Ive attached the bptm logs of the media server. Also the tape logs timestamp too.
yesterday i only had 2 Frozen and today i have 20 total.
05-15-2018 01:30 PM
That's a lot of tape errors.
05-15-2018 03:32 PM
@Alexis_Jeldrezthese are old tapes which come back from iron mountain.
so im not sure how this actually works but when i ran: /usr/openv/volmgr/bin/robtest on my media server the output was:
No locally-controlled robots with test utilities are configured
From my master server:
brm-up-nbum-1:~ root # tpconfig -d
Id DriveName Type Residence
Drive Path Status
****************************************************************************
Currently defined robotics are:
TLD(0) robot control host = brm-up-nbu-1
TLD(1) robot control host = brm-up-nbu-1
EMM Server = brm-up-nbum-1
That means only brm-up-nbu-1 can do robtest? is that how it should be setup? or thats up to preferece?
i did the robtest on brm-up-nbu-1(media server) and i was able to move tapes to two different drives. i only did two because ive heard it gets stuck if you use it more then 5mins.
05-15-2018 10:44 PM
05-16-2018 03:07 AM
I agree with @Amol_Nair - log snippets do not help. The media id is not always part of the error.
We need full logs or at least ALL entries for a particular job/PID (e.g. 53251).
For load operation error, you need syslog (/var/log/messages) on the robot control host - brm-up-nbu-1.
Note that you need Media Manager processes to run in VERBOSE mode in order to have meaningful logging in messages file:
Add VERBOSE to vm.conf on robot control host plus ALL media servers, followed by restart of NBU.
If we look at the RESERVE_ERROR entries in your opening post, it seems that there may be reservation conflicts of drives between media servers in SSO environment.
So, extremely important to verify that SSO is configured correctly and that Persistent Binding is in place between HBA and OS on all media servers.
You may want to check vmdareq output on the master server at regular intervals during peack backup window to view drive assignments.
05-16-2018 10:21 AM - edited 05-16-2018 10:23 AM
I am very new to netbackup. i did find the log files you guys were asking for.
@Marianneyou mentioned "it seems that there may be reservation conflicts of drives between media servers in SSO environment.So, extremely important to verify that SSO is configured correctly and that Persistent Binding is in place between HBA and OS on all media servers."
How do i go about this? Where should i start from. Any documents would be helpful.
Really appreciate everyones help.
05-17-2018 12:03 AM
05-17-2018 01:54 AM
I am curious to see vmdareq output on the master server.
(In /usr/openv/volmgr/bin.)
Please remember to add VERBOSE entry to vm.conf on robot control host and all other media servers, followed by NBU restart.
This will ensure detailed logging around tape movement and other NBU Media Manager actions.
The reservation conflicts is what is causing your issues.
You need to verify that Persistent Binding is in place.
Once this is done, you can re-do SSO config by running the Device Config Wizard, select robot control host and all media servers sharing the drives. Once completed, the SSO config should be fine.
05-18-2018 10:50 AM - edited 05-18-2018 10:58 AM
Hi @Marianne,
is this the correct way to add VERBOSE entry to vm.conf:
MM_SERVER_NAME=hostname
MM_SERVER_NAME=hostname
MM_SERVER_NAME=hostname
DAYS_TO_KEEP_LOGS = number
VERBOSE
I would add all the media servers,not master, to the robot control host media server only OR i do the above on all media servers? or just add all media servers on the robot control host media server and just add VERBOSE to all the other media servers?Days to keep i can put =3. that way it will keep the logs for 3days?
i attached the vmdareq, looks wierd to me.
@Amol_NairI read the links you provided, but when i clicked on the Emulex article for more info it wasnt loading.
so im unsure how to do Persistent Binding.
FYI, i have a drive paths going down everyday. I just UP them. should i be tackling this first and then moving on to frozen tapes?
The person who was working on NBU is no longer here, and im filling in with hardly any background.
05-18-2018 10:53 AM
i saved the vmdareq in notepad but it keeps saying the conetens of the attachment doesnt match its file type, even tho i saved it as all file types. Hopefully word works.
05-18-2018 11:09 AM
05-18-2018 11:32 AM - edited 05-22-2018 09:09 AM
Thanks @Marianne ill do that.
vmdareq output is all over the place. some is readable while other is just junk. i edited the doc and removed all the junk incase you wanted a look.
05-22-2018 08:04 AM
Hi @Marianne,
After adding the VERBOSE to vm.conf file do i need to restart the media servers or restart the nbu service?
05-23-2018 09:39 PM
05-24-2018 12:15 AM