02-12-2009 03:11 AM
Hi All,
A netbackup newbie here.
The SUN veritas netbackup support is crap, so as a last resort i'm turning to this Forum .
I think that my netbackup is writing ot the wrong volume pool
The problematic client is subctuxs1 and the policy name is TEST_SUBCTUXS1_FS in this case.
Its is writing to the 'SUBCTUXS1' volume pool instead of to the 'OFFSITE' volume pool,
because W325,W338L2,W302L2,....
are from the 'SUBCTUXS1' volume pool.
See below for the volume group info, configuration details of
the TEST_SUBCTUXS1_FS policy and the error logs from bptm/log.xxxxx
Any help or suggestions would be VERY much appreciated.
Rgds
Henley
root@SUBCTUXS7:netbackup[562]# bpmedialist -p SUBCTUXS1 | grep ^W3
W301L2 3 246 02/22/2008 01:03 03/05/2008 01:28 hcart2 386381248 0
W302L2 3* 104 02/07/2009 01:25 02/12/2009 03:07 hcart2 633243328 0
W325L2 3 298 01/15/2009 03:03 01/23/2009 03:03 hcart2 853282112 0
W338L2 3 101 01/28/2009 01:16 02/07/2009 01:25 hcart2 902444224 0
W352L2 3 142 01/21/2009 01:09 02/12/2009 01:25 hcart2 537587136 0
W371L2 3 766 10/04/2008 03:09 02/04/2009 01:25 hcart2 548301120 0
W384L2 3 328 12/10/2008 01:13 01/15/2009 03:03 hcart2 944233504 0
W300L2 5 3 01/03/2009 13:58 01/16/2009 18:46 hcart2 188601536 2
But this policy is supposed to use the 'OFFSITE' volume pool
root@SUBCTUXS7:netbackup[565]# tail db/class/TEST_SUBCTUXS1_FS/info
CHKPT_INTERVAL 0
ENABLE_PFI 0
OFFHOST_BACKUP 0
USE_ALT_CLIENT 0
USE_DATA_MOVER 0
DATA_MOVER_TYPE 2
CLASS_ID FB820DBC1DD111B2B0890003BACA00B5
RESIDENCE SUBCTUXS1-hcart2-robot-tld-0 *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL*
POOL OFFSITE NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup
#VMD5_DIGEST=9e 6e cd 35 d5 f8 51 e1 11 00 96 b3 86 c3 a5 6c
the bptm log file shows the following errors
19:28:12.251 [439] <2> select_media: added 57 media id's to list that matched robot number/type and media type
19:28:12.251 [439] <2> select_media: skipping media id W325L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W338L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W301L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W302L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W302L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W352L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W352L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: skipping media id W384L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.252 [439] <2> select_media: consider allowing retention level 0 on media (W371L2) that is 3 since its enabled by NetBackup configuration
19:28:12.252 [439] <2> select_media: skipping media id W371L2, it is not in correct storage unit or volume pool
19:28:12.252 [439] <2> select_media: getting new media id for retention level 0
19:28:12.252 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=vmd address=10.39.142.136 protocol=tcp non-reserved port=13701
19:28:12.252 [439] <2> nb_bind_on_port_addr: bound to port 62467
19:28:12.254 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.254 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.357 [439] <2> vmdb_query_scratch_bypool2: server returned: EXIT_STATUS 35
19:28:12.357 [439] <2> string_to_record: read response: EXIT_STATUS 35
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: unable to unpack string from server: volume does not exist in database (35)
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: query_scratchbypool2 request status: volume does not exist in database (35)
19:28:12.358 [439] <2> set_job_details: Sending Tfile jobid (98543)
19:28:12.358 [439] <2> set_job_details: LOG 1234351692 16 bptm 439 Media Manager volume pool OFFSITE has no more unassigned media in robotic device TL
D(0)
19:28:12.358 [439] <2> set_job_details: Done
19:28:12.358 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=bpdbm address=10.39.142.136 protocol=tcp non-reserved port=13721
19:28:12.359 [439] <2> nb_bind_on_port_addr: bound to port 62468
19:28:12.359 [439] <2> logconnections: BPDBM CONNECT FROM 10.39.142.130.62468 TO 10.39.142.136.13721
19:28:12.360 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.361 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.361 [439] <2> check_authentication: no authentication required
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.432 [439] <16> select_media: Media Manager volume pool OFFSITE has no more unassigned media in robotic device TLD(0)
19:28:12.432 [439] <2> bptm: EXITING with status 96 <----------
19:28:16.578 [443] <2> bptm: INITIATING (VERBOSE = 5): -count -cmd -rt 8 -rn 0 -stunit SUBCTUXS1-hcart2-robot-tld-0 -den 14 -mt 2 -masterversion 51000
0
root@SUBCTUXS7:netbackup[562]# bpmedialist -p SUBCTUXS1 | grep ^W3
W301L2 3 246 02/22/2008 01:03 03/05/2008 01:28 hcart2 386381248 0
W302L2 3* 104 02/07/2009 01:25 02/12/2009 03:07 hcart2 633243328 0
W325L2 3 298 01/15/2009 03:03 01/23/2009 03:03 hcart2 853282112 0
W338L2 3 101 01/28/2009 01:16 02/07/2009 01:25 hcart2 902444224 0
W352L2 3 142 01/21/2009 01:09 02/12/2009 01:25 hcart2 537587136 0
W371L2 3 766 10/04/2008 03:09 02/04/2009 01:25 hcart2 548301120 0
W384L2 3 328 12/10/2008 01:13 01/15/2009 03:03 hcart2 944233504 0
W300L2 5 3 01/03/2009 13:58 01/16/2009 18:46 hcart2 188601536 2
But this policy is supposed to use the 'OFFSITE' volume pool
root@SUBCTUXS7:netbackup[565]# tail db/class/TEST_SUBCTUXS1_FS/info
CHKPT_INTERVAL 0
ENABLE_PFI 0
OFFHOST_BACKUP 0
USE_ALT_CLIENT 0
USE_DATA_MOVER 0
DATA_MOVER_TYPE 2
CLASS_ID FB820DBC1DD111B2B0890003BACA00B5
RESIDENCE SUBCTUXS1-hcart2-robot-tld-0 *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL*
POOL OFFSITE NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup
#VMD5_DIGEST=9e 6e cd 35 d5 f8 51 e1 11 00 96 b3 86 c3 a5 6c
the bptm log file shows the following errors
19:28:12.251 [439] <2> select_media: added 57 media id's to list that matched robot number/type and media type
19:28:12.251 [439] <2> select_media: skipping media id W325L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W338L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W301L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W302L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W302L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W352L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W352L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: skipping media id W384L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.252 [439] <2> select_media: consider allowing retention level 0 on media (W371L2) that is 3 since its enabled by NetBackup configuration
19:28:12.252 [439] <2> select_media: skipping media id W371L2, it is not in correct storage unit or volume pool
19:28:12.252 [439] <2> select_media: getting new media id for retention level 0
19:28:12.252 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=vmd address=10.39.142.136 protocol=tcp non-reserved port=13701
19:28:12.252 [439] <2> nb_bind_on_port_addr: bound to port 62467
19:28:12.254 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.254 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.357 [439] <2> vmdb_query_scratch_bypool2: server returned: EXIT_STATUS 35
19:28:12.357 [439] <2> string_to_record: read response: EXIT_STATUS 35
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: unable to unpack string from server: volume does not exist in database (35)
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: query_scratchbypool2 request status: volume does not exist in database (35)
19:28:12.358 [439] <2> set_job_details: Sending Tfile jobid (98543)
19:28:12.358 [439] <2> set_job_details: LOG 1234351692 16 bptm 439 Media Manager volume pool OFFSITE has no more unassigned media in robotic device TL
D(0)
19:28:12.358 [439] <2> set_job_details: Done
19:28:12.358 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=bpdbm address=10.39.142.136 protocol=tcp non-reserved port=13721
19:28:12.359 [439] <2> nb_bind_on_port_addr: bound to port 62468
19:28:12.359 [439] <2> logconnections: BPDBM CONNECT FROM 10.39.142.130.62468 TO 10.39.142.136.13721
19:28:12.360 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.361 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.361 [439] <2> check_authentication: no authentication required
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.432 [439] <16> select_media: Media Manager volume pool OFFSITE has no more unassigned media in robotic device TLD(0)
19:28:12.432 [439] <2> bptm: EXITING with status 96 <----------
19:28:16.578 [443] <2> bptm: INITIATING (VERBOSE = 5): -count -cmd -rt 8 -rn 0 -stunit SUBCTUXS1-hcart2-robot-tld-0 -den 14 -mt 2 -masterversion 51000
0
02-12-2009 06:34 AM
Schedules can overwrite the default volume pool and storage unit for the policy.
Double check how your backup schedule is setup.
02-12-2009 08:49 AM
19:28:12.358 [439] <2> set_job_details: LOG 1234351692 16 bptm 439 Media Manager volume pool OFFSITE has no more unassigned media in robotic device TL
looking at your logs all the tape in volume pool got disqualified because they were:
19:28:12.251 [439] <2> select_media: skipping media id W301L2, either FULL or FROZE or SUSPENDED or IMPORTED
or
19:28:12.252 [439] <2> select_media: skipping media id W371L2, it is not in correct storage unit or volume pool
1) check the the tapes to see if they are frozen or imported or suspened and clean that up to make the avail
2) the tapes in the pool show to be hcart2 - check the storage unit you have the policy set up to use and verify that storage unit is using drives set at hcart2
02-12-2009 09:23 AM
Can you paste the bppllist <policy name> -U ?
This will tell us if you have any different pool under your schedules also under your ITC config if you have one.
regards.
02-12-2009 09:31 AM
horly,
The "bptm" log is just listing media that it is considering for the backup, it disregards each of them and then goes to the Scratch pool to find an AVAILABLE media. It finds no scratch pool , or no Scratch pool in the robot, and fails with the "status code 96".
Run the "available_media" script. Look in your "OFFSITE" pool for media being the robot (robot type will be TLD) and having the status "AVAILABLE" on the far right column.
The schedule is looking at the right pool, the pool just does not have any "AVAIALBLE" media in the robot.
Hope this helps, or moves you along further.
Let me know.
02-12-2009 11:49 PM
hi sunbird, seems like you are right. The output from the available_media script tells me
that none of the media in the OFFSITE pool is avail. Why can it use an 'ACTIVE' media ?
is it because it 'claimed' by another client? The client for this failed policy is 'subctuxs1' and
I dont see this server in the server host for the bpmedialist -p OFFSITE output.
output of "bpmedialist -p OFFSITE" is included as well.
If this is the case, can i recycle the 2 FULL media in the OFFSITE pool?
If so how can i do that ? i.e What is the command to do that ?
Many thanks for all the response submitted .
<output of available_media>
OFFSITE pool
W318L2 HCART2 NONE - - - 3* 561934528 ACTIVE
W349L2 HCART2 TLD 0 4 - 3* 381757088 ACTIVE
W351L2 HCART2 NONE - - - 5 130040320 ACTIVE
W381L2 HCART2 TLD 0 9 - 3 143630944 ACTIVE
W382L2 HCART2 NONE - - - 3* 311192672 ACTIVE
W383L2 HCART2 NONE - - - 3* 654227040 ACTIVE
W385L2 HCART2 TLD 0 7 - 5 185643168 ACTIVE
W387L2 HCART2 TLD 0 5 - 3* 248176512 ACTIVE
W388L2 HCART2 NONE - - - 3* 669421792 ACTIVE
W389L2 HCART2 NONE - - - 3* 446785728 ACTIVE
W390L2 HCART2 NONE - - - 5 126568096 ACTIVE
W333L2 HCART2 TLD 0 6 - 3* 787991136 FULL
W386L2 HCART2 TLD 0 3 - 3* 763563840 FULL
<output of bpmedialist -p OFFISTE>
root@SUBCTUXS7:logs[555]# bpmedialist -p OFFSITE | grep -v ^\$
Server Host = SUBCTUXS7
id rl images allocated last updated density kbytes restores
vimages expiration last read <------- STATUS ------->
--------------------------------------------------------------------------------
W318L2 3* 48 10/26/2008 03:39 01/18/2009 05:03 hcart2 561934528 0
1 02/18/2009 05:03 N/A
W333L2 3* 68 09/28/2008 03:39 12/21/2008 19:31 hcart2 787991136 0
1 02/21/2009 04:01 N/A FULL
W349L2 3* 57 12/21/2008 22:01 02/11/2009 20:41 hcart2 381757088 0
3 04/01/2009 00:32 N/A
W383L2 3* 66 11/30/2008 03:30 01/12/2009 00:33 hcart2 654227040 0
3 04/15/2009 00:33 N/A
W386L2 3* 71 11/16/2008 03:30 12/28/2008 11:32 hcart2 763563840 0
3 02/28/2009 04:02 N/A FULL
W388L2 3* 73 12/07/2008 22:03 01/26/2009 00:33 hcart2 669421792 0
35 04/29/2009 00:33 N/A
W389L2 3* 58 12/07/2008 22:01 01/19/2009 00:33 hcart2 446785728 0
33 04/22/2009 00:33 N/A
Server Host = subctux02
id rl images allocated last updated density kbytes restores
vimages expiration last read <------- STATUS ------->
--------------------------------------------------------------------------------
W381L2 3 127 09/30/2008 19:09 02/12/2009 19:07 hcart2 143630944 0
32 03/15/2009 19:07 N/A
Server Host = vapps98a
id rl images allocated last updated density kbytes restores
vimages expiration last read <------- STATUS ------->
--------------------------------------------------------------------------------
W351L2 5 2 01/18/2009 00:13 01/25/2009 00:13 hcart2 130040320 0
2 04/28/2009 00:13 N/A
W382L2 3* 5 09/28/2008 00:19 02/01/2009 00:13 hcart2 311192672 0
3 05/05/2009 00:13 N/A
W385L2 5 7 11/16/2008 00:10 02/11/2009 19:37 hcart2 185643168 0
7 05/15/2009 19:37 N/A
W387L2 3* 4 10/12/2008 00:19 02/08/2009 00:14 hcart2 248176512 0
2 05/12/2009 00:14 N/A
W390L2 5 2 11/30/2008 00:10 01/11/2009 00:13 hcart2 126568096 0
2 04/14/2009 00:13 N/A
02-13-2009 12:56 AM
Hi Omar,
here is the output of bppllist
seems like the schedule is not overwriting that of the policy's default vol pool.
Thanks
root@SUBCTUXS7:logs[556]# bppllist TEST_SUBCTUXS1_FS -U
------------------------------------------------------------
Policy Name: TEST_SUBCTUXS1_FS
Policy Type: Standard
Active: yes
Effective date: 07/23/2005 19:36:17
Client Compress: no
Follow NFS Mounts: no
Cross Mount Points: no
Collect TIR info: no
Block Incremental: no
Mult. Data Streams: no
Client Encrypt: no
Checkpoint: no
Policy Priority: 0
Max Jobs/Policy: Unlimited
Disaster Recovery: 0
Residence: SUBCTUXS1-hcart2-robot-tld-0
Volume Pool: OFFSITE
Keyword: (none specified)
HW/OS/Client: Solaris Solaris10 SUBCTUXS1
Include: /tmp
Schedule: test
Type: Full Backup
Frequency: every 1 day
Maximum MPX: 1
Synthetic: 0
PFI Recovery: 0
Retention Level: 0 (1 week)
Number Copies: 1
Fail on Error: 0
Residence: (specific storage unit not required)
Volume Pool: (same as policy volume pool)
Daily Windows:
02-13-2009 05:34 AM
Horly,
The OFFSITE pool has six media actually in the library, each are own by other Media Servers (not the desired Media Server "SUBCTUXS1"):
Server Host = SUBCTUXS7
W333L2 HCART2 TLD 0 6 - 3* 787991136 FULL
W349L2 HCART2 TLD 0 4 - 3* 381757088 ACTIVE
W386L2 HCART2 TLD 0 3 - 3* 763563840 FULL
Server Host = subctux02
W381L2 HCART2 TLD 0 9 - 3 143630944 ACTIVE
Server Host = vapps98a
W385L2 HCART2 TLD 0 7 - 5 185643168 ACTIVE
W387L2 HCART2 TLD 0 5 - 3* 248176512 ACTIVE
So, it cannot use the above media.
You can expire all of the images associated with one of the media id's if you wish using the "bpexpdate" command.
cd /usr/openv/netbackup/bin/admincmd
./bpexpdate -m <media_id> -d 0 -host <Media_server>
(need to answer "y" if you really want to get rid of all the images on it)
Most of the media are not due to expire for a while. Media id "W318L2" would expire it's last image next Wednesday (2/18), but it is currently not in the robitc library. So, not seeing the rest of the reports, you really need to add some more media or keep the images for a shorter time.Additionally, it looks like you are retaining multiple retention levels on the same media, which can cause you to use more media.
Hope this helps.
02-13-2009 05:39 AM
Sun support is crap ?
I am sorry to hear that. I was always impressed with StorageTEK support of Netbackup. Maybe things have changed since their merger.
03-03-2009 07:37 PM
hi Guys
I think this problem is actually a USER problem i.e me.
Thanks for the suggestions from SunBird and Stumpr , i have consolidated my
media/volume under 1 media server i.e subctuxs7. And unfreezed some media from other
volum pool and moved it to this volume pool. 'OFFSITE'
And configured all the my policy to use the same media server i.e subctuxs7.
The backup is now running fine.
Thanks for the help once again.
Its actually a relatively simply problem really. Cannot understand why sun veritas support cannot help, since I have provided them with pretty much the same info.
03-04-2009 01:06 PM
I was not impressed with them when I had them.
I was much happier when I got to go directly to Symantec support.