cancel
Showing results for 
Search instead for 
Did you mean: 

SUN Netbkup Support is Crap ! Need help on why policy is using the wrong volume pool .

horly
Level 3

Hi All,

A netbackup newbie here.

The SUN veritas netbackup support is crap, so as a last resort i'm turning to this Forum .

 

I think that my netbackup is writing ot the wrong volume pool

 

The problematic client is subctuxs1 and the policy name is TEST_SUBCTUXS1_FS in this case.

Its is writing to the 'SUBCTUXS1' volume pool instead of to the 'OFFSITE' volume pool,

because W325,W338L2,W302L2,....

are from the 'SUBCTUXS1' volume pool.

 

See below for the volume group info, configuration details of

the TEST_SUBCTUXS1_FS policy and the error logs from bptm/log.xxxxx

 

 

Any help or suggestions would be VERY much appreciated. 

 

Rgds

Henley

 

root@SUBCTUXS7:netbackup[562]# bpmedialist -p SUBCTUXS1 | grep ^W3

W301L2   3    246   02/22/2008 01:03  03/05/2008 01:28  hcart2   386381248     0
W302L2   3*   104   02/07/2009 01:25  02/12/2009 03:07  hcart2   633243328     0
W325L2   3    298   01/15/2009 03:03  01/23/2009 03:03  hcart2   853282112     0
W338L2   3    101   01/28/2009 01:16  02/07/2009 01:25  hcart2   902444224     0
W352L2   3    142   01/21/2009 01:09  02/12/2009 01:25  hcart2   537587136     0
W371L2   3    766   10/04/2008 03:09  02/04/2009 01:25  hcart2   548301120     0
W384L2   3    328   12/10/2008 01:13  01/15/2009 03:03  hcart2   944233504     0
W300L2   5      3   01/03/2009 13:58  01/16/2009 18:46  hcart2   188601536     2

 

But this policy is supposed to use the 'OFFSITE'  volume pool

root@SUBCTUXS7:netbackup[565]# tail db/class/TEST_SUBCTUXS1_FS/info
CHKPT_INTERVAL 0
ENABLE_PFI 0
OFFHOST_BACKUP 0
USE_ALT_CLIENT 0
USE_DATA_MOVER 0
DATA_MOVER_TYPE 2
CLASS_ID FB820DBC1DD111B2B0890003BACA00B5
RESIDENCE SUBCTUXS1-hcart2-robot-tld-0 *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL*
POOL OFFSITE NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup
#VMD5_DIGEST=9e 6e cd 35 d5 f8 51 e1 11 00 96 b3 86 c3 a5 6c

 

 

the bptm log file  shows the following errors

 

19:28:12.251 [439] <2> select_media: added 57 media id's to list that matched robot number/type and media type

19:28:12.251 [439] <2> select_media: skipping media id W325L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W338L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W301L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W302L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W302L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W352L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W352L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: skipping media id W384L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.252 [439] <2> select_media: consider allowing retention level 0 on media (W371L2) that is 3 since its enabled by NetBackup configuration
19:28:12.252 [439] <2> select_media: skipping media id W371L2, it is not in correct storage unit or volume pool
19:28:12.252 [439] <2> select_media: getting new media id for retention level 0
19:28:12.252 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=vmd address=10.39.142.136 protocol=tcp non-reserved port=13701
19:28:12.252 [439] <2> nb_bind_on_port_addr: bound to port 62467
19:28:12.254 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.254 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.357 [439] <2> vmdb_query_scratch_bypool2: server returned:  EXIT_STATUS 35
19:28:12.357 [439] <2> string_to_record: read response:  EXIT_STATUS 35
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: unable to unpack string from server: volume does not exist in database (35)
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: query_scratchbypool2 request status:  volume does not exist in database (35)
19:28:12.358 [439] <2> set_job_details: Sending Tfile jobid (98543)
19:28:12.358 [439] <2> set_job_details: LOG 1234351692 16 bptm 439 Media Manager volume pool OFFSITE has no more unassigned media in robotic device TL
D(0)

19:28:12.358 [439] <2> set_job_details: Done
19:28:12.358 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=bpdbm address=10.39.142.136 protocol=tcp non-reserved port=13721
19:28:12.359 [439] <2> nb_bind_on_port_addr: bound to port 62468
19:28:12.359 [439] <2> logconnections: BPDBM CONNECT FROM 10.39.142.130.62468 TO 10.39.142.136.13721
19:28:12.360 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.361 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.361 [439] <2> check_authentication: no authentication required
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.432 [439] <16> select_media: Media Manager volume pool OFFSITE has no more unassigned media in robotic device TLD(0)
19:28:12.432 [439] <2> bptm: EXITING with status 96 <----------
19:28:16.578 [443] <2> bptm: INITIATING (VERBOSE = 5): -count -cmd -rt 8 -rn 0 -stunit SUBCTUXS1-hcart2-robot-tld-0 -den 14 -mt 2 -masterversion 51000
0

 

 

 root@SUBCTUXS7:netbackup[562]# bpmedialist -p SUBCTUXS1 | grep ^W3
W301L2   3    246   02/22/2008 01:03  03/05/2008 01:28  hcart2   386381248     0
W302L2   3*   104   02/07/2009 01:25  02/12/2009 03:07  hcart2   633243328     0
W325L2   3    298   01/15/2009 03:03  01/23/2009 03:03  hcart2   853282112     0
W338L2   3    101   01/28/2009 01:16  02/07/2009 01:25  hcart2   902444224     0
W352L2   3    142   01/21/2009 01:09  02/12/2009 01:25  hcart2   537587136     0
W371L2   3    766   10/04/2008 03:09  02/04/2009 01:25  hcart2   548301120     0
W384L2   3    328   12/10/2008 01:13  01/15/2009 03:03  hcart2   944233504     0
W300L2   5      3   01/03/2009 13:58  01/16/2009 18:46  hcart2   188601536     2

 

But this policy is supposed to use the 'OFFSITE'  volume pool

 

root@SUBCTUXS7:netbackup[565]# tail db/class/TEST_SUBCTUXS1_FS/info
CHKPT_INTERVAL 0
ENABLE_PFI 0
OFFHOST_BACKUP 0
USE_ALT_CLIENT 0
USE_DATA_MOVER 0
DATA_MOVER_TYPE 2
CLASS_ID FB820DBC1DD111B2B0890003BACA00B5
RESIDENCE SUBCTUXS1-hcart2-robot-tld-0 *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL* *NULL*
POOL OFFSITE NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup NetBackup
#VMD5_DIGEST=9e 6e cd 35 d5 f8 51 e1 11 00 96 b3 86 c3 a5 6c

 

 

the bptm log file  shows the following errors

 

19:28:12.251 [439] <2> select_media: added 57 media id's to list that matched robot number/type and media type

19:28:12.251 [439] <2> select_media: skipping media id W325L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W338L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: skipping media id W301L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W302L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W302L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: consider allowing retention level 0 on media (W352L2) that is 3 since its enabled by NetBackup configuration
19:28:12.251 [439] <2> select_media: skipping media id W352L2, it is not in correct storage unit or volume pool
19:28:12.251 [439] <2> select_media: skipping media id W384L2, either FULL or FROZE or SUSPENDED or IMPORTED
19:28:12.252 [439] <2> select_media: consider allowing retention level 0 on media (W371L2) that is 3 since its enabled by NetBackup configuration
19:28:12.252 [439] <2> select_media: skipping media id W371L2, it is not in correct storage unit or volume pool
19:28:12.252 [439] <2> select_media: getting new media id for retention level 0
19:28:12.252 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=vmd address=10.39.142.136 protocol=tcp non-reserved port=13701
19:28:12.252 [439] <2> nb_bind_on_port_addr: bound to port 62467
19:28:12.254 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.254 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.254 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.357 [439] <2> vmdb_query_scratch_bypool2: server returned:  EXIT_STATUS 35
19:28:12.357 [439] <2> string_to_record: read response:  EXIT_STATUS 35
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: unable to unpack string from server: volume does not exist in database (35)
19:28:12.357 [439] <16> vmdb_query_scratch_bypool2: query_scratchbypool2 request status:  volume does not exist in database (35)
19:28:12.358 [439] <2> set_job_details: Sending Tfile jobid (98543)
19:28:12.358 [439] <2> set_job_details: LOG 1234351692 16 bptm 439 Media Manager volume pool OFFSITE has no more unassigned media in robotic device TL
D(0)

19:28:12.358 [439] <2> set_job_details: Done
19:28:12.358 [439] <2> nb_getsockconnected: host=SUBCTUXS7 service=bpdbm address=10.39.142.136 protocol=tcp non-reserved port=13721
19:28:12.359 [439] <2> nb_bind_on_port_addr: bound to port 62468
19:28:12.359 [439] <2> logconnections: BPDBM CONNECT FROM 10.39.142.130.62468 TO 10.39.142.136.13721
19:28:12.360 [439] <2> vauth_authentication_required: vauth_comm.c.743: no methods for address: no authentication required
19:28:12.361 [439] <2> vauth_connector: vauth_comm.c.177: no methods for address: no authentication required
19:28:12.361 [439] <2> check_authentication: no authentication required
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.774: Ignoring VxSS authentication: 2 0x00000002
19:28:12.361 [439] <2> vnet_check_vxss_client_magic_with_info: vnet_vxss_helper.c.930: Not using VxSS authentication: 2 0x00000002
19:28:12.432 [439] <16> select_media: Media Manager volume pool OFFSITE has no more unassigned media in robotic device TLD(0)
19:28:12.432 [439] <2> bptm: EXITING with status 96 <----------
19:28:16.578 [443] <2> bptm: INITIATING (VERBOSE = 5): -count -cmd -rt 8 -rn 0 -stunit SUBCTUXS1-hcart2-robot-tld-0 -den 14 -mt 2 -masterversion 51000
0

 

10 REPLIES 10

Nathan_Kippen
Level 6
Certified

Schedules can overwrite the default volume pool and storage unit for the policy.

 

Double check how your backup schedule is setup.

J_H_Is_gone
Level 6

19:28:12.358 [439] <2> set_job_details: LOG 1234351692 16 bptm 439 Media Manager volume pool OFFSITE has no more unassigned media in robotic device TL

 

 

looking at your logs all the tape in volume pool got disqualified because they were:

19:28:12.251 [439] <2> select_media: skipping media id W301L2, either FULL or FROZE or SUSPENDED or IMPORTED

or

19:28:12.252 [439] <2> select_media: skipping media id W371L2, it is not in correct storage unit or volume pool

 

1) check the the tapes to see if they are frozen or imported or suspened and clean that up to make the avail

2) the tapes in the pool show to be hcart2 - check the storage unit you have the policy set up to use and verify that storage unit is using drives set at hcart2

Omar_Villa
Level 6
Employee

Can you paste the bppllist <policy name> -U  ?

 

This will tell us if you have any different pool under your schedules also under your ITC config if you have one.

 

regards.

sunbird
Level 4

horly,

 

The "bptm" log is just listing media that it is considering for the backup, it disregards each of them and then goes to the Scratch pool to find an AVAILABLE media. It finds no scratch pool , or no Scratch pool in the robot, and fails with the "status code 96".

 

Run the "available_media" script. Look in your "OFFSITE" pool for media being the robot (robot type will be TLD) and having the status "AVAILABLE" on the far right column.

 

The schedule is looking at the right pool, the pool just does not have any "AVAIALBLE" media in the robot.

 

Hope this helps, or moves you along further.

 

Let me know.

horly
Level 3

hi sunbird, seems like you are right. The output from the available_media script tells me

that none of the media in the OFFSITE pool is avail.  Why can it use an 'ACTIVE' media ? 

is it because it 'claimed' by another client? The  client for this failed policy is 'subctuxs1' and

I dont see  this server in the server host for the bpmedialist -p OFFSITE output.

 

output of "bpmedialist -p OFFSITE" is included as well. 

 

If this is the case, can i recycle the 2 FULL media in the OFFSITE pool?

If so how can i do that ? i.e What is the command to do that ?

 

Many thanks for all the response  submitted .

  

<output of available_media>

 

OFFSITE pool

W318L2  HCART2   NONE     -       -      -       3*    561934528        ACTIVE
W349L2  HCART2   TLD      0       4      -       3*    381757088        ACTIVE
W351L2  HCART2   NONE     -       -      -       5     130040320        ACTIVE
W381L2  HCART2   TLD      0       9      -       3     143630944        ACTIVE
W382L2  HCART2   NONE     -       -      -       3*    311192672        ACTIVE
W383L2  HCART2   NONE     -       -      -       3*    654227040        ACTIVE
W385L2  HCART2   TLD      0       7      -       5     185643168        ACTIVE
W387L2  HCART2   TLD      0       5      -       3*    248176512        ACTIVE
W388L2  HCART2   NONE     -       -      -       3*    669421792        ACTIVE
W389L2  HCART2   NONE     -       -      -       3*    446785728        ACTIVE
W390L2  HCART2   NONE     -       -      -       5     126568096        ACTIVE
W333L2  HCART2   TLD      0       6      -       3*    787991136        FULL
W386L2  HCART2   TLD      0       3      -       3*    763563840        FULL

 

<output of bpmedialist -p OFFISTE>

 

 root@SUBCTUXS7:logs[555]# bpmedialist -p OFFSITE | grep -v ^\$
Server Host = SUBCTUXS7
 id     rl  images   allocated        last updated      density  kbytes restores
           vimages   expiration       last read         <------- STATUS ------->
--------------------------------------------------------------------------------
W318L2   3*    48   10/26/2008 03:39  01/18/2009 05:03  hcart2   561934528     0
                1   02/18/2009 05:03        N/A       
W333L2   3*    68   09/28/2008 03:39  12/21/2008 19:31  hcart2   787991136     0
                1   02/21/2009 04:01        N/A           FULL
W349L2   3*    57   12/21/2008 22:01  02/11/2009 20:41  hcart2   381757088     0
                3   04/01/2009 00:32        N/A       
W383L2   3*    66   11/30/2008 03:30  01/12/2009 00:33  hcart2   654227040     0
                3   04/15/2009 00:33        N/A       
W386L2   3*    71   11/16/2008 03:30  12/28/2008 11:32  hcart2   763563840     0
                3   02/28/2009 04:02        N/A           FULL
W388L2   3*    73   12/07/2008 22:03  01/26/2009 00:33  hcart2   669421792     0
               35   04/29/2009 00:33        N/A       
W389L2   3*    58   12/07/2008 22:01  01/19/2009 00:33  hcart2   446785728     0
               33   04/22/2009 00:33        N/A       
Server Host = subctux02
 id     rl  images   allocated        last updated      density  kbytes restores
           vimages   expiration       last read         <------- STATUS ------->
--------------------------------------------------------------------------------
W381L2   3    127   09/30/2008 19:09  02/12/2009 19:07  hcart2   143630944     0
               32   03/15/2009 19:07        N/A       
Server Host = vapps98a
 id     rl  images   allocated        last updated      density  kbytes restores
           vimages   expiration       last read         <------- STATUS ------->
--------------------------------------------------------------------------------
W351L2   5      2   01/18/2009 00:13  01/25/2009 00:13  hcart2   130040320     0
                2   04/28/2009 00:13        N/A       
W382L2   3*     5   09/28/2008 00:19  02/01/2009 00:13  hcart2   311192672     0
                3   05/05/2009 00:13        N/A       
W385L2   5      7   11/16/2008 00:10  02/11/2009 19:37  hcart2   185643168     0
                7   05/15/2009 19:37        N/A       
W387L2   3*     4   10/12/2008 00:19  02/08/2009 00:14  hcart2   248176512     0
                2   05/12/2009 00:14        N/A       
W390L2   5      2   11/30/2008 00:10  01/11/2009 00:13  hcart2   126568096     0
                2   04/14/2009 00:13        N/A       

horly
Level 3

Hi Omar,

here is the output of bppllist

 

seems like  the schedule is not overwriting that of the policy's default vol pool.

 

Thanks

 

 root@SUBCTUXS7:logs[556]# bppllist TEST_SUBCTUXS1_FS -U
------------------------------------------------------------

Policy Name:       TEST_SUBCTUXS1_FS

  Policy Type:         Standard
  Active:              yes
  Effective date:      07/23/2005 19:36:17
  Client Compress:     no
  Follow NFS Mounts:   no
  Cross Mount Points:  no
  Collect TIR info:    no
  Block Incremental:   no
  Mult. Data Streams:  no
  Client Encrypt:      no
  Checkpoint:          no
  Policy Priority:     0
  Max Jobs/Policy:     Unlimited
  Disaster Recovery:   0
  Residence:           SUBCTUXS1-hcart2-robot-tld-0
  Volume Pool:         OFFSITE
  Keyword:             (none specified)

  HW/OS/Client:  Solaris       Solaris10     SUBCTUXS1

  Include:  /tmp

  Schedule:          test
    Type:            Full Backup
    Frequency:       every 1 day
    Maximum MPX:     1
    Synthetic:       0
    PFI Recovery:    0
    Retention Level: 0 (1 week)
    Number Copies:   1
    Fail on Error:   0
    Residence:       (specific storage unit not required)
    Volume Pool:     (same as policy volume pool)
    Daily Windows:

 

sunbird
Level 4

Horly,

 

The OFFSITE pool has six media actually in the library, each are own by other Media Servers (not the desired Media Server "SUBCTUXS1"):

 

Server Host = SUBCTUXS7
W333L2  HCART2   TLD      0       6      -       3*    787991136        FULL
W349L2  HCART2   TLD      0       4      -       3*    381757088        ACTIVE
W386L2  HCART2   TLD      0       3      -       3*    763563840        FULL

Server Host = subctux02
W381L2  HCART2   TLD      0       9      -       3     143630944        ACTIVE

Server Host = vapps98a
W385L2  HCART2   TLD      0       7      -       5     185643168        ACTIVE
W387L2  HCART2   TLD      0       5      -       3*    248176512        ACTIVE
 

 

So, it cannot use the above media.

 

You can expire all of the images associated with one of the media id's if you wish using the "bpexpdate" command. 

 

cd  /usr/openv/netbackup/bin/admincmd

 

./bpexpdate  -m  <media_id>  -d  0  -host  <Media_server>

(need to answer "y" if you really want to get rid of all the images on it)

 

 

Most of the media are not due to expire for a while. Media id "W318L2" would expire it's last image next Wednesday (2/18), but it is currently not in the robitc library. So, not seeing the rest of the reports, you really need to add some more media or keep the images for a shorter time.Additionally, it looks like you are retaining multiple retention levels on the same media, which can cause you to use more media.

 

Hope this helps.

Stumpr2
Level 6

Sun support is crap ?

 

I am sorry to hear that. I was always impressed with StorageTEK support of Netbackup. Maybe things have changed since their merger.

horly
Level 3

hi Guys

I think  this problem is actually a USER problem i.e me.

Thanks for the suggestions  from SunBird and Stumpr , i have consolidated my

media/volume under 1 media server i.e subctuxs7.  And unfreezed some media from other

volum pool and moved it to this volume pool. 'OFFSITE'

And configured all the my policy to use the same media server i.e subctuxs7.

The backup is now running fine.

Thanks for the help once again.

Its actually a relatively simply problem really. Cannot understand why sun veritas support cannot help, since I have provided them with pretty much the same info.

 

 


J_H_Is_gone
Level 6

I was not impressed with them when I had them.

 

I was much happier when I got to go directly to Symantec support.