Forum Discussion

Ramiro-Magan's avatar
9 years ago

RMAN restore troubleshooting.

Hi guys, I am having a bit of a problem doing an RMAN restore I can not seem to find a workaround.

So you get an idea, I have the following configuration.

Master Server - Windows 2008 R2 with Netbackup 7.6.2

Media Server - windows 2003 with Netbackup 7.5.0.7

Client Solaris 10

 

So, I took a backup for an Oracle server on the media server, (it is a remote site) grabed the tapes and loaded them on the master server, did my catalog, everything looks pretty.

Now when I try to do the restore, I am using a new environment, and get the following error in my logs.

24/02/2016 12:43:39 - begin Restore
24/02/2016 12:43:40 - Info bprd(pid=11800) Found (1) files in (1) images for Restore Job ID 670624.xxx  
24/02/2016 12:43:40 - 1 images required
24/02/2016 12:43:40 - media OVA383 required
24/02/2016 12:43:41 - restoring image ar-gecs21_1456047404
24/02/2016 12:43:42 - Error bpbrm(pid=4000) bpcd on cfhq004so0368 exited with status 48: client hostname could not be found
24/02/2016 12:43:42 - Info bpbrm(pid=4000) telling media manager to start restore on client     
24/02/2016 12:43:44 - Warning bptm(pid=7580) failure logging message to client cfhq004so0368 in log /usr/openv/netbackup/logs/user_ops/dbext/logs/27976.0.1456328618:  client hostname could not be found (48)
24/02/2016 12:43:45 - Error bpbrm(pid=5572) bpcd on cfhq004so0368 exited with status 48: client hostname could not be found
24/02/2016 12:43:45 - requesting resource OVA383
24/02/2016 12:43:45 - awaiting resource OVA383 A pending request has been generated for this resource request.
     Operator action may be required. Pending Action: No action.,
     Media ID: OVA383, Barcode: OVA383L4, Density: hcart, Access Mode: Read,
     Action Drive Name: N/A, Action Media Server: myarsw00200bk01, Robot Number: N/A, Robot Type: NONE,
     Volume Group: 000_00000_TLD, Action Acs: N/A, Action Lsm: N/A
    
24/02/2016 12:43:45 - Info bpbrm(pid=5572) listening for client connection         
24/02/2016 12:43:45 - Error bpbrm(pid=5572) bpcd on cfhq004so0368 exited with status 48: client hostname could not be found
24/02/2016 12:43:45 - Error bpbrm(pid=5572) listen for client protocol error - couldn't write necessary information on /usr/openv/netbackup/logs/user_ops/dbext/logs/27976.0.1456328618 
24/02/2016 12:43:46 - Info bpbrm(pid=4000) child done, status 25         
24/02/2016 12:43:46 - Info bpbrm(pid=4000) sending message to media manager: STOP RESTORE ar-gecs21_1456047404     
24/02/2016 12:43:49 - awaiting resource OVA383 A pending request has been generated for this resource request.
     Operator action may be required. Pending Action: No action.,
     Media ID: OVA383, Barcode: OVA383L4, Density: hcart, Access Mode: Read,
     Action Drive Name: N/A, Action Media Server: myarsw00200bk01, Robot Number: N/A, Robot Type: NONE,
     Volume Group: 000_00000_TLD, Action Acs: N/A, Action Lsm: N/A
    
24/02/2016 12:44:44 - Info bpbrm(pid=4000) media manager for backup id ar-gecs21_1456047404 exited with status 150: termination requested by administrator
24/02/2016 12:44:45 - restored image ar-gecs21_1456047404 - (cannot connect on socket(25)); restore time 0:01:04
24/02/2016 12:44:45 - end Restore; elapsed time: 0:01:06
Restore error (2850)

 

---------------------------------

Among other things, what cought my eye, is that it seems to be trying to load the tape in media server myarsw00200bk01, which is not the server the tapes are loaded now, it should be myarsw00403bk01.

 

If you could point me in the correct direction I would appreciate it, as I am lost, having tried all I could thing of.

 

Thanks.

  • If I understand correctly, the source client and source media server and source client used for the backup was myarsw002..? And now you want to use media server myarsw004... to restore to client cfhq004..? Can we assume that you have taken all the steps (altnames files on master server) to enable redirected restore? You also need to tell NBU to use another media server for the restore. The default for tape restore is to send restore task to media server that performed the backup. There are 2 methods to specify another media server to do the restore : 1) Host Properties - > Master - > Universal settings -> Media Host override Put source media server in 1st field, destination media server in 2nd field. 2) change media ownership with this command on the master server : bpmedia -movedb -m ( media-id ) -oldserver (oldserver) -newserver (newserver) Double-check forward and reverse name lookup between destination media server and destination client as well as port connectivity.

4 Replies

  • I assume by "catalog" you mean "import"?

    The media server that did the import will own the media and so should be the one doing the restore so it should be in that servers library.

    When doing alternate RMAN restores there are lots of things to get right .. a little light reading for some ideas:

    http://www.veritas.com/docs/000065979
    http://www.veritas.com/docs/000032618
    https://www.veritas.com/community/forums/oracle-rman-restore-alternate-client-different-netbackup-domain

     

  • Mark, I meant I did an inventory of the library for the master server.

    Fron the articles you shared, I found that at some point in time, an altname was configured, I re configured it to point in the correct direction, but I am still out of luck.

    In fact, I am getting a new error.

     

    24/02/2016 14:07:33 - begin Restore
    24/02/2016 14:07:34 - Info bprd(pid=11860) Found (1) files in (1) images for Restore Job ID 670685.xxx  
    24/02/2016 14:07:35 - 1 images required
    24/02/2016 14:07:35 - media OVA383 required
    24/02/2016 14:07:36 - restoring image ar-gecs21_1456047404
    24/02/2016 14:12:38 - Info bpbrm(pid=2800) connect failed STATUS (18) CONNECT_FAILED        
    24/02/2016 14:12:38 - Info bpbrm(pid=2800)     status: FAILED, (44) CONNECT_TIMEOUT; system: (10036) A blocking operation is currently executing. ; FROM 0.0.0.0 TO cfhq004so0368 172.24.1.217 bpcd VIA pbx
    24/02/2016 14:12:38 - Info bpbrm(pid=2800)     status: FAILED, (44) CONNECT_TIMEOUT; system: (10060) Connection timed out.; FROM 0.0.0.0 TO cfhq004so0368 172.24.1.217 bpcd VIA vnetd
    24/02/2016 14:12:38 - Info bpbrm(pid=2800)     status: FAILED, (44) CONNECT_TIMEOUT; system: (10060) Connection timed out.; FROM 0.0.0.0 TO cfhq004so0368 172.24.1.217 bpcd
    24/02/2016 14:12:38 - Error bpbrm(pid=2800) Cannot connect to cfhq004so0368         
    24/02/2016 14:12:38 - Info bpbrm(pid=2800) telling media manager to start restore on client     
    24/02/2016 14:16:20 - Warning bptm(pid=5472) failure logging message to client cfhq004so0368 in log /usr/openv/netbackup/logs/user_ops/dbext/logs/29712.0.1456333653:  cannot connect on socket (25)
    24/02/2016 14:16:21 - requesting resource OVA383
    24/02/2016 14:16:21 - awaiting resource OVA383 A pending request has been generated for this resource request.
         Operator action may be required. Pending Action: No action.,
         Media ID: OVA383, Barcode: OVA383L4, Density: hcart, Access Mode: Read,
         Action Drive Name: N/A, Action Media Server: myarsw00200bk01, Robot Number: N/A, Robot Type: NONE,
         Volume Group: 000_00000_TLD, Action Acs: N/A, Action Lsm: N/A
        
    24/02/2016 14:16:41 - awaiting resource OVA383 A pending request has been generated for this resource request.
         Operator action may be required. Pending Action: No action.,
         Media ID: OVA383, Barcode: OVA383L4, Density: hcart, Access Mode: Read,
         Action Drive Name: N/A, Action Media Server: myarsw00200bk01, Robot Number: N/A, Robot Type: NONE,
         Volume Group: 000_00000_TLD, Action Acs: N/A, Action Lsm: N/A
        
    24/02/2016 14:17:41 - Info bpbrm(pid=7304) connect failed STATUS (18) CONNECT_FAILED        
    24/02/2016 14:17:41 - Info bpbrm(pid=7304)     status: FAILED, (44) CONNECT_TIMEOUT; system: (10036) A blocking operation is currently executing. ; FROM 0.0.0.0 TO cfhq004so0368 172.24.1.217 bpcd VIA pbx
    24/02/2016 14:17:41 - Info bpbrm(pid=7304)     status: FAILED, (44) CONNECT_TIMEOUT; system: (10060) Connection timed out.; FROM 0.0.0.0 TO cfhq004so0368 172.24.1.217 bpcd VIA vnetd
    24/02/2016 14:17:41 - Info bpbrm(pid=7304)     status: FAILED, (44) CONNECT_TIMEOUT; system: (10036) A blocking operation is currently executing. ; FROM 0.0.0.0 TO cfhq004so0368 172.24.1.217 bpcd
    24/02/2016 14:17:41 - Error bpbrm(pid=7304) Cannot connect to cfhq004so0368         
    24/02/2016 14:17:41 - Info bpbrm(pid=7304) listening for client connection         

     

    --------------------------

    What is still getting my eye is the fact that it is looking for a tape in the media server, while it is in the master server.

  • If I understand correctly, the source client and source media server and source client used for the backup was myarsw002..? And now you want to use media server myarsw004... to restore to client cfhq004..? Can we assume that you have taken all the steps (altnames files on master server) to enable redirected restore? You also need to tell NBU to use another media server for the restore. The default for tape restore is to send restore task to media server that performed the backup. There are 2 methods to specify another media server to do the restore : 1) Host Properties - > Master - > Universal settings -> Media Host override Put source media server in 1st field, destination media server in 2nd field. 2) change media ownership with this command on the master server : bpmedia -movedb -m ( media-id ) -oldserver (oldserver) -newserver (newserver) Double-check forward and reverse name lookup between destination media server and destination client as well as port connectivity.
  • Thanks Marianne, you rock!

    I totally missed the media host override. 

    Started working right away after that.