Forum Discussion

Kisad's avatar
Kisad
Level 5
12 years ago

SLP's failing with 84

I have several SLP jobs failing with error code 84, the job type is replication on the Activity monitor.

I checked the SLP's and they are configured in this manner

SLP Name->MediaServer01

Backup(Operation) -> MediaServer01(Storage) ----> Duplication(Operation) -- Monthly (Volume Pool) ->MediaServer01_TLD0 (Storage)

Here's the details from the activity monitor Job details.

10/11/2013 10:34:14 - requesting resource LCM_MediaServer01
10/11/2013 10:34:15 - granted resource  LCM_MediaServer01
10/11/2013 10:34:15 - started process RUNCMD (pid=1712)
10/11/2013 10:34:16 - requesting resource @aaaan
10/11/2013 10:34:16 - reserving resource @aaaan
10/11/2013 10:34:16 - resource @aaaan reserved
10/11/2013 10:34:16 - granted resource  MediaID=@aaaan;DiskVolume=PureDiskVolume;DiskPool=MediaServer01_onsite;Path=PureDiskVolume;StorageServer=MediaServer01;MediaServer=MediaServer01
10/11/2013 10:34:30 - Info bpdm (pid=1616) started
10/11/2013 10:34:30 - started process bpdm (pid=1616)
10/11/2013 10:34:48 - Info MediaServer01 (pid=1616) Using OpenStorage to replicate backup id clientmachine, media id @aaaan, storage server MediaServer01, disk volume PureDiskVolume
10/11/2013 10:34:48 - Info MediaServer01 (pid=1616) Replicating images to target storage server MediaServer02-offsitelocation, disk volume PureDiskVolume
10/11/2013 10:35:02 - Error  (pid=1712) ReplicationJob::Replicate: Replication failed for backup id clientmachine: media write error (84)
10/11/2013 10:35:02 - Replicate failed for backup id clientmachine with status 84
10/11/2013 10:35:04 - requesting resource @aaaan
10/11/2013 10:35:04 - granted resource  MediaID=@aaaan;DiskVolume=PureDiskVolume;DiskPool=MediaServer01;Path=PureDiskVolume;StorageServer=MediaServer01;MediaServer=MediaServer01
10/11/2013 10:35:11 - Critical bpdm (pid=1616) Storage Server Error: (Storage server: PureDisk:MediaServer01) CALaunchAIRReplicate: Failed to complete launchAIRReplicate webservice (Could not setup replication: get Remote SPAId webservice failed, could not determine whether target is PDDE or PDDO (connection actively refused) ) V-454-61
10/11/2013 10:35:11 - Error bpdm (pid=1616) <async> copy image failed: error 2060023: server is shut down
10/11/2013 10:35:11 - Error bpdm (pid=1616) copy failed: error 2060023
10/11/2013 10:35:11 - Error bpdm (pid=1616) <async> cancel failed: error 2060001: one or more invalid arguments
10/11/2013 10:35:11 - Error bpdm (pid=1616) copy cancel failed: error 2060001
10/11/2013 10:35:11 - Info MediaServer01 (pid=1616) StorageServer=PureDisk:MediaServer01; Report=PDDO Stats for (MediaServer01): scanned: 4 KB, CR sent: 0 KB, CR sent over FC: 0 KB, dedup: 100.0%

 

 

Netbackup 7.5.0.6 running on Windows 2008 R2 Enterprise SP1 

 

Thanks

5 Replies

  • 10/11/2013 10:34:16 - granted resource  MediaID=@aaaan;DiskVolume=PureDiskVolume;DiskPool=MediaServer01_onsite;Path=PureDiskVolume;StorageServer=MediaServer01;MediaServer=MediaServer01
    10/11/2013 10:34:30 - Info bpdm (pid=1616) started
     
    10/11/2013 10:35:04 - granted resource  MediaID=@aaaan;DiskVolume=PureDiskVolume;DiskPool=MediaServer01;Path=PureDiskVolume;StorageServer=MediaServer01;MediaServer=MediaServer01
    10/11/2013 10:35:11 - Critical bpdm (pid=1616) Storage Server Error: (Storage server: PureDisk:MediaServer01) CALaunchAIRReplicate: Failed to complete launchAIRReplicate webservice (Could not setup replication: get Remote SPAId webservice failed, could not determine whether target is PDDE or PDDO (connection actively refused) ) V-454-61
     
     
    both source and destinations are granted from the pure disk Volume..  its not using the tape storage unit as you specified in SLP.

     

  • Seems this is AIR replication to remote media server with different master server.

    Please check that initial config was done correctly.

  • Thanks Nagalla. I did notice that both source and destination are pure disk volume but when i checked the SLP i found tape storage unite listed in the duplication. At this point i wasnt sure what was going on. I'm wondering if SLP has tape then why is the job failure showing both as pure disk.

  • Thanks Marianne. I have very little knowledge of AIR. If i may ask ,how did you know this is AIR replication?

    On the souce netbackup master, the media server is listed in the storage servers and the disk pool is up and free space on it. 

    On the offiste netbackup master, the offsite media server is also listed in the storage servers and the disk pool too is up and has free space on it.

    Could you please give me pointers to what other things that i shoud check from netbackup prespective.

     

  • It seems to have conflicting message here.

    The SLP opertaions defines a "duplication" from a disk storage (PD) to tape storage (TLD). It is assumed you don't have any "replication" configured in the SLP.

    However you said the job was a replication type, and if that was a AIR replication (we can see the error message stating  CALaunchAIRReplicate: Failed to complete launchAIRReplicate webservice  ,that looks like AIR. 

    AIR is Auto Image Replication, where images can be replication from one master server to another (master server), configuration requires MSDP setup on both master server using SLP. In source SLP, you need a "replication" operation, and in target SLP, you need an "import" operation. If you don't have that setup,  you don't have AIR.

    We may need to look at your actual SLP configration, "nbstl -L" output.