cancel
Showing results for 
Search instead for 
Did you mean: 

Netbackup AIR is failing with error 174

ShasRaj_UK
Level 3
Partner

Hi All,

 

Then replication jobs between prod site and the DR site is failed with error 174 .I checked the configuration all seems to be fine .

AT production site the replication jobs are created but at DR  , no import jobs are refelecting .

Previously it was working but since last week the Netbackup env. has this issue.

 

At Prod :

Master server : V 7.6.0.4 (Linux)

Media server : 2 x Netbackup Appliance  5230 (2.6.0.4)

 

At DR :

Master server : V 7.6.0.4 (Linux)

Media server : 2 x Netbackup Appliance  5230 (2.6.0.4)

Error in Job Details :

20/07/2015 10:35:21 - Info bpdm(pid=49089) started            
20/07/2015 10:35:21 - started process bpdm (49089)
20/07/2015 10:38:11 - Info lappsnbua01(pid=49089) Using OpenStorage to replicate backup id LAPPWSDCPDB01_1437247536, media id @aaaai, storage server lappsnbua02, disk volume PureDiskVolume
20/07/2015 10:38:11 - Info lappsnbua01(pid=49089) Replicating images to target storage server scopsnbua01, disk volume PureDiskVolume   
20/07/2015 11:21:48 - requesting resource LCM_*Remote*Master*
20/07/2015 11:21:52 - granted resource LCM_*Remote*Master*
20/07/2015 11:21:53 - Info nbreplicate(pid=2364) Suspend window close behavior is not supported for nbreplicate    
20/07/2015 11:21:53 - Info nbreplicate(pid=2364) window close behavior: Continue processing the current image     
20/07/2015 11:21:53 - started process RUNCMD (2364)
20/07/2015 11:22:13 - requesting resource @aaaai
20/07/2015 11:22:13 - reserving resource @aaaai
20/07/2015 11:22:22 - reserved resource @aaaai
20/07/2015 11:22:22 - granted resource MediaID=@aaaai;DiskVolume=PureDiskVolume;DiskPool=dp_lappsnbua02_dedupe;Path=PureDiskVolume;StorageServer=lappsnbua02;MediaServer=lappsnbua01
20/07/2015 12:31:52 - Critical bpdm(pid=49089) Storage Server Error: (Storage server: PureDisk:lappsnbua02) CALaunchAIRReplicate: Failed to complete launchAIRReplicate webservice (Could not setup replication: Load of remote config failed (connection reset by peer) ) V-454-61
20/07/2015 12:31:52 - Error bpdm(pid=49089) <async> copy image failed: error 2060019: error occurred on network socket  
20/07/2015 12:31:52 - Error bpdm(pid=49089) copy failed: error 174         
20/07/2015 12:31:52 - Error bpdm(pid=49089) <async> cancel failed: error 2060001: one or more invalid arguments   
20/07/2015 12:31:52 - Error bpdm(pid=49089) copy cancel failed: error 174        
20/07/2015 12:31:52 - Info lappsnbua01(pid=49089) StorageServer=PureDisk:lappsnbua02; Report=PDDO Stats for (lappsnbua02): scanned: 4 KB, CR sent: 0 KB, CR sent over FC: 0 KB, dedup: 100.0%, cache disabled
20/07/2015 13:18:56 - Error nbreplicate(pid=2364) ReplicationJob::Replicate: Replication failed for backup id LAPPWSDCPDB01_1437247536: media write error (84)  
20/07/2015 13:18:56Replicate failed for backup id LAPPWSDCPDB01_1437247536 with status 84
20/07/2015 13:18:56 - end operation
no images were successfully processed(191)

3 REPLIES 3

Marianne
Level 6
Partner    VIP    Accredited Certified

Have you verified that initial setup between the 2 masters was done correctly?

Have you checked firewall ports?

ShasRaj_UK
Level 3
Partner

Thanks Marianne for you post .

Yes the setup is fine .

On ports :

Yesterday I was checking 1556 , 10082 and 10102 and I found , at DR site there are no services to listen on 10082 or 10102.

Do I have to check some different ports than these three ? 

 

watsons
Level 6

Does it happen to all images, or just some large images, if it's just partial, could be a network timeout issue..

Check bpdm logs for the actual error, match with the time that it fails with "copy failed: error 174".