cancel
Showing results for 
Search instead for 
Did you mean: 

NDMP backup failure

cnx_anup
Level 4

Can somebody through light on how to troubleshoot for the log below :

Error logs,
========

7/15/2015 2:54:34 PM - mounting XXXXX
7/15/2015 2:55:20 PM - Info bptm(pid=26144) media id XXXXX mounted on drive index 1, drivepath {3,0,1,0}, drivename IBM.ULT3580-HH5.002, copy 1
7/15/2015 2:55:20 PM - mounted; mount time: 0:00:46
7/15/2015 2:55:20 PM - positioning XXXXX to file 6
7/15/2015 2:56:28 PM - positioned XXXXX; position time: 0:01:08
7/15/2015 2:56:28 PM - begin writing
7/15/2015 8:16:18 PM - Error bpbrm(pid=27544) db_FLISTsend failed: file read failed (13)       
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) write socket err 10054 (An existing connection was forcibly closed by the remote host.
.)
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) FH send to BRM failed - Ddi - 0 0 0 0 4701543 0 0 9 13680367 Thumbs.db
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) NDMP backup failed, path = /root_vdm_1/FS_XXXXXXXX_01/MPFS_XXXXXXXX_01/FS01_QTR010       
7/15/2015 8:16:19 PM - Info bpbrm(pid=27632) child done, status 13         
7/15/2015 8:16:19 PM - Info bpbrm(pid=27632) sending message to media manager: STOP BACKUP THBANVAPFL10_1436946871     
7/15/2015 8:16:48 PM - Error ndmpagent(pid=25552) XXXXXXXX: Medium error          
7/15/2015 8:16:48 PM - Warning ndmpagent(pid=25552) XXXXXXXX: Write failed on archive volume 1      
7/15/2015 8:16:48 PM - Info ndmpagent(pid=25552) XXXXXXXX: server_archive: emctar vol 1, 1025041 files, 0 bytes read, 595648118784 bytes written
7/15/2015 8:16:48 PM - Error ndmpagent(pid=25552) XXXXXXXX: Backup is aborted.         
7/15/2015 8:16:49 PM - Info bpbrm(pid=27632) media manager for backup id THBANVAPFL10_1436946871 exited with status 150: termination requested by administrator
7/15/2015 8:16:49 PM - end writing; write time: 5:20:21
file read failed(13)

1 ACCEPTED SOLUTION

Accepted Solutions

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

You will need logs on the media server to try and find the reason for the failure.

We can see that the backup had already written quite a bit of data when it seemed to get hung on an image thumbnail!

7/15/2015 2:56:28 PM - begin writing
7/15/2015 8:16:18 PM - Error bpbrm(pid=27544) db_FLISTsend failed: file read failed (13)       
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) write socket err 10054 (An existing connection was forcibly closed by the remote host.
.)
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) FH send to BRM failed - Ddi - 0 0 0 0 4701543 0 0 9 13680367 Thumbs.db
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) NDMP backup failed, path = /root_vdm_1/FS_XXXXXXXX_01/MPFS_XXXXXXXX_01/FS01_QTR010       

Logs needed:

ndmpagent 
bpbrm and bptm (at logging level of 3 minimum)

nbdmpagent will hopefully give more info about the file/folder where backup is getting stuck.

bptm will tell us up to when data was received from the filer.

bpbrm will show when last metadata was received and the failure was actually a timeout because no data was received from the filer.

In the meantime, change Client Read Timeout on the media server to 1800 (30 minutes).

View solution in original post

5 REPLIES 5

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified
Please always add information about the environment - posting only Activity Monitor details does not help to understand the bigger picture. Is NAS filer backing up itself to local tape drives or is filer bacķed up across the network to a media server?

cnx_anup
Level 4

EMC NAS (VNX 5300)

Backing up across the network to a media server.

This particular policy has 24 filer path, out of which only one path is failing with this error.

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

You will need logs on the media server to try and find the reason for the failure.

We can see that the backup had already written quite a bit of data when it seemed to get hung on an image thumbnail!

7/15/2015 2:56:28 PM - begin writing
7/15/2015 8:16:18 PM - Error bpbrm(pid=27544) db_FLISTsend failed: file read failed (13)       
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) write socket err 10054 (An existing connection was forcibly closed by the remote host.
.)
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) FH send to BRM failed - Ddi - 0 0 0 0 4701543 0 0 9 13680367 Thumbs.db
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) NDMP backup failed, path = /root_vdm_1/FS_XXXXXXXX_01/MPFS_XXXXXXXX_01/FS01_QTR010       

Logs needed:

ndmpagent 
bpbrm and bptm (at logging level of 3 minimum)

nbdmpagent will hopefully give more info about the file/folder where backup is getting stuck.

bptm will tell us up to when data was received from the filer.

bpbrm will show when last metadata was received and the failure was actually a timeout because no data was received from the filer.

In the meantime, change Client Read Timeout on the media server to 1800 (30 minutes).

cruisen
Level 6
Partner Accredited

Hello cnx anup,

What are your buffer size settings, I' ve seen settings that are not compatible to the Nas server, like more than 256kb for example.

Also I would like to add, if you use the setting SIZE_DATA_BUFFER_NDMP. In my experience because of some reason will not be taking into acount. The best will be to stay at default 64kb or to synchronise with your EMC Nas server, for that you need to set parameter on the nas to be the same as on Netbackup. 

Best regards 

Cruisen

cnx_anup
Level 4

Thank you Marianne and Cruisen. Based on your recommendations we are troubleshooting the issue. I will update the thread :)