Forum Discussion

cnx_anup's avatar
cnx_anup
Level 4
10 years ago

NDMP backup failure

Can somebody through light on how to troubleshoot for the log below :

Error logs,
========

7/15/2015 2:54:34 PM - mounting XXXXX
7/15/2015 2:55:20 PM - Info bptm(pid=26144) media id XXXXX mounted on drive index 1, drivepath {3,0,1,0}, drivename IBM.ULT3580-HH5.002, copy 1
7/15/2015 2:55:20 PM - mounted; mount time: 0:00:46
7/15/2015 2:55:20 PM - positioning XXXXX to file 6
7/15/2015 2:56:28 PM - positioned XXXXX; position time: 0:01:08
7/15/2015 2:56:28 PM - begin writing
7/15/2015 8:16:18 PM - Error bpbrm(pid=27544) db_FLISTsend failed: file read failed (13)       
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) write socket err 10054 (An existing connection was forcibly closed by the remote host.
.)
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) FH send to BRM failed - Ddi - 0 0 0 0 4701543 0 0 9 13680367 Thumbs.db
7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) NDMP backup failed, path = /root_vdm_1/FS_XXXXXXXX_01/MPFS_XXXXXXXX_01/FS01_QTR010       
7/15/2015 8:16:19 PM - Info bpbrm(pid=27632) child done, status 13         
7/15/2015 8:16:19 PM - Info bpbrm(pid=27632) sending message to media manager: STOP BACKUP THBANVAPFL10_1436946871     
7/15/2015 8:16:48 PM - Error ndmpagent(pid=25552) XXXXXXXX: Medium error          
7/15/2015 8:16:48 PM - Warning ndmpagent(pid=25552) XXXXXXXX: Write failed on archive volume 1      
7/15/2015 8:16:48 PM - Info ndmpagent(pid=25552) XXXXXXXX: server_archive: emctar vol 1, 1025041 files, 0 bytes read, 595648118784 bytes written
7/15/2015 8:16:48 PM - Error ndmpagent(pid=25552) XXXXXXXX: Backup is aborted.         
7/15/2015 8:16:49 PM - Info bpbrm(pid=27632) media manager for backup id THBANVAPFL10_1436946871 exited with status 150: termination requested by administrator
7/15/2015 8:16:49 PM - end writing; write time: 5:20:21
file read failed(13)

  • You will need logs on the media server to try and find the reason for the failure.

    We can see that the backup had already written quite a bit of data when it seemed to get hung on an image thumbnail!

    7/15/2015 2:56:28 PM - begin writing
    7/15/2015 8:16:18 PM - Error bpbrm(pid=27544) db_FLISTsend failed: file read failed (13)       
    7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) write socket err 10054 (An existing connection was forcibly closed by the remote host.
    .)
    7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) FH send to BRM failed - Ddi - 0 0 0 0 4701543 0 0 9 13680367 Thumbs.db
    7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) NDMP backup failed, path = /root_vdm_1/FS_XXXXXXXX_01/MPFS_XXXXXXXX_01/FS01_QTR010       

    Logs needed:

    ndmpagent 
    bpbrm and bptm (at logging level of 3 minimum)

    nbdmpagent will hopefully give more info about the file/folder where backup is getting stuck.

    bptm will tell us up to when data was received from the filer.

    bpbrm will show when last metadata was received and the failure was actually a timeout because no data was received from the filer.

    In the meantime, change Client Read Timeout on the media server to 1800 (30 minutes).

5 Replies

  • Please always add information about the environment - posting only Activity Monitor details does not help to understand the bigger picture. Is NAS filer backing up itself to local tape drives or is filer bacķed up across the network to a media server?
  • EMC NAS (VNX 5300)

    Backing up across the network to a media server.

    This particular policy has 24 filer path, out of which only one path is failing with this error.

  • You will need logs on the media server to try and find the reason for the failure.

    We can see that the backup had already written quite a bit of data when it seemed to get hung on an image thumbnail!

    7/15/2015 2:56:28 PM - begin writing
    7/15/2015 8:16:18 PM - Error bpbrm(pid=27544) db_FLISTsend failed: file read failed (13)       
    7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) write socket err 10054 (An existing connection was forcibly closed by the remote host.
    .)
    7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) FH send to BRM failed - Ddi - 0 0 0 0 4701543 0 0 9 13680367 Thumbs.db
    7/15/2015 8:16:18 PM - Error ndmpagent(pid=25552) NDMP backup failed, path = /root_vdm_1/FS_XXXXXXXX_01/MPFS_XXXXXXXX_01/FS01_QTR010       

    Logs needed:

    ndmpagent 
    bpbrm and bptm (at logging level of 3 minimum)

    nbdmpagent will hopefully give more info about the file/folder where backup is getting stuck.

    bptm will tell us up to when data was received from the filer.

    bpbrm will show when last metadata was received and the failure was actually a timeout because no data was received from the filer.

    In the meantime, change Client Read Timeout on the media server to 1800 (30 minutes).

  • Hello cnx anup,

    What are your buffer size settings, I' ve seen settings that are not compatible to the Nas server, like more than 256kb for example.

    Also I would like to add, if you use the setting SIZE_DATA_BUFFER_NDMP. In my experience because of some reason will not be taking into acount. The best will be to stay at default 64kb or to synchronise with your EMC Nas server, for that you need to set parameter on the nas to be the same as on Netbackup. 

    Best regards 

    Cruisen

  • Thank you Marianne and Cruisen. Based on your recommendations we are troubleshooting the issue. I will update the thread :)