cancel
Showing results for 
Search instead for 
Did you mean: 

NDMP error 636(netapp)

Ivy_Yang
Level 6

hi, I came across a NDMP backup failure recently. it used to work well in the past.  we have 10000 opened bi-directionally.

but the backups always ended with 636 error....
would youplease help point the right direction?
it lasted almost 1 week.
thanks in advance!

Master: linux, NBU7.5.0.6
Media: linux, NBU7.5.0.6

on the master
[root@lx0034nbumast bin]# nbemmcmd -listhosts |grep ndmp             dsfiler05-101
[root@lx0034nbumast bin]# tpautoconf -verify dsfiler05-101
Connecting to host "dsfiler05-101" as user "ndmp"...
Waiting for connect notification message...
Opening session--attempting with NDMP protocol version 4...
Opening session--successful with NDMP protocol version 4
  host supports MD5 authentication
Getting MD5 challenge from host...
Logging in using MD5 method...
Host info is:
  host name "dsfiler05"
  os type "NetApp"
  os version "NetApp Release 7.3.2"
  host id "0135080178"
Login was successful
Host supports LOCAL backup/restore
Host supports 3-way backup/restore
Host has SnapVault Secondary license installed


on the Media server
[root@lx0003nbumed01 ~]#  nbemmcmd -listhosts |grep ndmp             dsfiler05-101
[root@lx0003nbumed01 ~]# tpautoconf -verify dsfiler05-101
Connecting to host "dsfiler05-101" as user "ndmp"...
Waiting for connect notification message...
Opening session--attempting with NDMP protocol version 4...
Opening session--successful with NDMP protocol version 4
  host supports MD5 authentication
Getting MD5 challenge from host...
Logging in using MD5 method...
Host info is:
  host name "dsfiler05"
  os type "NetApp"
  os version "NetApp Release 7.3.2"
  host id "0135080178"
Login was successful
Host supports LOCAL backup/restore
Host supports 3-way backup/restore
Host has SnapVault Secondary license installed

5 REPLIES 5

RiaanBadenhorst
Moderator
Moderator
Partner    VIP    Accredited Certified

Hello,

 

Did you create the logs for the ndmp backups to be able to troubelshoot further?

http://www.symantec.com/docs/TECH56492

 

Check this note as well, http://www.symantec.com/docs/TECH214335

 

And describe the scenario/environment a bit more.

Ivy_Yang
Level 6
thanks Riaan ! let me try the solution. will keep you posted!!!

Ivy_Yang
Level 6
our issue is still pending there...... we have a lot of NDMP backups(EMC VNX, netapp). and this NDMP was working well.....and not sure why it doesn;t work... we open case with Symantec 2 month ago and Symantec insisted we need to allow the firwall "any to any"...........but this request is denied by our network team.... so the issue is still pending there..... we remove all the backup selections but only backs up /vol/vol0 on netapp, and the size is only 5G, port 10000 is open......

watsons
Level 6

Have you really tried what Riann suggested with those technotes, with the one lowering TCP keepalive time on master & media server? Error 636 is usually caused by that.

If port 10000 is opened, I suppose your test of telnet <ndmphost> 10000 is working fine. Can you show us one of your job details with that error 636?

 

mnolan
Level 6
Employee Accredited Certified

As the author of http://www.symantec.com/docs/TECH214335 that Riian posted, I can confirm that a 636 should have nothing to do with the connection between the media server processes and the NDMP filer. This usually has everything to do with bpbrm connection and NBJM. The 636 error is generally "floating" in the detailed status (ie no timestamp) and is NBJM stating that it was checking for updates from bpbrm and found the socket closed. Further investigation into bpbrm logs may determine when the sockets actually closed.

The related article http://www.symantec.com/docs/TECH214335 goes further into the 636 issue.