cancel
Showing results for 
Search instead for 
Did you mean: 

Netbackup7.1 write fail to HP D2D device ( windows 2008 R2)

STG
Level 3

Our group installed a new HP D2D 2504i disk based appliance. I installed OST plug-in for it, and looks like Netbackup can backup and resotre small files on it. but I try to backup and restore larger image( say one server size maybe 17GB). the system halts. and looks like Netbackup lost connection with D2D storage. here is the detail message showing on the job.

22032012 104353 - Info nbjm(pid=4492) starting backup job (jobid=171) for client dac2, policy DAC_Servers, schedule Full_Backup 
22032012 104353 - Info nbjm(pid=4492) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=171, request id{5B25CE40-74EB-4358-B62E-38FDBC07A17E}) 
22032012 104353 - requesting resource D2D_Disk_Storage
22032012 104353 - requesting resource scadadc1.NBU_CLIENT.MAXJOBS.dac2
22032012 104353 - requesting resource scadadc1.NBU_POLICY.MAXJOBS.DAC_Servers
22032012 104353 - granted resource scadadc1.NBU_CLIENT.MAXJOBS.dac2
22032012 104353 - granted resource scadadc1.NBU_POLICY.MAXJOBS.DAC_Servers
22032012 104353 - granted resource MediaID=@aaaal;Path=192.168.111.66HPLSU1;MediaServer=scadadc1
22032012 104353 - granted resource D2D_Disk_Storage
22032012 104353 - estimated 17671065 Kbytes needed
22032012 104353 - Info nbjm(pid=4492) started backup job for client dac2, policy DAC_Servers, schedule Full_Backup on storage unit D2D_Disk_Storage
22032012 104354 - Info bpbrm(pid=5204) dac2 is the host to backup data from    
22032012 104354 - Info bpbrm(pid=5204) reading file list from client       
22032012 104354 - started process bpbrm (5204)
22032012 104354 - connecting
22032012 104356 - Info bpbrm(pid=5204) starting bpbkar32 on client        
22032012 104356 - Info bpbkar32(pid=4064) Backup started          
22032012 104356 - Info bptm(pid=6104) start           
22032012 104356 - Info bptm(pid=6104) using 262144 data buffer size       
22032012 104356 - Info bptm(pid=6104) setting receive network buffer to 1049600 bytes     
22032012 104356 - Info bptm(pid=6104) using 30 data buffers        
22032012 104356 - connected; connect time 000002
22032012 104358 - Info bptm(pid=6104) start backup          
22032012 104358 - Info bptm(pid=6104) backup child process is pid 5760.3016      
22032012 104358 - Info bptm(pid=5760) start           
22032012 104358 - begin writing


at about 4179968KB, 11500 files, it stopped.23% of the total job done.==> our system is running windows 2008 R2 server. and I did tune up this server base on the article http://www.symantec.com/business/support/index?page=content&id=TECH153412. Before I tune up the system will simple show

19/03/2012 16:06:55 - Critical bptm(pid=5480) image write failed: error 2060017: system call failed

any one can give me hint how to get it work?

Great thanks

Jack

13 REPLIES 13

Douglas_A
Level 6
Partner Accredited Certified

Can you give me some information on your D2D..


Plugin Version?

D2D Software version?

 

Typically this error means a connection failed when the media server attempted connection.

STG
Level 3

and I did watch the disk usage from web-browser,  the disk has a spike on the usage then it back to zero usage. then the web-browser freezed too. before I apply the tune up on windows 2008 R2, web-browser never freeze up, but I get 2060017 error.

Jack

STG
Level 3

After symantec tech called me, and we go througth some bpdown -f -v and bpup -f -v ; it is very strange that everything starts to work now. I am happen with the result. windows 2008 tune up documents for netbackup works.

Mark_Solutions
Level 6
Partner Accredited Certified

With any de-dupe / OST setup it really helps to add the following on you servers (as you may find after the re-starts they are OK for a while but then start to fail again):

<install path>\netbackup\db\config\DPS_PROXYDEFAULTRECVTMO and put a value of 800 in it

Hope this helps

STG
Level 3

I will add this file in the system right away. BTW, I am using OST plug-in, but used as basic disk. I am not sure if i am using Media Server Deduplication(MSDP). and in this situation, I am not even sure do I need to purchase extra license or not. I am contacting symantec to see what to do.

Mark_Solutions
Level 6
Partner Accredited Certified

OK - keep us up to date with how things go - that file has helped a lot of my customers with and without de-dupe

STG
Level 3

I look inside folder \db\config\, there are two files. one is behavior one is dc. Here is the behavior file:

WAKEUP_INTERVAL 10
POLICY_UPDATE_INTERVAL 10
MAX_JOBS_PER_CLIENT 1
TRIES_PER_PERIOD 2
TIME_PERIOD 12
KEEP_LOGS 28
KEEP_TIR 1
TIMEOUT 0
MHDRIVE_MOUNT_TIMEOUT 0
HOURS_AGO 5111900
PREPROCESS_INTERVAL 0
MAX_DRIVES_THIS_MASTER 0
MAX_BACKUP_COPIES 2
CLEANUP_INTERVAL 12
CLEANUP_WAIT_TIME 60

so you can see there is TIMEOUT 0  ==> I am not sure if this is the same parameter as DPS_PROXYDEFAULTRECVTMO

also if I add DPS_PROXYDEFAULTRECVTMO file. do I just put one line inside, say TIMEOUT 800 ?

thanks for the advise.
 

Mark_Solutions
Level 6
Partner Accredited Certified

The timeout setting you see is different.

Just create the file (in upper case and with no file extention - like we do for NUMBER_DATA_BUFFERS) and then open it in notepad and just put 800 in it, nothing else.

Marianne
Level 6
Partner    VIP    Accredited Certified

"I am using OST plug-in, but used as basic disk" ?????

Should be configured as Advanced Disk. Enterprise Disk license needed.

STG
Level 3

thank you Marianne and Mark, I need call symantec to get Advance Disk license.

like Mark said, last friday afternoon the backup on one slower server failed, and freezed up the media server, this morning I need to restart the media server. I looked at the log today, it looks like data buffer is not enough?  see ==> 23/03/2012 15:33:26 - Info bptm(pid=4676) waited for full buffer 55514 times, delayed 131080 times   

even through it says  partially successful, but this morning. I can't log into D2D device, and media server response is very slow. need to restart machine.

Jack

 

 

 

 

 

23/03/2012 14:57:04 - Info nbjm(pid=2712) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=181, request id:{1AB9CBCD-F201-498B-9D20-DEDB616408A6}) 
23/03/2012 14:57:04 - requesting resource D2D_Disk_Storage
23/03/2012 14:57:04 - requesting resource scadadc1.NBU_CLIENT.MAXJOBS.mgt01
23/03/2012 14:57:04 - requesting resource scadadc1.NBU_POLICY.MAXJOBS.Windows_Servers
23/03/2012 14:57:04 - granted resource scadadc1.NBU_CLIENT.MAXJOBS.mgt01
23/03/2012 14:57:04 - granted resource scadadc1.NBU_POLICY.MAXJOBS.Windows_Servers
23/03/2012 14:57:04 - granted resource MediaID=@aaaaq;Path=\\192.168.111.66\NetBackup;MediaServer=scadadc1
23/03/2012 14:57:04 - granted resource D2D_Disk_Storage
23/03/2012 14:57:04 - estimated 0 Kbytes needed
23/03/2012 14:57:04 - Info nbjm(pid=2712) started backup job for client mgt01, policy Windows_Servers, schedule Full on storage unit D2D_Disk_Storage
23/03/2012 14:57:04 - started process bpbrm (4504)
23/03/2012 14:57:04 - started
23/03/2012 14:57:50 - Info bpbrm(pid=4504) mgt01 is the host to backup data from    
23/03/2012 14:57:50 - Info bpbrm(pid=4504) reading file list from client       
23/03/2012 14:57:50 - connecting
23/03/2012 14:58:34 - Info bpbrm(pid=4504) starting bpbkar32 on client        
23/03/2012 14:58:34 - connected; connect time: 00:00:44
23/03/2012 14:58:41 - Info bpbkar32(pid=3632) Backup started          
23/03/2012 14:58:41 - Info bptm(pid=4676) start           
23/03/2012 14:58:48 - Info bptm(pid=4676) using 262144 data buffer size       
23/03/2012 14:58:48 - Info bptm(pid=4676) setting receive network buffer to 1049600 bytes     
23/03/2012 14:58:48 - Info bptm(pid=4676) using 30 data buffers        
23/03/2012 14:58:48 - Info bptm(pid=4676) start backup          
23/03/2012 14:58:49 - Info bptm(pid=4676) backup child process is pid 5484.2244      
23/03/2012 14:58:49 - Info bptm(pid=5484) start           
23/03/2012 14:58:49 - begin writing
23/03/2012 15:07:03 - Warning bpbrm(pid=4504) from client mgt01: WRN - can't open file: C:\Documents and Settings\u4mp\Local Settings\Temp\Perflib_Perfdata_f28.dat (WIN32 32: The process cannot access the file because it is being used by another process. )
23/03/2012 15:33:26 - Info bptm(pid=4676) waited for full buffer 55514 times, delayed 131080 times   
23/03/2012 15:33:27 - Info bpbrm(pid=4504) validating image for client mgt01       
23/03/2012 15:33:50 - end writing; write time: 00:35:01
the requested operation was partially successful(1)

The job was successfully completed, but some files may have been
busy or unaccessible. See the problems report or the client's logs for more details.
23/03/2012 15:33:55 - Info bpbkar32(pid=3632) done. status: 1: the requested operation was partially successful   

Mark_Solutions
Level 6
Partner Accredited Certified

This looks like the systems performance is just very poor

What is your network connection and are you using any buffer files on your media servers (\netbackup\db\config directory)?

STG
Level 3

hi, Mark:

The network connection is 1G indivisual vlan. I still contact symantec to get Enterprise Disk license. Haven't set up any buffer files yet. This server only got 8G of RAM, on Windows level is set 16G of Virtual memory. already plan to add 16G of RAM. when directly backup from media server, the D2D disk writing speed is about 75M/s, when backup the slower clents(100M network link), the D2d disk writing speed is only 35M/s. right now everything is set by default. need to check if there are some good article about data buffer setting.

Thanks

Jack

Mark_Solutions
Level 6
Partner Accredited Certified

NUMBER_DATA_BUFFERS_DISK - i use 32

SIZE_DATA_BUFFERS_DISK - i use 1048576

Storage Unit Fragment Size - i use 5000MB

Hope this helps