Random Error 636 on a specific SAP backup client windows - media linux
Good morning all,
recently i'm facing a strange issue, it is strange because at the end of the day backup the SAP FULL DB is completed well, but on the NetBackup Admin console i continue to see error 636.
This backup run every day, but, errors is random, twice or more a week, not for specific days:
5/20/2014 3:52:34 AM - Info bpbrm(pid=26401) vm767502.ams20.vzbi.caas is the host to backup data from
5/20/2014 3:52:34 AM - Info bpbrm(pid=26401) reading file list from client
5/20/2014 3:52:36 AM - Info bpbrm(pid=26401) starting bphdb on client
5/20/2014 3:52:37 AM - Info bphdb(pid=1992) Backup started
5/20/2014 4:00:00 AM - Info nbjm(pid=7131) starting backup job (jobid=439982) for client vm767502.ams20.vzbi.caas, policy MNG_AMS20_SAP_DW2_DMZ, schedule DAILY_FULL_SAP_DB
5/20/2014 4:00:00 AM - Info nbjm(pid=7131) requesting MEDIA_SERVER_WITH_ATTRIBUTES resources from RB for backup job (jobid=439982, request id:{7332B3BA-DFC2-11E3-9FF2-4BBC7D56EE44})
5/20/2014 4:00:00 AM - requesting resource ME-A2_PDSTU
5/20/2014 4:00:00 AM - requesting resource ph469702.ams20.vzbi.caas.NBU_CLIENT.MAXJOBS.vm767502.ams20.vzbi.caas
5/20/2014 4:00:00 AM - requesting resource ph469702.ams20.vzbi.caas.NBU_POLICY.MAXJOBS.MNG_AMS20_SAP_DW2_DMZ
5/20/2014 4:00:00 AM - granted resource ph469702.ams20.vzbi.caas.NBU_CLIENT.MAXJOBS.vm767502.ams20.vzbi.caas
5/20/2014 4:00:00 AM - granted resource ph469702.ams20.vzbi.caas.NBU_POLICY.MAXJOBS.MNG_AMS20_SAP_DW2_DMZ
5/20/2014 4:00:00 AM - granted resource ME-A2_PDSTU
5/20/2014 4:00:00 AM - estimated 0 Kbytes needed
5/20/2014 4:00:00 AM - Info nbjm(pid=7131) started backup (backupid=vm767502.ams20.vzbi.caas_1400551200) job for client vm767502.ams20.vzbi.caas, policy MNG_AMS20_SAP_DW2_DMZ, schedule DAILY_FULL_SAP_DB on storage unit ME-A2_PDSTU
5/20/2014 4:00:01 AM - started process bpbrm (26401)
5/20/2014 4:00:08 AM - connecting
5/20/2014 4:00:10 AM - connected; connect time: 00:00:02
read from input socket failed(636)
On the client (windows 2k8 standard 64bit) i see:
BR0280I BRBACKUP time stamp: 2014-05-20 05.47.05
BR0292I Execution of BRARCHIVE finished with return code 0
The current date is: Tue 05/20/2014
The current time is: 5:47:05.51
-END---------------------
.....
[root@ph469702]/usr/openv/netbackup/bin# /usr/openv/netbackup/bin/admincmd/bppllist MNG_AMS20_SAP_DW2_DMZ -L
Policy Name: MNG_AMS20_SAP_DW2_DMZ
Options: 0x0
template: FALSE
audit_reason: ?
Names: (none)
Policy Type: SAP (17)
Active: yes
Effective date: 08/02/2011 17:47:45
Mult. Data Stream: no
Perform Snapshot Backup: no
Snapshot Method: (none)
Snapshot Method Arguments: (none)
Perform Offhost Backup: no
Backup Copy: 0
Use Data Mover: no
Data Mover Type: 2
Use Alternate Client: no
Alternate Client Name: (none)
Use Virtual Machine: 0
Hyper-V Server Name: (none)
Enable Instant Recovery: no
Policy Priority: 100
Max Jobs/Policy: Unlimited
Disaster Recovery: 0
Collect BMR Info: no
Keyword: (none specified)
Data Classification: -
Residence is Storage Lifecycle Policy: no
Client Encrypt: no
Checkpoint: no
Residence: ME-A2_PDSTU
Volume Pool: DataStore
Server Group: *ANY*
Granular Restore Info: no
Exchange Source attributes: no
Exchange 2010 Preferred Server: (none defined)
Application Discovery: no
Discovery Lifetime: 0 seconds
ASC Application and attributes: (none defined)
Generation: 128
Ignore Client Direct: no
Enable Metadata Indexing: no
Index server name: NULL
Use Accelerator: no
Client/HW/OS/Pri/DMI: vm767502.ams20.vzbi.caas Windows-x64 Windows2008 0 1 0 0 ?
Include: C:\backup-scripts\backup-online_NetBackup.bat
Schedule: DAILY_FULL_SAP_DB
Type: FULL SSAP (0)
Calendar sched: Enabled
Allowed to retry after run day
........
Maximum MPX: 1
Synthetic: 0
Checksum Change Detection: 0
PFI Recovery: 0
Retention Level: 1 (2 weeks)
u-wind/o/d: 0 0
Incr Type: DELTA (0)
Alt Read Host: (none defined)
Max Frag Size: 0 MB
Number Copies: 1
Fail on Error: 0
Residence: (specific storage unit not required)
Volume Pool: (same as policy volume pool)
Server Group: (same as specified for policy)
Residence is Storage Lifecycle Policy: 0
Schedule indexing: 0
Daily Windows:
Day Open Close W-Open W-Close
......
Schedule: SAP
Type: UBAK SAP (2)
Frequency: 7 day(s) (604800 seconds)
Maximum MPX: 1
Synthetic: 0
Checksum Change Detection: 0
PFI Recovery: 0
Retention Level: 1 (2 weeks)
u-wind/o/d: 0 0
Incr Type: DELTA (0)
Alt Read Host: (none defined)
Max Frag Size: 0 MB
Number Copies: 1
Fail on Error: 0
Residence: (specific storage unit not required)
Volume Pool: (same as policy volume pool)
Server Group: (same as specified for policy)
Residence is Storage Lifecycle Policy: 0
Schedule indexing: 0
Daily Windows:
Day Open Close W-Open W-Close
.....
I wonder if some timeout is wrongly setup for this specific couple of client/media, since they work both into a DMZ zone and only the MASTER is into TRUST zone, with all FIREWALL ACL correctly configured.
It is true because i see other servers working fine, even on the same MEDIA, without any glitches.
I have no idea why it is impacting only this one, and only this DB backup, not even mswindows FULL or INC.
Thank you for any advice.
Regards,
Michele