Communication errors
Hello
I got:
Win2008R2 Master server
Tape Library Quantum Scalar i500
Oracle(RHEL) client machine
3 defferent Windows 2008R2 client machine
All 3 Windows client backups fails with error 98:
28.07.2016 13:00:00 - Info nbjm(pid=5476) starting backup job (jobid=494375) for client kzwbackup01.eub.kz, policy Backup_SQL_MUOR, schedule Differential-Inc
28.07.2016 13:00:00 - Info nbjm(pid=5476) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=494375, request id:{7B803017-4C6B-4FF1-B78B-C26E4B9B84DD})
28.07.2016 13:00:00 - requesting resource kzwnb01-hcart3-robot-tld-0
28.07.2016 13:00:00 - requesting resource kzwnb01.eub.kz.NBU_CLIENT.MAXJOBS.kzwbackup01.eub.kz
28.07.2016 13:00:00 - requesting resource kzwnb01.eub.kz.NBU_POLICY.MAXJOBS.Backup_SQL_MUOR
28.07.2016 13:00:00 - granted resource kzwnb01.eub.kz.NBU_CLIENT.MAXJOBS.kzwbackup01.eub.kz
28.07.2016 13:00:00 - granted resource kzwnb01.eub.kz.NBU_POLICY.MAXJOBS.Backup_SQL_MUOR
28.07.2016 13:00:00 - granted resource 000106
28.07.2016 13:00:00 - granted resource HP.ULTRIUM6-SCSI.004
28.07.2016 13:00:00 - granted resource kzwnb01-hcart3-robot-tld-0
28.07.2016 13:00:00 - estimated 161158 Kbytes needed
28.07.2016 13:00:00 - Info nbjm(pid=5476) started backup (backupid=kzwbackup01.eub.kz_1469689200) job for client kzwbackup01.eub.kz, policy Backup_SQL_MUOR, schedule Differential-Inc on storage unit kzwnb01-hcart3-robot-tld-0
28.07.2016 13:00:00 - started process bpbrm (9716)
28.07.2016 13:00:00 - started
28.07.2016 13:00:05 - Info bpbrm(pid=9716) kzwbackup01.eub.kz is the host to backup data from
28.07.2016 13:00:05 - Info bpbrm(pid=9716) reading file list for client
28.07.2016 13:00:05 - connecting
28.07.2016 13:00:08 - Info bpbrm(pid=9716) starting bpbkar32 on client
28.07.2016 13:00:08 - connected; connect time: 0:00:03
28.07.2016 13:00:10 - Info bpbkar32(pid=488) Backup started
28.07.2016 13:00:10 - Info bptm(pid=4040) start
28.07.2016 13:00:10 - Info bptm(pid=4040) using 65536 data buffer size
28.07.2016 13:00:10 - Info bptm(pid=4040) setting receive network buffer to 263168 bytes
28.07.2016 13:00:10 - Info bptm(pid=4040) using 30 data buffers
28.07.2016 13:00:10 - Info bptm(pid=4040) start backup
28.07.2016 13:00:10 - Info bptm(pid=4040) backup child process is pid 10092.7356
28.07.2016 13:00:10 - Info bptm(pid=4040) Waiting for mount of media id 000106 (copy 1) on server kzwnb01.eub.kz.
28.07.2016 13:00:10 - Info bptm(pid=10092) start
28.07.2016 13:00:10 - mounting 000106
28.07.2016 13:00:46 - Info bpbkar32(pid=488) change journal NOT enabled for <D:\sql_backup\MUOR>
28.07.2016 13:31:11 - Error bptm(pid=4040) error requesting media, TpErrno = Robot operation failed
28.07.2016 13:31:11 - Warning bptm(pid=4040) media id 000106 load operation reported an error
28.07.2016 13:31:11 - current media 000106 complete, requesting next resource Any
28.07.2016 14:01:27 - Info bptm(pid=4040) EXITING with status 98 <----------
28.07.2016 14:01:32 - Info bpbkar32(pid=488) done. status: 98: error requesting media (tpreq)
28.07.2016 14:01:32 - end writing
error requesting media (tpreq)(98)
28.07.2016 14:01:35 - Info bpbrm(pid=9020) Starting delete snapshot processing
28.07.2016 14:01:39 - Info bpfis(pid=7628) Backup started
28.07.2016 14:01:39 - Critical bpbrm(pid=9020) from client kzwbackup01.eub.kz: cannot open C:\Program Files\Veritas\NetBackup\online_util\fi_cntl\bpfis.fim.kzwbackup01.eub.kz_1469689200.1.0
28.07.2016 14:01:39 - Info bpfis(pid=7628) done. status: 1542
28.07.2016 14:01:39 - end operation
28.07.2016 14:01:39 - Info bpfis(pid=7628) done. status: 1542: An existing snapshot is no longer valid and cannot be mounted for subsequent operations
Oracle fails with error 6 (not every backup, just few times in a day)
RMAN logs shows next:
channel ORA_SBT_TAPE_1: input backup set: count=9309, stamp=916194140, piece=1
channel ORA_SBT_TAPE_1: starting piece 1 at 26-JUL-16
channel ORA_SBT_TAPE_1: backup piece /archive/archivelog/DB.LOGS.ERP.s9309.p1.t916194140
piece handle=2tr9o0qs_1_3 comment=API Version 2.0,MMS Version 5.0.0.0
channel ORA_SBT_TAPE_1: finished piece 1 at 26-JUL-16
channel ORA_SBT_TAPE_1: backup piece complete, elapsed time: 00:00:45
channel ORA_SBT_TAPE_1: input backup set: count=9297, stamp=916194139, piece=1
channel ORA_SBT_TAPE_1: starting piece 1 at 26-JUL-16
channel ORA_SBT_TAPE_1: backup piece /archive/archivelog/DB.LOGS.ERP.s9297.p1.t916194139
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03009: failure of backup command on ORA_SBT_TAPE_1 channel at 07/26/2016 10:45:24
ORA-19506: failed to create sequential file, name="2hr9o0qr_1_3", parms=""
ORA-27028: skgfqcre: sbtbackup returned error
ORA-19511: Error received from media manager layer, error text:
VxBSACreateObject: Failed with error:
Server Status: failed to communicate with resource requester
ORA-19600: input file is backup piece (/archive/archivelog/DB.LOGS.ERP.s9297.p1.t916194139)
ORA-19601: output file is backup piece (2hr9o0qr_1_3)
Recovery Manager complete.
Backup ended at 2016-07-26 10:45:33
Can u provide me how to determine cause of that errors, which logs could help me
RMAN script is old an allways works fine, communication between master and client were checked via script(telneted ports 1556,13724,13782 every 30 secs, and no single interrupt appers in script log)
Master server were restarted few times, updated drivers for tape drives
Another windows and oracle(rhel) backups works well(about 30 clients)