Oracle RMAN backup status 13, timer expired

Question

Hi allSometimes, but not regularly, we experience one of our large Oracle backup ending with status 13.Archive logs is always successful, so is the FULLs. This applies to our diffs.We have a few thousand jobs that backup to a 5330HA cluster. Our Oracle backups runs NBU 8.1.2 on RHEL 7. Mediaserver is Appliance 3.1.2, with latest MSDP EEB bundle.Here is from Job Details of Parent job:&nbsp;16.mar.2020&nbsp;03:39:20&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;input&nbsp;datafile&nbsp;file&nbsp;number=00029&nbsp;name=+DATA1/PXCDBL_DBM/82CDF1F23C1993F2E053C443F80AE973/DATAFILE/datex_lob.2830.100126063716.mar.2020&nbsp;03:39:20&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;input&nbsp;datafile&nbsp;file&nbsp;number=00024&nbsp;name=+DATA1/PXCDBL_DBM/82CDF1F23C1993F2E053C443F80AE973/DATAFILE/system.2824.100125923116.mar.2020&nbsp;03:39:21&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;input&nbsp;datafile&nbsp;file&nbsp;number=00027&nbsp;name=+DATA1/PXCDBL_DBM/82CDF1F23C1993F2E053C443F80AE973/DATAFILE/users.2828.100125924116.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;released&nbsp;channel:&nbsp;ch0016.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;released&nbsp;channel:&nbsp;ch0116.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;RMAN-00571:&nbsp;===========================================================16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;RMAN-00569:&nbsp;===============&nbsp;ERROR&nbsp;MESSAGE&nbsp;STACK&nbsp;FOLLOWS&nbsp;===============16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;RMAN-00571:&nbsp;===========================================================16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;RMAN-03009:&nbsp;failure&nbsp;of&nbsp;backup&nbsp;command&nbsp;on&nbsp;ch00&nbsp;channel&nbsp;at&nbsp;03/16/2020&nbsp;07:25:3316.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;ORA-27192:&nbsp;skgfcls:&nbsp;sbtclose2&nbsp;returned&nbsp;error&nbsp;-&nbsp;failed&nbsp;to&nbsp;close&nbsp;file16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;ORA-19511:&nbsp;non&nbsp;RMAN,&nbsp;but&nbsp;media&nbsp;manager&nbsp;or&nbsp;vendor&nbsp;specific&nbsp;failure,&nbsp;error&nbsp;text:16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;&nbsp;&nbsp;&nbsp;Failed&nbsp;to&nbsp;process&nbsp;backup&nbsp;file&nbsp;&lt;bk_dPXCDBL_un2ur6tn8_s29410_p1_t1035171560&gt;16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;ORA-19502:&nbsp;write&nbsp;error&nbsp;on&nbsp;file&nbsp;"bk_dPXCDBL_un2ur6tn8_s29410_p1_t1035171560",&nbsp;block&nbsp;number&nbsp;1&nbsp;(block&nbsp;size=8192)16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;ORA-27030:&nbsp;skgfwrt:&nbsp;sbtwrite2&nbsp;returned&nbsp;error16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;ORA-19511:&nbsp;non&nbsp;RMAN,&nbsp;but&nbsp;media&nbsp;manage16.mar.2020&nbsp;07:25:50&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;Recovery&nbsp;Manager&nbsp;complete.16.mar.2020&nbsp;07:25:55&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;End&nbsp;of&nbsp;Recovery&nbsp;Manager&nbsp;output.16.mar.2020&nbsp;07:25:55&nbsp;-&nbsp;Info&nbsp;bphdb&nbsp;(pid=120725)&nbsp;INF&nbsp;-&nbsp;End&nbsp;Oracle&nbsp;Recovery&nbsp;Manager.Here is from job details of the failing job:16.mar.2020&nbsp;03:39:35&nbsp;-&nbsp;Info&nbsp;bptm&nbsp;(pid=233514)&nbsp;start&nbsp;backup16.mar.2020&nbsp;03:40:38&nbsp;-&nbsp;Info&nbsp;bptm&nbsp;(pid=233514)&nbsp;backup&nbsp;child&nbsp;process&nbsp;is&nbsp;pid&nbsp;23512116.mar.2020&nbsp;03:40:38&nbsp;-&nbsp;begin&nbsp;writing16.mar.2020&nbsp;06:40:39&nbsp;-&nbsp;Error&nbsp;bpbrm&nbsp;(pid=233505)&nbsp;socket&nbsp;read&nbsp;failed:&nbsp;errno&nbsp;=&nbsp;62&nbsp;-&nbsp;Timer&nbsp;expired16.mar.2020&nbsp;06:40:41&nbsp;-&nbsp;Error&nbsp;bptm&nbsp;(pid=233514)&nbsp;media&nbsp;manager&nbsp;terminated&nbsp;by&nbsp;parent&nbsp;process&nbsp;Obviousley there is a timeout involved here, but where? The backup job shows always timeout after exactly 3 hrs.&nbsp;&nbsp;

michal_mikulik1 · Answer

Hello,
&nbsp;
this kind of issue is better to solve with support, however here are some hints:
- does 3hrs correspond to any timeout in NBU configuration (Client Read Timeout etc.)?
- during these 3 hrs, Bytes Written in the corresponding Job Details is increasing, or is stuck at some value, or is stuck at zero?
- if possible, try to switch to client-side dedup, it is usually quicker thus completing below timeouts
- is it a Copilot backup, or traditional RMAN backup? Consider Copilot (incremental merge)
Regards
Michal

road · Answer

Thank you for your hints!I suspect that the job is queued in NetBackup, and are waiting until resources are available. We have 2 streams pr Oracle Intelligent Policy, with a high priority setting in the policy. Other streams running in the same policy is exiting with staus 0.&nbsp;I have asked Firewall team if they have settings that could explain the timeout after exactly 3 hrs.Client Read Timeout set to default 300 sec.No data written for failed stream.Client Side Dedupe not used, neither is Copilot.Will raise a support case when necessary.Thanks again!

liuyl · Answer

I also have the same issue ！And&nbsp; increasing the client_read_timeout to 7200 or higher in the media server side&nbsp; does not resolve this problem 。

Forum Discussion

Oracle RMAN backup status 13, timer expired

3 Replies

Related Content

Certificate Expiration

Oracle database restore from vmware type backup

NetBackup Oracle Archive Logs backup only

Oracle to Netbackup Copilot

Delete after making copies in Oracle Archive Logs backup

Recent Discussions

command: bperror

MS-SharePoint policy restore error (2804) .

How to restore a backup

How to configure RBAC

10 years old netbackup appliance database service down, ssl certification out date