cancel
Showing results for 
Search instead for 
Did you mean: 

file read failed  (13)

shahriar_sadm
Level 6

Hi Dears, Recently I have an error on just one media server and just on incremental backups.

Please see the log below:

Thanks


06/21/2016 20:50:07 - begin writing
06/21/2016 20:51:19 - Info bpbrm (pid=23871) b2c is the host to backup data from
06/21/2016 20:51:19 - Info bpbrm (pid=23871) telling media manager to start backup on client
06/21/2016 20:51:19 - Info bptm (pid=23936) using 65536 data buffer size
06/21/2016 20:51:19 - Info bptm (pid=23936) using 12 data buffers
06/21/2016 20:51:19 - Info bpbrm (pid=23871) spawning a brm child process
06/21/2016 20:51:19 - Info bpbrm (pid=23871) child pid: 8976
06/21/2016 20:51:20 - Info bpbrm (pid=23871) sending bpsched msg: CONNECTING TO CLIENT FOR b2c_1466526005
06/21/2016 20:51:20 - Info bpbrm (pid=23871) listening for client connection
06/21/2016 20:51:20 - Info bpbrm (pid=23871) INF - Client read timeout = 3600
06/21/2016 20:51:20 - Info bpbrm (pid=23871) accepted connection from client
06/21/2016 20:51:21 - Info bpbkar (pid=12512) Backup started
06/21/2016 20:51:21 - Info bpbrm (pid=23871) Sending the file list to the client
06/21/2016 20:51:29 - Info bpbkar (pid=12512) dbclient(pid=12512) wrote first buffer(size=262144)
06/21/2016 23:52:00 - end writing; write time: 3:01:53
06/21/2016 23:53:09 - Error bpbrm (pid=8976) socket read failed: errno = 62 - Timer expired
06/21/2016 23:53:09 - Info bpbrm (pid=23871) sending message to media manager: STOP BACKUP b2c_1466526005
06/21/2016 23:53:14 - Info bpbrm (pid=23871) media manager for backup id b2c_1466526005 exited with status 150: termination requested by administrator
file read failed  (13)

6 REPLIES 6

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

The error points to a Client Read Timeout, but the timeout is already set to 3600 (1 hour).
No use increasing this value.

So, it really looks like the backup is hanging on the client.

What is Client OS and NBU version?
I remember a situation some time ago when Incremental backup was hanging when checkpoint was taken during the backup.

You will need bpbkar log on the client and bpbrm and bptm on the media to troubleshoot.
Increase logging level on client and media server to 3 (please only ever increase logging to 5 when requested by Veritas Support).

shahriar_sadm
Level 6

Hi Marianne,

This server is media server and directly connected to tape library. no client involved here, 

NBU Master version is 7.7.2 and media server is 7.5 (issue appear when my master server was 7.5 (before upgrade))

Media Server: SunOS 5.10 Generic_147147-26 sun4u sparc SUNW,SPARC-Enterprise

no Checkpoint set for this operation.

bpbkar directory is empty on media server.

I chenged log level to 3 on media server and after next error 13 I will share them. ( bptm and bpbrm )

Thnaks

 

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

What is the policy type for this backup?

I see that a bpbkar and dbclient process is started on the client (which is also media server?)
Info bpbkar (pid=12512) dbclient(pid=12512)

Both have same PID, so probably a database backup with only dbclient process?

You then need dbclient log folder with 777 permission.

 

shahriar_sadm
Level 6

This is application backup for oracler with RMAN. and this is media server,

Now I have some error 13 on other server but some different,

(This backup is on datadomain)

P.S: this backup has 5 channels with status 13 that all of them failed at same time


06/27/2016 21:38:42 - begin writing
06/27/2016 21:40:10 - Info bpbrm (pid=6884) server1 is the host to backup data from
06/27/2016 21:40:10 - Info bpbrm (pid=6884) reading file list from client
06/27/2016 21:40:10 - Info bpbrm (pid=6884) listening for client connection
06/27/2016 21:40:11 - Info bpbrm (pid=6884) INF - Client read timeout = 3000
06/27/2016 21:40:11 - Info bpbrm (pid=6884) accepted connection from client
06/27/2016 21:40:12 - Info bphdb (pid=7381) Backup started
06/27/2016 21:40:12 - Info bpbrm (pid=6884) bptm pid: 6888
06/27/2016 21:40:12 - Info bptm (pid=6888) start
06/27/2016 21:40:13 - Info bptm (pid=6888) using 262144 data buffer size
06/27/2016 21:40:13 - Info bptm (pid=6888) using 30 data buffers
06/27/2016 21:40:16 - Info bptm (pid=6888) start backup
06/27/2016 21:40:22 - Info bphdb (pid=7381) dbclient(pid=7381) wrote first buffer(size=262144)
06/28/2016 03:08:16 - end writing; write time: 5:29:34
06/28/2016 03:09:43 - Error bpbrm (pid=6884) socket read failed: errno = 62 - Timer expired
06/28/2016 03:09:45 - Error bptm (pid=6888) media manager terminated by parent process
06/28/2016 03:09:51 - Info bphdb (pid=7381) done. status: 13: file read failed

shahriar_sadm
Level 6

Hi Marianne. I extract the logs, what to looking for in this logs? specific term?

Thanks

 

tgray21
Level 3

Search for the string "<16>" without quotes.  Find the first instance and then start reading the lines around the error to give you clues, and feel free to post them here.

Do you also have the dbclient and bphdb logs?  Since this is an Oracle RMAN backup, those will be more helpful I believe.  You create those in the same logs directory and they are also turned up in the bp.conf file with VERBOSE = 5.  Make sure to turn down logging when finished.

Regards,

Tyler