Forum Discussion

shahriar_sadm's avatar
9 years ago

file read failed  (13)

Hi Dears, Recently I have an error on just one media server and just on incremental backups.

Please see the log below:

Thanks


06/21/2016 20:50:07 - begin writing
06/21/2016 20:51:19 - Info bpbrm (pid=23871) b2c is the host to backup data from
06/21/2016 20:51:19 - Info bpbrm (pid=23871) telling media manager to start backup on client
06/21/2016 20:51:19 - Info bptm (pid=23936) using 65536 data buffer size
06/21/2016 20:51:19 - Info bptm (pid=23936) using 12 data buffers
06/21/2016 20:51:19 - Info bpbrm (pid=23871) spawning a brm child process
06/21/2016 20:51:19 - Info bpbrm (pid=23871) child pid: 8976
06/21/2016 20:51:20 - Info bpbrm (pid=23871) sending bpsched msg: CONNECTING TO CLIENT FOR b2c_1466526005
06/21/2016 20:51:20 - Info bpbrm (pid=23871) listening for client connection
06/21/2016 20:51:20 - Info bpbrm (pid=23871) INF - Client read timeout = 3600
06/21/2016 20:51:20 - Info bpbrm (pid=23871) accepted connection from client
06/21/2016 20:51:21 - Info bpbkar (pid=12512) Backup started
06/21/2016 20:51:21 - Info bpbrm (pid=23871) Sending the file list to the client
06/21/2016 20:51:29 - Info bpbkar (pid=12512) dbclient(pid=12512) wrote first buffer(size=262144)
06/21/2016 23:52:00 - end writing; write time: 3:01:53
06/21/2016 23:53:09 - Error bpbrm (pid=8976) socket read failed: errno = 62 - Timer expired
06/21/2016 23:53:09 - Info bpbrm (pid=23871) sending message to media manager: STOP BACKUP b2c_1466526005
06/21/2016 23:53:14 - Info bpbrm (pid=23871) media manager for backup id b2c_1466526005 exited with status 150: termination requested by administrator
file read failed  (13)

6 Replies

  • The error points to a Client Read Timeout, but the timeout is already set to 3600 (1 hour).
    No use increasing this value.

    So, it really looks like the backup is hanging on the client.

    What is Client OS and NBU version?
    I remember a situation some time ago when Incremental backup was hanging when checkpoint was taken during the backup.

    You will need bpbkar log on the client and bpbrm and bptm on the media to troubleshoot.
    Increase logging level on client and media server to 3 (please only ever increase logging to 5 when requested by Veritas Support).

  • Hi Marianne,

    This server is media server and directly connected to tape library. no client involved here, 

    NBU Master version is 7.7.2 and media server is 7.5 (issue appear when my master server was 7.5 (before upgrade))

    Media Server: SunOS 5.10 Generic_147147-26 sun4u sparc SUNW,SPARC-Enterprise

    no Checkpoint set for this operation.

    bpbkar directory is empty on media server.

    I chenged log level to 3 on media server and after next error 13 I will share them. ( bptm and bpbrm )

    Thnaks

     

  • What is the policy type for this backup?

    I see that a bpbkar and dbclient process is started on the client (which is also media server?)
    Info bpbkar (pid=12512) dbclient(pid=12512)

    Both have same PID, so probably a database backup with only dbclient process?

    You then need dbclient log folder with 777 permission.

     

  • This is application backup for oracler with RMAN. and this is media server,

    Now I have some error 13 on other server but some different,

    (This backup is on datadomain)

    P.S: this backup has 5 channels with status 13 that all of them failed at same time


    06/27/2016 21:38:42 - begin writing
    06/27/2016 21:40:10 - Info bpbrm (pid=6884) server1 is the host to backup data from
    06/27/2016 21:40:10 - Info bpbrm (pid=6884) reading file list from client
    06/27/2016 21:40:10 - Info bpbrm (pid=6884) listening for client connection
    06/27/2016 21:40:11 - Info bpbrm (pid=6884) INF - Client read timeout = 3000
    06/27/2016 21:40:11 - Info bpbrm (pid=6884) accepted connection from client
    06/27/2016 21:40:12 - Info bphdb (pid=7381) Backup started
    06/27/2016 21:40:12 - Info bpbrm (pid=6884) bptm pid: 6888
    06/27/2016 21:40:12 - Info bptm (pid=6888) start
    06/27/2016 21:40:13 - Info bptm (pid=6888) using 262144 data buffer size
    06/27/2016 21:40:13 - Info bptm (pid=6888) using 30 data buffers
    06/27/2016 21:40:16 - Info bptm (pid=6888) start backup
    06/27/2016 21:40:22 - Info bphdb (pid=7381) dbclient(pid=7381) wrote first buffer(size=262144)
    06/28/2016 03:08:16 - end writing; write time: 5:29:34
    06/28/2016 03:09:43 - Error bpbrm (pid=6884) socket read failed: errno = 62 - Timer expired
    06/28/2016 03:09:45 - Error bptm (pid=6888) media manager terminated by parent process
    06/28/2016 03:09:51 - Info bphdb (pid=7381) done. status: 13: file read failed

  • Hi Marianne. I extract the logs, what to looking for in this logs? specific term?

    Thanks

     

  • Search for the string "<16>" without quotes.  Find the first instance and then start reading the lines around the error to give you clues, and feel free to post them here.

    Do you also have the dbclient and bphdb logs?  Since this is an Oracle RMAN backup, those will be more helpful I believe.  You create those in the same logs directory and they are also turned up in the bp.conf file with VERBOSE = 5.  Make sure to turn down logging when finished.

    Regards,

    Tyler