cancel
Showing results for 
Search instead for 
Did you mean: 
Highlighted

readCommFile: ERR - timed out after 900 seconds while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs

Good day I have a Netbackup client version NetBackup-Solaris10 7.1.0.4.  I am running the client on Solaris 10 and have SAP installed on Oracle 11.2.0.3.

 

Netbackup file system backups complete successfully.  However, Oracle backups via backint fails with the following error in the backint log:

 

04:16:24.461 [29620] <16> readCommFile: ERR - timed out after 900 seconds while reading from /usr/openv/netbackup/logs/user_ops/dbext

/logs/vxbsa.1410228074.29611.prog.pcb_std

04:16:24.461 [29620] <32> serverResponse: ERR - could not read from comm file </usr/openv/netbackup/logs/user_ops/dbext/logs/vxbsa.14

10228074.29611.prog.pcb_std>

04:16:24.461 [29620] <16> CreateNewImage: ERR - serverResponse() failed

04:16:24.461 [29620] <16> bsa_InitiateBackup: ERR - Unable to CreateNewImage

04:16:24.495 [29611] <16> bsa_a_backup_completed: ERR - child <29620> backup failed: 1 <the requested operation was partially success

ful>

 

Other errors I find in further logs is:

 

Error bprd(pid=12172) Unable to write progress log </usr/openv/netbackup/logs/user_ops/dbext/logs/vxbsa.1410247515.2168.prog.pcb_std> on client tsrm2h03rnb. Policy=TSRM2H03RNB-SAP Sched=Weekly

Error bprd(pid=12172) CLIENT tsrm2h03rnb  POLICY TSRM2H03RNB-SAP  SCHED Weekly  EXIT STATUS 130 (system error occurred)

 

Permissions on the file system is fine.

 

Please assist.

 

Thanks

Wilhelm

1 Solution

Accepted Solutions
Highlighted
Accepted Solution!

Filesystem backups do not

Filesystem backups do not need master <-> client comms. 
It needs media server <-> client comms only.
Forward and reverse name lookup is needed as well as port 1556 (pbx) open in both directions.

Database backups initiate backup stream from the client and connects directly to bprd on the master.
The master then needs to be able to communicate with the client for updates to log file.

To test master <-> client comms, do the following:
Create bprd log folder on the master under /usr/openv/netbackup/logs . Restart NBU after bprd folder is created.
Create bpcd log on the client. Same location.

On the master, run this command:
bptestbpcd -client <client-name> -verbose -debug

Check output as well as client's bpcd log.

On the client, run this command:
bpclntcmd -pn

Check output as well as master's bprd log.
 


 


 

View solution in original post

2 Replies
Highlighted
Accepted Solution!

Filesystem backups do not

Filesystem backups do not need master <-> client comms. 
It needs media server <-> client comms only.
Forward and reverse name lookup is needed as well as port 1556 (pbx) open in both directions.

Database backups initiate backup stream from the client and connects directly to bprd on the master.
The master then needs to be able to communicate with the client for updates to log file.

To test master <-> client comms, do the following:
Create bprd log folder on the master under /usr/openv/netbackup/logs . Restart NBU after bprd folder is created.
Create bpcd log on the client. Same location.

On the master, run this command:
bptestbpcd -client <client-name> -verbose -debug

Check output as well as client's bpcd log.

On the client, run this command:
bpclntcmd -pn

Check output as well as master's bprd log.
 


 


 

View solution in original post

Highlighted

Looks like a timeout to me,

Looks like a timeout to me, you could try to increase CLIENT_READ_TIMEOUT and maybe CLIENT_CONNECT_TIMEOUT to a value higher than 900 (seconds)

Also found the bphdb & dbclient logs to be helpful in these cases

In our case the timeouts was often caused by the backup waiting for a resource

 

The standard questions: Have you checked: 1) What has changed. 2) The manual 3) If there are any tech notes or VOX posts regarding the issue