Hi N_J
I had a look at the bpdb2 logs - there are 3 -
db2bid.073018_00001.txt
db2d20.073018_00001.txt
log.073018.txt
Neither of these logs contain complete info for a backup from beginning to end.
There is no reference whatsoever to a backup in db2d20.073018_00001.txt.
The other 2 logs contain errors and the name of user_ops log files, but the beginning of the backup and progress of the backup do not appear in these logs.
If the backups started the previous day, we need those as well.
log.073018.txt starts with an error shortly after midnight - we can only assume that the backup started the previous evening?
Error:
00:21:25.230 [28002] <16> readCommFile: ERR - timed out after 3600 seconds while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/28002.0.1532899283
00:21:25.231 [28002] <32> serverResponse: ERR - could not read from comm file </usr/openv/netbackup/logs/user_ops/dbext/logs/28002.0.1532899283>
00:21:25.231 [28002] <16> CreateNewImage: ERR - serverResponse() failed
00:21:25.231 [28002] <16> VxBSACreateObject: ERR - Could not create new image with file /DB2/AUDBOE14/LOGFILE/node0000/db2bid/C0000000_S0020131.LOG.
It seems as if the backup failed on an archive log file :
/DB2/AUDBOE14/LOGFILE/node0000/db2bid/C0000000_S0020131.LOG
What is worrying is the timeout after 1 hour (3600 sec).
Can you see what is logged in /usr/openv/netbackup/logs/user_ops/dbext/logs/28002.0.1532899283 ?
And find the bpdb2 logs for the previous day to see what happened since the beginning of the backup?
Especially in the hour before 00:21.
Another timeout error for another log a while later:
00:54:29.610 [6764] <16> readCommFile: ERR - timed out after 3600 seconds while reading from /usr/openv/netbackup/logs/user_ops/dbext/logs/6764.0.1532901267
00:54:29.610 [6764] <32> serverResponse: ERR - could not read from comm file </usr/openv/netbackup/logs/user_ops/dbext/logs/6764.0.1532901267>
00:54:29.610 [6764] <16> CreateNewImage: ERR - serverResponse() failed
00:54:29.610 [6764] <16> VxBSACreateObject: ERR - Could not create new image with file /DB2/BOE14/LOGFILE/node0000/db2bid/C0000000_S0033833.LOG.
There are more user_ops logs listed here along with backup failure for DB2 archive logs.
The fact that the backup seems to fail on archive logs poses another question :
Is there a difference between archive log methods for working and non-working backups?
Does the profile that is loaded in the non-working script (/db2/BID/sqllib/db2profile), perhaps reference a different db2.conf file with different archive log method?
PS:
It will probably be best to log a Support call with Veritas and work with a knowledgeable support engineer to track down the issue.
You will need all of the logs listed in NBU for DB2 manual (including bprd on the master and bpbrm and bptm on the master server.
Support will ask for high level logs - they have the time and the tools to sift through these massive logs.