master win 2008, NBU 188.8.131.52, 2 NBU 5220 appliance version 2.5.3 - one in each data center, AIR enabled via SLP.
nbapp1 in DC1 take backup(primary copy) and replicates to nbapp2 in DC2(copy2), then nbapp1 makes tape copy(copy 3) from primary copy.
nbapp2 in DC2 take backup(primary copy) and replicates to nbapp1 in DC1(copy2), then nbapp1 makes tape copy(copy 3) from copy 2.
nbapp1 backup around 5TB data on daily basis. nbapp2 backup around 6.6Tb data daily
on both appliance , we have this configuration:
Settings> NetBackup DataBuffers Number Show
NUMBER_DATA_BUFFERS : 128
NUMBER_DATA_BUFFERS_DISK : 128
NUMBER_DATA_BUFFERS_FT : 16 (Default)
NUMBER_DATA_BUFFERS_RESTORE : 128
Settings> NetBackup DataBuffers Size Show
SIZE_DATA_BUFFERS : 262144 (Default)
SIZE_DATA_BUFFERS_DISK : 262144 (Default)
SIZE_DATA_BUFFERS_FT : 262144 (Default)
since past few weeks few backups were found to be in hung state and slow backups which runs into production hrs. this problem is happening almost 3-4 times a week. catalo backup also taking 3 times more time. please share your suggestion how to find network choke\congestion problem.
catalog backup also goes to tape. appliance to tape copy giving good throughput. It is only happening on backups whch goes to nbapp1.
You say this just started happening? What changed in the environment? Are you seeing any hardware issues on the nbapp1 appliance -- is the battery relearning or the RAID controller rebuilding a disk for example?
upgraded from 184.108.40.206 to 7.6 GA ---- nothing else changed.
Only master server upgraded, not the Appliance?
According to your opening post the Appliance is still on 220.127.116.11, right?
Best to upgrade Appliance as well to benefit from dedupe enhancements in 7.6/2.6.
Please patch master and Appliance as well - 18.104.22.168 and 22.214.171.124.