You will see lots of errors like these during this period:
14:01:33.469  <16> start_bptm: cannot connect to live200mcs.retail2u.trcg.co.uk
14:01:33.469  <16> start_bptm: bpcd exit: cannot connect to server backup restore manager (205)
14:01:33.469  <16> get_stunits: get_num_avail_drives failed with stat 205
14:01:33.469  <16> log_in_errorDB: cannot connect to server live200mcs.retail2u.trcg.co.uk; marking storage unit LIVE200MCS_9840_SPEKE_DRIVE as unavailable
While bpsched -mainempty is running, you will notice entries like these:
19:00:12.918  <8> bpsched_main: another regular bpsched is already examining the policy configuration
19:00:12.918  <4> bpsched: scheduler exiting - regular bpsched is already running (214)
No backups will be submitted.
You really need to get rid of unavailable media servers.
Hopefully you know how to properly decommission media servers?
Bpsched wakes up every 20 minutes to scheduele backup, is a previous run isn't completed, the called bpsched will exit. It is not uncommon during peak hours for bpsched to keep running for hours. Trouble getting resource status will for sure cause long running bpsched processes.
I can echo the point - it is important to remove missing and inactive systems from the master bp.conf - I had a few people with PC included and when more than a few went off line I could see an immediate impact and delays.
Just removing names from bp.conf won't help.
As per my post of 2 weeks ago, we see how bpsched is trying to connect to STU media servers to count UP drives:
If the Storage Units still exist for 'dead' media servers, then bpsched will still get stuck while trying to probe media servers for UP drives.
Image expiration will also be problem because media cannot be deassigned.
Thanks for your update.
Yes, i am getting it done. but it is gonna take long time . But really not sure how to proceed with mediadb as none of the media servers are live now.
to alleviate your current situation (bpsched going into hung state while waiting for response), please delete unused Storage Units and delete unused names from master's bp.conf.
This will take you all of 5 minutes... maybe 10 if you want to copy out a list of existing config.
Changing media server ownership of tapes can be dealt with later.
Maybe ask in a new post when you have the time to deal with it.