cancel
Showing results for 
Search instead for 
Did you mean: 

Tape continually being loaded when netbackup thinks it is complete

contra04
Level 5

Hello we have a strange issue today where we see a tape that has no data on it, that is continually being loaded. tHe backup job is now stuck at 4 gb, and has been doing this for the last 3-4 hours...

 

29/09/2011 10:33:24 - mounting ASI364
29/09/2011 10:34:25 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:35:17 - granted resource ASI364
29/09/2011 10:35:17 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:35:17 - granted resource Quantum-PX502
29/09/2011 10:35:19 - mounting ASI364
29/09/2011 10:36:26 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:38:12 - granted resource ASI364
29/09/2011 10:38:12 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:38:12 - granted resource Quantum-PX502
29/09/2011 10:38:17 - mounting ASI364
29/09/2011 10:39:30 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:40:43 - granted resource ASI364
29/09/2011 10:40:43 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:40:43 - granted resource Quantum-PX502
29/09/2011 10:40:45 - mounting ASI364
29/09/2011 10:41:52 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:43:29 - granted resource ASI364
29/09/2011 10:43:29 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:43:29 - granted resource Quantum-PX502
29/09/2011 10:43:38 - mounting ASI364
29/09/2011 10:44:40 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:45:52 - granted resource ASI364
29/09/2011 10:45:52 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:45:52 - granted resource Quantum-PX502
29/09/2011 10:45:57 - mounting ASI364
29/09/2011 10:47:08 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:49:07 - granted resource ASI364
29/09/2011 10:49:07 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:49:07 - granted resource Quantum-PX502
29/09/2011 10:49:12 - mounting ASI364

 

Any idea how to get out of this loop ? I am trying to reassign the tape to a diffrent pool - it is in the daily pool now, but it will not let me reassign it while it is in ue.  How do I fix this?

12 REPLIES 12

Andy_Welburn
Level 6

or in scratch?

If so, you could try to "freeze" ASI364 & see if this forces the job to try another tape.

I cannot understand *why* NB would continually try & load the same media. Is there possibly an underlying drive issue?

***EDIT***

Along the same lines, there could also be a specific media issue preventing it from actually loading successfully. Again, freeze the tape.

You may have to restart or suspend/resume the job.

Eject the tape from the library & give it a good inspection.

 

May also by errors logged in GUI reports, bptm logs, /usr/openv/netbackup/db/media/errors, syslogs/event viewer

contra04
Level 5

Hi thanks very much Andy. Im hitting panic now, and have been able to suspend 4 of the jobs, but all of the queued jobs then take there chance and try and grab the tape drive resource.

 

Ive pinned it down to the second drive on our Quantium Ultrium 3, and all jobs using this drive are in the loop. strange that 2 policies at the same time think they are both using the drive at the same time ? one of them does show error though:

 

This is another policy AT the same time ?? thinking that it is mounting this tape, using this drive HP.Ultrium3-SCSI.001

9/09/2011 10:28:02 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:29:09 - granted resource ASI364
29/09/2011 10:29:09 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:29:09 - granted resource Quantum-PX502
29/09/2011 10:29:14 - mounting ASI364
29/09/2011 10:30:14 - Error bptm(pid=8800) cannot open file C:\Program Files\VERITAS\NetBackup\db\media\tpreq\drive_HP.Ultrium3-SCSI.001, The device has indicated that cleaning is required before further operations are attempted. (1165)
29/09/2011 10:30:17 - Warning bptm(pid=8800) media id ASI364 load operation reported an error     
29/09/2011 10:30:18 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:31:23 - granted resource ASI364
29/09/2011 10:31:23 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:31:23 - granted resource Quantum-PX502
29/09/2011 10:31:26 - mounting ASI364
29/09/2011 10:32:34 - Error bptm(pid=8800) cannot open file C:\Program Files\VERITAS\NetBackup\db\media\tpreq\drive_HP.Ultrium3-SCSI.001, The device has indicated that cleaning is required before further operations are attempted. (1165)
29/09/2011 10:32:35 - Warning bptm(pid=8800) media id ASI364 load operation reported an error     
29/09/2011 10:32:35 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:33:22 - granted resource ASI364
29/09/2011 10:33:22 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:33:22 - granted resource Quantum-PX502
29/09/2011 10:33:24 - mounting ASI364
29/09/2011 10:34:23 - Error bptm(pid=8800) cannot open file C:\Program Files\VERITAS\NetBackup\db\media\tpreq\drive_HP.Ultrium3-SCSI.001, The device has indicated that cleaning is required before further operations are attempted. (1165)
29/09/2011 10:34:25 - Warning bptm(pid=8800) media id ASI364 load operation reported an error     
29/09/2011 10:34:25 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:35:17 - granted resource ASI364
29/09/2011 10:35:17 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:35:17 - granted resource Quantum-PX502
29/09/2011 10:35:19 - mounting ASI364
29/09/2011 10:36:19 - Error bptm(pid=8800) cannot open file C:\Program Files\VERITAS\NetBackup\db\media\tpreq\drive_HP.Ultrium3-SCSI.001, The device has indicated that cleaning is required before further operations are attempted. (1165)
29/09/2011 10:36:23 - Warning bptm(pid=8800) media id ASI364 load operation reported an error     
29/09/2011 10:36:26 - current media ASI364 complete, requesting next resource Any
29/09/2011 10:38:12 - granted resource ASI364
29/09/2011 10:38:12 - granted resource HP.Ultrium3-SCSI.001
29/09/2011 10:38:12 - granted resource Quantum-PX502
29/09/2011 10:38:17 - mounting ASI364
29/09/2011 10:39:22 - Error bptm(pid=8800) cannot open file C:\Program Files\VERITAS\NetBackup\db\media\tpreq\drive_HP.Ultrium3-SCSI.001, The device has indicated that cleaning is required before further operations are attempted. (1165)
29/09/2011 10:39:29 - Warning bptm(pid=8800) media id ASI364 load operation reported an error     
29/09/2011 10:39:30 - current media ASI364 complete, requesting next resource Any

 

Ahh ok cleaning the drive - any idea how to do it from the web page on an HP ultrium3? I know it is not recommended to do it from netbackup in software

Andy_Welburn
Level 6

the tape drive resource."

Been there. Done that!

"This is another policy AT the same time ??"

Depending on your config, there's nothing to stop, nor anything wrong with, multiple policies using the same drive/media - if they use the same volume pool & have the same retention then multiple backups will multiplex onto one drive.

"any idea how to do it from the web page on an HP ultrium3?"

Absolutely no idea! Nothing to stop you from manually inserting a cleaning cartridge into the drive (if you can gain direct access to it) OR load it into the media access port on the library & use "robtest" to move it into the drive OR inventory it into NetBackup (ensure barcode rules are correct so it gets recognised as a cleaning cartridge of the correct type) then you can right-click & "clean now" the drive.

Simplest & quickest option would be to robtest the cartridge from the MAP/CAP direct to the drive:

Robtest commands that can be used to test the SCSI functionality of a robot
http://www.symantec.com/business/support/index?page=content&id=TECH83129

so, if you load the cleaning cartridge into the first slot of the media access port - presuming you library has one - & your affected drive is drive 2 the robtest command

m p1 d2

will result in drive 2 being cleaned. m d2 p1 to move it back to allow for removal once complete.

Mark_Solutions
Level 6
Partner Accredited Certified

Hi

I have not heard of an issue with cleaning drives from NetBackup as long as it is done from Device Monitor and not from Activity Monitor.

Since NBU V6 the hex alert code has been read by NBU and appropriate action taken including cleaning.

No need to set a cleaning frequency, just let NBU handle it as needed.

Put an appropriate barcode label on a cleaning tape - such as CLN001 - and then set up a barcode rule under the advanced options in the inventory to set anything starting in CLN is a cleaning tape

Careful here - if you medi shows as HCART3 the the media type in the rule must be 1/2 inch cleaning tape 3 and it should default to being in the NONE pool.

NetBackup should then just clean the drive but if not just right click the drive and select Drive Cleaning - Clean Now

In the PX502 GUI you can go to Operations - Move and select the slot the cleaning tape is in and the drive that needs cleaning.

In relation to the original issue make sure that the Inventory in NetBackup is up to date and that no one has changed tapes whilst a tape was still in the drive.

If this happens the tape gets abandoned in the drive and you need to use robtest to move it to an empty slot and then update the NBU inventory.

The event logs on the Media Server (if it is windows) will also tell you a lot about what is happening.

Hope this helps

contra04
Level 5

Thanks a million guys, however I now have a severity 1 level red alert situation. Terrible day for backups, simply sinister.

 

I rebooted the backup server, and now I think I have corrupted the database as the database service will not start.

tried bpup, bpdown etc and the database just refuses to start. see the logs:

 

I am trying to follow this thread -http://www.symantec.com/business/support/index?page=content&id=TECH150327

however the linked document will not load!

 

do I need to create a new database then restore a catalog backup ?

 

How to do this in windows ?

29/09/2011 13:06:52.931 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:06:52.931 [ServerImpl::GetInterfaceRef] For <DeviceAllocator> version <1>
29/09/2011 13:06:52.946 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:06:52.946 V-111-1061 [DeviceAllocatorImpl::updateMachineState] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:06:52.977 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:52.993 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4928>
29/09/2011 13:06:52.993 [ServerImpl::GetInterfaceRef] For <Machine> version <1>
29/09/2011 13:06:52.993 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:06:53.009 V-111-1061 [MachineImpl::ListMachines] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4928>
29/09/2011 13:06:53.009 [MachineImpl::ListMachines] Master <harry>
29/09/2011 13:06:53.024 [MachineImpl::ListMachines] retval - <0>
29/09/2011 13:06:53.024 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:53.024 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4840>
29/09/2011 13:06:53.024 [ServerImpl::GetInterfaceRef] For <Machine> version <1>
29/09/2011 13:06:53.024 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:06:53.024 V-111-1061 [MachineImpl::ListMachines] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4840>
29/09/2011 13:06:53.024 [MachineImpl::ListMachines] Master <harry>
29/09/2011 13:06:53.040 [MachineImpl::ListMachines] retval - <0>
29/09/2011 13:06:53.056 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:53.056 [DeviceAllocatorImpl::updateMachineState] retval - <0>
29/09/2011 13:06:53.259 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:06:53.259 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:06:53.306 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:57.837 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:06:57.837 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:06:57.868 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:57.884 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:06:57.884 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:06:57.915 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:07:20.571 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:07:20.571 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:07:20.571 [Error] V-111-1203 Master server harry not found
29/09/2011 13:07:22.430 [DA_Thread_Pool::CheckIfVmscdIsNeeded] Failed to get database connection.(.\DAThreads.cpp:1518)
29/09/2011 13:08:12.429 [DA_Thread_Pool::CheckIfVmscdIsNeeded] Failed to get database connection.(.\DAThreads.cpp:1518)
29/09/2011 13:09:02.427 [DA_Thread_Pool::CheckIfVmscdIsNeeded] Failed to get database connection.(.\DAThreads.cpp:1518)
29/09/2011 13:09:13.068 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:09:13.068 [ServerImpl::GetInterfaceRef] For <DeviceConfig> version <1>
29/09/2011 13:09:13.083 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:09:13.083 V-111-1061 [DeviceConfigImpl::queryEmmDeviceConf] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:09:13.083 [DeviceConfigImpl::queryEmmDeviceConf] MachineName = < harry >, MachineType = < 1 >
29/09/2011 13:09:33.989 [FATClientORBConfig::shutServant] Calling fini() on FAT Client
29/09/2011 13:09:33.989 [FATClientORBConfig::shutServant] Done fini() on FAT Client
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:34.005 [REMORBConfig::shutServant] Calling fini() on REM
29/09/2011 13:09:34.083 [DeviceConfigImpl::queryEmmDeviceConf] Failed to get database connection.(.\DeviceConfigImpl.cpp:566)
29/09/2011 13:09:34.083 V-111-1061 [DeviceAllocatorImpl::updateMachineState] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>

contra04
Level 5

Guys Thanks for the help with cleaning

 

we now have a much more serious errror. I restarted the server and i think I have corrupted the relational database.

29/09/2011 13:06:52.915 [ExtMappingsImpl::queryExtFileVersion] retval - <0>
29/09/2011 13:06:52.931 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:06:52.931 [ServerImpl::GetInterfaceRef] For <DeviceAllocator> version <1>
29/09/2011 13:06:52.946 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:06:52.946 V-111-1061 [DeviceAllocatorImpl::updateMachineState] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:06:52.977 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:52.993 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4928>
29/09/2011 13:06:52.993 [ServerImpl::GetInterfaceRef] For <Machine> version <1>
29/09/2011 13:06:52.993 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:06:53.009 V-111-1061 [MachineImpl::ListMachines] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4928>
29/09/2011 13:06:53.009 [MachineImpl::ListMachines] Master <harry>
29/09/2011 13:06:53.024 [MachineImpl::ListMachines] retval - <0>
29/09/2011 13:06:53.024 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:53.024 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4840>
29/09/2011 13:06:53.024 [ServerImpl::GetInterfaceRef] For <Machine> version <1>
29/09/2011 13:06:53.024 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:06:53.024 V-111-1061 [MachineImpl::ListMachines] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<4840>
29/09/2011 13:06:53.024 [MachineImpl::ListMachines] Master <harry>
29/09/2011 13:06:53.040 [MachineImpl::ListMachines] retval - <0>
29/09/2011 13:06:53.056 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:53.056 [DeviceAllocatorImpl::updateMachineState] retval - <0>
29/09/2011 13:06:53.259 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:06:53.259 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:06:53.306 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:57.837 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:06:57.837 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:06:57.868 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:06:57.884 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:06:57.884 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:06:57.915 [StorageUnitImpl_2::ListStorageUnits_2] retval - <0>
29/09/2011 13:07:20.571 V-111-1061 [StorageUnitImpl_2::ListStorageUnits_2] Client signature HOST=<harry> VER=<700000> APP=<dbstunitq> PID=<5588>
29/09/2011 13:07:20.571 [StorageUnitImpl_2::ListStorageUnits_2] Master = harry
29/09/2011 13:07:20.571 [Error] V-111-1203 Master server harry not found
29/09/2011 13:07:22.430 [DA_Thread_Pool::CheckIfVmscdIsNeeded] Failed to get database connection.(.\DAThreads.cpp:1518)
29/09/2011 13:08:12.429 [DA_Thread_Pool::CheckIfVmscdIsNeeded] Failed to get database connection.(.\DAThreads.cpp:1518)
29/09/2011 13:09:02.427 [DA_Thread_Pool::CheckIfVmscdIsNeeded] Failed to get database connection.(.\DAThreads.cpp:1518)
29/09/2011 13:09:13.068 V-111-1061 [ServerImpl::GetInterfaceRef] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:09:13.068 [ServerImpl::GetInterfaceRef] For <DeviceConfig> version <1>
29/09/2011 13:09:13.083 [ServerImpl::GetInterfaceRef] retval - <0>
29/09/2011 13:09:13.083 V-111-1061 [DeviceConfigImpl::queryEmmDeviceConf] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>
29/09/2011 13:09:13.083 [DeviceConfigImpl::queryEmmDeviceConf] MachineName = < harry >, MachineType = < 1 >
29/09/2011 13:09:33.989 [FATClientORBConfig::shutServant] Calling fini() on FAT Client
29/09/2011 13:09:33.989 [FATClientORBConfig::shutServant] Done fini() on FAT Client
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:33.989 [EMMTaskBase::run_event_loop] Finished run() in this thread for <NBFSMCLIENT>
29/09/2011 13:09:34.005 [REMORBConfig::shutServant] Calling fini() on REM
29/09/2011 13:09:34.083 [DeviceConfigImpl::queryEmmDeviceConf] Failed to get database connection.(.\DeviceConfigImpl.cpp:566)
29/09/2011 13:09:34.083 V-111-1061 [DeviceAllocatorImpl::updateMachineState] Client signature HOST=<harry> VER=<700000> APP=<ltid> PID=<4772>

How can I restore this without a backup server?

 

using NB 7.0 on windows

contra04
Level 5

Thank you both for your replies, a couple of legends !

 

It has taklen me an hour to get netbackup on line after restarting the server. The database service would not start. I feared I would have to recreate the database then perform a restore. Was totally pooing myself.

When trying a clean from netbackup I get an error:

The following error occurred while attempting to clean drive HP.Ultrium3-SCSI.001 on harry. daemon failed accepting connection (59)

I have a cleaning take in slot 38, and it is named CLN01. it was part of a pool called "netbackup" Where are the rules to set the 3 inch thing?

I managed to move the tape inthe 502 GUI:

"In the PX502 GUI you can go to Operations - Move and select the slot the cleaning tape is in and the drive that needs cleaning."

 

Do you know how long this takes ? is there a status monitor to tell it is cleaning?

Andy_Welburn
Level 6

Can be accessed when you inventory robot. (See attached )

If they are "LTO3" for example they will show in the media list in the GUI as HC3_CLN.

Mark_Solutions
Level 6
Partner Accredited Certified

If the tape is already in NetBackup it may show as HCART (or HCART2 etc.) already so a bar code rule at that point is too late.

If that is the case delete it from NetBackup and then create the barcode rule and re-inventory the library so that it gets added as a new cleaning tape and goes into the NONE pool.

contra04
Level 5

Thanks guys - I deleted the tape, then changed the rules as the barcode began with L1CLN. have added a new rule, and got it to show as HC3_CLN after an inventory update. Awesome. All the help ive got here is amazing. 3 months ago I saw netbackup for the first time. And am am still scared at the mess im mopping up ;)

 

Now I get a protocol error 39 - is that because there are 2 backup jobs running on each drive.?

 

Do I need to stop all jobs then right click - clean from the device monitor?

Andy_Welburn
Level 6

NetBackup Status Code 39 Microsoft Cluster Server (MSCS) clustered Enterprise Vault database backup is not accepted in NetBackup.
Media Manager Status Code 39 network protocol error
Device Configuaration Error Code 39 Adding this drive would exceed the maximum allowed

- none seem to fit with anything to do with device cleaning as far as I can see!

Digging a bit further (didn't realise error codes seem to come from all over these days!!):

Device Management Status Code 39 Parameter is invalid

The tpclean command was called with invalid arguments, or an internal function encountered a missing reference to data it requires.

  • If a cleaning operation was requested, check the tpclean usage statement and compare with the parameters that were specified.

  • Check the installed software components and verify that they are all at a compatible release version.

Is that how you were trying to clean the drive, with the tpclean command? If so, how did you run it?

"Do I need to stop all jobs then right click - clean from the device monitor?" - probably the easiest way! You managed to get your jobs running ok then? Not sure if a cleaning job will queue as per a normal job or whether it'll just jump in as soon as possible. Generally I wait until free then right-click "clean now"

 

contra04
Level 5

Hmm both the drives are no longer in a status of "drives needs cleaning" and the tape has a remaining cleans cound of 22. I had set this to 25 in the robot rule.  Does this mean 3 cleans have happened in automatically? I hope so ! awesome if yes.

Apparently there was a backup guy employed here for 3 months who was a bit rubbish and he messed up all of the tape names. This is why the original rule setup by the old spanish master (who no longer works here) was no longer working. He had the classic rule CLN tapes are cleaning tapes. The new tape was L1CLN. About half the tapes in the robot no longer match there barcodes.  So when a new cleaning tape was put in about a year ago, it just sat there, and was classified as a normal tape.

However running a manual one results in this:

 

this is the exact error

 

Now I am quite sure that my two tape drives in the Quantum run 24 hours a day, 7 days a week.  And I have 30 percent of jobs failing because the backup windows closes. Weekly and monthly backups take 3 days to run, and there is not enough time in the day to back up the data at this organisation. Where does one begin looking at changing backup windows etc? What are some usefull command line reporting tools I can use to get a better picure of:

How much data is being backed up by each policy and when - seems to be wildly different each day in the activity monitor. Like when in the start window, something is starting - it seems to be totally random depending on which other jobs are hogging the tape drives. 

If many policys affect the start times of  other policies how do even start to get an idea of how we can makes things better ?

How can I see a log of drive time in a day, being used by which policy when ?

I feel like I have inherited a monster - and with my limited knowledge I feel like I need to get a hold of what is going on.  Is it just too much data for two LTO3 drives - or can I fine tune the start windows, I need to find a way to get the information to answer these questions.