cancel
Showing results for 
Search instead for 
Did you mean: 

Problema con SLP

Seve
Level 2

Nuestra organización dispone de Netbackup 7.6.1.2 bajo sistemas SOLARIS SPARC version 10. Todo backup se realiza vía SLP en dos pasos: en el primero se realiza backup hacia un media server 5230; y en el segundo, duplicación a un 5230 remoto.

Por un mal procedimiento se paró el master server mientras existían SLPs en ejecución. La consecuencia es que estamos obteniendo los siguientes errores:

 

 # bperror -U -problems |more
    TIME            SERVER/CLIENT                      TEXT
11/29/2015 09:17:18 sfexpl sfexpl  Image processing for 2 images has been
                    aborted due to unrecoverable errors
11/29/2015 09:22:21 sfexpl sfexpl  Image processing for 2 images has been
                    aborted due to unrecoverable errors

 

 

Efectivamente tenemos dos backups que no han podido completar su ciclo de SLP. Verificamos los identificadores de las imagenes que no pueden ser duplicadas mediante el comando <bpimagelist> y a una fecha concreta, porque sabemos el intervalo en el que se produjo el error:

 # bpimagelist -idonly  -d 01/01/2015 -e 11/20/2015 -stl_incomplete
Time: Fri Nov 06 12:29:26 2015   ID: probda_1446809366   UBAK (2)
Time: Fri Nov 06 12:29:25 2015   ID: probda_1446809365   UBAK (2)

 

Y sobre cada uno de ellos determinamos el estado de "Copy State" de  la SLP, verificando que su estado es "1 (NOT STARTED)":

root@sfexpl # nbstlutil list -backupid probda_1446809366 -U|less
Image:
 Master Server                 : sfexpl
 Backup ID                     : probda_1446809366
 Client                        : probda
 Backup Time                   : 1446809366 (2015/11/06 12:29:26)
 Policy                        : rman_probda_probdb_caepro
 Client Type                   : 4
 Schedule Type                 : 2
 Storage Lifecycle Policy      : SLP_RMAN_PRO
 Storage Lifecycle State       : 1 (NOT_STARTED)
 Storage Lifecycle Is Inactive : false
 Time In Process               : 0 (0:00:00)
 Data Classification ID        : (none specified)
 Version Number                : 1
 OriginMasterServer            : (none specified)
 OriginMasterServerID          : 00000000-0000-0000-0000-000000000000
 Import From Replica Time      : 0 (0:00:00)
 Required Expiration Date      : 0 (0:00:00)
 Created Date Time             : 1446809366 (2015/11/06 12:29:26)

 

 

Podríamos tambien ejecutar nbstlutil list -l -image_incomplete, que igualmente nos mostraría el estado anterior.

 

Para cada una de las imágenes en el estado indicado, comprobamos en el catálogo qué copias están disponibles:

Client:            probda
Backup ID:         probda_1446809366
Policy:            rman_probda_probdb_caepro
Policy Type:       Oracle (4)
Proxy Client:      (none specified)
Creator:           oracle
Name1:             (none specified)
Sched Label:       Default-Application-Backup
Schedule Type:     UBAK (2)
Retention Level:   190 days (11)
Backup Time:       Fri Nov 06 12:29:26 2015 (1446809366)
Elapsed Time:      8 second(s)
Expiration Time:   Sat May 14 13:29:26 2016 (1463225366)
Maximum Expiration Time:   Thu Jan 01 01:00:00 1970 (0)
Compressed:        no
Client Encrypted:  no
Kilobytes:         16416
Number of Files:   1
Number of Copies:  0
Number of Fragments:   0

.......

 

Desde mi punto de vista, la imágen está anotada en catálogo pero el número de copias es cero (¿?) y como no se ha completado su SLP, nbstserv intenta rearrancar su duplicación sin éxito.

 

Si verifico esta imágen:

 bpverify -backupid probda_1446809366
Verify started 11/30/2015 09:31:46
INF - Skipping backup id probda_1446809366, it does not contain data.

INF - Status = found no images or media matching the selection criteria.

 

¿ Cómo puedo cancelar estos backups/SLP ?. Lo he intentado con el comando siguiente, pero sin éxito porque se sigue reintentando:

 

 nbstlutil cancel -backupid probda_1446809366

 

A través del log <nbstsserv> encuentro:

# vxlogview --prodid 51216 -o 226 -b "11/28/2015 00:00:01 AM" > /tmp/dbtserv.log

less /tmp/dbtserv.log

11/30/15 09:39:36.760 [ImageMgr::handle_timeout] Starting new session(ImageManager.cpp:315)
11/30/15 09:39:38.463 V-226-27 [ImageMgr::NewImageProcessing] No completed backup copy located for image probda_1446809365
11/30/15 09:39:38.463 V-226-27 [ImageMgr::NewImageProcessing] No completed backup copy located for image probda_1446809366
11/30/15 09:39:39.563 [Error] V-226-26 Image processing for 2 images has been aborted due to unrecoverable errors
11/30/15 09:39:39.564 [ImageMgr::handle_timeout] Session completed(ImageManager.cpp:319)
11/30/15 09:39:39.564 [ImageMgr::StartSession] next ImageMgr session scheduled for 2015/11/30 09:44:39(ImageManager.cpp:267)

1 ACCEPTED SOLUTION

Accepted Solutions

stucci
Level 6

ciao ho letto il tuo post anche se sono italiano e penso di avere capito il problema.

usa il comando seguente per vedere quante copie ha il tuo backup / use this command to view how many copies you have

bpimagelist -backupid

per vedere se un backupid ha finito il ciclo SLP puoi lanciare il comando sotto e se nella colonna Expire date esce scritto "Infinite" significa che non ha terminato le dupliche / whit this command you can see if the backupid has ended SLP cycle, in expiredate coulomn if you read "Infinite" the SLP is not end

bpimagelist -backupid *********** -U

Avrai almeno la copia primaria, e devi cercare di stoppare il ciclo SLP / I think that you have almost the primary copy, and you must try to stop SLP cycle.

Prova prima ad interrompere il ciclo di duplicazione / in first time interrupt the SLP cycle

nbstlutil inactive -backupid

poi prova a cancellare la copia / in second step try to cancel

nbstlutil cancel -backupid

Dopo avere fermato la SLP se vuoi eliminare anche il backup usa / after you stopped SLP cycle, if you want, you can expire the whole image

bpexpdate -recalculate -backupid ******* -d 0

View solution in original post

2 REPLIES 2

stucci
Level 6

ciao ho letto il tuo post anche se sono italiano e penso di avere capito il problema.

usa il comando seguente per vedere quante copie ha il tuo backup / use this command to view how many copies you have

bpimagelist -backupid

per vedere se un backupid ha finito il ciclo SLP puoi lanciare il comando sotto e se nella colonna Expire date esce scritto "Infinite" significa che non ha terminato le dupliche / whit this command you can see if the backupid has ended SLP cycle, in expiredate coulomn if you read "Infinite" the SLP is not end

bpimagelist -backupid *********** -U

Avrai almeno la copia primaria, e devi cercare di stoppare il ciclo SLP / I think that you have almost the primary copy, and you must try to stop SLP cycle.

Prova prima ad interrompere il ciclo di duplicazione / in first time interrupt the SLP cycle

nbstlutil inactive -backupid

poi prova a cancellare la copia / in second step try to cancel

nbstlutil cancel -backupid

Dopo avere fermato la SLP se vuoi eliminare anche il backup usa / after you stopped SLP cycle, if you want, you can expire the whole image

bpexpdate -recalculate -backupid ******* -d 0

Genericus
Moderator
Moderator
   VIP   

Agree with stucci - please mark his comment as soluition

From the master server:

1. you have to inactive the SLP backupid = nbstlutil inactive -wait  -backupid probda_1446809366

2. Make sure it is not actively running in activity monitor

3. cancel the backupid = nbstlutil cancel -wait  -backupid probda_1446809366

NetBackup 9.1.0.1 on Solaris 11, writing to Data Domain 9800 7.7.4.0
duplicating via SLP to LTO5 & LTO8 in SL8500 via ACSLS