cancel
Showing results for 
Search instead for 
Did you mean: 

Media Server Lost connection between Master Server

Vasu_H
Level 3

Hi All,

As we have 1 master Server and 3 Media Servers, in that one particular media server we are receiving the below error while open the NetBackup Administrator Console.

unable to read configuration: cannot connect on socket (25).

Unable to connect to the EMM Server : The EMM server failed to process the request (78).

We have check the below steps:

1. We have checked the connectivity from Master Server to media server (Ping Status, nslookup....) fine.

2. We have checked the EMM status from Master Server its fine. But while checking from Media Server its not available.

3. We have checked the configuration from Master server and also compared with other media server, everything same.

Note: From Last sunday(21st march 2015) to till date all the backup jobs which is scheduled under this media server its getting failed with error code 25 and 26.

Please help me how to check further for this issue.

Regards,

Vasu H

10 REPLIES 10

Marianne
Level 6
Partner    VIP    Accredited Certified

Please post output of the following commands on the master server :

nbemmcmd -listhosts -verbose

nbemmcmd -getemmserver

(highlight problematic media server in above output)

Check if OS firewall perhaps got activated on media server .

Create bpcd log folder on client and see if master server connection request is received by media server when you run this command:

bptestbpcd -host (media-server-name) -verbose -debug

 

Post output of command and media server's bpcd log as .txt File attachment.

Vasu_H
Level 3

Hi Marianne,

 

Below are the requested details and the problematic Media Server Name: clap-nbu01.

nbemmcmd -listhosts -verbose

NBEMMCMD, Version: 7.5.0.7
The following hosts were found:
s3ocsm32
    ClusterName = ""
    MachineName = "s3ocsm32"
    FQName = "s3ocsm32"
    LocalDriveSeed = ""
    MachineDescription = ""
    MachineFlags = 0x17
    MachineNbuType = media (1)
    MachineState = active for tape and disk jobs (14)
    MasterServerName = "crw-nbu01"
    NetBackupVersion = 7.5.0.7 (750700)
    OperatingSystem = windows (11)
    ScanAbility = 5
crw-nbu02
    ClusterName = ""
    MachineName = "crw-nbu02"
    FQName = "crw-nbu02.ocs.co.uk"
    LocalDriveSeed = ""
    MachineDescription = ""
    MachineFlags = 0x15
    MachineNbuType = media (1)
    MachineState = active for tape and disk jobs (14)
    MasterServerName = "crw-nbu01"
    NetBackupVersion = 7.5.0.7 (750700)
    OperatingSystem = windows (11)
    ScanAbility = 5
clap-nbu01
    ClusterName = ""
    MachineName = "clap-nbu01"
    FQName = "clap-nbu01.ocs.co.uk"
    LocalDriveSeed = ""
    MachineDescription = ""
    MachineFlags = 0x14
    MachineNbuType = media (1)
    MachineState = active for tape jobs (10)
    MasterServerName = "crw-nbu01"
    NetBackupVersion = 7.5.0.7 (750700)
    OperatingSystem = windows (11)
    ScanAbility = 5

crw-nbu01
    ClusterName = ""
    MachineName = "crw-nbu01"
    FQName = "crw-nbu01.ocs.co.uk"
    GlobalDriveSeed = "VEND:#.:PROD:#.:IDX"
    LocalDriveSeed = ""
    MachineDescription = ""
    MachineFlags = 0x17
    MachineNbuType = master (3)
    MachineState = active for disk jobs (12)
    NetBackupVersion = 7.5.0.7 (750700)
    OperatingSystem = windows (11)
    ScanAbility = 5
S3OCSV02
    MachineName = "S3OCSV02"
    FQName = "s3ocsv02.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
S3OCSM23
    MachineName = "S3OCSM23"
    FQName = "s3ocsm23.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi08
    MachineName = "crw-vi08"
    FQName = "crw-vi08.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi07
    MachineName = "crw-vi07"
    FQName = "crw-vi07.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi06
    MachineName = "crw-vi06"
    FQName = "crw-vi06.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi05
    MachineName = "crw-vi05"
    FQName = "crw-vi05.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi03
    MachineName = "crw-vi03"
    FQName = "crw-vi03.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi02
    MachineName = "crw-vi02"
    FQName = "crw-vi02.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Crw-vi04
    MachineName = "Crw-vi04"
    FQName = "crw-vi04.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Clap-vc01
    MachineName = "Clap-vc01"
    FQName = "clap-vc01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-vi01
    MachineName = "crw-vi01"
    FQName = "crw-vi01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Crw-vc01
    MachineName = "Crw-vc01"
    FQName = "crw-vc01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Crw-vdiesx02
    MachineName = "Crw-vdiesx02"
    FQName = "crw-vdiesx02.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Crw-vdiesx01
    MachineName = "Crw-vdiesx01"
    FQName = "crw-vdiesx01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Crw-vdivc01
    MachineName = "Crw-vdivc01"
    FQName = "crw-vdivc01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
Craw-mon-vc01
    MachineName = "Craw-mon-vc01"
    FQName = "craw-mon-vc01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = virtual_machine (10)
crw-nbu01
    MachineName = "crw-nbu01"
    FQName = "crw-nbu01.ocs.co.uk"
    MachineDescription = ""
    MachineNbuType = server (6)
Command completed successfully.

 

----------------------------------------------------------------------------------------------------------------------------------------------

nbemmcmd -getemmserver

NBEMMCMD, Version: 7.5.0.7
These hosts were found in this domain: clap-nbu01, crw-nbu01, crw-nbu02, s3ocsm32

Checking with the host "clap-nbu01"... 
Checking with the host "crw-nbu01"... 
Checking with the host "crw-nbu02"... 
Checking with the host "s3ocsm32"... 

Server Type    Host Version        Host Name                     EMM Server          
MEDIA          7.5                 clap-nbu01                    crw-nbu01           
MASTER         7.5                 crw-nbu01                     crw-nbu01           
MEDIA          7.5                 crw-nbu02                     crw-nbu01           
MEDIA          7.5                 s3ocsm32                      crw-nbu01           
Command completed successfully.

-------------------------------------------------------------------------------------------------------------------------------------------------

bptestbpcd -host (media-server-name) -verbose -debug

13:34:31.670 [4680.5328] <2> setup_debug_log: switched debug log file for bpcd
13:34:31.670 [4680.5328] <2> bpcd main: VERBOSE = 0
13:34:31.670 [4680.5328] <2> logparams: C:\Program Files\Veritas\NetBackup\bin\bpcd.exe -standalone 
13:34:31.670 [4680.5328] <2> process_requests: offset to GMT 0
13:34:31.670 [4680.5328] <2> logconnections: BPCD ACCEPT FROM 172.16.0.116.54421 TO 172.16.16.84.1556 fd = 428
13:34:31.670 [4680.5328] <2> process_requests: setup_sockopts complete
13:34:31.670 [4680.5328] <2> bpcd peer_hostname: Connection from host crw-nbu01 (172.16.0.116) port 54421
13:34:31.670 [4680.5328] <2> bpcd valid_server: comparing crw-nbu01 and crw-nbu01
13:34:31.670 [4680.5328] <4> bpcd valid_server: hostname comparison succeeded
13:34:31.857 [4680.5328] <2> process_requests: output socket port number = 1
13:34:32.575 [4680.5328] <8> verify_hashes: [vnet_vnetd.c:1641] hash_str baf751426ec7852a384ef19d1e2584a9
13:34:32.575 [4680.5328] <2> process_requests: Duplicated vnetd socket on stderr
13:34:32.575 [4680.5328] <2> process_requests: <---- NetBackup 7.5 0 ------------initiated
13:34:32.575 [4680.5328] <2> process_requests: VERBOSE = 0
13:34:32.575 [4680.5328] <2> process_requests: Not using VxSS authentication with crw-nbu01
13:34:32.824 [4680.5328] <2> process_requests: BPCD_PEERNAME_RQST
13:34:32.824 [4680.5328] <2> bpcd peer_hostname: Connection from host crw-nbu01 (172.16.0.116) port 54421
13:34:33.027 [4680.5328] <2> process_requests: BPCD_HOSTNAME_RQST
13:34:33.246 [4680.5328] <2> process_requests: BPCD_CLIENTNAME_RQST
13:34:33.464 [4680.5328] <2> process_requests: BPCD_GET_VERSION_RQST
13:34:33.682 [4680.5328] <2> process_requests: BPCD_GET_PLATFORM_RQST
13:34:33.916 [4680.5328] <2> process_requests: BPCD_GET_VERSION_RQST
13:34:34.150 [4680.5328] <2> process_requests: BPCD_PATCH_VERSION_RQST
13:34:34.353 [4680.5328] <2> retrieveLocalPatchVersion: Reading from C:\Program Files\Veritas\NetBackup\bin\version.txt
13:34:34.353 [4680.5328] <2> parsePatchVersionString: parsing = >7.5.0.7
<
13:34:34.353 [4680.5328] <2> parsePatchVersionString: theRest = ><
13:34:34.603 [4680.5328] <2> process_requests: BPCD_GET_VERSION_RQST
13:34:34.806 [4680.5328] <2> process_requests: BPCD_PATCH_VERSION_RQST
13:34:34.806 [4680.5328] <2> retrieveLocalPatchVersion: Reading from C:\Program Files\Veritas\NetBackup\version.txt
13:34:34.806 [4680.5328] <2> parsePatchVersionString: parsing = >7.5.0.7
<
13:34:34.806 [4680.5328] <2> parsePatchVersionString: theRest = ><
13:34:35.040 [4680.5328] <2> process_requests: BPCD_GET_VERSION_RQST
13:34:35.258 [4680.5328] <2> process_requests: BPCD_READ_HOST_CONFIG_RQST
13:34:35.508 [4680.5328] <2> process_requests: BPCD_DISCONNECT_RQST
13:34:35.508 [4680.5328] <2> bpcd exit_bpcd: exit status 0  ----------->exiting

 

---------------------------------------------------------------------------------------------------------------------------------------------------

Please help me on this issue.

 

Regards,

Vasu H

Nicolai
Moderator
Moderator
Partner    VIP   

Please attach large amount of debug text as a text file.

Connect anti-spam often marks debug text as spam and thus need moderation from the TA.

Vasu_H
Level 3

Hi Marianne,

 

When I have posted my updates I got the below message. I have tried twice and getting same message. Please let me know how to send the details to you.

"Your comment has been queued for moderation by site administrators and will be published after approval"

Thanks & Regards,

Vasu H

revarooo
Level 6
Employee

login to clap-nbu01  and then run:

bptestbpcd -host crw-nbu01  -verbose

Also:

bpclntcmd -self

bpclntcmd -pn

 

also run this so we can see what services are running

bpps 

 

 

post up all of the output

Vasu_H
Level 3

Hi Revarooo,

Below are the details.

bptestbpcd -host crw-nbu01  -verbose

0 1 2
172.16.16.84:526 -> 172.16.0.116:13782
172.16.16.84:13724 <- 172.16.0.116:55840
PEER_NAME = clap-nbu01.ocs.co.uk
HOST_NAME = crw-nbu01
CLIENT_NAME = crw-nbu01
VERSION = 0x07500007
PLATFORM = win_x64
PATCH_VERSION = 7.5.0.7 
SERVER_PATCH_VERSION = 7.5.0.7 
MASTER_SERVER = crw-nbu01
EMM_SERVER = crw-nbu01
NB_MACHINE_TYPE = MASTER_SERVER

------------------------------------------------------------------------------------------------------------------------------------------------------

bpclntcmd -self

gethostname() returned: clap-nbu01
host clap-nbu01: clap-nbu01 at 172.16.16.84
aliases:     clap-nbu01     172.16.16.84
getfqdn(clap-nbu01) returned: Clap-nbu01.ocs.co.uk

----------------------------------------------------------------------------------------------------------------------------------------------------

bpclntcmd -pn

We didn't get any output for this command

---------------------------------------------------------------------------------------------------------------------------------------------------

bpps

* CLAP-NBU01                                             3/25/15 14:11:58.600
COMMAND           PID      LOAD             TIME   MEM                  START
vnetd            5004    0.000%            0.062  8.4M   3/25/15 10:48:36.903
bpinetd          6264    0.000%            3.603   11M   3/25/15 10:48:37.028
bpcd             4104    0.000%            0.062   10M   3/25/15 10:48:37.480
vmd              4728    0.000%            0.078   18M   3/25/15 10:48:37.605
ltid             4528    0.000%            0.093   20M   3/25/15 10:48:37.730
avrd             5736    0.000%            0.046   15M   3/25/15 10:49:01.676
tldd             7936    0.000%            0.031   15M   3/25/15 10:49:01.676
tldcd            6164    0.000%            0.046   16M   3/25/15 10:49:02.908
NBConsole        6840    0.000%            1.700   52M   3/25/15 13:23:02.119
bpbrm            7392    0.000%            0.062   15M   3/25/15 13:41:26.755
bpps              852    0.000%            0.015  7.7M   3/25/15 14:11:57.571

----------------------------------------------------------------------------------------------------------------------------------------------------

 

Thanks & Regards,

Vasu H

 

 

 

 

 

Nicolai
Moderator
Moderator
Partner    VIP   

Please read my previous post

Vasu_H
Level 3

Hi Nicolai,

I tried to attach the text file but in vain. So, I have pasted the details in this.

 

Regards,

Vasu H

revarooo
Level 6
Employee

- bpclntcmd -pn

- We didn't get any output for this command

 

then you have a problem. Enable the bprd log on the master (if not already done so) re-run the command on the media server and look for the IP of your media server incoming

 

you should see something like:

 

logconnections: BPRD ACCEPT FROM xx.xxx.xxx.xxx.58796 TO xx.xxx.xxx.xxx.13720 fd = 9

The first xx.xxx being the IP of your media server - does it show that? If not, it's a comms issue (firewall, routing etc)

 

Vasu_H
Level 3

Hi All,

 

Thank you so much for your help. I have restarted the media server, then the issue got resolved.

Once again my sincere thanks to you all.

Regards,

Vasu H