cancel
Showing results for 
Search instead for 
Did you mean: 

can't connect to client(58)

Ghustavo
Level 3

Hello,

I need the support of your (a) specialists for backup job correction.

Follow datalhes of the backup job:

09/07/2018 12:00:00 - Info nbjm(pid=5124) starting backup job (jobid=68462) for client srvbd1-ma.ma.trf1.gov.br, policy JFMA-OVm-Bd-06_Bd1Archiv, schedule Arch 

09/07/2018 12:00:00 - Info nbjm(pid=5124) requesting MEDIA_SERVER_WITH_ATTRIBUTES resources from RB for backup job (jobid=68462, request id:{8039333B-23E0-4C72-A76F-6BBEBA6C080E}) 

09/07/2018 12:00:00 - requesting resource dedup.stu.srvbkp1-ma

09/07/2018 12:00:00 - requesting resource srvbkp1-ma.NBU_CLIENT.MAXJOBS.srvbd1-ma.ma.trf1.gov.br

09/07/2018 12:00:00 - requesting resource srvbkp1-ma.NBU_POLICY.MAXJOBS.JFMA-OVm-Bd-06_Bd1Archiv

09/07/2018 12:00:00 - granted resource srvbkp1-ma.NBU_CLIENT.MAXJOBS.srvbd1-ma.ma.trf1.gov.br

09/07/2018 12:00:00 - granted resource srvbkp1-ma.NBU_POLICY.MAXJOBS.JFMA-OVm-Bd-06_Bd1Archiv

09/07/2018 12:00:00 - granted resource dedup.stu.srvbkp1-ma

09/07/2018 12:00:00 - estimated 0 Kbytes needed

09/07/2018 12:00:00 - Info nbjm(pid=5124) started backup (backupid=srvbd1-ma.ma.trf1.gov.br_1531148400) job for client srvbd1-ma.ma.trf1.gov.br, policy JFMA-OVm-Bd-06_Bd1Archiv, schedule Arch on storage unit dedup.stu.srvbkp1-ma

09/07/2018 12:00:00 - started process bpbrm (31624)

09/07/2018 12:00:22 - Info bpbrm(pid=31624) connect failed STATUS (18) CONNECT_FAILED       

09/07/2018 12:00:22 - Info bpbrm(pid=31624) status: FAILED, (43) CONNECT_RESET; system: (10054) An existing connection was forcibly closed by the remote host. ; FROM 0.0.0.0 TO srvbd1-ma.ma.trf1.gov.br 192.168.0.80 bpcd VIA pbx

09/07/2018 12:00:22 - Info bpbrm(pid=31624) status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0.0.0.0 TO srvbd1-ma.ma.trf1.gov.br 192.168.0.80 bpcd VIA vnetd

09/07/2018 12:00:22 - Info bpbrm(pid=31624) status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0.0.0.0 TO srvbd1-ma.ma.trf1.gov.br 192.168.0.80 bpcd

09/07/2018 12:00:22 - Error bpbrm(pid=31624) Cannot connect to srvbd1-ma.ma.trf1.gov.br        

09/07/2018 12:00:22 - Info bpbkar32(pid=0) done. status: 58: can't connect to client     

09/07/2018 12:00:22 - end writing

can't connect to client(58)

 

1.The host cache cleanup has been performed, ensuring that any changes related to host validation are updated.

2.Direct / inverse search tests were performed from the Master server.

C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -self

gethostname() returned: srvbkp1-ma

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at ::1

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.26

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.70

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 172.23.3.181

aliases:     srvbkp1-ma.ma.trf1.gov.br     srvbkp1-ma     ::1     192.168.0.70     192.168.0.26     172.23.3.181

getfqdn(srvbkp1-ma) returned: srvbkp1-ma.ma.trf1.gov.br

 C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -self srvbd1-ma

gethostname() returned: srvbkp1-ma

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at ::1

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.26

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.70

host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 172.23.3.181

aliases:     srvbkp1-ma.ma.trf1.gov.br     srvbkp1-ma     ::1     192.168.0.70     192.168.0.26     172.23.3.181

getfqdn(srvbkp1-ma) returned: srvbkp1-ma.ma.trf1.gov.br

 C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -pn

expecting response from server srvbkp1-ma

srvbkp1-ma.ma.trf1.gov.br srvbkp1-ma 192.168.0.26 50629

 C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -hn srvbd1-ma

host srvbd1-ma: srvbd1-ma.ma.trf1.gov.br at 192.168.0.80

aliases:     srvbd1-ma.ma.trf1.gov.br     srvbd1-ma     192.168.0.80

 

C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -ip 192.168.0.80

host 192.168.0.80: srvbd1-ma.ma.trf1.gov.br at 192.168.0.80

aliases:     srvbd1-ma.ma.trf1.gov.br     192.168.0.80

3. Result of the bptestbpcd command

C:\Program Files\Veritas\NetBackup\bin\admincmd>bptestbpcd -client srvbd1-ma -debug -verbose

13:49:03.706 [46304.44280] <2> bptestbpcd: VERBOSE = 0

13:49:03.800 [46304.44280] <2> vnet_pbxConnect: ../../libvlibs/vnet_pbx.c.652: pbxSetAddrEx/pbxConnectEx return error 10054:An existing connection

 forcibly closed by the remote host.

13:49:03.800 [46304.44280] <8> do_pbx_service: [vnet_connect.c:2081] vnet_pbxConnect() failed, status=18, errno=10054, use_vnetd=0, cr->vcr_servic

cd

13:49:03.800 [46304.44280] <8> async_connect: [vnet_connect.c:1677] do_service failed 18 0x12

13:49:04.798 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:05.812 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:07.825 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:08.823 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:10.836 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:11.850 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:15.859 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:16.889 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:24.892 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:25.890 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d

13:49:25.890 [46304.44280] <16> connect_to_service: connect failed STATUS (18) CONNECT_FAILED

        status: FAILED, (43) CONNECT_RESET; system: (10054) An existing connection was forcibly closed by the remote host. ; FROM 0.0.0.0 TO srvbd

 192.168.0.80 bpcd VIA pbx

        status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0

.0 TO srvbd1-ma 192.168.0.80 bpcd VIA vnetd

        status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0

.0 TO srvbd1-ma 192.168.0.80 bpcd

13:49:25.890 [46304.44280] <8> vnet_connect_to_bpcd: [vnet_connect.c:297] connect_to_service() failed 18 0x12

13:49:25.890 [46304.44280] <2> local_bpcr_connect: Can't connect to client srvbd1-ma

13:49:25.890 [46304.44280] <2> ConnectToBPCD: bpcd_connect_and_verify(srvbd1-ma, srvbd1-ma) failed: 25

<16>bptestbpcd main: Function ConnectToBPCD(srvbd1-ma) failed: 25

13:49:25.890 [46304.44280] <16> bptestbpcd main: Function ConnectToBPCD(srvbd1-ma) failed: 25

<16>bptestbpcd main: cannot connect on socket

13:49:25.922 [46304.44280] <16> bptestbpcd main: cannot connect on socket

<2>bptestbpcd: cannot connect on socket

13:49:25.937 [46304.44280] <2> bptestbpcd: cannot connect on socket

<2>bptestbpcd: EXIT status = 25

13:49:25.937 [46304.44280] <2> bptestbpcd: EXIT status = 25

cannot connect on socket

 

7 REPLIES 7

Tape_Archived
Moderator
Moderator
   VIP   

Please run the bpclntcmd commands on client srvbd1-ma. You have run all the basic network related commands on Master sever, instead you run on the client and check results.

It looks like master sever is not able to connect client - srvbd1-ma over the bpcd port; Please check bpcd port is open for backup to work if there is firewall set in this network.

RahulSharma
Level 2

You need to open port 1556, 13724 and 13782 on client machine. 1556 should be bi-directional atleast.

Ensure Netbackup services are not disabled on client and are in running state.

Logs required:

Please enable bpcd logs on client and admin logs on master which is also your media server.

Once logs are enabled, run bptestbpcd again.

 

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

You can see that the master/media server tried to connect on 3 ports: 
bpcd VIA pbx (port 1556)
CONNECT_RESET; system: (10054) An existing connection was forcibly closed by the remote host.

bpcd VIA vnetd (port 13724)
CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it.

bpcd (port 13782)
CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it.

I don't see a lookup problem, rather firewall issue or some security software on the client that is preventing port connection.
Please check for OS firewall on the client: Windows firewall if Windows client, iptables on Linux client.

As pastas bpbkar, bpcd, bpbrm foram criadas no clientdb para análise dos logs.

Siga os testes do bptestbpcd:

Ping:

Hosts:

Portas ouvindo:

Depois disso, o backup será lançado manualmente.

sugestões e procedimentos a serem revisados ​​são bem-vindos.

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

@Ghustavo

Could you please try again?

This is all we see in your last post (using Google Translate):

The bpbkar, bpcd, bpbrm folders were created on clientdb for log analysis.
Follow the bptestbpcd tests:
Ping:
Hosts:
Ports listening:
After that, the backup will be launched manually.
suggestions and procedures to be reviewed are welcome.


@Marianne 

Checked out that the bpbkar, bpbrm, bpcd folders had not been created on clientdb. Thus both were created for analysis of processes in netbackup.

 Follow connection test results:

bpclntcmd_1.pngbpclntcmd_2.pngbpclntcmd_3.pngbpclntcmd_4.pnghosts.pngping.png

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified
Once again -
I don't see a lookup problem, rather firewall issue or some security software on the client that is preventing port connection.
Please check for OS firewall on the client: iptables on Linux client.