07-09-2018 10:38 AM
Hello,
I need the support of your (a) specialists for backup job correction.
Follow datalhes of the backup job:
09/07/2018 12:00:00 - Info nbjm(pid=5124) starting backup job (jobid=68462) for client srvbd1-ma.ma.trf1.gov.br, policy JFMA-OVm-Bd-06_Bd1Archiv, schedule Arch
09/07/2018 12:00:00 - Info nbjm(pid=5124) requesting MEDIA_SERVER_WITH_ATTRIBUTES resources from RB for backup job (jobid=68462, request id:{8039333B-23E0-4C72-A76F-6BBEBA6C080E})
09/07/2018 12:00:00 - requesting resource dedup.stu.srvbkp1-ma
09/07/2018 12:00:00 - requesting resource srvbkp1-ma.NBU_CLIENT.MAXJOBS.srvbd1-ma.ma.trf1.gov.br
09/07/2018 12:00:00 - requesting resource srvbkp1-ma.NBU_POLICY.MAXJOBS.JFMA-OVm-Bd-06_Bd1Archiv
09/07/2018 12:00:00 - granted resource srvbkp1-ma.NBU_CLIENT.MAXJOBS.srvbd1-ma.ma.trf1.gov.br
09/07/2018 12:00:00 - granted resource srvbkp1-ma.NBU_POLICY.MAXJOBS.JFMA-OVm-Bd-06_Bd1Archiv
09/07/2018 12:00:00 - granted resource dedup.stu.srvbkp1-ma
09/07/2018 12:00:00 - estimated 0 Kbytes needed
09/07/2018 12:00:00 - Info nbjm(pid=5124) started backup (backupid=srvbd1-ma.ma.trf1.gov.br_1531148400) job for client srvbd1-ma.ma.trf1.gov.br, policy JFMA-OVm-Bd-06_Bd1Archiv, schedule Arch on storage unit dedup.stu.srvbkp1-ma
09/07/2018 12:00:00 - started process bpbrm (31624)
09/07/2018 12:00:22 - Info bpbrm(pid=31624) connect failed STATUS (18) CONNECT_FAILED
09/07/2018 12:00:22 - Info bpbrm(pid=31624) status: FAILED, (43) CONNECT_RESET; system: (10054) An existing connection was forcibly closed by the remote host. ; FROM 0.0.0.0 TO srvbd1-ma.ma.trf1.gov.br 192.168.0.80 bpcd VIA pbx
09/07/2018 12:00:22 - Info bpbrm(pid=31624) status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0.0.0.0 TO srvbd1-ma.ma.trf1.gov.br 192.168.0.80 bpcd VIA vnetd
09/07/2018 12:00:22 - Info bpbrm(pid=31624) status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0.0.0.0 TO srvbd1-ma.ma.trf1.gov.br 192.168.0.80 bpcd
09/07/2018 12:00:22 - Error bpbrm(pid=31624) Cannot connect to srvbd1-ma.ma.trf1.gov.br
09/07/2018 12:00:22 - Info bpbkar32(pid=0) done. status: 58: can't connect to client
09/07/2018 12:00:22 - end writing
can't connect to client(58)
1.The host cache cleanup has been performed, ensuring that any changes related to host validation are updated.
2.Direct / inverse search tests were performed from the Master server.
C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -self
gethostname() returned: srvbkp1-ma
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at ::1
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.26
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.70
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 172.23.3.181
aliases: srvbkp1-ma.ma.trf1.gov.br srvbkp1-ma ::1 192.168.0.70 192.168.0.26 172.23.3.181
getfqdn(srvbkp1-ma) returned: srvbkp1-ma.ma.trf1.gov.br
C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -self srvbd1-ma
gethostname() returned: srvbkp1-ma
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at ::1
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.26
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 192.168.0.70
host srvbkp1-ma: srvbkp1-ma.ma.trf1.gov.br at 172.23.3.181
aliases: srvbkp1-ma.ma.trf1.gov.br srvbkp1-ma ::1 192.168.0.70 192.168.0.26 172.23.3.181
getfqdn(srvbkp1-ma) returned: srvbkp1-ma.ma.trf1.gov.br
C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -pn
expecting response from server srvbkp1-ma
srvbkp1-ma.ma.trf1.gov.br srvbkp1-ma 192.168.0.26 50629
C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -hn srvbd1-ma
host srvbd1-ma: srvbd1-ma.ma.trf1.gov.br at 192.168.0.80
aliases: srvbd1-ma.ma.trf1.gov.br srvbd1-ma 192.168.0.80
C:\Program Files\Veritas\NetBackup\bin>bpclntcmd -ip 192.168.0.80
host 192.168.0.80: srvbd1-ma.ma.trf1.gov.br at 192.168.0.80
aliases: srvbd1-ma.ma.trf1.gov.br 192.168.0.80
3. Result of the bptestbpcd command
C:\Program Files\Veritas\NetBackup\bin\admincmd>bptestbpcd -client srvbd1-ma -debug -verbose
13:49:03.706 [46304.44280] <2> bptestbpcd: VERBOSE = 0
13:49:03.800 [46304.44280] <2> vnet_pbxConnect: ../../libvlibs/vnet_pbx.c.652: pbxSetAddrEx/pbxConnectEx return error 10054:An existing connection
forcibly closed by the remote host.
13:49:03.800 [46304.44280] <8> do_pbx_service: [vnet_connect.c:2081] vnet_pbxConnect() failed, status=18, errno=10054, use_vnetd=0, cr->vcr_servic
cd
13:49:03.800 [46304.44280] <8> async_connect: [vnet_connect.c:1677] do_service failed 18 0x12
13:49:04.798 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:05.812 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:07.825 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:08.823 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:10.836 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:11.850 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:15.859 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:16.889 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:24.892 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:25.890 [46304.44280] <8> async_connect: [vnet_connect.c:1700] getsockopt SO_ERROR returned 10061 0x274d
13:49:25.890 [46304.44280] <16> connect_to_service: connect failed STATUS (18) CONNECT_FAILED
status: FAILED, (43) CONNECT_RESET; system: (10054) An existing connection was forcibly closed by the remote host. ; FROM 0.0.0.0 TO srvbd
192.168.0.80 bpcd VIA pbx
status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0
.0 TO srvbd1-ma 192.168.0.80 bpcd VIA vnetd
status: FAILED, (42) CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it. ; FROM 0
.0 TO srvbd1-ma 192.168.0.80 bpcd
13:49:25.890 [46304.44280] <8> vnet_connect_to_bpcd: [vnet_connect.c:297] connect_to_service() failed 18 0x12
13:49:25.890 [46304.44280] <2> local_bpcr_connect: Can't connect to client srvbd1-ma
13:49:25.890 [46304.44280] <2> ConnectToBPCD: bpcd_connect_and_verify(srvbd1-ma, srvbd1-ma) failed: 25
<16>bptestbpcd main: Function ConnectToBPCD(srvbd1-ma) failed: 25
13:49:25.890 [46304.44280] <16> bptestbpcd main: Function ConnectToBPCD(srvbd1-ma) failed: 25
<16>bptestbpcd main: cannot connect on socket
13:49:25.922 [46304.44280] <16> bptestbpcd main: cannot connect on socket
<2>bptestbpcd: cannot connect on socket
13:49:25.937 [46304.44280] <2> bptestbpcd: cannot connect on socket
<2>bptestbpcd: EXIT status = 25
13:49:25.937 [46304.44280] <2> bptestbpcd: EXIT status = 25
cannot connect on socket
07-09-2018 01:18 PM - edited 07-09-2018 01:20 PM
Please run the bpclntcmd commands on client srvbd1-ma. You have run all the basic network related commands on Master sever, instead you run on the client and check results.
It looks like master sever is not able to connect client - srvbd1-ma over the bpcd port; Please check bpcd port is open for backup to work if there is firewall set in this network.
07-09-2018 09:00 PM
You need to open port 1556, 13724 and 13782 on client machine. 1556 should be bi-directional atleast.
Ensure Netbackup services are not disabled on client and are in running state.
Logs required:
Please enable bpcd logs on client and admin logs on master which is also your media server.
Once logs are enabled, run bptestbpcd again.
07-09-2018 11:37 PM
You can see that the master/media server tried to connect on 3 ports:
bpcd VIA pbx (port 1556)
CONNECT_RESET; system: (10054) An existing connection was forcibly closed by the remote host.
bpcd VIA vnetd (port 13724)
CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it.
bpcd (port 13782)
CONNECT_REFUSED; system: (10061) No connection could be made because the target machine actively refused it.
I don't see a lookup problem, rather firewall issue or some security software on the client that is preventing port connection.
Please check for OS firewall on the client: Windows firewall if Windows client, iptables on Linux client.
07-10-2018 06:11 AM
As pastas bpbkar, bpcd, bpbrm foram criadas no clientdb para análise dos logs.
Siga os testes do bptestbpcd:
Ping:
Hosts:
Portas ouvindo:
Depois disso, o backup será lançado manualmente.
sugestões e procedimentos a serem revisados são bem-vindos.
07-10-2018 06:18 AM
Could you please try again?
This is all we see in your last post (using Google Translate):
The bpbkar, bpcd, bpbrm folders were created on clientdb for log analysis.
Follow the bptestbpcd tests:
Ping:
Hosts:
Ports listening:
After that, the backup will be launched manually.
suggestions and procedures to be reviewed are welcome.
07-10-2018 09:40 AM
@MarianneChecked out that the bpbkar, bpbrm, bpcd folders had not been created on clientdb. Thus both were created for analysis of processes in netbackup.
Follow connection test results:
07-10-2018 09:48 AM