Backup error 43
Hello Guys,
I really need your help regarding a confusing error code which is 43: unexpected message received.
Master server is on Windows server 2012R2 and NetBackup version 8.2
Media serve: RHEL, NBU 8.2
Client : nbu version 7.6.1.2 and Os : Windows server 2012R2
Policy type : Ms-Windows
snip detailled status :
Aug 31, 2020 4:13:09 PM - Info nbjm (pid=4420) starting backup job (jobid=309891) for client SRV-CLIENT.domain.com, policy TEST_POLICY, schedule WFULL
Aug 31, 2020 4:13:09 PM - Info nbjm (pid=4420) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=309891, request id:{03415F63-91B6-4431-8179-F3BA8595898B})
Aug 31, 2020 4:13:09 PM - requesting resource srv-media-stu
Aug 31, 2020 4:13:09 PM - requesting resource srv-mas.domain.com.NBU_CLIENT.MAXJOBS.SRV-CLIENT.domain.com
Aug 31, 2020 4:13:09 PM - requesting resource srv-mas.domain.com.NBU_POLICY.MAXJOBS.TEST_POLICY
Aug 31, 2020 4:13:09 PM - granted resource srv-mas.domain.com.NBU_CLIENT.MAXJOBS.SRV-CLIENT.domain.com
Aug 31, 2020 4:13:09 PM - granted resource srv-mas.domain.com.NBU_POLICY.MAXJOBS.TEST_POLICY
Aug 31, 2020 4:13:09 PM - granted resource MediaID=@aaaaz;DiskVolume=PureDiskVolume;DiskPool=srv-media-dp;Path=PureDiskVolume;StorageServer=srv-media.domain.com;MediaServer=srv-media.domain.com
Aug 31, 2020 4:13:09 PM - granted resource srv-media-stu
Aug 31, 2020 4:13:09 PM - estimated 263878 kbytes needed
Aug 31, 2020 4:13:09 PM - begin Parent Job
Aug 31, 2020 4:13:09 PM - begin WOFB: Step By Condition
Operation Status: 0
Aug 31, 2020 4:13:09 PM - end WOFB: Step By Condition; elapsed time 0:00:00
Aug 31, 2020 4:13:09 PM - begin WOFB: Stream Discovery
Operation Status: 0
Aug 31, 2020 4:13:09 PM - end WOFB: Stream Discovery; elapsed time 0:00:00
Aug 31, 2020 4:13:09 PM - begin WOFB: Read File List
Operation Status: 0
Aug 31, 2020 4:13:09 PM - end WOFB: Read File List; elapsed time 0:00:00
Aug 31, 2020 4:13:09 PM - begin WOFB: Create Snapshot
Aug 31, 2020 4:13:19 PM - started process bpbrm (pid=23478)
Aug 31, 2020 4:13:19 PM - end writing
Operation Status: 43
Aug 31, 2020 4:13:19 PM - end WOFB: Create Snapshot; elapsed time 0:00:10
Aug 31, 2020 4:13:19 PM - begin WOFB: Stop On Error
Operation Status: 0
Aug 31, 2020 4:13:19 PM - end WOFB: Stop On Error; elapsed time 0:00:00
Operation Status: 43
Aug 31, 2020 4:13:19 PM - end Parent Job; elapsed time 0:00:10
unexpected message received (43)
Note: we also have this on clients with NetBackup version 8.0 and on other media servers, but all other backups are running just fine on other media servers (we have a lot fo media servers)
As soon as we execute the backup,it gets stuck for almost 10 minutes and fails with that error code 43.
it didn't show on which process it hangs, because allow multuiple data stream was enabled, and when I disabled it, the detailled status showed bpbrm process.
But as per bpbrm logs, I don't think the problem is coming from this process, because the log only reported this error:
13:13:50.907 [25298.25298] <2> vnet_apply_domain_constraints: Domain hint(f7185221-b5d1-44da-84a0-164dd102486d) is ignored for client connection
13:13:50.907 [25298.25298] <2> dump_proxy_info: local_proxy_info:
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----pid: 1684
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----connection_id: {987318CA-EC4C-11EA-9EA3-E54F786B6AB8}:OUTBOUND
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----ca_roots: f7185221-b5d1-44da-84a0-164dd102486d
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----ca_roots_excluded: UNCONSTRAINED
13:13:50.907 [25298.25298] <2> dump_proxy_info: remote_proxy_info:
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----pid: 3408
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----connection_id: {F0DD5121-0265-47FD-AA6F-5898F072B9E0}:INBOUND
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----ca_roots: f7185221-b5d1-44da-84a0-164dd102486d
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----ca_roots_excluded: f772adf8-78b7-4565-8713-e2d6a12d8245, aa4559d8-7cbd-4758-a0e9-8279f8a1d084, 2472d64c-15ad-4417-836d-08f452c2dc6c, 7f0fcc50-121f-4000-bbfa-e78b23188bdc, 08ca4903-d9e5-4033-a609-cce797308811, 631bc3d3-6af6-417d-96e0-05088d927652, 32af3387-6fa4-407e-aa18-0a7140cbecaa, 7162123d-ba3c-4fee-8d73-fded949ee1df
13:13:50.907 [25298.25298] <2> dump_proxy_info: remote_endpoint_info:
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----pid: 15352
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----exe_name: bpdbm
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----local_hostname: srv-mas.domain.com
13:13:50.907 [25298.25298] <2> dump_proxy_info: domain_constraints_set:
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----process_hint_server_name: srv-mas.domain.com
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----process_hint: f7185221-b5d1-44da-84a0-164dd102486d
13:13:50.907 [25298.25298] <2> dump_proxy_info: ----process_hint_reason: the domain was inherited from the local peer process
13:13:50.909 [25298.25298] <2> is_connection_INBOUND: Connection-Is-INBOUND:[FALSE]
13:13:50.910 [25298.25298] <2> logconnections: PROXY CONNECT FROM 10.X.X.1.54741 TO 10.X.X.2.1556 fd = 6
13:13:50.910 [25298.25298] <2> logconnections: BPDBM CONNECT FROM 127.0.0.1.46627 TO 127.0.0.1.40458 fd = 6
13:13:50.910 [25298.25298] <2> vnet_check_vxss_client_magic_with_info_ex: [vnet_vxss_helper.c:1137] Ignoring VxSS authentication 2 0x2
13:13:50.935 [25298.25298] <2> isDONE: reply DONE 43 <?xml version="1.0" encoding="utf-8"?><StatusMsgDoc><StatusMsg NBUStatus="43" Severity="2" VXULoid="271" VXULtid="43"><MsgString>unexpected message received</MsgString></StatusMsg></StatusMsgDoc>
13:13:50.936 [25298.25298] <2> isDONE: reply DONE 43
13:13:50.936 [25298.25298] <2> db_startrequest: protocol error 1
13:13:50.936 [25298.25298] <16> db_begin: db_startrequest() failed: unexpected message received
13:13:50.936 [25298.25298] <2> db_getCLIENT: db_begin() failed: unexpected message received
13:13:50.936 [25298.25298] <2> verify_client: ../bpbrm.c.37760: db_getCLIENT failed for CLIENT: srv-client.domain.com
13:13:50.936 [25298.25298] <2> verify_client: ../bpbrm.c.37956: db_getCLIENT failed: 43 43 0x0000002b
13:13:50.936 [25298.25298] <2> bpbrm main: ../bpbrm.c.3622: verify_client failed: 43 43 0x0000002b
13:13:50.937 [25298.25298] <2> bpbrm kill_child_process: start
13:13:50.937 [25298.25298] <2> inform_client_of_status: COMM_SOCK == INVALID_SOCKET, 0
13:13:50.937 [25298.25298] <2> bpbrm Exit: attempting to send mail to root on srv-client.domain.com
I attached a snip of nbjm logs (I had to change names).
I can't find anything anywhere regarding this, in fact, we don't know what is causing the problem, media, master or client
Note: we have already rebooted media and client.
- Hello David,
Thanks a lot for your persistence and your help,
Actually we found the root cause of the problem, it was a problem in Reverse DNS (media->master) once we corrected that, backups start to work again.