ALERT - ISILON incrementals - succeed but backup no data! watch for attempt count!
This is NOT NB specific and is across all versions.
Please be aware - there is an issue with Isilon TCP/IP and NetBackup - watch out for multiple tries on incremental backups very infrequently.
Backup will complete successfully update the listing then fail and rerun successfully and backup nothing.
What you see in NetBackup - incremental isilon backups succeed with multiple attempts - BUT
1st attempts find new data - backs up data - fails
2nd (or 3rd ) attempt completes with NO error - but backs up NO data!
From Isilon support ( after 9 months of Veritas/Isilon fighting over whose fault it is... )
Summary
After review, engineering has confirmed from the Network trace investigation that this issue is observed due to rare DMA slowness with reading from the received buffer (1-2% failure jobs). At the same time, the RST flag is sent by the Isilon node due to fast socket termination by the NDMP daemon without confirmation that all data was received by the DMA client exactly at the same slowness time on the DMA side.
To improve this situation from the Isilon NDMP side, engineering is looking at the following NDMP changes:
- Сall shutdown() before close()
The Dev team has already started this improvement in the Isilon NDMP code. Also, Engineering is working on a repro of this problem to have a lab where the code improvement can be validated. The current estimation is that the fix will take up to 4 weeks, assuming no issues. ( It has been 5 weeks now and no updated ETA... )