03-24-2008 11:55 AM
08-20-2008 04:43 PM
I am having the same problem with about 5 Linux machines, all CentOS 5.2, or RHEL 5.2, 32-bit, on Dell and HP xeon servers (BUExec 12d server on Windows 2003, latest versions). I have been assuming that it was an LVM problem, but following this thread and more tests, it does seem to point to RALUS. I have even moved to making sure the lvm snapshots are readonly.
--- file backupexec_pre.sh ---
#!/bin/sh
/usr/sbin/lvcreate --snapshot -L 1000m -p r -n snap_web /dev/vg_scsi142_2/lv_web
mount -o ro /dev/vg_scsi142_2/snap_web /snapshot/web
----------- END FILE ---------
---- file backupexec_post.sh ----
#!/bin/sh
#DEBUG
lsof /snapshot/web > /tmp/lsofweb
/bin/umount /dev/vg_scsi142_2/snap_web && \
/usr/sbin/lvremove -f /dev/vg_scsi142_2/snap_web
---------- END FILE ----------
From the command line I can run
# backupexec_pre.sh
# tar cfz /dev/null /snapshot/web
# backupexec_post.sh
and it works fine. snap_web gets removed.
A simple Backup Exec backup job that runs these 2 scripts, and backs up files modified in the last day, is not able to destroy the snapshot. I need to reboot now to remove the snapshot. After both of these backups (tar and BUE) less than 5% of the 1000m I set aside is used by the snapshot.(/tmp/lsofweb is empty)
I had tried to run "beremote --log-file-auto &" but I foregrounded it and lost the last half of the logs. However, since beremote was attached to my tty, I saw these message at the end of the backup:
"
File descriptor 5 left open
...
File descriptor 13 left open
Can't remove open logical volume "snap_web"
"