-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
potential race condition prevents 'unix.rm' from deleting directory #206
Comments
Thanks. Once I get to picking apart the scripts I will consider this.
As an aside, I am not sure why log files would be removed as part of the process? If the information in the log file is potentially useful it should not be deleted so the user can review the logs if needed. If the log files are not useful then why write them? I know that you are just calling Pyaflowa as part of the seisflows scripts so you don’t necessarily have control over the behavior.
From: Bryant Chow ***@***.***>
Sent: Sunday, March 31, 2024 12:13 PM
To: adjtomo/seisflows ***@***.***>
Cc: scott314159 ***@***.***>; Mention ***@***.***>
Subject: [adjtomo/seisflows] potential race condition prevents 'unix.rm' from deleting directory (Issue #206)
Raised in #205 <#205> by @scott314159 <https://github.com/scott314159> , there seems to be a race condition during the finalization command of the Pyaflowa preprocessing module where the main job tries to delete the log file directory but returns OSError: [Errno 39] Directory not empty , which traces back from shutil.rmtree.
This seems to be a know issue with shutil.rmtree, see e.g., https://stackoverflow.com/questions/11228079/python-remove-directory-error-file-exists that happens on NFS file systems where file locks are present.
My thinking is that this is a race condition where the file locks cannot be removed before the rm command tries to delete. One potential try is to put a try-except clause in unix.rm that waits a few seconds if it hits this OSError, to give the filesystem some time to remove it's lock.
—
Reply to this email directly, view it on GitHub <#206> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/BHNHRJTH4WA2L73OI5DDNH3Y3BG2NAVCNFSM6AAAAABFQTI6L6VHI2DSMVQWIX3LMV43ASLTON2WKOZSGIYTOMJRGY4TEMQ> .
You are receiving this because you were mentioned. <https://github.com/notifications/beacon/BHNHRJTU2MOWFRRIBCHDUTDY3BG2NA5CNFSM6AAAAABFQTI6L6WGG33NNVSW45C7OR4XAZNFJFZXG5LFVJRW63LNMVXHIX3JMTHIIJUE7I.gif> Message ID: ***@***.*** ***@***.***> >
|
Hey @scott314159, again, sorry for the long pause in communication here! Yeah the log message removal is intended to reduce file count, as for large-scale problems there may be (hundreds of) thousands of files created which may stress file systems, but I agree there is usually useful information there. The parameter |
Raised in #205 by @scott314159, there seems to be a race condition during the finalization command of the Pyaflowa preprocessing module where the main job tries to delete the log file directory but returns OSError: [Errno 39] Directory not empty , which traces back from
shutil.rmtree
.This seems to be a know issue with
shutil.rmtree
, see e.g., https://stackoverflow.com/questions/11228079/python-remove-directory-error-file-exists that happens on NFS file systems where file locks are present.My thinking is that this is a race condition where the file locks cannot be removed before the
rm
command tries to delete. One potential try is to put a try-except clause inunix.rm
that waits a few seconds if it hits this OSError, to give the filesystem some time to remove it's lock.seisflows/seisflows/tools/unix.py
Lines 197 to 205 in d7661ee
The text was updated successfully, but these errors were encountered: