-
-
Notifications
You must be signed in to change notification settings - Fork 31.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: I/O operation on closed file. in ZipFile destructor #81954
Comments
When running this code: from zipfile import ZipFile
import io
def foo():
pass
data = io.BytesIO()
zf = ZipFile(data, "w") I get this message: Exception ignored in: <function ZipFile.__del__ at 0x7f9005caa160>
Traceback (most recent call last):
File "/home/user/git/oss/cpython/Lib/zipfile.py", line 1800, in __del__
File "/home/user/git/oss/cpython/Lib/zipfile.py", line 1817, in close
ValueError: I/O operation on closed file. Comment out def foo: pass, and there is no error. It looks like the bug was introduced with commit ada319b (bpo-32388). |
This looks like a regression in 3.8 so I have added 3.8 regression tag. |
Still reproducible in cpython 3.10.0a3 (debian unstable) and 3.10.0a6 (pyenv). |
That's what happened. Function foo creates a reference loop. It has reference to the module dict, and the dict has reference to the function. The dict has also references to BytesIO and ZipFile objects. At shutdown stage the garbage collector is called. It founds the reference loop which contains the module dict, function foo and BytesIO and ZipFile objects. It tries to break the loop by calling the destructor of arbitrary object in the loop. In that case it is the BytesIO object. It does not help, but finally the destructor of the ZipFile object is called (either directly or after breaking the loop and destroying the module dict). The ZipFile destructor tries to close the BytesIO object, but it is already closed in its destructor. It is a dangerous situation which potentially can cause data corruption (if the underlying stream is closed before flushing buffers). Clearing the module dict before invoking garbage collecting would solve this issue. Making the garbage collector more clever and and avoid destroying objects if it does not break the loop would help too, but it is more complex problem and such change could not be backported to 3.8. Victor, were there any changes in the garbage collector or interpreter shutdown code in 3.8? |
Thanks Serhiy for the explanation, it's making sense now. Guess whatever I did back then (no idea what I was working on) was basically a mistake; I should have closed my ZipFile properly, e.g. by using context managers. So maybe it's not really a cpython bug. I ran a git-bisect back then and came up with https://hg.python.org/lookup/ada319bb6d0ebcc68d3e0ef2b4279ea061877ac8 as cause. |
I took some notes there: https://pythondev.readthedocs.io/finalization.html I proposed bpo-42671 "Make the Python finalization more deterministic" but it will break some use cases, so I'm not comfortable with this idea yet :-( |
IMO ZipFile destructor should emit a ResourceWarning if it's not closed explicitly. Nothing in the Python specification gives you any warranty that destructors will be ever called... Depending on the Python implementation and depending on many things, you never know when a destructor is called. Just call the close() method explicitly. |
FYI TextIOWrapper has a similar issue: bpo-17852. There was an attempt to fix the issue in 2017, but it had to be reverted. |
In my testing on 3.9.1 and the current
The test code runs without errors if the following check is done before seeking the if not self.fp.closed: here: The fix also requires adding With these two small changes, all test_zipfile tests pass. I'm not sure this is the proper fix, because this is a complex issue.. I will go ahead and put up a PR if this seems like a good approach. |
I forgot to add, to fix the tests I also had to add |
(Python 3.10.10 -- excuse the noise if it's been fixed in 3.11) What's the status of this? I'm getting similar warnings in my tests:
This is pretty buried -- I'm not using zipfile directly, it's being used by The odd thing is that this warning does not pop up when I run that test by itself, only when it is run as part of the suite -- which makes me thing it's maybe related to this issue with the GC and old references, but ??? Anyway, I think @akulakov has the right idea -- if you put a guard around the close() call, the warning will go away. Maybe that would hide another more important error, but it seems not unlikely that an object may be deleted having been partially closed already. |
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
The text was updated successfully, but these errors were encountered: