_Py_wfopen no longer exported #127350

PlanetCNC · 2024-11-27T23:25:06Z

Bug report

Bug description:

_Py_wfopen in no longer exported since 3.13.
I'm using embed version and I can not use fopen or _wfopen.
Please reconsider decision to remove _Py_wfopen since it is only way to open file when used in embed mode and fopen/_wfopen is not available. Without it PyRun_FileExFlags is useless to me and my application can no longer call external scripts.

CPython versions tested on:

3.13

Operating systems tested on:

Windows

skirpichev · 2024-11-28T01:22:56Z

Hidden by #107213

CC @vstinner

vstinner · 2024-11-28T09:53:04Z

Hi,

I'm using embed version and I can not use fopen or _wfopen.

Why can't you use fopen() or _wfopen()?

PlanetCNC · 2024-11-28T10:14:14Z

In my case PyRun_FileExFlags crashes if I use FILE* from fopen() or _wfopen().

I suspect this is because different toolset versions are used to compile python313.dll and our software. That is why we don't mix them.

With _Py_wfopen everything works flawlessly.

We integrated Python into our CNC machine control software and it was working great for lots of users until now. Here is an example which is no longer possible to run:
https://www.youtube.com/watch?v=MruGmi1JoCc

Here is our module documentation:
https://cnc.zone/sdk/python/python

vstinner · 2024-11-28T10:23:54Z

In my case PyRun_FileExFlags crashes if I use FILE* from fopen() or _wfopen(). I suspect this is because different toolset versions are used to compile python313.dll and our software. That is why we don't mix them.

The FILE* is not NULL? That's strange.

You may give a try to _Py_fopen_obj() function which is the last "open" function which returns a FILE* in the Python C API. It's private, you're not supposed to use it.

PlanetCNC · 2024-11-28T10:40:06Z

No, it is not NULL.

I tried _Py_fopen_obj() but I was unable to make it work. It is "private" anyway so I expect to be removed in next version.
How about making new, nonprivate function with same implementation as _Py_wfopen ?

PlanetCNC · 2024-11-28T11:20:22Z

I managed to make _Py_fopen_obj() work. Please do not remove my last option to open file.

ZeroIntensity · 2024-11-28T12:23:54Z

In my case PyRun_FileExFlags crashes if I use FILE* from fopen() or _wfopen().

That sounds like a bug, could you file a new issue with a reproducer?

But overall, I think we just need a public API if this is the only way to do something. You shouldn't ever rely on anything prefixed with _Py as a user.

vstinner · 2024-11-29T10:49:03Z

Yeah, you should be able to call fopen() and pass the result to a Python API which expect a FILE* file.

PlanetCNC · 2024-11-29T11:39:06Z

This is not correct. FILE* should not be passed across a DLL boundary which is case when using embed Python.
Here is Microsoft article explaining this in detail.
https://learn.microsoft.com/en-us/cpp/c-runtime-library/potential-errors-passing-crt-objects-across-dll-boundaries?view=msvc-170

asvetlov · 2024-11-29T12:17:55Z

I read the article as:

10 years ago you should be careful and use the same Visual Studio version and the same multithreaded (/MT vs /MD) mode as it was used for compiling Python itself. It was always a good advice. Now CRT is cross-compatible which helps people a lot; good to know.
Release the memory by the same dynamic library that was used for allocation. For example, call fclose() in your dll for files opened with fopen(). Also it sounds reasonable.

ZeroIntensity · 2024-11-29T17:22:37Z

As far as I can tell, _Py_wfopen is just a wrapper over fopen and _wfopen anyway. It shouldn't matter where the FILE * comes from.

PlanetCNC · 2024-11-29T21:20:50Z

As far as I can tell, _Py_wfopen is just a wrapper over fopen and _wfopen anyway. It shouldn't matter where the FILE * comes from.

This is not true. Microsoft clearly states that passing passing CRT objects across DLL boundaries causes potential errors.

I spend a lot of time trying to "fix" this issue and is simply not possible. And I'm not a novice - I have more than 30 years experience in programming very complex stuff that runs on all popular operating systems. It is not that I don't know how to correctly use 'fopen' and '_wfopen'.

If there is such a problem keeping '_Py_wfopen' then functions that accept FILE* should also expect filename as PyObject and then internally call _Py_fopen_obj(). This change should be fairly easy to make.

ZeroIntensity · 2024-11-29T23:03:19Z

I'm not doubting your expertise, but note that the PyRun APIs have existed for years; if they weren't usable without the private API on Windows, I would hope that we would have noticed by now. Could this be a matter of some incorrect linker flags?

cc @zooba

PlanetCNC · 2024-11-30T16:43:27Z

I use the /MT option, which is most likely the cause of the access violation. Unfortunately, switching to /MD is not feasible for my project. This is not a simple application—it consists of 2,714,742 lines of C++ code, runs on Windows, Linux, Mac, and Raspberry Pi, and interacts with dedicated hardware.

Python has been an part of the project for almost 10 years, starting with version 3.5, offering users a way to script their own commands. If the fopen/_wfopen issue means Python can no longer be used, then so be it. However, it certainly feels like an unnecessarily restrictive and frustrating reason for such a limitation.

ZeroIntensity · 2024-11-30T19:48:11Z

I'll wait and see if Steve has anything to add, but it sounds like we should expose a public API for it.

zooba · 2024-12-02T12:45:47Z

I'd prefer to totally drop FILE * from our public API and if necessary, add an incremental parser API. Or add an API that takes the path and handles reading internally (don't we have this one?).

The best thing to do here to be portable is to open and read the file yourself, and then pass the contents to our parser. If your files are too big to read into memory, an incremental parser API would let you pass in a chunk at a time (we should have this interface internally already, as it's how we handle stdin, but it's not public).

FILE * and file descriptors are terrible values to pass across boundaries. The emulation (on Windows) is flakey at best, but even worse when you insist on static linking (which makes duplicate copies of a lot of libc-equivalent state and doesn't share them).

The other alternative which may be a good option here is to also compile Python yourself, so that you can ensure it matches (and possibly even shares) the version of the CRT. In theory passing a FILE * across statically linked C runtimes should be okay, provided the versions match, though it depends whether they try to validate anything (which they probably do).¹ Our builds aren't especially special, and it's very easy to build on Windows.

File descriptors are, of course, right out, because they will index into different FILE * arrays. ↩

PlanetCNC · 2024-12-02T13:24:09Z

I'd prefer to totally drop FILE * from our public API and if necessary, add an incremental parser API. Or add an API that takes the path and handles reading internally (don't we have this one?).

Python does not have API that takes the path and handles reading internally. For me this solution seems perfect.

The best thing to do here to be portable is to open and read the file yourself, and then pass the contents to our parser. If your files are too big to read into memory, an incremental parser API would let you pass in a chunk at a time (we should have this interface internally already, as it's how we handle stdin, but it's not public).

Incremental parser API should be fine. However to me it seems to be quite some work to make it public. Specially because all heap allocations should be done in Python because of same boundary issues. I will check how incremental parser works.

The other alternative which may be a good option here is to also compile Python yourself, so that you can ensure it matches (and possibly even shares) the version of the CRT. In theory passing a FILE * across statically linked C runtimes should be okay, provided the versions match, though it depends whether they try to validate anything (which they probably do).1 Our builds aren't especially special, and it's very easy to build on Windows.

This can cause issues with dependencies and maintenance. Perhaps even licensing.

ZeroIntensity · 2024-12-02T13:41:21Z

Thanks, Steve. I'm now convinced this is an issue, but I'm not sure how we didn't notice it for so long. Is it just that nobody uses the PyRun* file APIs? (Some code searches are saying so.)

IMO, an incremental parser sounds good, but I think we should keep it simple for users (as in, we don't need to expose the fact that it's an incremental parser; just let them pass a file path and it should just work). Something like Py_CompilePath or PyRun_FilePath should work. Luckily, none of the current FILE* APIs are stable, so we can drop them in a few versions with a deprecation. I'm happy to temporarily expose a public Py_wfopen as a bandaid, too. In the meantime @PlanetCNC, does reading the file yourself and passing them to one of the compiling APIs work?

zooba · 2024-12-02T15:06:49Z

This can cause issues with dependencies and maintenance. Perhaps even licensing.

It shouldn't create any new issues on top of what you're already doing. CPython is very flexibly licensed, despite not being one of the typical short licenses, it has basically the same requirements (i.e. none). Further, the license should apply the same if you're using our build vs. using your own, so the only difference is going to be if you make users do the install themselves (which I also don't recommend).

Python does not have API that takes the path and handles reading internally. For me this solution seems perfect.

Okay, let's add this. The implementation is really just going to _Py_wfopen and pass the FILE * into the existing API, so it shouldn't be complicated, but it will get the non-portable types out of the public API.

PlanetCNC · 2024-12-03T09:37:17Z

All PyRun_... functions that accept a FILE* parameter already include a corresponding filename parameter. Currently, if the FILE* is NULL, these functions crash. A potential solution could be to modify the implementation so that when FILE* is NULL, the filename is used internally to call _Py_fopen or _Py_wfopen. This approach would resolve the issue without requiring any changes to the existing API definitions.

zooba · 2024-12-03T21:59:31Z

A potential solution could be to modify the implementation so that when FILE* is NULL, the filename is used internally to call _Py_fopen or _Py_wfopen.

Sounds good to me.

Anyone want to open a PR?

PlanetCNC added the type-bug An unexpected behavior, bug, or error label Nov 27, 2024

skirpichev added the topic-C-API label Nov 28, 2024

ZeroIntensity added the OS-windows label Nov 29, 2024

github-actions bot mentioned this issue Dec 1, 2024

Monthly issue metrics report hugovk/test#88

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

_Py_wfopen no longer exported #127350

_Py_wfopen no longer exported #127350

PlanetCNC commented Nov 27, 2024 •

edited by github-actions bot

Loading

skirpichev commented Nov 28, 2024

vstinner commented Nov 28, 2024

PlanetCNC commented Nov 28, 2024

vstinner commented Nov 28, 2024

PlanetCNC commented Nov 28, 2024

PlanetCNC commented Nov 28, 2024

ZeroIntensity commented Nov 28, 2024

vstinner commented Nov 29, 2024

PlanetCNC commented Nov 29, 2024

asvetlov commented Nov 29, 2024

ZeroIntensity commented Nov 29, 2024

PlanetCNC commented Nov 29, 2024

ZeroIntensity commented Nov 29, 2024

PlanetCNC commented Nov 30, 2024 •

edited

Loading

ZeroIntensity commented Nov 30, 2024

zooba commented Dec 2, 2024

PlanetCNC commented Dec 2, 2024

ZeroIntensity commented Dec 2, 2024

zooba commented Dec 2, 2024

PlanetCNC commented Dec 3, 2024

zooba commented Dec 3, 2024

_Py_wfopen no longer exported #127350

_Py_wfopen no longer exported #127350

Comments

PlanetCNC commented Nov 27, 2024 • edited by github-actions bot Loading

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

skirpichev commented Nov 28, 2024

vstinner commented Nov 28, 2024

PlanetCNC commented Nov 28, 2024

vstinner commented Nov 28, 2024

PlanetCNC commented Nov 28, 2024

PlanetCNC commented Nov 28, 2024

ZeroIntensity commented Nov 28, 2024

vstinner commented Nov 29, 2024

PlanetCNC commented Nov 29, 2024

asvetlov commented Nov 29, 2024

ZeroIntensity commented Nov 29, 2024

PlanetCNC commented Nov 29, 2024

ZeroIntensity commented Nov 29, 2024

PlanetCNC commented Nov 30, 2024 • edited Loading

ZeroIntensity commented Nov 30, 2024

zooba commented Dec 2, 2024

Footnotes

PlanetCNC commented Dec 2, 2024

ZeroIntensity commented Dec 2, 2024

zooba commented Dec 2, 2024

PlanetCNC commented Dec 3, 2024

zooba commented Dec 3, 2024

PlanetCNC commented Nov 27, 2024 •

edited by github-actions bot

Loading

PlanetCNC commented Nov 30, 2024 •

edited

Loading