gh-90102: Remove isatty call during regular open #124922

cmaloney · 2024-10-03T06:29:49Z

TTYs are always character devices. If the interpreter knows a file is not a character device when it would call isatty, skip the isatty call. Inside open() in the same python library call there is a fresh stat result that contains that information. Use the stat result to skip a system call.

This shortcut was suggested by @eryksun in 2021. isatty is not necessarily constant over time, but inside the python library call which opened the fd and has yet to return the fd, it seems reasonable to rely on the fd not changing. The fd may be visible to caller code which passes in an opener or intercepts specific calls.

For gh-120754, with this change reading text with open('README.md').read() is down to 6 system calls:

openat(AT_FDCWD, "README.rst", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=8921, ...}) = 0
lseek(3, 0, SEEK_CUR)                   = 0
read(3, "This is Python version 3.14.0 al"..., 8922) = 8921
read(3, "", 1)                          = 0
close(3)                                = 0

When reading bytes with buffering=0 (disabling BufferedIO which remoevs the lseek), reading a regular file is down to 5 system calls (strace python -c "from pathlib import Path; Path('README.rst').read_bytes()")

Benchmark

Run on a 2024 MacBook Air running Squoia 15.0

Benchmark code

import pyperf
from pathlib import Path


def read_all(all_paths):
    for p in all_paths:
        # note: Using open rather than pathlib since 
        # pathlib.read_bytes() skips BufferedIO and that makes a
        # significant performance delta around the reduced isatty 
        # requirement. Reading bytes rather than text since at 
        # least one `.py` file in tree results in UnicodeDecodeError
        open(p, 'rb').read()


def read_file(path_obj):
    path_obj.read_text()


all_rst = list(Path("Doc").glob("**/*.rst"))
all_py = list(Path(".").glob("**/*.py"))
assert all_rst, "Should have found rst files"
assert all_py, "Should have found python source files"


runner = pyperf.Runner()
runner.bench_func("read_file_small", read_file, Path("Doc/howto/clinic.rst"))
runner.bench_func("read_file_large", read_file, Path("Doc/c-api/typeobj.rst"))

runner.bench_func("read_all_rst", read_all, all_rst)
runner.bench_func("read_all_py", read_all, all_py)

Before:

.....................
read_file_small: Mean +- std dev: 8.30 us +- 0.09 us
.....................
read_file_large: Mean +- std dev: 21.1 us +- 0.2 us
.....................
read_all_rst: Mean +- std dev: 4.42 ms +- 0.05 ms
.....................
read_all_py: Mean +- std dev: 19.5 ms +- 0.2 ms

After:

.....................
read_file_small: Mean +- std dev: 7.78 us +- 0.07 us
.....................
read_file_large: Mean +- std dev: 20.8 us +- 0.4 us
.....................
read_all_rst: Mean +- std dev: 4.16 ms +- 0.05 ms
.....................
read_all_py: Mean +- std dev: 18.5 ms +- 0.3 ms

For files which are recently read / in OS filecache (or on fast devices), there is a ~15% performance overhead with BufferedIO (see: #122111 where I updated Path.read_bytes() and measured the change). Unfortunately multiple attempts I've made to do small reworks to make the lseek unnecessary in BufferedIO have resulted in no performance change and a lot of complexity.

I have been developing a more broad refactoring that could reduce the BufferedIO overhead as well as several other I/O overheads while meeting current API expectations (ex. each layer of the stack re-figures out readable and writeable, every call must re-validate with many branches the fd state and arguments, _pyio needs to copy at least once in user-space because os.read can't be passed a buffer / always allocates a new one, etc.).

I'm planning to put together a talk on "Journey to the center of open().read()" and submit it to present at a San Francisco bay area python meetup since as I've been working on this I've found it's a very intricate set of operations which didn't match my mental image in some interesting ways. Hope is to then do a second talk with sample working implementation which shows how could rework internals while keeping the existing Python I/O API to reduce overheads, increase readability, solve some longstanding bugs, and possibly enable usage of io_uring for more performance improvement. Overarching goal would be to get down to one largely python native I/O implementation with improved performance from the optimizations as well as opening new performance improvement avenues.

@vstinner This replaces #121593 on top of #123412

Issue: Avoid calling isatty() for most open() calls #90102

Modules/_io/fileio.c

Lib/_pyio.py

Modules/_io/fileio.c

Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner

LGTM

Update for pythongh-111178

vstinner · 2024-10-08T06:51:54Z

Merged, thank you. At the first read, I didn't understand the relationship between S_ISCHR() and isatty(). But your comment makes sense and explains it correctly.

ngnpope · 2024-10-08T07:41:28Z

Lib/_pyio.py

+ """
+ if (self._stat_atopen is not None
+ and not stat.S_ISCHR(self._stat_atopen.st_mode)):
+ return True


I think this should be:

Suggested change

return True

return False

Yep, Py_RETURN_FALSE; I got right in the C but not the pyio. Making a new PR to update....

Co-authored-by: Victor Stinner <vstinner@python.org>

cmaloney added 3 commits October 2, 2024 21:06

pythongh-90102: Shortcut isatty in _pyio open()

a1a1cb7

pythongh-90102: Shortcut isatty in _io open()

b07dac7

add news

346517e

bedevere-app bot mentioned this pull request Oct 3, 2024

Avoid calling isatty() for most open() calls #90102

Closed

bedevere-app bot added the awaiting review label Oct 3, 2024

cmaloney commented Oct 3, 2024

View reviewed changes

Modules/_io/fileio.c Show resolved Hide resolved

cmaloney added 3 commits October 2, 2024 23:52

WinConsoleIO add _isatty_openonly, simplify FileIO's

cba68bc

fileio: Fix Ubuntu compile error

110c490

Add accidentally dropped newline to comment

de661d1

vstinner reviewed Oct 3, 2024

View reviewed changes

Lib/_pyio.py Outdated Show resolved Hide resolved

Lib/_pyio.py Outdated Show resolved Hide resolved

Lib/_pyio.py Outdated Show resolved Hide resolved

Modules/_io/fileio.c Show resolved Hide resolved

cmaloney and others added 3 commits October 3, 2024 08:40

Update Lib/_pyio.py

1f96a9f

Co-authored-by: Victor Stinner <vstinner@python.org>

_open_only, tweak comments

1b12723

switch to reST lieral backquotes

1783237

vstinner approved these changes Oct 3, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Oct 3, 2024

Merge branch 'main' into cmaloney/stat_atopen_skip_isatty_t3

882867c

Update for pythongh-111178

vstinner merged commit cc9b9be into python:main Oct 8, 2024
37 checks passed

bedevere-app bot removed the awaiting merge label Oct 8, 2024

cmaloney deleted the cmaloney/stat_atopen_skip_isatty_t3 branch October 8, 2024 06:52

ngnpope reviewed Oct 8, 2024

View reviewed changes

efimov-mikhail pushed a commit to efimov-mikhail/cpython that referenced this pull request Oct 9, 2024

pythongh-90102: Remove isatty call during regular open (python#124922)

c214282

Co-authored-by: Victor Stinner <vstinner@python.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-90102: Remove isatty call during regular open #124922

gh-90102: Remove isatty call during regular open #124922

cmaloney commented Oct 3, 2024 •

edited

Loading

vstinner left a comment

vstinner commented Oct 8, 2024

ngnpope Oct 8, 2024

cmaloney Oct 8, 2024

cmaloney Oct 8, 2024

gh-90102: Remove isatty call during regular open #124922

gh-90102: Remove isatty call during regular open #124922

Conversation

cmaloney commented Oct 3, 2024 • edited Loading

Benchmark

vstinner left a comment

Choose a reason for hiding this comment

vstinner commented Oct 8, 2024

ngnpope Oct 8, 2024

Choose a reason for hiding this comment

cmaloney Oct 8, 2024

Choose a reason for hiding this comment

cmaloney Oct 8, 2024

Choose a reason for hiding this comment

cmaloney commented Oct 3, 2024 •

edited

Loading