You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have a couple of issues/discussions on better interfacing builtin functions and the VM: #374 #546
Which we will want to do, but we need to make argument clinic more widely used and get it produce better code first.
There are two things we can do:
Use argument more. There are still hundreds of functions in the standard library that could use it, but don't.
Make argument clinic produce vectorcall functions and parse the format string at build time.
The above two tasks is that they are fully independent in term of implementation, but complement each other nicely in terms of performance improvements.
The other nice thing is that they can both be automated.
I've generated stats on the number of calls to the various argument parsing functions in the benchmark suite.
Compare this to the 9.3 billion bytecode instructions executed. _PyArg_CheckPositional is relatively cheap, which suggests the overhead of these parsing functions is not that big a deal on the benchmark suite, but is still measurable. It could be a lot higher for other code.
We will have better numbers when we have automated profiling.
The text was updated successfully, but these errors were encountered:
Let's make better use of argument clinic.
We have a couple of issues/discussions on better interfacing builtin functions and the VM:
#374
#546
Which we will want to do, but we need to make argument clinic more widely used and get it produce better code first.
There are two things we can do:
The above two tasks is that they are fully independent in term of implementation, but complement each other nicely in terms of performance improvements.
The other nice thing is that they can both be automated.
I've generated stats on the number of calls to the various argument parsing functions in the benchmark suite.
Stats:
_PyArg_CheckPositional: 92M calls
_PyArg_UnpackKeywords: 44M calls
Compare this to the 9.3 billion bytecode instructions executed.
_PyArg_CheckPositional
is relatively cheap, which suggests the overhead of these parsing functions is not that big a deal on the benchmark suite, but is still measurable. It could be a lot higher for other code.We will have better numbers when we have automated profiling.
The text was updated successfully, but these errors were encountered: