gh-104635: Add NO_EXCEPTION flag to opcode metadata #106394

corona10 · 2023-07-04T02:25:20Z

This PR is motivated from comments from @iritkatriel and @carljm.

There are several code patterns that can cause multiple stores to the same locals index in near succession,
...
Also need to make sure there isn't something between them that could raise an exception. We don't currently have this, > we probably need to add this in the opcode_metadata.

But I'm not sure if this is the concept Irit originally intended, so I write the PR as the PoC with heuristic codes that will not raise exceptions.
If there is something I missed, please let me know.

We can use this metadata for aggressive optimization between two opcodes that will not raise exceptions.

Issue: dead store elimination in the compiler #104635

iritkatriel · 2023-07-05T07:27:14Z

What is the heuristic to determine whether an opcode raises?

corona10 · 2023-07-05T08:41:23Z

What is the heuristic to determine whether an opcode raises?

I thought that we should use a blocklist approach or allowlist approach for classifying opcodes.
In the current implementation, I used a blocklist approach.

if the opcode calls any APIs that begin with _Py or Py (including Py_Incref), it has the possibility of causing an exception.
Another case is ERROR_IF, it has the possibility of an error exception before the macro is called.

The reason for skipping JUMPXXX opcode as an exceptional case is that the current implementation does not pass the assertion of the analyzer. I should analyze why this happening but before that, I wanted to sync the idea of this metadata before digging into it.

carljm

It's not clear to me that "exception-raising" status of an opcode changes enough that we need to automatically determine this metadata via heuristic on the opcode implementation, rather than just manually maintain it.

I'm not necessarily opposed, if we can find a sufficiently conservative heuristic that does not fail in the unsafe direction. But I'm not sure I see any such possible heuristic, given my inline comment.

carljm · 2023-07-05T17:15:37Z

Tools/cases_generator/generate_cases.py

@@ -252,6 +252,7 @@ class InstructionFlags:
    HAS_CONST_FLAG: bool
    HAS_NAME_FLAG: bool
    HAS_JUMP_FLAG: bool
+    NO_EXCEPTION_FLAG: bool


Negative flags tend to lead to confusing double-negative code constructs. So I would prefer for the flag to be CAN_RAISE_FLAG than NO_EXCEPTION_FLAG, and similar throughout the PR.

carljm · 2023-07-05T17:16:59Z

Tools/cases_generator/generate_cases.py

+        if token.kind == "IDENTIFIER":
+            if token_text == "error_if":
+                return False
+            if token_text.startswith("py") or token_text.startswith("_py"):


There are static helper functions in ceval.c that are used by opcodes and not Py prefixed, and some of these can raise. E.g. match_class. We can special case the existing ones, but someone could add a new one at any time. So I don't think this heuristic is safe, and I'm not sure I see any possible heuristic that would be safe in the face of possible future changes.

I guess the match_class case should be covered by ERROR_IF used afterward to check if _PyErr_Occurred. But what if someone calls a non Py prefixed helper function and then uses a manual goto error; rather than ERROR_IF? There are some opcodes that use goto error; directly...

Maybe checking for both ERROR_IF and goto error would be adequate, and we wouldn't even need to check for use of Py prefixed API? For an opcode implementation to handle an error correctly, it seems it needs to goto error or ERROR_IF. Unless someone introduces another new macro that includes goto error, that would break the heuristic.

Existing macros that implicitly goto error include CHECK_EVAL_BREAKER, DECREF_INPUTS_AND_REUSE_FLOAT, and INSTRUMENTED_JUMP.

pythongh-104635: Add NO_EXCEPTION flag to opcode metadata

11cf8ec

corona10 requested review from carljm and iritkatriel July 4, 2023 02:25

bedevere-bot added the awaiting core review label Jul 4, 2023

bedevere-bot mentioned this pull request Jul 4, 2023

dead store elimination in the compiler #104635

Open

corona10 changed the title ~~gh-104635: add NO_SIDE_EFFECT flag to opcode metadata~~ gh-104635: Add NO_EXCEPTION flag to opcode metadata Jul 4, 2023

corona10 added skip news DO-NOT-MERGE labels Jul 4, 2023

corona10 requested a review from markshannon as a code owner July 4, 2023 02:48

corona10 force-pushed the gh-104635-no-side-effect branch from 4d2ac67 to 4025d49 Compare July 4, 2023 07:07

Add application example

47b2995

corona10 force-pushed the gh-104635-no-side-effect branch from 4025d49 to 47b2995 Compare July 4, 2023 07:08

Nadasayed129 approved these changes Jul 4, 2023

View reviewed changes

remove GO_TO_INSTRUCTION case

3c59a3b

carljm reviewed Jul 5, 2023

View reviewed changes

corona10 closed this Mar 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-104635: Add NO_EXCEPTION flag to opcode metadata #106394

gh-104635: Add NO_EXCEPTION flag to opcode metadata #106394

corona10 commented Jul 4, 2023 •

edited by bedevere-bot

Loading

iritkatriel commented Jul 5, 2023

corona10 commented Jul 5, 2023

carljm left a comment

carljm Jul 5, 2023

carljm Jul 5, 2023

carljm Jul 5, 2023 •

edited

Loading

gh-104635: Add NO_EXCEPTION flag to opcode metadata #106394

gh-104635: Add NO_EXCEPTION flag to opcode metadata #106394

Conversation

corona10 commented Jul 4, 2023 • edited by bedevere-bot Loading

iritkatriel commented Jul 5, 2023

corona10 commented Jul 5, 2023

carljm left a comment

Choose a reason for hiding this comment

carljm Jul 5, 2023

Choose a reason for hiding this comment

carljm Jul 5, 2023

Choose a reason for hiding this comment

carljm Jul 5, 2023 • edited Loading

Choose a reason for hiding this comment

corona10 commented Jul 4, 2023 •

edited by bedevere-bot

Loading

carljm Jul 5, 2023 •

edited

Loading