[red-knot] Add control flow for try/except blocks (v3) #13729

AlexWaygood · 2024-10-13T15:52:36Z

Summary

This is a third attempt to add some understanding of exception-handler control flow to red-knot. It follows the previous attempts #13338 (which was very naive about finally blocks) and #13633 (which was overly ambitious in how it attempted to handle finally blocks).

The discussion in #13633 concluded that we aren't yet ready to add an accurate understanding of finally suites to red-knot, as it will require double-visiting of finally suites in order to accurately model the semantics, and this violates some core assumptions made by the use-def map. As such, this PR rips out a lot of the infrastructure that #13633 added for specifically dealing with finally suites, and instead adds some copious TODO comments in builder.rs and the tests.

The rest of this summary section is copied from the summary section of #13633:

The semantics of try/except blocks are very complicated! I've written up a long document outlining all the various jumps control flow could take, which can be found here. I won't try to summarise that document in this PR description. But I will give a brief description of some of the ways I've attempted to model these semantics in this PR:

Abstractions for handling try/except blocks have been added to a new builder submodule, builder/exception_handlers.rs:

TryNodeContext keeps track of state for a single try/except/else/finally block. Exactly what state we need to keep track of varies according to whether the node has a finally branch, and according to which branch of the StmtTry node we're currently visiting.
TryNodeContextStack is a stack of TryNodeContext instances. For any given scope, try blocks can be arbitrarily nested; this means that we must keep a stack of TryNodeContexts for each scope we visit.
TryNodeContextStackManager is a stack of TryNodeContextStacks. Whenever we enter a nested scope, a new TryNodeContextStack is initialised by the TryNodeContextStackManager and appended to the stack of stacks. Whenever we exit that scope, the TryNodeContextStack is popped off the stack of stacks.

The diff for this PR is quite large, but this is mostly tests. There aren't actually that many tests, but they unfortunately need to be quite verbose. This is because we may add a more sophisticated understanding of exception handlers in the future (where we would understand that e.g. x = 1 can never raise an exception), and I wanted the tests to be robust to this so that they wouldn't have to be rewritten when that happens. (This also helps readability of the tests, since we obviously know that x = 1 can never raise exceptions.) To address this, I made sure to use assignments to function calls for testing places where a raised exception could cause a jump in control flow. This will be robust to future improvements, since it will always be the case that we will consider a function call capable of raising arbitrary exceptions.

Test Plan

Tests have been added using mdtest, and can be found in crates/red_knot_python_semantic/resources/mdtest/except_handler_control_flow.md.

AlexWaygood · 2024-10-13T15:56:09Z

(aacc4e7 and onwards are the commits that differ from #13633; all commits prior to that are the same as in #13633.)

github-actions · 2024-10-13T16:13:46Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

MichaReiser · 2024-10-14T16:23:43Z

crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs

+
+/// The contexts of nested `try`/`except` blocks for a single scope
+#[derive(Debug, Default)]
+pub(super) struct TryNodeContextStack(RefCell<Vec<TryNodeContext>>);


Removing the RefCell and instead rely on compile time borrow checking doesn't seem to be difficult. I quickly tried it out locally and the only issue you run into is with record_definition. My recommendation there is to instead expose an iter_mut method on TryNodeStack and inline the logic into builder.rs.

Removing the RefCell should also remove the need for TryNodeContextStackRefMut because you can just return a &TryNodeContextStack or &mut TryNodeContextStack based on the mutability that is needed.

I had to bring back the RefCell in 24d1398. record_definition() is indeed the only place where normal borrowing rules are difficult to work with here, but I still don't see how to avoid using a RefCell internally :( I tried your suggestion of exposing an iter_mut method and inlining the logic into builder.rs, but I still couldn't make the borrow checker happy.

I have at least managed to remove TryNodeContextStackRefMut, however!

Yeah, the self.flow_snapshot makes this a bit awkward. There are two possibilities:

Pass the snapshot to record_definition

Subject: [PATCH] Update snapshot The old snapshot was incorrect because the format range excluded the first (all empty) line. Now the first line gets formatted also. --- Index: crates/red_knot_python_semantic/src/semantic_index/builder.rs IDEA additional info: Subsystem: com.intellij.openapi.diff.impl.patch.CharsetEP <+>UTF-8 =================================================================== diff --git a/crates/red_knot_python_semantic/src/semantic_index/builder.rs b/crates/red_knot_python_semantic/src/semantic_index/builder.rs --- a/crates/red_knot_python_semantic/src/semantic_index/builder.rs (revision 24d139891efc6030ef0f3ec2eebd1999c2e6e831) +++ b/crates/red_knot_python_semantic/src/semantic_index/builder.rs (date 1729073072841) @@ -103,9 +103,9 @@ .expect("Always to have a root scope") } - fn try_node_context_stack(&self) -> &TryNodeContextStack { + fn try_node_context_stack(&mut self) -> &mut TryNodeContextStack { self.try_node_context_stack_manager - .current_try_context_stack() + .current_try_context_stack_mut() } fn push_scope(&mut self, node: NodeWithScopeRef) { @@ -239,7 +239,9 @@ DefinitionCategory::Declaration => use_def.record_declaration(symbol, definition), DefinitionCategory::Binding => use_def.record_binding(symbol, definition), } - self.try_node_context_stack().record_definition(self); + + let snapshot = self.flow_snapshot(); + self.try_node_context_stack().record_definition(snapshot); definition } Index: crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs IDEA additional info: Subsystem: com.intellij.openapi.diff.impl.patch.CharsetEP <+>UTF-8 =================================================================== diff --git a/crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs b/crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs --- a/crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs (revision 24d139891efc6030ef0f3ec2eebd1999c2e6e831) +++ b/crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs (date 1729073033138) @@ -23,6 +23,12 @@ .expect("There should always be at least one `TryBlockContexts` on the stack") } + pub(super) fn current_try_context_stack_mut(&mut self) -> &mut TryNodeContextStack { + self.0 + .last_mut() + .expect("There should always be at least one `TryBlockContexts` on the stack") + } + /// Pop a new [`TryNodeContextStack`] off the stack of stacks. /// /// Each [`TryNodeContextStack`] is only valid for a single scope @@ -38,33 +44,36 @@ /// The contexts of nested `try`/`except` blocks for a single scope #[derive(Debug, Default)] -pub(super) struct TryNodeContextStack(RefCell<Vec<TryNodeContext>>); +pub(super) struct TryNodeContextStack(Vec<TryNodeContext>); impl TryNodeContextStack { /// Push a new [`TryNodeContext`] for recording intermediate states /// while visiting a [`ruff_python_ast::StmtTry`] node that has a `finally` branch. - pub(super) fn push_context(&self) { - self.0.borrow_mut().push(TryNodeContext::default()); + pub(super) fn push_context(&mut self) { + self.0.push(TryNodeContext::default()); } /// Pop a [`TryNodeContext`] off the stack. - pub(super) fn pop_context(&self) -> Vec<FlowSnapshot> { + pub(super) fn pop_context(&mut self) -> Vec<FlowSnapshot> { let TryNodeContext { try_suite_snapshots, } = self .0 - .borrow_mut() .pop() .expect("Cannot pop a `try` block off an empty `TryBlockContexts` stack"); try_suite_snapshots } /// For each `try` block on the stack, push the snapshot onto the `try` block - pub(super) fn record_definition(&self, builder: &SemanticIndexBuilder) { - for context in self.0.borrow_mut().iter_mut() { - context.record_definition(builder.flow_snapshot()); + pub(super) fn record_definition(&mut self, snapshot: FlowSnapshot) { + for context in self.0.iter_mut() { + context.record_definition(snapshot.clone()); } } + + pub(super) fn iter_mut(&mut self) -> impl Iterator<Item = &mut TryNodeContext> { + self.0.iter_mut() + } } /// Context for tracking definitions over the course of a single

do the "move" trick

let use_def = self.current_use_def_map_mut(); match category { DefinitionCategory::DeclarationAndBinding => { use_def.record_declaration_and_binding(symbol, definition); } DefinitionCategory::Declaration => use_def.record_declaration(symbol, definition), DefinitionCategory::Binding => use_def.record_binding(symbol, definition), } let mut try_stack_manager = std::mem::take(&mut self.try_node_context_stack_manager); for context in try_stack_manager.current_try_context_stack_mut().iter_mut() { context.record_definition(self.flow_snapshot()); } self.try_node_context_stack_manager = try_stack_manager; definition

Idea (1) is what I did last night and it caused a massive perf regression, because it means you end up taking a snapshot after every definition, even if you're not visiting a node inside an exception handler!

I'll try out idea (2) and see if it works

MichaReiser

Wow, the code makes this seem easy

This looks good but I would prefer if we avoid using RefCell in the context. I even think that regular borrowing will simplify the implementation by removing the need for the MutRef type.

crates/red_knot_python_semantic/src/semantic_index/builder/except_handlers.rs

crates/red_knot_python_semantic/src/semantic_index/builder.rs

AlexWaygood · 2024-10-15T20:26:23Z

Thanks @MichaReiser! Managed to get rid of the RefCell 🎉

codspeed-hq · 2024-10-15T20:31:02Z

CodSpeed Performance Report

Merging #13729 will not alter performance

_{Comparing except-handler-3 (24d1398) with main (fb1d1e3)}

Summary

✅ 32 untouched benchmarks

AlexWaygood · 2024-10-15T20:37:19Z

Merging #13729 will degrade performances by 22.53%

I don't understand how the last commit (21ab481) I pushed could have caused this. Nothing I did in that commit should have had any negative impact on performance, I don't think. This doesn't make any sense to me. (Edit: I tried squashing commits and force-pushing to trigger a rerun of the benchmarks; no difference ☹️)

crates/red_knot_python_semantic/src/semantic_index/builder.rs

AlexWaygood · 2024-10-16T09:07:12Z

Ready for re-review again!

AlexWaygood added the red-knot Multi-file analysis & type inference label Oct 13, 2024

AlexWaygood requested review from carljm and MichaReiser as code owners October 13, 2024 15:52

AlexWaygood mentioned this pull request Oct 13, 2024

[red-knot] Add control flow for try/except blocks (v2) #13633

Closed

AlexWaygood force-pushed the except-handler-3 branch from 1779602 to aacc4e7 Compare October 13, 2024 16:00

MichaReiser reviewed Oct 14, 2024

View reviewed changes

AlexWaygood force-pushed the except-handler-3 branch from e4d1f72 to 21ab481 Compare October 15, 2024 20:24

AlexWaygood requested a review from MichaReiser October 15, 2024 20:26

[red-knot] Add control flow for try/except blocks (v3)

7717eae

AlexWaygood force-pushed the except-handler-3 branch from 21ab481 to 7717eae Compare October 15, 2024 20:51

AlexWaygood commented Oct 15, 2024

View reviewed changes

crates/red_knot_python_semantic/src/semantic_index/builder.rs Outdated Show resolved Hide resolved

fix perf regression?

24d1398

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Add control flow for try/except blocks (v3) #13729

[red-knot] Add control flow for try/except blocks (v3) #13729

AlexWaygood commented Oct 13, 2024

AlexWaygood commented Oct 13, 2024 •

edited

Loading

github-actions bot commented Oct 13, 2024 •

edited

Loading

MichaReiser Oct 14, 2024

AlexWaygood Oct 16, 2024

MichaReiser Oct 16, 2024

AlexWaygood Oct 16, 2024

MichaReiser left a comment •

edited

Loading

AlexWaygood commented Oct 15, 2024

codspeed-hq bot commented Oct 15, 2024 •

edited

Loading

AlexWaygood commented Oct 15, 2024 •

edited

Loading

AlexWaygood commented Oct 16, 2024

[red-knot] Add control flow for try/except blocks (v3) #13729

Are you sure you want to change the base?

[red-knot] Add control flow for try/except blocks (v3) #13729

Conversation

AlexWaygood commented Oct 13, 2024

Summary

Test Plan

AlexWaygood commented Oct 13, 2024 • edited Loading

github-actions bot commented Oct 13, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

MichaReiser Oct 14, 2024

Choose a reason for hiding this comment

AlexWaygood Oct 16, 2024

Choose a reason for hiding this comment

MichaReiser Oct 16, 2024

Choose a reason for hiding this comment

AlexWaygood Oct 16, 2024

Choose a reason for hiding this comment

MichaReiser left a comment • edited Loading

Choose a reason for hiding this comment

AlexWaygood commented Oct 15, 2024

codspeed-hq bot commented Oct 15, 2024 • edited Loading

CodSpeed Performance Report

Merging #13729 will not alter performance

Summary

AlexWaygood commented Oct 15, 2024 • edited Loading

AlexWaygood commented Oct 16, 2024

AlexWaygood commented Oct 13, 2024 •

edited

Loading

github-actions bot commented Oct 13, 2024 •

edited

Loading

`ruff-ecosystem` results

MichaReiser left a comment •

edited

Loading

codspeed-hq bot commented Oct 15, 2024 •

edited

Loading

AlexWaygood commented Oct 15, 2024 •

edited

Loading