[TB Optimization] Skip subtrees based on the subtree's root node's permissions #4008

JoJoDeveloping · 2024-11-01T23:18:01Z

In #4006, we re-added the functionality for skipping subtrees. It turns out that just skipping subtrees based on their last recorded access is imprecise. In certain cases, we know we can skip subtrees purely based on the root's current permission, without having to track the last access. Specifically:

Disabled nodes can always be skipped, since the whole subtree must necessarily be invariant under all foreign accesses
Frozen nodes an be skipped for foreign reads, since the whole subtree must necessarily be invariant under all foreign reads.

Note that this PR loosens the notion of "invariant" a bit. For example, it is possible that there is a Reserved protected node that is a child of a Frozen node. When that undergoes a foreign read, it becomes conflicted. If we skip accessing that subtree, it no longer does become conflicted.
The reason this is still OK is that the only effect of this conflictedness is blocking child write accesses. But such accesses are already blocked by the Frozen node further up the tree. So no UB is missed, all that happens is that diagnostics are triggered at a different node.

For more detailed analysis of why this is correct, see the in-code comments.

Here is a performance analysis, comparing this PR's improvements with that of #4006:

As in #4006, this is a log graph. The blue line shows performance without #4006, red is with the re-added optimization of #4006, yellow is this PR (which is stacked on top of #4006), and green is just the changes proposed here, but with the "latest foreign access tracking" machinery of #4006 removed. As can be seen, having both combined gives the greatest performance.

Finally, note that this PR is draft, since it is stacked on top of #4006. This PR only intends to contribute one commit, the rest are included in #4006, and should be discussed there.

This commit supplies a real fix, which makes retags more complicated, at the benefit of making accesses more performant.

…cess would mostly be a NOP

RalfJung · 2024-11-04T17:00:50Z

all that happens is that diagnostics are triggered at a different node.

That is at least potentially confusing. :/ But maybe the fix here should be on the diagnostic side, not the core algorithm.

Does this depend on when the GC runs, or is it deterministic?

As in #4006, this is a log graph. The blue line shows performance without #4006, red is with the re-added optimization of #4006, yellow is this PR (which is stacked on top of #4006), and green is just the changes proposed here, but with the "latest foreign access tracking" machinery of #4006 removed. As can be seen, having both combined gives the greatest performance.

It seems like most benchmarks are unchanged by this PR (compared to just #4006), only a few of them benefit. big-allocs gets slightly worse.

Do you have evidence that this is beneficial on (a non-trivial fraction of) real-world code?

JoJoDeveloping · 2024-11-04T17:02:06Z

only a few of them benefit

Indeed. Intuitively, if you use lots of shared references, you benefit.

big-allocs gets slightly worse.

True, but I'd say that this is within measurement imprecision. That test just allocates a lot, without ever touching the memory.

JoJoDeveloping · 2024-11-04T17:04:18Z

That is at least potentially confusing. :/ But maybe the fix here should be on the diagnostic side, not the core algorithm.

It depends. Arguably, the fact that it's because there's a frozen parent could be more clear than the fact that it's because you are reserved conflicted protected. But note that there's nothing the diagnostics can do to do things differently here, because the child node would not never become conflicted with this.

Does this depend on when the GC runs, or is it deterministic?

It is deterministic.

RalfJung · 2024-11-04T17:02:00Z

src/borrow_tracker/tree_borrows/tree.rs

+                // of `ReservedIM`, `Disabled`, or a not-yet-accessed "lazy" permission thing.
+                // The two former are already invariant under all foreign accesses, and for
+                // the latter it does not really matter, since they can not be used/initialized
+                // due to having a protected parent. So this only affects diagnostics, but the


should be "disabled parent", right?

RalfJung · 2024-11-04T17:02:44Z

src/borrow_tracker/tree_borrows/tree.rs

@@ -185,6 +185,30 @@ impl LocationState {
                // need to be applied to this subtree.
                _ => false,
            };
+            if self.permission.is_disabled() {
+                // A foreign access to a `Disabled` tag will have almost no observable effect.
+                // It's a theorem that `Disabled` node have no protected initialized children,


That's not an obvious theorem -- can you give a brief argument?

It's proven in Coq 😛.

The reason it holds is that to become disabled, you need to have a foreign write access happen. But that would have triggered any protected initialized nodes that are children of the node being disabled. And you can't have a new child of Disabled become initialized, because that would mean the to-be-initialized node has a child access, which is however blocked by the Disabled parent.

RalfJung · 2024-11-04T17:03:19Z

src/borrow_tracker/tree_borrows/tree.rs

+                // It's a theorem that `Disabled` node have no protected initialized children,
+                // and so this foreign access will never trigger any protector.
+                // Further, the children will never be able to read or write again, since they
+                // have a `Disabled` parents. Even further, all children of `Disabled` are one


The argument could end here, right? The permissions below don't matter since anyway no access is possible.

RalfJung · 2024-11-04T17:04:25Z

src/borrow_tracker/tree_borrows/tree.rs

+                // effect, the only further thing they could do is make protected `Reserved`
+                // nodes become conflicted, i.e. make them reject child writes for the further
+                // duration of their protector. But such a child write is already rejected
+                // because this node is frozen. So this only affects diagnostics, but the


Is it possible to add a testcase that demonstrates the effect on diagnostics?

JoJoDeveloping added 3 commits November 1, 2024 15:37

Add benchmark showing effectivity of subtree skipping

67fac9b

Properly fix rust-lang#3846 by resetting parents on lazy node creation

04630e0

This commit supplies a real fix, which makes retags more complicated, at the benefit of making accesses more performant.

try optimizing accesses on large trees by ignoring subtrees if the ac…

73383fd

…cess would mostly be a NOP

RalfJung self-assigned this Nov 4, 2024

RalfJung reviewed Nov 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TB Optimization] Skip subtrees based on the subtree's root node's permissions #4008

[TB Optimization] Skip subtrees based on the subtree's root node's permissions #4008

JoJoDeveloping commented Nov 1, 2024 •

edited

Loading

RalfJung commented Nov 4, 2024

JoJoDeveloping commented Nov 4, 2024 •

edited

Loading

JoJoDeveloping commented Nov 4, 2024

RalfJung Nov 4, 2024

RalfJung Nov 4, 2024

JoJoDeveloping Nov 4, 2024

RalfJung Nov 4, 2024

RalfJung Nov 4, 2024

[TB Optimization] Skip subtrees based on the subtree's root node's permissions #4008

Are you sure you want to change the base?

[TB Optimization] Skip subtrees based on the subtree's root node's permissions #4008

Conversation

JoJoDeveloping commented Nov 1, 2024 • edited Loading

RalfJung commented Nov 4, 2024

JoJoDeveloping commented Nov 4, 2024 • edited Loading

JoJoDeveloping commented Nov 4, 2024

RalfJung Nov 4, 2024

Choose a reason for hiding this comment

RalfJung Nov 4, 2024

Choose a reason for hiding this comment

JoJoDeveloping Nov 4, 2024

Choose a reason for hiding this comment

RalfJung Nov 4, 2024

Choose a reason for hiding this comment

RalfJung Nov 4, 2024

Choose a reason for hiding this comment

JoJoDeveloping commented Nov 1, 2024 •

edited

Loading

JoJoDeveloping commented Nov 4, 2024 •

edited

Loading