Make `create_def` a query #142711

cjgillot · 2025-06-19T08:19:01Z

Creating a new definition using create_def currently behaves as a red node for incremental compilation: the caller query must be recomputed at the next run. This is suboptimal.

If a query's dependencies have not changed, we'd like the query engine to re-create the definitions and continue replaying the dependency graph and load cached data. This idea falls short with a subtle global state dependency: when creating definitions, we must ensure that DefPathHash are globally unique, and carry a disambiguator global state around. Because of this, a call to create_def may change its result DefPathHash due to global state, and force the caller query to be recomputed. If that happens, the caller query would need to be recomputed, but must not re-create the definitions that the query engine created for it.

This PR attempts to solve this issue by using a create_def_raw query. This query is eval_always, so it can safely read global state. The query engine sees its return value as the DefPathHash, so will detect if it unexpectedly changed. We benefit from the caching queries do.

However, we want multiple successive calls to create_def from the same queries to result in a new definition each time. For instance when creating anonymous definitions that are only identified by their parent and a disambiguator. To allow this, we add 2 extra parameters to create_def_raw that are unique to each call by the caller query: the query's DepNode and a counter of calls from within that query.

Consider this example. Some query A calls create_def twice, we generate calls to:

create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 0) This returns ::{use#0}.
create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 1) The query has different arguments, so returns a brand new ::{use::1}.
create_def(CRATE_DEF_ID, Impl) = create_def_raw(CRATE_DEF_ID, Impl, A, 2) This returns ::{impl#0}.

When replaying, if nothing changed, the query engine will call create_def_raw thrice, and A will be green.

If another query created a definition with (CRATE_DEF_ID, Impl), ::{impl#0} is taken. The 3rd call to create_def needs to use another disambiguator, that call to create_def_raw returns ::{impl#1} and be correctly red. A needs to be re-computed. The definitions for uses have already been created when walking the dep-graph. The re-computation calls use the results in create_def_raw cache, aka ::{use#0}, ::{use#1} and ::{impl#1}, and we do not have duplicate definitions.

r? @oli-obk
cc @Zoxc

rustbot · 2025-06-19T08:19:08Z

Some changes occurred to the CTFE machinery

cc @RalfJung, @oli-obk, @lcnr

Some changes occurred to the CTFE / Miri interpreter

cc @rust-lang/miri

Some changes occurred to MIR optimizations

cc @rust-lang/wg-mir-opt

cjgillot · 2025-06-19T10:30:24Z

@bors2 try @rust-timer queue

rust-bors · 2025-06-19T10:30:27Z

⌛ Trying commit 31b00c7 with merge 5c08747…

To cancel the try build, run the command @bors2 try cancel.

Make `create_def` a query Creating a new definition using `create_def` currently behaves as a red node for incremental compilation: the caller query must be recomputed at the next run. This is suboptimal. If a query's dependencies have not changed, we'd like the query engine to re-create the definitions and continue replaying the dependency graph and load cached data. This idea falls short with a subtle global state dependency: when creating definitions, we must ensure that `DefPathHash` are globally unique, and carry a disambiguator global state around. Because of this, a call to `create_def` may change its result `DefPathHash` due to global state, and force the caller query to be recomputed. If that happens, the caller query would need to be recomputed, but must not re-create the definitions that the query engine created for it. This PR attempts to solve this issue by using a `create_def_raw` query. This query is `eval_always`, so it can safely read global state. The query engine sees its return value as the `DefPathHash`, so will detect if it unexpectedly changed. We benefit from the caching queries do. However, we want multiple successive calls to `create_def` from the same queries to result in a new definition each time. For instance when creating anonymous definitions that are only identified by their parent and a disambiguator. To allow this, we add 2 extra parameters to `create_def_raw` that are unique to each call by the caller query: the query's `DepNode` and a counter of calls from within that query. Consider this example. Some query A calls create_def twice, we generate calls to: 1. `create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 0)` This returns `::{use#0}`. 2. `create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 1)` The query has different arguments, so returns a brand new `::{use::1}`. 3. `create_def(CRATE_DEF_ID, Impl) = create_def_raw(CRATE_DEF_ID, Impl, A, 2)` This returns `::{impl#0}`. When replaying, if nothing changed, the query engine will call `create_def_raw` thrice, and `A` will be green. If another query created a definition with `(CRATE_DEF_ID, Impl)`, `::{impl#0}` is taken. The 3rd call to `create_def` needs to use another disambiguator, that call to `create_def_raw` returns `::{impl#1}` and be correctly red. `A` needs to be re-computed. The definitions for `use`s have already been created when walking the dep-graph. The re-computation calls use the results in `create_def_raw` cache, aka `::{use#0}`, `::{use#1}` and `::{impl#1}`, and we do not have duplicate definitions. r? `@oli-obk` cc `@Zoxc`

bjorn3 · 2025-06-19T10:41:12Z

Consider this example. Some query A calls create_def twice, we generate calls to:

create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 0) This returns ::{use#0}.

create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 1) The query has different arguments, so returns a brand new ::{use::1}.

What happens if during a future incr comp session the first definition goes away? Will the second definition still use ::{use#1} as DefPath? If so that would cause the output of incremental compilation to not just depend on the current input (with the incr comp cache being just a cache), but rather on the current input and all previous inputs. That would break both reproducibility of the output (making it impossible for us to ever enable incr comp in release mode even if we were very confident in it's correctness) and make incr comp issues harder to debug as there is no a lot more state that needs to be taken into account. Two compilations may no longer be enough to reproduce any incr comp issue.

rust-log-analyzer · 2025-06-19T11:14:37Z

The job aarch64-gnu-llvm-19-1 failed! Check out the build log: (web) (plain)

Click to see the possible cause of the failure (guessed by this bot)

cjgillot · 2025-06-19T11:30:04Z

Consider this example. Some query A calls create_def twice, we generate calls to:

create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 0) This returns ::{use#0}.

create_def(CRATE_DEF_ID, Use) = create_def_raw(CRATE_DEF_ID, Use, A, 1) The query has different arguments, so returns a brand new ::{use::1}.

What happens if during a future incr comp session the first definition goes away? Will the second definition still use ::{use#1} as DefPath? If so that would cause the output of incremental compilation to not just depend on the current input (with the incr comp cache being just a cache), but rather on the current input and all previous inputs. That would break both reproducibility of the output (making it impossible for us to ever enable incr comp in release mode even if we were very confident in it's correctness) and make incr comp issues harder to debug as there is no a lot more state that needs to be taken into account. Two compilations may no longer be enough to reproduce any incr comp issue.

I'm not sure I fully grasp the question. The first definition going away means that A hit a red dependency before creating definitions, so the incremental engine never attempts to replay create_def(CRATE_DEF_ID, Use, A, 0). Could you detail the step-by-step scenario you have?

bjorn3 · 2025-06-19T11:42:21Z

Never mind. I assumed though you were literally using the index you pass to create_def_raw as the final disambiguator. I now get what you are actually doing.

rust-bors · 2025-06-19T12:59:27Z

☀️ Try build successful (CI)
Build commit: 5c08747 (5c087477fc537974142e0b0eb84b0975d9056214, parent: 70e2b4a4d197f154bed0eb3dcb5cac6a948ff3a3)

Zoxc · 2025-06-19T15:15:21Z

I haven't looked at the PR too closely yet, but it seems to be a different direction than what me and @oli-obk were heading in #t-compiler > query side effects: create_def.

This seems to revert parts of #140453 by adding back global state which could possibly cause non-deterministic def paths for the parallel compiler.

cjgillot · 2025-06-19T17:00:43Z

I agree this is a different direction. (TBH, I'm not sure I fully understand the direction you are taking in that thread.) The problem is so difficult that I think it worth experimenting with several competing schemes until we are convinced we have a satisfying one. This PR is heavily inspired from that zulip discussion's first message, and salsa entity scheme.

I don't agree this PR could introduce non-determinism. The manipulation of untracked state, including disambiguators, is done in eval_always queries, which is ok.

oli-obk · 2025-06-19T17:10:20Z

I don't agree this PR could introduce non-determinism. The manipulation of untracked state, including disambiguators, is done in eval_always queries, which is ok.

The nondeterminism is that two threads invoking it at the same time get arbitrary disambiguators back. This means running the compiler twice with parallel rustc produces different results. Soundness of the query system is not the issue, reproducability is. This also applies to incremental getting different results if forcing causes them to create def ids in different orders than in previous runs

cjgillot · 2025-06-19T19:18:58Z

The nondeterminism is that two threads invoking it at the same time get arbitrary disambiguators back. This means running the compiler twice with parallel rustc produces different results. Soundness of the query system is not the issue, reproducability is. This also applies to incremental getting different results if forcing causes them to create def ids in different orders than in previous runs

Is that even possible?

Uniqueness of the DefPathHash is a global property, so disambiguation must be a global mutable state. We can limit usage and reduce likelihood, but we cannot get rid of it entirely.

So if two threads can create definitions, the only way to make compilation reproducible is to ensure a reproducible order between those two threads.

oli-obk · 2025-06-20T04:49:14Z

The way we solved this is to disambiguate different queries that create DefIds by using new and different DefPathData.

Of course if we screw up and produce the same one twice we get an ICE, but then we just go fix that by adding a new variant. The combination of different parent plus different DefPathData should always be uniquely specifyable. I don't see where it would not be. E.g. for the RPITITs we count its position in the return type of its parent function to add an integral disambiguator. Parent + Rpitit DefPathData + index is always unique

rust-timer · 2025-06-20T05:03:22Z

Finished benchmarking commit (5c08747): comparison URL.

Overall result: ❌ regressions - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

	mean	range	count
Regressions ❌ (primary)	0.8%	[0.2%, 1.8%]	153
Regressions ❌ (secondary)	1.3%	[0.2%, 3.8%]	75
Improvements ✅ (primary)	-0.3%	[-0.3%, -0.3%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.8%	[-0.3%, 1.8%]	154

Max RSS (memory usage)

Results (primary 1.7%, secondary 2.4%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	1.7%	[0.5%, 8.2%]	58
Regressions ❌ (secondary)	3.4%	[0.5%, 14.7%]	25
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-2.8%	[-4.5%, -1.3%]	5
All ❌✅ (primary)	1.7%	[0.5%, 8.2%]	58

Cycles

Results (primary 2.1%, secondary 3.8%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	2.1%	[1.3%, 3.2%]	55
Regressions ❌ (secondary)	3.8%	[1.3%, 6.0%]	28
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	2.1%	[1.3%, 3.2%]	55

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 691.846s -> 691.623s (-0.03%)
Artifact size: 372.01 MiB -> 372.04 MiB (0.01%)

rustbot assigned oli-obk Jun 19, 2025

cjgillot added 2 commits June 19, 2025 10:29

Create untracked state inside create_global_ctxt.

930c5b4

Make create_def a query.

31b00c7

cjgillot force-pushed the query-create-def branch from 3d8ce06 to 31b00c7 Compare June 19, 2025 10:30

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jun 19, 2025

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Jun 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make `create_def` a query #142711

Make `create_def` a query #142711

cjgillot commented Jun 19, 2025

Uh oh!

rustbot commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

This comment has been minimized.

rust-bors bot commented Jun 19, 2025

Uh oh!

bjorn3 commented Jun 19, 2025

Uh oh!

rust-log-analyzer commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

bjorn3 commented Jun 19, 2025

Uh oh!

rust-bors bot commented Jun 19, 2025

Uh oh!

This comment has been minimized.

Zoxc commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

oli-obk commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

oli-obk commented Jun 20, 2025

Uh oh!

rust-timer commented Jun 20, 2025

Uh oh!

Uh oh!

Make create_def a query #142711

Are you sure you want to change the base?

Make create_def a query #142711

Conversation

cjgillot commented Jun 19, 2025

Uh oh!

rustbot commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

This comment has been minimized.

rust-bors bot commented Jun 19, 2025

Uh oh!

bjorn3 commented Jun 19, 2025

Uh oh!

rust-log-analyzer commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

bjorn3 commented Jun 19, 2025

Uh oh!

rust-bors bot commented Jun 19, 2025

Uh oh!

This comment has been minimized.

Zoxc commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

oli-obk commented Jun 19, 2025

Uh oh!

cjgillot commented Jun 19, 2025

Uh oh!

oli-obk commented Jun 20, 2025

Uh oh!

rust-timer commented Jun 20, 2025

Overall result: ❌ regressions - please read the text below

Instruction count

Max RSS (memory usage)

Cycles

Binary size

Uh oh!

Uh oh!

Make `create_def` a query #142711

Make `create_def` a query #142711