Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rollup of 5 pull requests #137111

Merged
merged 10 commits into from
Feb 16, 2025
Merged

Rollup of 5 pull requests #137111

merged 10 commits into from
Feb 16, 2025

Conversation

jhpratt
Copy link
Member

@jhpratt jhpratt commented Feb 16, 2025

Successful merges:

r? @ghost
@rustbot modify labels: rollup

Create a similar rollup

durin42 and others added 10 commits February 4, 2025 10:37
This changed in llvm/llvm-project@91cb8f5.
The commit itself is mostly about some intrinsic instructions, but as an
aside it also mentions something about addrspace for tensor memory,
which I believe is what this string is telling us.

@rustbot label: +llvm-main
The host runtime (HIP or HSA) expects a kernel descriptor object for
each kernel in the ELF file. The amdgpu LLVM backend generates the
object. It is created as a symbol with the name of the kernel plus a
`.kd` suffix.

Add it to the exported symbols in the linker script, so that it can be
found.
…ingjubilee

Export kernel descriptor for amdgpu kernels

The host runtime (HIP or HSA) expects a kernel descriptor object for each kernel in the ELF file. The amdgpu LLVM backend generates the object. It is created as a symbol with the name of the kernel plus a `.kd` suffix.

Add it to the exported symbols in the linker script, so that it can be found.

For reference, the symbol is created here in LLVM: https://github.com/llvm/llvm-project/blob/d5457e4c1619e5dbeefd49841e284cbc24d35cb4/llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp#L966
I wrote [a test](Flakebi@6a9115b) for this as well, I’ll add that once the target is merged and working.
With this, all PRs to get working code for amdgpu are open (this + the target + the two patches adding addrspacecasts for alloca and global variables).

Tracking issue: rust-lang#135024

r? `@workingjubilee`
nvptx64: update default alignment to match LLVM 21

This changed in llvm/llvm-project@91cb8f5. The commit itself is mostly about some intrinsic instructions, but as an aside it also mentions something about addrspace for tensor memory, which I believe is what this string is telling us.

`@rustbot` label: +llvm-main
…pes-better-error, r=compiler-errors

abi_unsupported_vector_types: say which type is the problem
…r=oli-obk

Ignore Self in bounds check for associated types with Self:Sized

Fixes rust-lang#137053

This is morally a fix of rust-lang#112319, since the `Self: Sized` check was just missing here.

r? oli-obk
@rustbot rustbot added A-run-make Area: port run-make Makefiles to rmake.rs S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-release Relevant to the release subteam, which will review and decide on the PR/issue. rollup A PR which is a rollup labels Feb 16, 2025
@jhpratt
Copy link
Member Author

jhpratt commented Feb 16, 2025

@bors r+ rollup=never p=5

@bors
Copy link
Contributor

bors commented Feb 16, 2025

📌 Commit 7884e17 has been approved by jhpratt

It is now in the queue for this repository.

@bors bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 16, 2025
@bors
Copy link
Contributor

bors commented Feb 16, 2025

⌛ Testing commit 7884e17 with merge 4229b80...

@bors
Copy link
Contributor

bors commented Feb 16, 2025

☀️ Test successful - checks-actions
Approved by: jhpratt
Pushing 4229b80 to master...

@bors bors added the merged-by-bors This PR was explicitly merged by bors. label Feb 16, 2025
@bors bors merged commit 4229b80 into rust-lang:master Feb 16, 2025
7 checks passed
@rustbot rustbot added this to the 1.87.0 milestone Feb 16, 2025
@rust-timer
Copy link
Collaborator

📌 Perf builds for each rolled up PR:

PR# Message Perf Build Sha
#135797 Import initial generated 1.85 relnotes f5744b69629dc1c7305b493cf32a0a52b0f81244 (link)
#135909 Export kernel descriptor for amdgpu kernels 083650f597d28e14b2ac115892b8e55692dfa2eb (link)
#136545 nvptx64: update default alignment to match LLVM 21 3fabdcabb79ab3fe9ef801fa50fb0d9b9f967034 (link)
#137092 abi_unsupported_vector_types: say which type is the problem 5b6d67e3f3ffc320ab2ab2a67a34348efc6e711b (link)
#137097 Ignore Self in bounds check for associated types with Self:… 29845da28b0d7b98caec2faa564e4efd53f80b76 (link)

previous master: c3fe9e7e4d

In the case of a perf regression, run the following command for each PR you suspect might be the cause: @rust-timer build $SHA

@rust-timer
Copy link
Collaborator

Finished benchmarking commit (4229b80): comparison URL.

Overall result: ✅ improvements - no action needed

@rustbot label: -perf-regression

Instruction count

This is the most reliable metric that we have; it was used to determine the overall result at the top of this comment. However, even this metric can sometimes exhibit noise.

mean range count
Regressions ❌
(primary)
- - 0
Regressions ❌
(secondary)
- - 0
Improvements ✅
(primary)
-0.4% [-0.4%, -0.4%] 1
Improvements ✅
(secondary)
- - 0
All ❌✅ (primary) -0.4% [-0.4%, -0.4%] 1

Max RSS (memory usage)

This benchmark run did not return any relevant results for this metric.

Cycles

Results (primary -2.0%, secondary -5.7%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

mean range count
Regressions ❌
(primary)
- - 0
Regressions ❌
(secondary)
- - 0
Improvements ✅
(primary)
-2.0% [-3.1%, -1.2%] 6
Improvements ✅
(secondary)
-5.7% [-8.2%, -3.3%] 10
All ❌✅ (primary) -2.0% [-3.1%, -1.2%] 6

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 790.836s -> 790.759s (-0.01%)
Artifact size: 350.03 MiB -> 350.03 MiB (-0.00%)

@jhpratt jhpratt deleted the rollup-izd98sx branch March 7, 2025 09:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-run-make Area: port run-make Makefiles to rmake.rs merged-by-bors This PR was explicitly merged by bors. rollup A PR which is a rollup S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-release Relevant to the release subteam, which will review and decide on the PR/issue.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants