[pull] main from llvm:main #5634

pull · 2025-09-12T01:14:13Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.3)

Can you help keep this open source service alive? 💖 Please sponsor : )

b50ad94 added umul_with_overflow simplifications to InstSimplifyFolder (used by SCEVExpander) and 9b1b937 added dead instruction cleanup to SCEVExpander. Remove special handling of umul by 1, handled automatically due to the changes above.

This worfklow is still being tested, since I can't figure out how to trigger it without actually merging it to main.

Adds tests and targets for the remaining inttypes functions.

Adds tests and targets for all the ctype functions.

This patch implements the `llvm.loop.estimated_trip_count` metadata discussed in [[RFC] Fix Loop Transformations to Preserve Block Frequencies](https://discourse.llvm.org/t/rfc-fix-loop-transformations-to-preserve-block-frequencies/85785). As the RFC explains, that metadata enables future patches, such as PR #128785, to fix block frequency issues without losing estimated trip counts.

Early-cancellation should have been removed from libcxx-run-benchmarks.yml in the first place, not libcxx-build-and-test.yaml.

Restrict to VGPR only (VRegSrc_32) for mfma scale operands to workaround a hardware design defect: For all Inline/SGPR constants, SP HW use bits [30:23] as the scale. TODO: We may still be able to allow Inline Constants/SGPR, with a proper shift, to obtain a potentially better performance. Fixes: SWDEV-548629

…158055) Fix regression after 62f2641. Previous patch handled the register case, but the memory case snuck another use of ptr_rc_tailcall hidden inside i64mem_TC

In loop of `writeAndReadCoverageRegions`, `OutputFunctions[I].Filenames` references to contents of `Filenames` after returning from `readCoverageRegions` but `Filenames` will be cleared in next call of `readCoverageRegions`, causes dangling reference. The lifetime of the contents of `Filenames` must be equal or longer than `OutputFunctions[I]`, thus it has been moved into `OutputFunctions[I]` (typed `OutputFunctionCoverageData`).

…dress (#154841)

…ED. (#157629) This PR adds the support for treating a function return value to be safe if the function is annotated with NS_RETURNS_RETAINED or CF_RETURNS_RETAINED.

…arking job

This PR implements `SymbolFileNativePDB::AddSymbols` which adds public symbols to the symbol table. These symbols are found in the publics stream. It contains mangled names coupled with addresses. Addresses are a pair of (segment, offset). If I understood correctly, then the segment is the section ID from the COFF header. Sections are already [constructed](https://github.com/llvm/llvm-project/blob/c48ec7fb60b5e0b4100731d75f82ea63c0ec7b45/lldb/source/Plugins/ObjectFile/PECOFF/ObjectFilePECOFF.cpp#L1048) using this 1-based index ([MS docs](https://learn.microsoft.com/en-us/windows/win32/debug/pe-format#section-table-section-headers)). This allows us to use `section_list->FindSectionByID`.

Fixes #148052 . When removing EH Pad blocks, the value defined by them becomes poison. These poison values are then used by `catchret` and `cleanupret`, which is invalid. This commit replaces those unreachable `catchret` and `cleanupret` instructions with `unreachable`.

Reverts #157691 Change was relanded in #157885

The lldb-remote-linux-ubuntu bot (and only this bot) is still failing for TestCortexMExceptionUnwind.py because the Target triple is somehow inheriting a non-Darwin OS. I marked this API test skipUnlessDarwin but this bot can be identified more specifically by a skipIfRemote test. There's no benefit to running this test remotely anyway; it doesn't execute any code.

This patch updates the llvm-debuginfod tests to work with the lit internal shell. One test was missing env before environment variables and another was using a brace expansion.

This reverts commit 5125f47. This was reverted because it broke some debuginfod tests. Those have been fixed now in #158141. Relanding now and hoping it sticks this time.

Like other functions which results in abort, treat asm brk instruction as trivial.

…58148)

This solves a tricky issue where we can't install the libc++ dependencies until after we've checked out the monorepo.

…58158) `mlir/test/Dialect/Vector/td/unroll-elements.mlir` is fed as a data dependency into`mlir/test/Dialect/Vector/vector-to-elements-lowering.mlir` added in [#157142](#157142). The Bazel rule here automatically picks up all mlir files as tests, which leads to `vector-to-elements-lowering` failing.

…to build-at-commit

Check if operands are ConstantInt to avoid crashing on constant expression after #156659.

… MIR (#158122) This patch adds serialization of AArch64MachineFunctionInfo::HasStackFrame into MIR.

…158328) Reverts #157656 There are multiple reports that this is causing miscompiles in the MSan test suite after bootstrapping and that this is causing miscompiles in rustc. Let's revert for now, and work to capture a reproducer next week.

Now that ulimit is implemented for the internal shell, we can make sure that the clang tests utilizing ulimit actually work. One just needs the removal of its shell requirement while the other one needs some rework to avoid bash for loops. These are writtein in Python for about the same amount of complexity. Reviewers: ilovepi, cmtice, AaronBallman, Sirraide, petrhosek Reviewed By: ilovepi Pull Request: #157977

### Context #99710 introduced `.loc_label` so we can terminate a line sequence. However, it did not advance PC properly. This is problematic for 1-instruction functions as it will have zero-length sequence. The test checked in that PR shows the problem: ``` # CHECK-LINE-TABLE: Address Line Column File ISA Discriminator OpIndex Flags # CHECK-LINE-TABLE-NEXT: ------------------ ------ ------ ------ --- ------------- ------- ------------- # CHECK-LINE-TABLE-NEXT: 0x00000028: 05 DW_LNS_set_column (1) # CHECK-LINE-TABLE-NEXT: 0x0000002a: 00 DW_LNE_set_address (0x0000000000000000) # CHECK-LINE-TABLE-NEXT: 0x00000035: 01 DW_LNS_copy # CHECK-LINE-TABLE-NEXT: 0x0000000000000000 1 1 1 0 0 0 is_stmt # CHECK-LINE-TABLE-NEXT: 0x00000036: 00 DW_LNE_end_sequence # CHECK-LINE-TABLE-NEXT: 0x0000000000000000 1 1 1 0 0 0 is_stmt end_sequence ``` Both rows having PC 0x0 is incorrect, and parsers won't be able to parse them. See more explanation why this is wrong in #154851. ### Design This PR attempts to fix this by advancing the PC to the next available Label, and advance to the end of the section if no Label is available. ### Implementation - `emitDwarfLineEndEntry` will advance PC to the `CurrLabel` - If `CurrLabel` is null, its probably a fake LineEntry we introduced in #110192. In that case look for the next Label - If still not label can be found, use `null` and `emitDwarfLineEndEntry` is smart enough to advance PC to the end of the section - Rename `LastLabel` to `PrevLabel`, "last" can mean "previous" or "final", this is ambigous. - Updated the tests to emit a correct label. ### Note This fix should render #154986 and #154851 obsolete, they were temporary fixes and don't resolve the root cause. --------- Signed-off-by: Peter Rong <[email protected]>

This reverts commit 615d07e. This was causing some MacOS buildbolt failures.

It is a follow-up for #131982 (comment) and #126898 (comment). The names do not match the behaviors, and the revision updates the names. Signed-off-by: hanhanW <[email protected]>

Tests exercizing TBAA metadata (both purposefully and not), and previously generated via UTC, have been regenerated and updated to version 6.

…stead of uint (#154827) This PR is updating Root Signature YAML to use enums, this is a required change to remove the use of to_underlying from DirectXContainer binary file. Closes: [#150676](#150676)

This PR turns on all Standalone tests for Windows except for the plugins (which aren't enabled by default).

Make IntrinsicsToAttributesMap's func. and arg. fields be able to have adaptive sizes based on input other than hardcoded 8bits/8bits. This will ease the pressure for adding new intrinsics in private downstreams. func. attr bitsize will become 7(127/128) vs 8(255/256)

- Introduced a new method `IsNVPTX()` in `ArchSpec` to check for NVPTX architecture. - Implemented the corresponding method in `ArchSpec.cpp` to utilize the existing triple architecture checks.

…efore (#157821) This change was motivated by CK where many VMCNT(0)'s were generated due to instructions lacking !alias.scope metadata. The two causes of this were: 1) LowerLDSModule not tacking on scope metadata on a single LDS variable 2) IPSCCP pass before inliner replacing noalias ptr derivative with a global value, which made inliner unable to track it back to the noalias ptr argument. However, it turns out that IPSCCP losing the scope information was largely ineffectual as ScopedNoAliasAA was able to handle asymmetric condition, where one MemLoc was missing scope, and still return NoAlias result. AMDGPU however was checking for existence of scope in SIInsertWaitcnts and conservatively treating it as aliasing all and inserted VMCNT(0) before DS_READs, forcing it to wait for all previous LDS DMA instructions. Since we know that ScopedNoAliasAA can handle asymmetry, we should also allow AA query to determine if two MIs may alias. Passed PSDB. Previous attempt to address the issue in IPSCCP, likely stalled: #154522 This solution may be preferrable over that as issue only affects AMDGPU.

Reverts #157529 Sorry, I missed that the missed that the LLVM test was using clang - layering dictates thats not OK. Please readjust the test case to work like the existing test coverage (or perhaps the existing test coverage is sufficient?) and post a new PR.

Dependabot cannot configure the branch prefix, which means it fails everytime it tries to run because we only allow user/ branches. This is in preparation for using Renovate which supports custom branch prefixes and has other advantages, like the ability to run/get setup without any assisstance from a repository admin unlike dependabot. This makes it significantly more hackable for the rest of the community.

This test errors when trying to append to the `%t` file when run in an environment where the source tree is mounted read-only, since `cp` preserves the read-only file permission.

When materializing integer ranges of splat tensors or vector as constants, they should use DenseElementsAttr of the shaped type, not IntegerAttrs of the element types, since this can violate the invariants of tensor/vector ops. Co-authored-by: Jeff Niu <[email protected]>

This is required for the upcoming bazel 9.x release where this rule is no longer automatically available.

@qedawkins

…157736) This commit: - Introduces a new `InParallelOpInterface`, along with the `ParallelCombiningOpInterface`, represent the parallel updating operations we have in a parallel loop of `scf.forall`. - Change the name of `ParallelCombiningOpInterface` to `InParallelOpInterface` as the naming was quite confusing. - `ParallelCombiningOpInterface` now is used to generalize operations that insert into shared tensors within parallel combining regions. Previously, only `tensor.parallel_insert_slice` was supported directly in `scf.InParallelOp` regions. - `tensor.parallel_insert_slice` now implements `ParallelCombiningOpInterface`. This change enables future extensions to support additional parallel combining operations beyond `tensor.parallel_insert_slice`, which have different update semantics, so the `in_parallel` region can correctly and safely represent these kinds of operation without potential mistakes such as races. Author credits: @qedawkins

…4379)" This reverts commit 1bafd02. This breaks the LLDB data formatters which means these failures show up on every premerge run. Reverting for now until fixing the LLDB formatters can be coordinated with a relanding.

There was a recent patch that added in some tests to the lit test suite that use split-file. An explicit dependency in CMake was not added, which led to check-lit not working if being run without doing a full build first. This patch explicitly adds the dependency inside the CMake file to fix this configuration.

…o sg (#155443) This PR adds patterns to distribute vector.step and vector.shape_cast op from wg to sg and it also enables constant, broadcast and elementwise ops to handle the slice attribute

…ata` (#154635) `MD_prof` is safe to keep when e.g. hoisting instructions. Issue #147390

…ing (#155296) Issue #152767

Modernized it to using `update_test_checks` which addresses an ambgiuty in the previous test formulation, where a profile metadaat of value `i32 1` would have (incorrectly matched.

…singleton. (#158012)

We were using the step security fork after the tj-actions/changed-files supply chain attack given Github disabled the repo and all our actions were failing during that time. Switch away from the fork back to the main repository to avoid an extra level of indirection until we can probably just stop using this action/roll our own.

fhahn and others added 30 commits September 11, 2025 20:09

[libc++] Add a workflow that builds benchmarks when commenting on a PR

162755b

This worfklow is still being tested, since I can't figure out how to trigger it without actually merging it to main.

[libc][bazel] Add tests and targets for inttypes (#158127)

b4650a4

Adds tests and targets for the remaining inttypes functions.

[libc][bazel] add tests and targets for ctype (#158124)

0ab2df2

Adds tests and targets for all the ctype functions.

[libc++] Get rid of concurrency in the libc++ benchmarking job

e08588d

[libc++] Remove concurrency in the right libc++ job

bd6e217

Early-cancellation should have been removed from libcxx-run-benchmarks.yml in the first place, not libcxx-build-and-test.yaml.

[cmake] Add missing shared library dependency after f3efbce

4ae520b

[Utils] Add support for split-file to diff_test_updater (#157765)

9eb17cc

[libc++] Fix sed pattern to extract benchmarks from the comment body

73e64e5

X86: Fix win64 tail call regression for tail call to loaded pointer (#…

eb3b7dd

…158055) Fix regression after 62f2641. Previous patch handled the register case, but the memory case snuck another use of ptr_rc_tailcall hidden inside i64mem_TC

[SimplifyCFG] Set branch weights when merging conditional store to ad…

3097688

…dress (#154841)

[WebKit checkers] Recognize NS_RETURNS_RETAINED and CF_RETURNS_RETAIN…

b64ed9d

…ED. (#157629) This PR adds the support for treating a function return value to be safe if the function is annotated with NS_RETURNS_RETAINED or CF_RETURNS_RETAINED.

[libc++] Use the correct pull-request base and targets for the benchm…

82218fb

…arking job

[bazel] Redo LSP changes to fix bazel build (#158150)

e0817c6

Reverts #157691 Change was relanded in #157885

[llvm-debuginfod] Update tests to work with internal shell (#158141)

e87dc2e

This patch updates the llvm-debuginfod tests to work with the lit internal shell. One test was missing env before environment variables and another was using a brace expansion.

Reapply "[llvm] Use lit internal shell by default"

73b24d2

This reverts commit 5125f47. This was reverted because it broke some debuginfod tests. Those have been fixed now in #158141. Relanding now and hoping it sticks this time.

[WebKit checkers] Treat asm brk as trivial (#155046)

2f9a458

Like other functions which results in abort, treat asm brk instruction as trivial.

[libc++] Properly extract the PR head and base from the Github event

6272540

[RISCV] Add helper functions to detect CLZ/CTZ/CPOP-like support. (#1…

7d249cf

…58148)

[libc++] Install dependencies right before they're needed

c939551

This solves a tricky issue where we can't install the libc++ dependencies until after we've checked out the monorepo.

[libc++] Make sure we forward the git repository from test-at-commit …

d2f67c3

…to build-at-commit

[ConstFold] Don't crash on ConstantExprs when folding get_active_lane_m.

607a813

Check if operands are ConstantInt to avoid crashing on constant expression after #156659.

[AArch64][MIR] Serialize AArch64MachineFunctionInfo::HasStackFrame to…

23d1ec6

… MIR (#158122) This patch adds serialization of AArch64MachineFunctionInfo::HasStackFrame into MIR.

rnk and others added 30 commits September 12, 2025 10:15

Revert "[lit] Implement ulimit builtin"

330068a

This reverts commit 615d07e. This was causing some MacOS buildbolt failures.

[mlir][NFC] Correct pattern names to match the behaviors. (#158177)

8eba28b

It is a follow-up for #131982 (comment) and #126898 (comment). The names do not match the behaviors, and the revision updates the names. Signed-off-by: hanhanW <[email protected]>

[llvm] Regenerate test checks including TBAA semantics (NFC)

3706070

Tests exercizing TBAA metadata (both purposefully and not), and previously generated via UTC, have been regenerated and updated to version 6.

[DirectX] Updating Root Signature YAML representation to use Enums in…

5fd3aad

…stead of uint (#154827) This PR is updating Root Signature YAML to use enums, this is a required change to remove the use of to_underlying from DirectXContainer binary file. Closes: [#150676](#150676)

[MLIR] enable Standalone example test for Windows (#158183)

1a6b2b6

This PR turns on all Standalone tests for Windows except for the plugins (which aren't enabled by default).

[LLDB][NVIDIA] Add NVPTX architecture support (#158334)

4826039

- Introduced a new method `IsNVPTX()` in `ArchSpec` to check for NVPTX architecture. - Implemented the corresponding method in `ArchSpec.cpp` to utilize the existing triple architecture checks.

[bazel] Fix buildifier in tblgen.bzl (#158351)

1756b6e

[gn build] Port 220d705

01d85e7

[gn build] Port 8c0f3b6

d161d37

[gn build] Port f3efbce

e1efb51

[PGO][test] Ensure test input is writeable after copying. (#158356)

b5516da

This test errors when trying to append to the `%t` file when run in an environment where the source tree is mounted read-only, since `cp` preserves the read-only file permission.

[bazel] Add rules_shell for sh_binary rule (#158365)

f645d20

This is required for the upcoming bazel 9.x release where this rule is no longer automatically available.

[MLIR][XeGPU] Distribute vector.step & vector.shape_cast op from wg t…

8e17f80

…o sg (#155443) This PR adds patterns to distribute vector.step and vector.shape_cast op from wg to sg and it also enables constant, broadcast and elementwise ops to handle the slice attribute

[libc] Change __builtin_memcpy to inline_memcpy. (#158345)

f019e23

[IR] Add MD_prof to the Keep list of `dropUBImplyingAttrsAndMetad…

9e33997

…ata` (#154635) `MD_prof` is safe to keep when e.g. hoisting instructions. Issue #147390

[NFC] Leave a comment in Local.cpp about debug info & sample profil…

8f25ea2

…ing (#155296) Issue #152767

[InstCombine] Make test resilient to metadata presence (#157607)

0d4a615

Modernized it to using `update_test_checks` which addresses an ambgiuty in the previous test formulation, where a profile metadaat of value `i32 1` would have (incorrectly matched.

[WebKit checkers] Treat function pointers with "Singleton" suffix as …

8ee31ab

…singleton. (#158012)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pull] main from llvm:main #5634

[pull] main from llvm:main #5634

pull bot commented Sep 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

[pull] main from llvm:main #5634

Are you sure you want to change the base?

[pull] main from llvm:main #5634

Conversation

pull bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pull bot commented Sep 12, 2025 •

edited

Loading