merge main into amd-staging #597

ronlieb · 2025-11-15T14:18:28Z

No description provided.

Some cases are relying on SIFixSGPRCopies to force VALU reg_sequence inputs with SGPR inputs to use all VGPR inputs, but this doesn't always happen if the reg_sequence isn't invalid. Make sure we use a vgpr up-front here so we don't rely on something later.

As in title. Without this, fpext behaves in selectionDAG as always having no fast-math flags.

) This PR improves the lowering of vectors of fp16 when using fpext. Previously vectors of fp16 were scalarized leading to lots of extra instructions. Now, vectors of fp16 will be lowered when extended to fp64 via the preexisting lowering logic for extends. To make use of the existing logic, we need to add elements until we reach the next power of 2.

Handle this for consistency with the zext case.

…lvm#168168) This probably should have turned into a regular integer constant earlier. This is to defend against future regressions.

The main improvement is to the mfma tests. There are some mild regressions scattered around, and a few major ones. The worst regressions are in some of the bitcast tests; these are cases where the SGPR argument list runs out and uses VGPRs, and the copies-from-VGPR are misidentified as divergent. Most of the shufflevector tests are also regressions. These end up with cleaner MIR, but then get poor regalloc decisions.

Implement support for the OffsetOfExpr

Upstream ExtVectorElementExpr with result Vector type

…#166724) Per [LWG554](https://cplusplus.github.io/LWG/issue554), the rationale is that even if `true / false` traps, the values causing trap are the converted `int` values produced by usual arithmetic conversion, but not the original `bool` values. This is also true for all other non-promoted integer types. As a result, `std::numeric_limits<I>` should be `false` if `I` is a non non-promoted integer type. Fixes llvm#166053.

…llvm#165779) (llvm#168034) Refer to llvm#158276 for previous hotfix. In Z3, boolean expressions are incompatible with bitvec operators. However, C expressions like `-(5 && a)` will generate such symbolic expressions, which will be further used as an integer. To be compatible with such usages, this fix converts such expressions to integer using the existing `fromCast`.

Update test to capture unnamed VPValues in variables, making it easier to update with future VPlan changes.

…n. (llvm#167965) Extend willNotFreeBetween to perform simple checking across blocks to support the case where CtxI is in a successor of the block that contains the assume, but the assume's parent is the single predecessor of CtxI's block. This enables using _builtin_assume_dereferenceable to vectorize std::find_if and co in practice. End-to-end reproducer: https://godbolt.org/z/6jbsd4EjT PR: llvm#167965

hopefully resolves oclConformance fails

z1-cciauto · 2025-11-15T14:19:42Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/2821

ronlieb · 2025-11-15T18:52:44Z

!PSDB

z1-cciauto · 2025-11-15T18:54:27Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/2822

arsenm and others added 16 commits November 14, 2025 20:19

[SelectionDAGBuilder] Propagate fast-math flags to fpext (llvm#167574)

e7b41df

As in title. Without this, fpext behaves in selectionDAG as always having no fast-math flags.

AMDGPU: Use vgpr to implement divergent i32->i64 anyext (llvm#168167)

d8f6e10

Handle this for consistency with the zext case.

AMDGPU: Consider isVGPRImm when forming constant from build_vector (l…

9fecebf

…lvm#168168) This probably should have turned into a regular integer constant earlier. This is to defend against future regressions.

MCNopsFragment,MCBoundaryAlignFragment: Use parent MCSubtargetInfo

d9dfe75

MCAsmBackend: Remove unneeded MCAssembler parameter

29e3c2e

[CIR] Implement support for OffsetOfExpr (llvm#167726)

30c8465

Implement support for the OffsetOfExpr

[CIR] ExtVectorElementExpr with result Vector type (llvm#167925)

22f550b

Upstream ExtVectorElementExpr with result Vector type

[VPlan] Strip outdated comment in optimizeForVFAndUF (NFC) (llvm#168068)

85db928

[LV] Use variables in CHECK lines for unnamed VPValues in test.

ca26cf8

Update test to capture unnamed VPValues in variables, making it easier to update with future VPlan changes.

merge main into amd-staging

b51530b

hopefully resolves oclConformance fails

ronlieb requested review from a team and dpalermo November 15, 2025 14:18

dpalermo approved these changes Nov 15, 2025

View reviewed changes

ronlieb merged commit f473089 into amd-staging Nov 15, 2025
11 of 12 checks passed

ronlieb deleted the amd/merge/upstream_merge_20251115081655 branch November 15, 2025 23:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

merge main into amd-staging #597

merge main into amd-staging #597

Uh oh!

ronlieb commented Nov 15, 2025

Uh oh!

z1-cciauto commented Nov 15, 2025

Uh oh!

ronlieb commented Nov 15, 2025

Uh oh!

z1-cciauto commented Nov 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

merge main into amd-staging #597

merge main into amd-staging #597

Uh oh!

Conversation

ronlieb commented Nov 15, 2025

Uh oh!

z1-cciauto commented Nov 15, 2025

Uh oh!

ronlieb commented Nov 15, 2025

Uh oh!

z1-cciauto commented Nov 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants