Fix Conv LHS packing padding/uninitialized ptrs V2 #27215

hariharans29 · 2026-01-30T21:23:02Z

Description

Refer to V1 of the fix here: #27214

This PR includes all fixes from the V1 PR + logic to invalidate the lhs cache pointers in case the pad buffer's underlying buffer has changed due to a resize. The ARM team will look at potentially enhancing this logic after the 1.24.0 release.

Motivation and Context

Fix #26669

Copilot

Pull request overview

Addresses non-deterministic correctness issues by hardening KleidiAI Conv LHS packing/padding behavior and adding an additional CUDA ConvTranspose bias validation.

Changes:

Initialize all entries in the KleidiAI Conv LHS indirection table to padding pointers to avoid uninitialized reads for partial tiles.
Replace the shared static padding buffer with a thread_local buffer and invalidate cached LHS pointer tables when the padding buffer reallocates.
Add CUDA ConvTranspose bias shape validation against the computed output channel count.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`onnxruntime/core/providers/cuda/nn/conv_transpose.cc`	Adds runtime validation that bias is a 1-D tensor matching `num_output_channels`.
`onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp`	Initializes LHS pointer table padding entries and updates padding-buffer + cache invalidation behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/core/providers/cuda/nn/conv_transpose.cc

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp

hariharans29 added 2 commits January 30, 2026 12:10

Mirror exisitng fix

d832337

Fix Conv LHS packing padding/uninitialized ptrs V2

6617a81

hariharans29 changed the title ~~Fix Conv LHS packing padding/uninitialized ptrs~~ Fix Conv LHS packing padding/uninitialized ptrs V2 Jan 30, 2026

Subtle change

fc61001

hariharans29 requested a review from Copilot January 30, 2026 21:25

Copilot started reviewing on behalf of hariharans29 January 30, 2026 21:26 View session

Copilot AI reviewed Jan 30, 2026

View reviewed changes

onnxruntime/core/providers/cuda/nn/conv_transpose.cc Outdated Show resolved Hide resolved

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp Show resolved Hide resolved

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp Show resolved Hide resolved

Revert accidental CUDA change

494a307

hariharans29 added the release:1.24.0 label Jan 30, 2026

hariharans29 added 2 commits January 30, 2026 13:40

Fix compilation issue

cf7d4a3

Cosmetic change

87c1104

hariharans29 requested a review from Copilot January 30, 2026 21:46

Copilot started reviewing on behalf of hariharans29 January 30, 2026 21:46 View session

Copilot AI reviewed Jan 30, 2026

View reviewed changes

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp Show resolved Hide resolved

hariharans29 mentioned this pull request Jan 30, 2026

[cpu] Loading certain models leads to global error state on M4 Max #26669

Open

edgchen1 approved these changes Jan 30, 2026

View reviewed changes

adrianlizarraga approved these changes Jan 30, 2026

View reviewed changes

hariharans29 enabled auto-merge (squash) January 30, 2026 23:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Conv LHS packing padding/uninitialized ptrs V2 #27215

Fix Conv LHS packing padding/uninitialized ptrs V2 #27215

hariharans29 commented Jan 30, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix Conv LHS packing padding/uninitialized ptrs V2 #27215

Are you sure you want to change the base?

Fix Conv LHS packing padding/uninitialized ptrs V2 #27215

Conversation

hariharans29 commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hariharans29 commented Jan 30, 2026 •

edited

Loading