Streaming backward #35

MagellaX · 2025-10-11T08:27:47Z

Summary by cubic

Improved masked-row handling in Triton fused online attention so masks and softmax match PyTorch SDPA and avoid NaNs on fully masked rows. Backward now receives the correct tile sizes; tests use a portable SDPA math context for parity.

Bug Fixes
- Added explicit validity tracking in online softmax; cast q/k/v and mask to float32; apply corrections and exp only to valid rows.
- Convert boolean attention_mask to -inf bias and load mask as qk dtype; keep fully masked rows at zero output with safe denominators.
- Replaced tl.isfinite with lse > -inf in backward and gated grad_output by row validity.
- Supplied TILE_M/TILE_N to the backward kernel and removed TILE_K from signatures and call sites.

cubic-dev-ai

1 issue found across 2 files

Prompt for AI agents (all 1 issues)


Understand the root cause of the following 1 issues and fix them.


<file name="tests/test_attention.py">

<violation number="1" location="tests/test_attention.py:32">
The test suite in `tests/test_attention.py` is missing a test case for the core bug being fixed: handling fully masked rows. The new logic in `fused_online_attention.py` is designed to prevent NaNs in this scenario, but without a dedicated test, the fix is not verified and could regress.</violation>
</file>

_{React with 👍 or 👎 to teach cubic. Mention @cubic-dev-ai to give feedback, ask questions, or re-run the review.}

cubic-dev-ai · 2025-10-11T08:38:52Z

tests/test_attention.py

 from stream_attention.core.star_attention import StarAttention
+
+
+def _math_sdpa_ctx():


The test suite in tests/test_attention.py is missing a test case for the core bug being fixed: handling fully masked rows. The new logic in fused_online_attention.py is designed to prevent NaNs in this scenario, but without a dedicated test, the fix is not verified and could regress.

Prompt for AI agents

Address the following comment on tests/test_attention.py at line 32: <comment>The test suite in `tests/test_attention.py` is missing a test case for the core bug being fixed: handling fully masked rows. The new logic in `fused_online_attention.py` is designed to prevent NaNs in this scenario, but without a dedicated test, the fix is not verified and could regress.</comment> <file context> @@ -20,6 +27,16 @@ from stream_attention.core.star_attention import StarAttention + + +def _math_sdpa_ctx(): + if sdpa_kernel_ctx is not None and SDPBackend is not None: + return sdpa_kernel_ctx(SDPBackend.MATH) </file context>

MagellaX added 10 commits October 10, 2025 15:48

Fix mask conversion and supply TILE_M/N in backward

d6a533d

Fix Triton fused attention mask handling

37d0af6

Ensure masked row logic works on Triton 2.1

2746485

Restore masked parity without tl.isfinite

4323cb6

Guard online softmax updates for masked rows

56a04d7

Stabilize Triton online softmax for masked rows

748daac

Use explicit validity mask in online softmax

809697f

Fix validity tracking in Triton mask path

eaa4649

Track mask validity in float space

ae154cb

Align Triton mask parity with math backend

9013330

MagellaX merged commit 03e006a into main Oct 11, 2025
3 checks passed

cubic-dev-ai bot reviewed Oct 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Streaming backward #35

Streaming backward #35

Uh oh!

MagellaX commented Oct 11, 2025 •

edited by cubic-dev-ai bot

Loading

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot Oct 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		from stream_attention.core.star_attention import StarAttention


		def _math_sdpa_ctx():

Streaming backward #35

Streaming backward #35

Uh oh!

Conversation

MagellaX commented Oct 11, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MagellaX commented Oct 11, 2025 •

edited by cubic-dev-ai bot

Loading

cubic-dev-ai bot Oct 11, 2025 •

edited

Loading