Esdebug5 #5286

zasdfgbnm · 2025-10-01T19:15:44Z

No description provided.

github-actions · 2025-10-01T19:17:03Z

Review updated until commit 918e1b5

Description

Simplify MatmulOp output handling by removing redundant striding logic
Refactor resetAllocationDomainAndContiguity to use tensor directly
Update argument preparation to support expression evaluation outputs
Introduce inferAllocationSizesAndStrides without validation for flexibility

Changes walkthrough 📝

Relevant files

Bug fix

nodes.cpp `Simplify MatmulOp output handling` csrc/ir/nodes.cpp Remove manual stride handling in MatmulOp::evaluate Directly return matmul_out without strided copy	+1/-11

Enhancement

fusion_cache_utils.cpp `Refactor allocation domain reset with tensor input` csrc/runtime/fusion_cache_utils.cpp Update resetAllocationDomainAndContiguity to take at::Tensor instead of sizes/strides Use inferAllocationSizesAndStrides to get sizes and strides Simplify contiguity update logic with direct indexing	+15/-75
fusion_kernel_runtime.cpp `Support expression evaluation in kernel preparation` csrc/runtime/fusion_kernel_runtime.cpp Add expression evaluation path for ExprEval scheduler Bind inputs and evaluate outputs using ExpressionEvaluator Pass is_expr_eval flag to updateWithSegmentOutputs	+25/-7
tensor_metadata.cpp `Split inference and validation of allocation sizes` csrc/tensor_metadata.cpp Rename inferAndValidateAllocationSizesAndStrides to inferAllocationSizesAndStrides Move validation to separate function call Add new inferAndValidate wrapper that calls infer function	+12/-1

Documentation

tensor_metadata.h `Declare allocation sizes inference functions` csrc/tensor_metadata.h Add new inferAllocationSizesAndStrides function declaration Document purpose of non-validating inference function Keep inferAndValidateAllocationSizesAndStrides for compatibility	+9/-0

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

🧪 No relevant tests

⚡ Recommended focus areas for review

Output Striding

The PR removes logic that ensures the output tensor has correct strides by constructing a strided tensor when the inferred output is not contiguous. This may lead to incorrect memory layout assumptions in downstream operations.

return {matmul_out};

Contiguity Update

The refactored resetAllocationDomainAndContiguity function no longer handles mismatched allocation domains or updates contiguity in all cases, potentially leading to incorrect contiguity metadata on TensorView.

    const at::Tensor& tensor) {
  const auto [sizes, strides] = inferAllocationSizesAndStrides(tensor, tv, ExpressionEvaluator());
  auto contiguity_without_reduction = computeContiguity(sizes, strides);
  std::vector<std::optional<bool>> contiguity;
  int64_t index = 0;
  for (auto id : tv->getMaybeAllocationDomain()) {
    if (id->isReduction()) {
      contiguity.push_back(std::nullopt);
    } else if (!id->isBroadcast() &&
        !contiguity_without_reduction[index].has_value()) {
      contiguity.push_back(true);
      index++;
    } else {
      contiguity.push_back(contiguity_without_reduction[index++]);
    }
  }
  tv->setContiguity(contiguity);
}

ExprEval Output Handling

When using ExprEval, output sizes and strides are computed via evaluation, but the updated contiguity logic may not be consistently applied due to changes in how resetAllocationDomainAndContiguity is called.

if (is_expr_eval) {
  // For expr evaluated fusion, the striding rules follow that of ATen.
  ExpressionEvaluator eval_fusion;
  for (auto [i, v] : enumerate(group_to_run->inputs())) {
    auto tensor_pv = args_manager.checkTensorMap(v);
    eval_fusion.bind(fusion_to_run->inputs()[i], tensor_pv);
  }
  for (auto v : fusion_to_run->outputs()) {
    auto result = eval_fusion.evaluate(v);
    group_runtime_outputs.push(result);
  }

zasdfgbnm · 2025-10-07T18:26:46Z

!test

zasdfgbnm added 13 commits September 25, 2025 15:06

save

6495aaf

save

24f83b8

save

c51c4c9

save

9c1aa2d

save

dde4577

save

189d5ac

save

2c69fc7

try

c243d94

save

a0464c7

save

e739ca0

save

06405d7

save

25481b4

save

2497dc2

zasdfgbnm added 16 commits October 2, 2025 11:13

save

99bfdbd

save

feea268

save

9626aef

save

c6b2607

save

8f1de8f

save

68c952f

save

ef34555

save

55f1cc9

save

88d39c2

save

cc5af01

save

55c7bb3

save

5cd5741

save

169dac2

save

5d3a8fd

save

bc2b169

save

25d6f52

zasdfgbnm added 3 commits October 2, 2025 18:43

save

caebbbd

try

12d8989

save

33d7049

zasdfgbnm mentioned this pull request Oct 3, 2025

Should DID parallelized dimensions have contiguity nullopt? #5310

Open

zasdfgbnm added 5 commits October 8, 2025 16:54

save

b5fdb13

save

a72f408

save

0586123

save

0aaf9eb

save

918e1b5

zasdfgbnm marked this pull request as ready for review October 9, 2025 00:01

zasdfgbnm merged commit 395927b into eval-segment Oct 9, 2025
7 of 9 checks passed

zasdfgbnm deleted the esdebug5 branch October 9, 2025 00:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Esdebug5 #5286

Esdebug5 #5286

Uh oh!

zasdfgbnm commented Oct 1, 2025

Uh oh!

github-actions bot commented Oct 1, 2025 •

edited

Loading

Uh oh!

zasdfgbnm commented Oct 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Esdebug5 #5286

Esdebug5 #5286

Uh oh!

Conversation

zasdfgbnm commented Oct 1, 2025

Uh oh!

github-actions bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes walkthrough 📝

PR Reviewer Guide 🔍

Uh oh!

zasdfgbnm commented Oct 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Oct 1, 2025 •

edited

Loading