Adding option for broadcasting tensors similar to torch WIP #3417

catch-twenty-two · 2025-07-24T11:08:28Z

This adds functionality to broadcast two tensors with potentially different static ranks to a common rank and then preform an operation on them. Before adding more operations via macros, I wanted to make sure this would be helpful to the community since as of now I use it the following way:

I have not fully incorporated this into the burn project, since I wanted to get a useful or not useful comment first

    fn test_broadcast_multi_dims_values() {
        let device = &NdArrayDevice::default();
        type B = NdArray<f32>;

        let a = Tensor::<B, 3>::from_data(
            [
                [[2, 8, 7, 2], [9, 14, 13, 12], [9, 14, 13, 12]],
                [[2, 8, 7, 2], [9, 14, 13, 12], [9, 14, 13, 12]],
            ],
            device,
        );

        let b = Tensor::<B, 2>::from_data([[4, 11, 10, 5]], device);

        let (a, b) = broadcast!(a:Tensor<B, 3>, b:Tensor<B, 2>);

        let a_add_b = a.add(b);

        Tensor::<B, 3>::from_data(
            [
                [[6, 19, 17, 7], [13, 25, 23, 17], [13, 25, 23, 17]],
                [[6, 19, 17, 7], [13, 25, 23, 17], [13, 25, 23, 17]],
            ],
            device,
        )
        .into_data()
        .assert_eq(&a_add_b.to_data(), true);
    }

Functionality can be easily added to incorporate operators, and then used like in the following example:

let a = Tensor::<B, 6>::empty([7, 6, 2, 3, 1, 9], device);
let b = Tensor::<B, 4>::empty([2, 1, 7, 1], device);

let a = add_broadcast!(a: Tensor<B, 6>, b: Tensor<B, 4>);

Note: Macros were used since the broadcast operation requires knowledge of what size the rank from the resulting tensors will be (which are constants) prior to compile time.

Checklist

Confirmed that cargo run-checks command has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

#3344
#1499

Changes

I've added the logic to correctly shape the tensors via a broadcast function and have operations done on them.

Testing

Multiple unit tests, and integrated into my own code base.

codecov · 2025-07-24T11:48:49Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 63.28%. Comparing base (5f40149) to head (90cb9ea).

❌ Your project check has failed because the head coverage (63.28%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #3417   +/-   ##
=======================================
  Coverage   63.28%   63.28%           
=======================================
  Files        1055     1055           
  Lines      123018   123018           
=======================================
  Hits        77858    77858           
  Misses      45160    45160

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

laggui

I think having broadcasting for tensors of different rank would address a real gap in usability 🙂

But we should be a thoughtful about the right approach (and I haven't really thought about it yet).

Some general comments on the current macro:

it only applies left padding to both operands, but broadcasting can be a bit more complex
the syntax is a little verbose with the type annotations vs type inference
having a per-op macro like add_broadcast! is helpful to apply different broadcasting rules, but can lead to macro explosion; we can probably have an element-wise broadcast and matmul broadcast to cover almost all cases

catch-twenty-two · 2025-07-30T07:20:32Z

Thanks for the review!

I think having broadcasting for tensors of different rank would address a real gap in usability 🙂

So, that kinda sounds like a yay, not a nay on usefulness? 😁

But we should be a thoughtful about the right approach (and I haven't really thought about it yet).

Some general comments on the current macro:

it only applies left padding to both operands, but broadcasting can be a bit more complex

Could you give me an quick example? I thought I had all the bases covered here afa the general way pytorch handles broadcasting. If there are some corner cases I'd love to account for them.

https://docs.pytorch.org/docs/stable/notes/broadcasting.html

the syntax is a little verbose with the type annotations vs type inference

Agreed! I couldn't think of a better way to pass the generic constants to the actual function (since as far as I know, this is the only way to do this?) to compare which one is larger (since this has to be done at compile time) and still have it look intuitively like the rest of the burn/rust tensor type syntax

let a: Tensor<B, D, K> = ect..

Any ideas would be greatly appreciated, as I know once this syntax is incorporated to the api, you are pretty much stuck with it.

I've since, updated it a bit, but it's still verbose:

broadcast!(
    a: Tensor<Backend, RANK_A>,
    b: Tensor<RANK_B>
)

having a per-op macro like add_broadcast! is helpful to apply different broadcasting rules, but can lead to macro explosion; we can probably have an element-wise broadcast and matmul broadcast to cover almost all cases

Agreed, this is why I only did it for one... I just wanted to show an example of what could be done... Personally I really dislike macros, I feel that they obfuscate the code and make debugging difficult, the less the better! I'll look into incorporating matmul.

laggui · 2025-08-05T14:28:14Z

Sorry for the late response, notification got lost in the recent flood 😅

So, that kinda sounds like a yay, not a nay on usefulness? 😁

Yeah I definitely see the usefulness! Just gotta nail the usage/implementation 🙂

Could you give me an quick example? I thought I had all the bases covered here afa the general way pytorch handles broadcasting. If there are some corner cases I'd love to account for them.

Actually this kinda ties into the third point regarding elemwise and matmul broadcast rules. For elemwise I believe it's simply left padding, it's matmul that has different rules.

Not entirely sure yet about the best approach, I'd have to think about the best way to tackle the syntax and rules 🤔

github-actions · 2025-09-05T12:11:49Z

This PR has been marked as stale because it has not been updated for over a month

catch-twenty-two force-pushed the add-broadcast-tensor branch from 5192999 to 6df1934 Compare July 24, 2025 11:09

laggui reviewed Jul 29, 2025

View reviewed changes

github-actions bot added the stale The issue or pr has been open for too long label Sep 5, 2025

Adding broadcast functionality

90cb9ea

catch-twenty-two force-pushed the add-broadcast-tensor branch from 4bc28cb to 90cb9ea Compare September 15, 2025 17:49

github-actions bot removed the stale The issue or pr has been open for too long label Sep 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding option for broadcasting tensors similar to torch WIP #3417

Adding option for broadcasting tensors similar to torch WIP #3417

Uh oh!

catch-twenty-two commented Jul 24, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 24, 2025 •

edited

Loading

Uh oh!

laggui left a comment •

edited

Loading

Uh oh!

catch-twenty-two commented Jul 30, 2025 •

edited

Loading

Uh oh!

laggui commented Aug 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

Uh oh!

Adding option for broadcasting tensors similar to torch WIP #3417

Are you sure you want to change the base?

Adding option for broadcasting tensors similar to torch WIP #3417

Uh oh!

Conversation

catch-twenty-two commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Related Issues/PRs

Changes

Testing

Uh oh!

codecov bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

laggui left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

catch-twenty-two commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

laggui commented Aug 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

Uh oh!

catch-twenty-two commented Jul 24, 2025 •

edited

Loading

codecov bot commented Jul 24, 2025 •

edited

Loading

laggui left a comment •

edited

Loading

catch-twenty-two commented Jul 30, 2025 •

edited

Loading