[Web] BiRefNet_T not working on webgpu #21968

guschmue · 2024-09-03T18:00:41Z

Describe the issue

https://huggingface.co/onnx-community/BiRefNet_T does not work on webgpu

To reproduce

See https://huggingface.co/onnx-community/BiRefNet_T

Urgency

No response

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

1.19

Execution Provider

'webgpu' (WebGPU)

guschmue · 2024-09-13T16:40:13Z

I looked briefly at this: the model takes a lot of memory for activations and will not work with wasm32.
If I run it on webgpu I see it using close to 7GB gpu memory. In theory if your gpu has that kind of memory it should work BUT the model uses GatherND and ScatterND which we have not implemented for webgpu and falls back to wasm just to run out of memory on the wasm end.
We could implement GatherND and ScatterND what should make it work on high end gpu's but most won't have those high end gpu's.

tidus2102 · 2024-09-16T06:20:42Z

hi @guschmue, thanks for your support. It's great if you can make it works on webgpu (at least with high-end gpus). About the memory, you means we can only run this model with wasm64 right?

fs-eire · 2024-10-08T21:48:01Z

If we can make GatherND and ScatterND working on WebGPU, by using external data side-loading it is possible to make it work in wasm32. And there is an ongoing effort to support wasm64 build.

xenova · 2024-10-09T14:55:37Z

Possibly related: https://huggingface.co/onnx-community/DepthPro-ONNX also throws an error for me:

An uncaught WebGPU validation error was raised: The number of storage buffers (36) in the Compute stage exceeds the maximum per-stage limit (8).

While validating binding counts

While validating [BindGroupLayoutDescriptor]

While calling [Device].CreateComputePipeline([ComputePipelineDescriptor "Concat"]).

Another error:

Uncaught (in promise) Error: [WebGPU] Kernel "[Concat] /encoder/Concat_23" failed. Error: non concat dimensions must match

The model works correctly in Node.js (CPU)

xenova · 2024-11-14T20:39:20Z

https://huggingface.co/briaai/RMBG-2.0 is a new birefnet-based model for state-of-the-art background removal. Would be useful to test too!

tidus2102 · 2024-11-29T04:23:48Z

Hi, just a kindly check if there have been any updates or progress regarding this issue? Thank you

fs-eire · 2024-12-02T19:36:11Z

The The number of storage buffers exceeds the maximum per-stage limit. is a known issue for Concat. this issue is being tracked.

tidus2102 · 2024-12-03T07:12:07Z

If we can make GatherND and ScatterND working on WebGPU, by using external data side-loading it is possible to make it work in wasm32. And there is an ongoing effort to support wasm64 build.

hi, how about this?

fs-eire · 2024-12-04T18:00:39Z

#22847 and #22755 introduced implementation of GatherND and ScatterND. Please allow one day or 2 for the pipeline to publish a nightly package.

tidus2102 · 2024-12-27T06:54:23Z

Hi, I've just tested the latest ORT nightly dev build but still got the memory error when inference BiRefNet_lite onnx model on Chrome 131.0.6778.205 arm64 - macOS 15.2 (24C101).

Here is the sample code.

Please help to check again. Thank you!

xenova · 2025-03-02T19:27:37Z

the model uses GatherND and ScatterND which we have not implemented for webgpu and falls back to wasm just to run out of memory on the wasm end.
We could implement GatherND and ScatterND what should make it work on high end gpu's but most won't have those high end gpu's.

Now that we have GatherND and ScatterND ops implemented, should this be working?

xenova · 2025-03-07T12:23:17Z

I've released a few more birefnet models we can use for testing: https://huggingface.co/models?library=transformers.js&other=birefnet. Unfortunately, it's still an issue today 😅

guschmue added platform:web issues related to ONNX Runtime web; typically submitted using template ep:WebGPU ort-web webgpu provider labels Sep 3, 2024

prathikr self-assigned this Sep 3, 2024

xenova mentioned this issue Sep 4, 2024

Error when use .onnx on web ZhengPeng7/BiRefNet#78

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Web] BiRefNet_T not working on webgpu #21968

[Web] BiRefNet_T not working on webgpu #21968

guschmue commented Sep 3, 2024

guschmue commented Sep 13, 2024 •

edited

Loading

tidus2102 commented Sep 16, 2024

fs-eire commented Oct 8, 2024

xenova commented Oct 9, 2024 •

edited

Loading

xenova commented Nov 14, 2024

tidus2102 commented Nov 29, 2024

fs-eire commented Dec 2, 2024

tidus2102 commented Dec 3, 2024

fs-eire commented Dec 4, 2024 •

edited

Loading

tidus2102 commented Dec 27, 2024 •

edited

Loading

xenova commented Mar 2, 2025

xenova commented Mar 7, 2025

[Web] BiRefNet_T not working on webgpu #21968

[Web] BiRefNet_T not working on webgpu #21968

Comments

guschmue commented Sep 3, 2024

Describe the issue

To reproduce

Urgency

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

Execution Provider

guschmue commented Sep 13, 2024 • edited Loading

tidus2102 commented Sep 16, 2024

fs-eire commented Oct 8, 2024

xenova commented Oct 9, 2024 • edited Loading

xenova commented Nov 14, 2024

tidus2102 commented Nov 29, 2024

fs-eire commented Dec 2, 2024

tidus2102 commented Dec 3, 2024

fs-eire commented Dec 4, 2024 • edited Loading

tidus2102 commented Dec 27, 2024 • edited Loading

xenova commented Mar 2, 2025

xenova commented Mar 7, 2025

guschmue commented Sep 13, 2024 •

edited

Loading

xenova commented Oct 9, 2024 •

edited

Loading

fs-eire commented Dec 4, 2024 •

edited

Loading

tidus2102 commented Dec 27, 2024 •

edited

Loading