dealing with data too large for a single buffer #6138

alphastrata · 2024-08-20T21:23:57Z

Connections
Link to the issues addressed by this PR, or dependent PRs in other repositories
discussion
thread on matrix

Description
The aim of this new example is to demonstrate taking a large input dataset, splitting it into chunks for the purpose of moving it onto the GPU, but then treating it as a single contiguous data structure once on the GPU.

Testing
Explain how this change is tested.

Checklist

Run cargo fmt.
Run cargo clippy. If applicable, add:
- --target wasm32-unknown-unknown
- --target wasm32-unknown-emscripten
Run cargo xtask test to run tests.
Add change to CHANGELOG.md. See simple instructions inside file.

…his doesn't supprot anyway)

cwfitzgerald

Sorry for the long wait time for a review!

Frankly as it exists right now, we cannot accept this example. While it physically shows one strategy for dealing with large data sets, after reading it, the user doesn't get a good idea of why that strategy should be used and what problems they are avoiding, compared to the more naive strategy of using larger and larger buffers. Through inline code comments and verbiage in the readme, the reader who has no idea about any of these topics (or even the details of memory allocation) should be able to understand why this is an effective strategy to utilize.

Some things I think it should touch on:

Large buffers may fail to allocate due to fragmentation
Growing/shrinking a dataset with a buffer system requires copying the entire buffer contents, whereas pagenated data just requires rebuilding a bind group.

I'm not going to close this, as I do think this can be transformed into something that would be great to have.

Added a few incidental comments.

cwfitzgerald · 2024-09-23T03:26:39Z

examples/src/big_compute_buffers/tests.rs

@@ -0,0 +1 @@
+


Empty file? This example definitely needs tests

cwfitzgerald · 2024-09-23T03:45:28Z

examples/src/big_compute_buffers/README.md

+As the maximum supported buffer size varies wildly per system, when you try to run this, then when it will likely fail, in-which-case read the error and update these `const`s accordingly:
+>`src/big_compute_buffers/mod.rs`
+```rust
+const MAX_BUFFER_SIZE: u64 = 1 << 27; // 134_217_728 // 134MB
+const MAX_DISPATCH_SIZE: u32 = (1 << 16) - 1; // 65_535


These defaults should work everywhere, they're the minimum required by webgpu.

alphastrata · 2024-09-23T07:42:55Z

Sorry for the long wait time for a review!

Frankly as it exists right now, we cannot accept this example. While it physically shows one strategy for dealing with large data sets, after reading it, the user doesn't get a good idea of why that strategy should be used and what problems they are avoiding, compared to the more naive strategy of using larger and larger buffers. Through inline code comments and verbiage in the readme, the reader who has no idea about any of these topics (or even the details of memory allocation) should be able to understand why this is an effective strategy to utilize.

Some things I think it should touch on:

Large buffers may fail to allocate due to fragmentation

Growing/shrinking a dataset with a buffer system requires copying the entire buffer contents, whereas pagenated data just requires rebuilding a bind group.

I'm not going to close this, as I do think this can be transformed into something that would be great to have.

Added a few incidental comments.

Cheers, I'll keep working on it.

alphastrata added 8 commits August 20, 2024 07:44

init files, dir structure

e076832

wip: it's working need to suss out the readme and some of the consts...

d6a2f5d

ok that's probably good enough for a first pass...

218ad7f

chore: spelling

b3218bd

chore: readme tweaks

f72ebb0

Merge branch 'gfx-rs:trunk' into trunk

ac41f98

chore: clippy and fmt

87ed862

chore: add self and changes to changelog.md

126a996

alphastrata marked this pull request as ready for review August 20, 2024 22:28

alphastrata requested a review from a team as a code owner August 20, 2024 22:28

alphastrata changed the title ~~DRAFT: dealing with data too large for a single buffer~~ dealing with data too large for a single buffer Aug 22, 2024

alphastrata and others added 6 commits August 23, 2024 15:55

Merge branch 'trunk' into trunk

50600c3

fix: typo and remove env_logger via cfg flag for wasm builds (which t…

4517673

…his doesn't supprot anyway)

Merge branch 'trunk' into trunk

c8328be

Merge branch 'trunk' into trunk

8ec5d28

Merge branch 'gfx-rs:trunk' into trunk

e8bb012

Merge branch 'trunk' into trunk

207f8d6

cwfitzgerald requested changes Sep 23, 2024

View reviewed changes

Merge branch 'gfx-rs:trunk' into trunk

28fbd9c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dealing with data too large for a single buffer #6138

dealing with data too large for a single buffer #6138

alphastrata commented Aug 20, 2024 •

edited

Loading

cwfitzgerald left a comment

cwfitzgerald Sep 23, 2024

cwfitzgerald Sep 23, 2024

alphastrata commented Sep 23, 2024

dealing with data too large for a single buffer #6138

Are you sure you want to change the base?

dealing with data too large for a single buffer #6138

Conversation

alphastrata commented Aug 20, 2024 • edited Loading

cwfitzgerald left a comment

Choose a reason for hiding this comment

cwfitzgerald Sep 23, 2024

Choose a reason for hiding this comment

cwfitzgerald Sep 23, 2024

Choose a reason for hiding this comment

alphastrata commented Sep 23, 2024

alphastrata commented Aug 20, 2024 •

edited

Loading