[Perf] Pipeline-friendly shard task submission in CacheStore by mag1c-h · Pull Request #888 · ModelEngine-Group/unified-cache-management

mag1c-h · 2026-04-02T03:02:40Z

Purpose

Decouple shard-level backend task submission to enable pipelining between dispatch and transfer stages.

Backend's Load() is async, multiple shards can be processed concurrently
Transfer stage can now start H2D on completed shards while waiting for slower ones
Reduces latency when shard I/O times are imbalanced

Modifications

Submit each shard to backend independently (not batched)
Push ShardTasks to running queue immediately
Each ShardTask has its own backend task handle for independent Wait()

Test

Modify	TensorSize	ShardNumber	BlockNumber	Load 100% from backend	Load 100% from Cache
Before	64KB	64	1024	314ms	264ms
After	64KB	64	1024	274ms	262ms

mag1c-h added 2 commits April 1, 2026 17:08

[Perf] Pipeline-friendly shard task submission

1b990bd

[test] Cache on posix hit test

74256e2

mag1c-h force-pushed the dev-cache-load-pipeline branch from 5ad2ea6 to 74256e2 Compare April 2, 2026 03:10

mag1c-h requested review from UESTC-AHao, qyh111, wuhuxiao and yuanzhg078 April 2, 2026 03:33

mag1c-h marked this pull request as ready for review April 2, 2026 03:33

mag1c-h requested a review from ygwpz as a code owner April 2, 2026 03:33

ygwpz approved these changes Apr 2, 2026

View reviewed changes

Merge branch 'develop' into dev-cache-load-pipeline

a33c0b0

qyh111 approved these changes Apr 2, 2026

View reviewed changes

mag1c-h merged commit d5520f6 into ModelEngine-Group:develop Apr 2, 2026
17 of 18 checks passed

mag1c-h deleted the dev-cache-load-pipeline branch April 2, 2026 06:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Perf] Pipeline-friendly shard task submission in CacheStore#888

[Perf] Pipeline-friendly shard task submission in CacheStore#888
mag1c-h merged 3 commits intoModelEngine-Group:developfrom
mag1c-h:dev-cache-load-pipeline

mag1c-h commented Apr 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mag1c-h commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Modifications

Test

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mag1c-h commented Apr 2, 2026 •

edited

Loading