Skip to content

[opt] Enhance NPU CPU affinity resolution with NUMA fallback#865

Merged
Infinite666 merged 2 commits intoModelEngine-Group:developfrom
wangwenxin0312:dev_opt_npu
Mar 25, 2026
Merged

[opt] Enhance NPU CPU affinity resolution with NUMA fallback#865
Infinite666 merged 2 commits intoModelEngine-Group:developfrom
wangwenxin0312:dev_opt_npu

Conversation

@wangwenxin0312
Copy link
Copy Markdown
Contributor

@wangwenxin0312 wangwenxin0312 commented Mar 25, 2026

Purpose

NUMA info: use real topology to map devices to NUMA nodes. NPU -> PCIe -> NUMA -> cpulist
NUMA info v2: Fallback: evenly distribute devices across NUMA nodes based on lscpu when topology information is unavailable.

Modifications

ucm/integration/vllm/device.py

@Infinite666 Infinite666 merged commit 3062203 into ModelEngine-Group:develop Mar 25, 2026
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants