[ascend] refactor code #4176

yao-fengchen · 2025-12-02T08:06:03Z

No description provided.

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py

jinminxi104 · 2025-12-04T06:35:31Z

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py

-            AscendOpsBackend.enable_graph = ascend_graph_runner.enable_graph
-            return ascend_graph_runner
+        from lmdeploy.pytorch.backends.cuda.graph_runner import CUDAGraphRunner
+        return CUDAGraphRunner(model, model_config, cache_config, backend_config, device)


let's make a new aclgraphrunner instead

Copilot

Pull request overview

This PR refactors the Ascend backend code by removing Ascend310P-specific support and simplifying the codebase to focus solely on Ascend910 devices. The refactoring also restructures the update_step_context method for better code organization.

Key changes:

Removed all Ascend310P-specific logic including NZ format weight transformations, block shape calculations, and attention mask handling
Simplified SocVersion class by replacing the cached device_name() method with a class attribute
Refactored update_step_context method to use helper functions for better code organization and readability

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
lmdeploy/pytorch/backends/dlinfer/linear.py	Removed Ascend310P-specific weight transformation logic for NZ format
lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py	Removed Ascend310P support from block shape methods, simplified SocVersion class, refactored update_step_context with helper functions, removed enable_aclgraph method, updated build_graph_runner to use only CUDAGraphRunner, removed Ascend310P compile mode settings
lmdeploy/pytorch/backends/dlinfer/ascend/graph_runner.py	Removed Ascend310P-specific get_logits compilation logic

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py

yao-fengchen added 4 commits December 1, 2025 07:32

refactor ascend op_backend

e388a94

refactor mask

080dc12

format code

0840a82

remove 310P judge

24cb832

jinminxi104 reviewed Dec 4, 2025

View reviewed changes

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py Outdated Show resolved Hide resolved

remove unused code

9a9604a

jinminxi104 reviewed Dec 4, 2025

View reviewed changes

jinminxi104 requested a review from Copilot December 4, 2025 08:03

Copilot started reviewing on behalf of jinminxi104 December 4, 2025 08:04 View session

Copilot finished reviewing on behalf of jinminxi104 December 4, 2025 08:06

Copilot AI reviewed Dec 4, 2025

View reviewed changes

update code

bd877fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ascend] refactor code #4176

[ascend] refactor code #4176

yao-fengchen commented Dec 2, 2025

Uh oh!

Uh oh!

jinminxi104 Dec 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ascend] refactor code #4176

Are you sure you want to change the base?

[ascend] refactor code #4176

Conversation

yao-fengchen commented Dec 2, 2025

Uh oh!

Uh oh!

jinminxi104 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants