Skip to content

Conversation

@wenhuach21
Copy link
Contributor

No description provided.

@wenhuach21 wenhuach21 requested review from Copilot, xin3he and yiliu30 and removed request for Copilot and xin3he January 4, 2026 05:50
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR fixes device placement issues in the Qwen3-VL model implementation and improves memory management during calibration. The changes address device mismatches that could cause runtime errors when models are distributed across multiple devices.

Key changes:

  • Updates Transformers version check for Qwen3-VL MoE compatibility
  • Corrects device placement to use self.model.device instead of self.device
  • Implements balanced memory allocation for multi-device model deployment

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File Description
auto_round/modelling/qwen3_vl_moe.py Updates version comparison logic for Transformers 5.0 compatibility
auto_round/compressors/mllm/compressor.py Fixes device placement for tensor operations during calibration
auto_round/compressors/base.py Adds balanced memory allocation and improves device placement consistency

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@wenhuach21 wenhuach21 changed the title fix qen3-vl device issue fix calibration device issue in some scenarios Jan 4, 2026
@wenhuach21 wenhuach21 merged commit 8c0f01e into main Jan 4, 2026
29 checks passed
@wenhuach21 wenhuach21 deleted the fix_qwen3_vl branch January 4, 2026 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants