Remove quantization_config in config.json from original deepseek models #753

Edwardf0t1 · 2026-01-09T01:44:30Z

What does this PR do?

Type of change: Bug fix

Overview: DeepSeek original checkpoints may include a quantization_config field in config.json
(describing the source checkpoint's quantization). When we export ModelOpt quantization
configs to hf_quant_config.json, leaving the original quantization_config in place can
be confusing. Add a function to remove it.

Usage

# Add a code snippet demonstrating how to use this

Testing

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes
Did you write any new necessary tests?: Yes/No
Did you add or update any necessary documentation?: Yes/No
Did you update Changelog?: Yes/No

Additional Information

Resolve nvbug https://nvbugspro.nvidia.com/bug/5736665

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

codecov · 2026-01-09T01:55:26Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.66%. Comparing base (307fe71) to head (52a2230).
⚠️ Report is 19 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #753   +/-   ##
=======================================
  Coverage   74.66%   74.66%           
=======================================
  Files         192      192           
  Lines       18975    18975           
=======================================
  Hits        14167    14167           
  Misses       4808     4808

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

…ls (#753) ## What does this PR do? **Type of change:** Bug fix **Overview:** DeepSeek original checkpoints may include a `quantization_config` field in `config.json` (describing the source checkpoint's quantization). When we export ModelOpt quantization configs to `hf_quant_config.json`, leaving the original `quantization_config` in place can be confusing. Add a function to remove it. ## Usage  ```python # Add a code snippet demonstrating how to use this ``` ## Testing  ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes  - **Did you write any new necessary tests?**: Yes/No - **Did you add or update any necessary documentation?**: Yes/No - **Did you update [Changelog](https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?**: Yes/No  ## Additional Information Resolve nvbug https://nvbugspro.nvidia.com/bug/5736665 --------- Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

remove quantization_config in config.json from original deepseek models

30c6110

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

Edwardf0t1 requested a review from a team as a code owner January 9, 2026 01:44

Edwardf0t1 requested review from cjluo-nv, meenchen and sugunav14 January 9, 2026 01:44

simplify

52a2230

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

cjluo-nv approved these changes Jan 13, 2026

View reviewed changes

Edwardf0t1 merged commit 0f05d67 into main Jan 15, 2026
35 checks passed

Edwardf0t1 deleted the zhiyu/fix-deepseek-config branch January 15, 2026 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove quantization_config in config.json from original deepseek models #753

Remove quantization_config in config.json from original deepseek models #753

Uh oh!

Edwardf0t1 commented Jan 9, 2026 •

edited

Loading

Uh oh!

codecov bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Remove quantization_config in config.json from original deepseek models #753

Remove quantization_config in config.json from original deepseek models #753

Uh oh!

Conversation

Edwardf0t1 commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

codecov bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Edwardf0t1 commented Jan 9, 2026 •

edited

Loading

codecov bot commented Jan 9, 2026 •

edited

Loading