add FP8 sweep and step_size flag #758

Fridah-nv · 2026-01-09T22:45:38Z

What does this PR do?

Type of change: ?

Overview: ?

Usage

# Add a code snippet demonstrating how to use this

Testing

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes/No
Did you write any new necessary tests?: Yes/No
Did you add or update any necessary documentation?: Yes/No
Did you update Changelog?: Yes/No

Additional Information

Signed-off-by: Fridah-nv <[email protected]>

copy-pr-bot · 2026-01-09T22:45:52Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

realAsma · 2026-01-09T23:07:12Z

modelopt/torch/quantization/calib/mse.py

-        for step, multiplier in enumerate(multipliers):
-            candidate_amax = self._initial_amax * multiplier
+        for step, candidate in enumerate(candidates):
+            if self._fp8_scale_sweep:


realAsma · 2026-01-09T23:45:09Z

modelopt/torch/quantization/calib/mse.py

+                # For FP8 scale sweep, use FP8 values as multipliers of initial_amax
+                # This ensures we search in a reasonable range relative to max calibration
+                multiplier = candidate
+                candidate_amax = self._initial_amax * multiplier


is not in this case candidate_amax = (fp8_by_448 * 6.0)?

Suggested change

# For FP8 scale sweep, use FP8 values as multipliers of initial_amax

# This ensures we search in a reasonable range relative to max calibration

multiplier = candidate

candidate_amax = self._initial_amax * multiplier

candidate_amax = (candidate * global_amax).view_as(self._initial_amax)

why?

fp8_scale = FP8(block_eqv_amax / 6.0 * (448/(global_amax/6.0)) so if we reverse calculate block_eqv_amax_from_fp8, we get: block_eqv_amax = (fp8_scale / 448.0) * global_amax candidate_amax = candidate * global_amax

Signed-off-by: Fridah-nv <[email protected]>

add FP8 sweep and step_size flag

61004e3

Signed-off-by: Fridah-nv <[email protected]>

Fridah-nv requested a review from a team as a code owner January 9, 2026 22:45

Fridah-nv requested review from jingyu-ml and removed request for a team January 9, 2026 22:45

Fridah-nv marked this pull request as draft January 9, 2026 22:45

realAsma reviewed Jan 9, 2026

View reviewed changes

fix FP8 amax calculation

4017e8d

Signed-off-by: Fridah-nv <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add FP8 sweep and step_size flag #758

add FP8 sweep and step_size flag #758

Uh oh!

Fridah-nv commented Jan 9, 2026

Uh oh!

copy-pr-bot bot commented Jan 9, 2026

Uh oh!

realAsma Jan 9, 2026

Uh oh!

realAsma Jan 9, 2026 •

edited

Loading

Uh oh!

realAsma Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add FP8 sweep and step_size flag #758

Are you sure you want to change the base?

add FP8 sweep and step_size flag #758

Uh oh!

Conversation

Fridah-nv commented Jan 9, 2026

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Jan 9, 2026

Uh oh!

realAsma Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

realAsma Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

realAsma Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

realAsma Jan 9, 2026 •

edited

Loading