[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations #5466

mgoin · 2024-06-12T18:31:53Z

There were many separate uses of checking CUDA compute capability in order to run quantization tests, so it is best practice to have one function to be the source-of-truth to check support.

Instead of each quantization test having duplicated code such as:

from vllm.model_executor.layers.quantization import QUANTIZATION_METHODS

aqlm_not_supported = True

if torch.cuda.is_available():
    capability = torch.cuda.get_device_capability()
    capability = capability[0] * 10 + capability[1]
    aqlm_not_supported = (capability <
                          QUANTIZATION_METHODS["aqlm"].get_min_capability())

It can be replaced with:

from tests.quantization.utils import is_quant_method_supported

aqlm_not_supported = not is_quant_method_supported("aqlm")

…ations

…ethod

…n test configurations (vllm-project#5466)

mgoin added 10 commits June 4, 2024 16:12

Add is_quant_method_supported to control quantization test configur…

20dbbe0

…ations

Format

5f2bce7

Cleanup fp8 test

abfb866

Merge branch 'upstream-main' into refactor-checking-supported-quant-m…

5cad61c

…ethod

Merge branch 'upstream-main' into refactor-checking-supported-quant-m…

c13a6cd

…ethod

Add specific quantization utils to get around import issues

119fc93

Newline

9621a58

Merge branch 'upstream-main' into refactor-checking-supported-quant-m…

1c358c2

…ethod

Merge branch 'upstream-main' into refactor-checking-supported-quant-m…

076e666

…ethod

Merge branch 'upstream-main' into refactor-checking-supported-quant-m…

73790b2

…ethod

simon-mo approved these changes Jun 12, 2024

View reviewed changes

simon-mo enabled auto-merge (squash) June 12, 2024 21:15

Merge branch 'main' into refactor-checking-supported-quant-method

1b6c75f

simon-mo merged commit 23ec72f into vllm-project:main Jun 13, 2024
117 of 119 checks passed

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 16, 2024

[CI/Build][REDO] Add is_quant_method_supported to control quantizatio…

59d5682

…n test configurations (vllm-project#5466)

joerunde pushed a commit to joerunde/vllm that referenced this pull request Jun 17, 2024

[CI/Build][REDO] Add is_quant_method_supported to control quantizatio…

a87b136

…n test configurations (vllm-project#5466)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jun 27, 2024

[CI/Build][REDO] Add is_quant_method_supported to control quantizatio…

7bafdc2

…n test configurations (vllm-project#5466)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 8, 2024

[CI/Build][REDO] Add is_quant_method_supported to control quantizatio…

6013049

…n test configurations (vllm-project#5466)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[CI/Build][REDO] Add is_quant_method_supported to control quantizatio…

485b448

…n test configurations (vllm-project#5466)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations #5466

[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations #5466

mgoin commented Jun 12, 2024

[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations #5466

[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations #5466

Conversation

mgoin commented Jun 12, 2024