Skip to content

Enable FP8/MXFP8 Ops with requests and CUDA alignment #2207

@CuiYifeng

Description

@CuiYifeng

🚀 The feature, motivation and pitch

Plan to enable the following ops for FP8/MXFP8:

Alternatives

No response

Additional context

No response

Metadata

Metadata

Labels

No labels
No labels

Type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions