[TASK] Add A New Quant Ball for FP32-MXFP Conversion

## Deliverables
- Add an MXFP ball RTL implementation in the prototype lib (under the arch path).
- A Pull Request (PR) containing a test written in C for this operation and a README to introduce your design.
- Report the performance results in this issue.

## Task Description 
- MXFP is a lower-precision floating-point representation designed to reduce data size and simplify computations in the following process. Using MXFP can improve throughput and hardware efficiency in bandwidth-sensitive workloads, while still maintaining acceptable numerical quality for many ML scenarios.
- You can learn this format and its variants, starting from this paper, "With Shared Microexponents, A Little Shifting Goes a Long Way". 
- As we envisage, an FP32 matrix will be loaded into the banks, and then a your customised MXFP instruction will read the data from one bank into the ball you are to implement, before outputting it to another bank.
- You can refer to the previous Pull Request (#6) for the detailed implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TASK] Add A New Quant Ball for FP32-MXFP Conversion #26

Deliverables

Task Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[TASK] Add A New Quant Ball for FP32-MXFP Conversion #26

Description

Deliverables

Task Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions