This crate implements a half-precision floating point f16 type for Rust implementing the IEEE
754-2008 standard binary16
a.k.a "half" format, as well as a bf16 type implementing the
bfloat16 format.
The f16 and bf16 types attempt to match existing Rust floating point type functionality where possible, and provides both conversion operations (such as to/from f32 and f64) and basic
arithmetic operations. Hardware support for these operations will be used whenever hardware support
is available—either through instrinsics or targeted assembly—although a nightly Rust toolchain may
be required for some hardware.
This crate provides no_std support so can easily be used in embedded code where a smaller float format is most useful.
Requires Rust 1.81 or greater. If you need support for older versions of Rust, use previous versions of this crate.
See the crate documentation for more details.
-
alloc— Enable use of thealloccrate when not using thestdlibrary.This enables the
vecmodule, which contains zero-copy conversions for theVectype. This allows fast conversion between rawVec<u16>bits andVec<f16>orVec<bf16>arrays, and vice versa. -
std— Enable features that depend on the Ruststdlibrary, including everything in theallocfeature.Enabling the
stdfeature enables runtime CPU feature detection of hardware support. Without this feature detection, harware is only used when compiler target supports them. -
serde- ImplementSerializeandDeserializetraits forf16andbf16. This adds a dependency on theserdecrate. -
num-traits— EnableToPrimitive,FromPrimitive,ToBytes,FromBytes,Num,Float,FloatCore,Signed, andBoundedtrait implementations from thenum-traitscrate. -
bytemuck— EnableZeroableandPodtrait implementations from thebytemuckcrate. -
rand_distr— Enable sampling from distributions likeStandardUniformandStandardNormalfrom therand_distrcrate. -
rkyv-- Enable zero-copy deserializtion withrkyvcrate. -
aribtrary-- Enable fuzzing support witharbitrarycrate by implementingArbitrarytrait. -
nightly-- Enable nightly-only features (currentlyloongarch64intrinsics).
The following list details hardware support for floating point types in this crate. When using std
library, runtime CPU target detection will be used. To get the most performance benefits, compile
for specific CPU features which avoids the runtime overhead and works in a no_std environment.
| Architecture | CPU Target Feature | Notes |
|---|---|---|
x86/x86_64 |
f16c |
This supports conversion to/from f16 only (including vector SIMD) and does not support any bf16 or arithmetic operations. |
aarch64 |
fp16 |
This supports all operations on f16 only. |
loongarch64 |
lsx |
(nightly feature only) This supports conversion to/from f16 only (including vector SIMD) and does not support any bf16 or arithmetic operations. |
All files in this library are dual-licensed and distributed under the terms of either of:
- MIT License (http://opensource.org/licenses/MIT)
- Apache License, Version 2.0 (http://www.apache.org/licenses/LICENSE-2.0)
at your option.
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.