Parametric floating-point unit with support for standard RISC-V formats and operations as well as transprecision formats, written in SystemVerilog.
Maintainer: Luca Bertaccini [email protected]
Principal Author: Stefan Mach [email protected]
The FPU is a parametric design that allows generating FP hardware units for various use cases. Even though mainly designed for use in RISC-V processors, the FPU or its sub-blocks can easily be utilized in other environments. Our design aims to be compliant with IEEE 754-2008 and provides the following features:
Any IEEE 754-2008 style binary floating-point format can be supported, including single-, double-, quad- and half-precision (binary32
, binary64
, binary128
, binary16
).
Formats can be defined with arbitrary number of exponent and mantissa bits through parameters and are always symmetrically biased.
Multiple FP formats can be supported concurrently, and the number of formats supported is not limited.
Multiple integer formats with arbitrary number of bits (as source or destionation of conversions) can also be defined.
- Addition/Subtraction
- Multiplication
- Fused multiply-add in four flavours (
fmadd
,fmsub
,fnmadd
,fnmsub
) - Division1,2
- Square root1,2
- Minimum/Maximum3
- Comparisons
- Sign-Injections (
copy
,abs
,negate
,copySign
etc.) - Conversions among all supported FP formats
- Conversions between FP formats and integers (signed & unsigned) and vice versa
- Classification
Multi-format FMA operations (i.e. multiplication in one format, accumulation in another) are optionally supported.
Optionally, packed-SIMD versions of all the above operations can be generated for formats narrower than the FPU datapath width. E.g.: Support for double-precision (64bit) operations and two simultaneous single-precision (32bit) operations.
It is also possible to generate only a subset of operations if e.g. divisions are not needed.
1Some compliance issues with IEEE 754-2008 are currently known to exist for the PULP DivSqrt unit (Rounding mismatches have been reported in GitHub issues. This can lead to results being off by 1ulp, and the inexact flag not being properly raised in these cases as well)
2Two DivSqrt units are supported: the multi-format PULP DivSqrt unit and a 32-bit unit integrated from the T-Head OpenE906. The PulpDivsqrt
parameter can be set to 1 or 0 to select the former or the latter unit, respectively.
3Implementing IEEE 754-201x minimumNumber
and maximumNumber
, respectively
All IEEE 754-2008 rounding modes are supported, namely
roundTiesToEven
roundTiesToAway
roundTowardPositive
roundTowardNegative
roundTowardZero
All IEEE 754-2008 status flags are supported, namely
- Invalid operation (
NV
) - Division by zero (
DZ
) - Overflow (
OF
) - Underflow (
UF
) - Inexact (
NX
)
FPnew currently depends on the following:
lzc
andrr_arb_tree
from thecommon_cells
repository (https://github.com/pulp-platform/common_cells.git)- optional: Divider and square-root unit from the
fpu-div-sqrt-mvp
repository (https://github.com/pulp-platform/fpu_div_sqrt_mvp.git)
These two repositories are included in the source code directory as git submodules, use
git submodule update --init --recursive
if you want to load these dependencies there.
Consider using Bender for managing dependencies in your projects. FPnew comes with Bender support!
The top-level module of the FPU is called fpnew_top
and can be directly instantiated in your design.
Make sure you compile the package fpnew_pkg
ahead of any files making references to types, parameters or functions defined there.
It is discouraged to import
all of fpnew_pkg
into your source files. Instead, explicitly scope references into the package like so: fpnew_pkg::foo
.
// FPU instance
fpnew_top #(
.Features ( fpnew_pkg::RV64D ),
.Implementation ( fpnew_pkg::DEFAULT_NOREGS ),
.TagType ( logic ),
.PulpDivsqrt ( 1'b1 )
) i_fpnew_top (
.clk_i,
.rst_ni,
.operands_i,
.rnd_mode_i,
.op_i,
.op_mod_i,
.src_fmt_i,
.dst_fmt_i,
.int_fmt_i,
.vectorial_op_i,
.simd_mask_i,
.tag_i,
.in_valid_i,
.in_ready_o,
.flush_i,
.result_o,
.status_o,
.tag_o,
.out_valid_o,
.out_ready_i,
.busy_o
);
More in-depth documentation on the FPnew configuration, interfaces and architecture is provided in docs/README.md
.
In case you find any issues with FPnew that have not been reported yet, don't hesitate to open a new issue here on Github. Please, don't use the issue tracker for support questions. Instead, consider contacting the maintainers or consulting the PULP forums.
In case you would like to contribute to the project, please refer to the contributing guidelines in docs/CONTRIBUTING.md
before opening a pull request.
HDL source code can be found in the src
directory while documentation is located in docs
.
A changelog is kept at docs/CHANGELOG.md
.
This repository loosely follows the GitFlow branching model.
This means that the master
branch is considered stable and used to publish releases of the FPU while the develop
branch contains features and bugfixes that have not yet been properly released.
Furthermore, this repository tries to adhere to SemVer, as outlined in the changelog.
FPnew is released under the SolderPad Hardware License, which is a permissive license based on Apache 2.0. Please refer to the SolderPad license file for further information.
The T-Head E906 DivSqrt unit, integrated into FPnew in vendor/opene906
, is reseased under the Apache License, Version 2.0. Please refer to the Apache 2.0 license file for further information.
If you use FPnew in your work, you can cite us:
FPnew Publication
@article{mach2020fpnew,
title={Fpnew: An open-source multiformat floating-point unit architecture for energy-proportional transprecision computing},
author={Mach, Stefan and Schuiki, Fabian and Zaruba, Florian and Benini, Luca},
journal={IEEE Transactions on Very Large Scale Integration (VLSI) Systems},
volume={29},
number={4},
pages={774--787},
year={2020},
publisher={IEEE}
}
If you use FPnew SDOTP in your work, you can cite us:
SDOTP Publication
@inproceedings{bertaccini2022minifloat,
title={MiniFloat-NN and ExSdotp: An ISA Extension and a Modular Open Hardware Unit for Low-Precision Training on RISC-V Cores},
author={Bertaccini, Luca and Paulin, Gianna and Fischer, Tim and Mach, Stefan and Benini, Luca},
booktitle={2022 IEEE 29th Symposium on Computer Arithmetic (ARITH)},
pages={1--8},
year={2022},
organization={IEEE}
}
This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 732631.
For further information, visit oprecomp.eu.