gateware floating point instructions on the kernel CPU #535

sbourdeauducq · 2016-08-05T03:25:09Z

dhslichter · 2016-08-05T21:18:03Z

Awesome, thanks for adding this. Can you post this on the m-labs website in the ARTIQ extensions list as well?

sbourdeauducq · 2016-08-20T03:58:49Z

I intend to remove this page at some point and replace it with a link to https://github.com/m-labs/artiq/issues?utf8=%E2%9C%93&q=label%3Atype%3Afor-contract%20

sbourdeauducq · 2017-03-18T04:03:53Z

@dhslichter mentioned basic math functions (sin, cos, log, exp). Should those be gateware-accelerated, or are slower software implementations (using gateware FP for floating addition, multiplication, etc.) OK?

jordens · 2017-03-18T10:47:40Z

I would really prefer not to stray to far of the beaten path here, i.e. for hard FP and vectorization (which is an extremely complicated thing to reinvent) we should concentrate on the corresponding openrisc instruction sets and implement them, instead of inventing new instruction sets. Also accelerated trigonometry is of similar complexity as vectorization.
Since hard FP and trigonometry have a multiplicative impact on performance, focusing on "standard" hard FP first seems prudent.

dhslichter · 2017-03-20T05:07:12Z

I agree with @jordens; the most important thing to do here would be to implement a "standard" hard FP instruction set. Basically the thing to do would be to implement all the instructions defined in the openrisc standard (ORFPX32) but not actually implemented in mor1kx. The sin/cos/log/exp functions could then be constructed from these with (hopefully) reasonable performance, and we can decide later if they truly require hardware acceleration themselves.

jordens · 2017-05-11T12:29:06Z

@whitequark could you add a short assessment of the state/quality/existence of support for or1k FP in the toolchain and the biggest roadblocks to this issue?

artiq compiler
llvm-or1k
llvmlite/new-pyllvm
compiler-rt
artiq runtime (probably need nothing specific here)
testbenches

whitequark · 2017-05-11T13:17:29Z

artiq compiler
llvmlite/new-pyllvm
artiq runtime
testbenches

No changes required as the LLVM instructions are the same for soft or hard FP.

compiler-rt

No changes needed as we will be using less compiler-rt code afterwards.

llvm-or1k

The OR1K backend supports the complete OR1K FPU single-precision instruction set except the MAC instruction. Extending to double precision (which we use) is trivial. Ditto for MAC.

jordens · 2017-05-11T13:26:59Z

@sbourdeauducq would we use the mor1kx FPU or port some other FPU or write ourselves a new one or go some completely different route?

sbourdeauducq · 2017-05-11T14:09:53Z

Take the usable parts of the mor1kx FPU (e.g. its interface into the CPU), rewrite the others.

sbourdeauducq · 2021-02-25T10:43:46Z

Seems VexRiscV has a FPU now and I would expect high quality.

sbourdeauducq · 2021-09-12T11:15:53Z

Funded by Duke.

sbourdeauducq · 2021-11-08T09:39:04Z

speedup: #1777 (comment)

sbourdeauducq added the type:needs-funding label Aug 5, 2016

jordens mentioned this issue Nov 1, 2016

Hard cores sinara-hw/sinara#47

Closed

sbourdeauducq changed the title ~~hard FP support on the kernel CPU~~ gateware floating point support on the kernel CPU Nov 2, 2016

sbourdeauducq changed the title ~~gateware floating point support on the kernel CPU~~ gateware floating point instructions on the kernel CPU Nov 2, 2016

jordens added the area:speed label May 11, 2017

jordens mentioned this issue Mar 2, 2018

Urukul FTW and attenuator update rate #939

Closed

jordens mentioned this issue Oct 24, 2018

Irregular delays when handling Novogorny sample output #1110

Closed

sbourdeauducq removed the type:needs-funding label Sep 12, 2021

sbourdeauducq assigned occheung Sep 12, 2021

occheung mentioned this issue Nov 8, 2021

Add floating-point instruction support to kernel for non-Kasli-v1.x targets using VexRiscv core #1777

Merged

8 tasks

sbourdeauducq closed this as completed in #1777 Nov 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gateware floating point instructions on the kernel CPU #535

gateware floating point instructions on the kernel CPU #535

sbourdeauducq commented Aug 5, 2016

dhslichter commented Aug 5, 2016

sbourdeauducq commented Aug 20, 2016

sbourdeauducq commented Mar 18, 2017

jordens commented Mar 18, 2017

dhslichter commented Mar 20, 2017

jordens commented May 11, 2017 •

edited

Loading

whitequark commented May 11, 2017

jordens commented May 11, 2017

sbourdeauducq commented May 11, 2017

sbourdeauducq commented Feb 25, 2021 •

edited

Loading

sbourdeauducq commented Sep 12, 2021

sbourdeauducq commented Nov 8, 2021

gateware floating point instructions on the kernel CPU #535

gateware floating point instructions on the kernel CPU #535

Comments

sbourdeauducq commented Aug 5, 2016

dhslichter commented Aug 5, 2016

sbourdeauducq commented Aug 20, 2016

sbourdeauducq commented Mar 18, 2017

jordens commented Mar 18, 2017

dhslichter commented Mar 20, 2017

jordens commented May 11, 2017 • edited Loading

whitequark commented May 11, 2017

jordens commented May 11, 2017

sbourdeauducq commented May 11, 2017

sbourdeauducq commented Feb 25, 2021 • edited Loading

sbourdeauducq commented Sep 12, 2021

sbourdeauducq commented Nov 8, 2021

jordens commented May 11, 2017 •

edited

Loading

sbourdeauducq commented Feb 25, 2021 •

edited

Loading