RFC-0030: FP8 dtype introduction to PyTorch #51

australopitek · 2023-01-02T10:22:05Z

This RFC proposes adding 8-bit floating point data types to PyTorch.

jakeh-gc · 2023-01-09T12:40:46Z

RFC-0030-native-fp8-dtype.md

+Since fp8 data type seems to be a natural evolution of currently used fp16/bf16, to reduce computation of big DL models, it’s worth to standardize this type. Few attempts of this were done recently:
+
+* Nvidia, Arm and Intel - https://arxiv.org/pdf/2209.05433.pdf
+* GraphCore and AMD - https://arxiv.org/pdf/2206.02915.pdf


For completeness, these formats are proposed by Graphcore, AMD, and Qualcomm.

I'll correct it when more comments are there.

YinglinSun · 2023-05-18T16:38:39Z

Curious what the progress for fp8 support looks like? Thanks!

australopitek · 2023-06-06T09:24:06Z

@jakeh-gc ,
Is there any progress in the fp8 datatypes area?

jakeh-gc · 2023-06-08T14:40:59Z

@australopitek I've been working more on the XLA side. The only activity I've seen in PyTorch was this pytorch/pytorch#97798, which didn't get merged.

albanD · 2023-06-15T14:27:03Z

Hey!
Wrote an update at pytorch/pytorch#91577 (comment) if you want to comment there!

timljj · 2023-06-27T07:12:59Z

Hi @australopitek, in your md file, you mentioned that for E5M2 "there are many models that can be trained only with this variant".

May I know what models/type of models you are referring to?

Also, does your statement mean that those models would not be able to be trained with E4M3?

australopitek · 2023-06-27T08:25:47Z

Hi @timljj ,
my wording was not precise. I wanted to say that some models can be trained with E5M2 without "help" of E4M3, i.e. E5M2 is sufficient (despite being "Applicable mainly for gradients in the backward pass of training").
Such experiments are described in the https://arxiv.org/pdf/1905.12334.pdf - Resnet18, Resnet34, Resnet50, GNMT, Transformer.

maxpain · 2023-08-12T14:15:58Z

Any updates?

australopitek · 2023-08-14T07:04:17Z

@maxpain,
float8 dtypes are already delivered to pytorch - pytorch/pytorch#104242 - I guess they will be released in PT2.1
@albanD,
should we commit this RFC for the sake of completeness?

albanD · 2023-08-14T14:58:50Z

Yes I think this one is good.
Note that this only covers the basic (unscaled) data types and so we only expect "power users" to be able to use this at this point. End-user facing UX are being looked into but not ready right now.

RFC-0030: FP8 dtype introduction to PyTorch

83270da

facebook-github-bot added the cla signed label Jan 2, 2023

australopitek marked this pull request as ready for review January 2, 2023 10:23

australopitek mentioned this pull request Jan 2, 2023

[RFC] FP8 dtype introduction to PyTorch pytorch/pytorch#91577

Closed

jakeh-gc reviewed Jan 9, 2023

View reviewed changes

RFC-0030: Incorporate comments

f743ffe

AmericanPresidentJimmyCarter mentioned this pull request Aug 12, 2023

fp8 huggingface/diffusers#4583

Closed

albanD merged commit d18e107 into pytorch:master Aug 14, 2023

KohakuBlueleaf mentioned this pull request Nov 19, 2023

A big improvement for dtype casting system with fp8 storage type and manual cast AUTOMATIC1111/stable-diffusion-webui#14031

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC-0030: FP8 dtype introduction to PyTorch #51

RFC-0030: FP8 dtype introduction to PyTorch #51

australopitek commented Jan 2, 2023

jakeh-gc Jan 9, 2023

australopitek Jan 17, 2023

YinglinSun commented May 18, 2023

australopitek commented Jun 6, 2023

jakeh-gc commented Jun 8, 2023

albanD commented Jun 15, 2023

timljj commented Jun 27, 2023

australopitek commented Jun 27, 2023

maxpain commented Aug 12, 2023

australopitek commented Aug 14, 2023

albanD commented Aug 14, 2023

RFC-0030: FP8 dtype introduction to PyTorch #51

RFC-0030: FP8 dtype introduction to PyTorch #51

Conversation

australopitek commented Jan 2, 2023

jakeh-gc Jan 9, 2023

Choose a reason for hiding this comment

australopitek Jan 17, 2023

Choose a reason for hiding this comment

YinglinSun commented May 18, 2023

australopitek commented Jun 6, 2023

jakeh-gc commented Jun 8, 2023

albanD commented Jun 15, 2023

timljj commented Jun 27, 2023

australopitek commented Jun 27, 2023

maxpain commented Aug 12, 2023

australopitek commented Aug 14, 2023

albanD commented Aug 14, 2023