[Tracker] WIP features for torchao 0.3 #252

supriyar · 2024-05-17T17:50:14Z

jeromeku · 2024-05-30T15:46:14Z

Generally, what needs to be done to compose a new dtype with FSDP?
What other (high priority) dtypes are on the ao roadmap for integration with FSDP?
Is there a universal representation for asymmetric / symmetrically quantized types in torch? I.e., a subbyte/byte type with scale / zero that can be used regardless of the quantization method?
Is development of fp8 primitives for training and inference primarily in pytorch/float8_experimental or are there specific torchao initiatives focused on fp8?

Happy to contribute on any of these fronts.

jerryzh168 · 2024-05-30T18:13:33Z

Is there a universal representation for asymmetric / symmetrically quantized types in torch? I.e., a subbyte/byte type with scale / zero that can be used regardless of the quantization method?

yes, it's called AffineQuantizedTensor, we are putting all types of symmetric/asymmetric, per_tensor/channel/group/token, int8/int4/int3/... under this tensor subclass.

here is an model level API walk through using the tensor subclass: https://github.com/pytorch/ao/tree/main/torchao/quantization#quantization-flow

currently I'm working on replacing existing APIs with it: #294, after that I also plan to publish a more detailed tutorials to talk about how to implement a new data representation with tensor subclass, using this as an example

msaroufim · 2024-05-30T20:08:57Z

@jeromeku regarding your other questions

To compose new dtypes with FSDP you can follow the playbook here [FSDP2][NF4Tensor][2/n] implement torch.chunk and other ops #150 we'll look to write some docs @weifengpy but let us know if you have any questions in the meantime
I guess regarding high pri dtypes, I'm not 100% sure yet cause it depends on what researchers might do - I'm personally biased to getting bitnet to work
Regarding fp8 training we're looking to centralize the fp8 work in this repo and @vkuzo is gonna be moving bits over time

bhack · 2024-06-13T16:38:06Z

Are we going to support dynamic inputs?

msaroufim · 2024-06-14T00:08:31Z

Hi @bhack! Seen you on a lot of threads. Could you share a bit more what you mean by dynamic inputs? Are you referring to dynamic shapes?

bhack · 2024-06-14T00:10:45Z

Yes dynamic input shape but not mainly on the batch dimension. So e.g. image with different width and height

supriyar mentioned this issue May 17, 2024

[Tracker] WIP Features for torchao v0.2 #132

Closed

22 tasks

msaroufim pinned this issue May 18, 2024

msaroufim added the tracker label Jun 4, 2024

msaroufim unpinned this issue Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tracker] WIP features for torchao 0.3 #252

[Tracker] WIP features for torchao 0.3 #252

supriyar commented May 17, 2024 •

edited

Loading

jeromeku commented May 30, 2024 •

edited

Loading

jerryzh168 commented May 30, 2024

msaroufim commented May 30, 2024

bhack commented Jun 13, 2024

msaroufim commented Jun 14, 2024

bhack commented Jun 14, 2024

[Tracker] WIP features for torchao 0.3 #252

[Tracker] WIP features for torchao 0.3 #252

Comments

supriyar commented May 17, 2024 • edited Loading

Spillover from 0.2.0

Benchmarking

Documentation

Tutorials

Core

jeromeku commented May 30, 2024 • edited Loading

jerryzh168 commented May 30, 2024

msaroufim commented May 30, 2024

bhack commented Jun 13, 2024

msaroufim commented Jun 14, 2024

bhack commented Jun 14, 2024

supriyar commented May 17, 2024 •

edited

Loading

jeromeku commented May 30, 2024 •

edited

Loading