Add doc for serialization/deserialization of torchao optimized models #524

jerryzh168 · 2024-07-17T21:47:19Z

Summary:
Addressing following questions:

What happens if I save a quantized model
What happens if I load a quantized model and describing deteails like assign=True

Specifically

Do you need ao as a dependency when you're loading a quantized model
Is the saved quantized model smaller on disk than the unquantized one

Test Plan:
.

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2024-07-17T21:47:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/524

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4eec577 with merge base 6dd82d8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

docs/source/ser_deser.rst

msaroufim · 2024-07-17T22:35:36Z

docs/source/ser_deser.rst

+
+What happens when serializing an optimized model?
+=================================================
+To serilize an optimized model, we just need to call `torch.save(m.state_dict(), f)`, because in torchao, we use tensor subclass to represent different dtypes or support different optimization techniques like quantization and sparsity. So after optimization, the only thing that is updated is the weight Tensor and the model structure is not changed at all. For example:


The subclass point is not well explained, I think what you're trying to say more plainly is at model save/load time we swap in the quantized weights

Which means we're instantiating the full precision model, which means we probably also want to explain why people might want to instantiate a model on cpu to later transfer to gpu

we recommend people to initialize a model in meta device, this is explained in the deserialization section.

does the example help with the explanation? otherwise let me know what else I can change

Summary: Addressing following questions: 1. What happens if I save a quantized model 2. What happens if I load a quantized model and describing deteails like assign=True Specifically 1. Do you need ao as a dependency when you're loading a quantized model 2. Is the saved quantized model smaller on disk than the unquantized one Test Plan: . Reviewers: Subscribers: Tasks: Tags:

updated

msaroufim

very nice!

…pytorch#524) Summary: Addressing following questions: 1. What happens if I save a quantized model 2. What happens if I load a quantized model and describing deteails like assign=True Specifically 1. Do you need ao as a dependency when you're loading a quantized model 2. Is the saved quantized model smaller on disk than the unquantized one Test Plan: . Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 17, 2024

jerryzh168 force-pushed the save_load_doc branch from 14ebc3e to d25f9dc Compare July 17, 2024 21:49

jerryzh168 requested a review from msaroufim July 17, 2024 21:49

msaroufim previously requested changes Jul 17, 2024

View reviewed changes

jerryzh168 force-pushed the save_load_doc branch from d25f9dc to 4eec577 Compare July 17, 2024 23:45

jerryzh168 requested a review from msaroufim July 17, 2024 23:45

msaroufim approved these changes Jul 18, 2024

View reviewed changes

msaroufim merged commit 891a588 into pytorch:main Jul 18, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add doc for serialization/deserialization of torchao optimized models #524

Add doc for serialization/deserialization of torchao optimized models #524

jerryzh168 commented Jul 17, 2024

pytorch-bot bot commented Jul 17, 2024 •

edited

Loading

msaroufim Jul 17, 2024

jerryzh168 Jul 17, 2024

jerryzh168 Jul 17, 2024

msaroufim left a comment

Add doc for serialization/deserialization of torchao optimized models #524

Add doc for serialization/deserialization of torchao optimized models #524

Conversation

jerryzh168 commented Jul 17, 2024

pytorch-bot bot commented Jul 17, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/524

✅ No Failures

msaroufim Jul 17, 2024

Choose a reason for hiding this comment

jerryzh168 Jul 17, 2024

Choose a reason for hiding this comment

jerryzh168 Jul 17, 2024

Choose a reason for hiding this comment

msaroufim left a comment

Choose a reason for hiding this comment

pytorch-bot bot commented Jul 17, 2024 •

edited

Loading