Quantize vit_b_16 tutorial - Part 1 #60

cpuhrsch · 2024-03-15T18:09:13Z

No description provided.

msaroufim · 2024-03-16T03:03:25Z

tutorials/quantize_vit/bfloat16_code

+V0315 17:55:08.077000 140041589225280 torch/_inductor/graph.py:1258] [0/0] [__output_code]     from torch._inductor.wrapper_benchmark import compiled_module_main
+V0315 17:55:08.077000 140041589225280 torch/_inductor/graph.py:1258] [0/0] [__output_code]     compiled_module_main('None', benchmark_compiled_module)
+V0315 17:55:08.077000 140041589225280 torch/_inductor/graph.py:1258] [0/0] [__output_code] 
+I0315 17:55:08.079000 140041589225280 torch/_inductor/graph.py:1264] [0/0] [__output_code] Output code written to: /tmp/torchinductor_cpuhrsch/2i/c2ixftylrwvvc3swfutdqklg6xb2w47xlwmfdmtgktp4yb4kzkro.py


Maybe check in this file instead?
/tmp/torchinductor_cpuhrsch/2i/c2ixftylrwvvc3swfutdqklg6xb2w47xlwmfdmtgktp4yb4kzkro.py

Yes, I can make it a .py file. I think the name isn't great. torch should have a way of outputting these in a neat location more officially.

msaroufim · 2024-03-16T03:06:07Z

tutorials/quantize_vit/run.sh

+
+# Store the output code for further inspection
+TORCH_LOGS='output_code' python run_vit_b.py 2> bfloat16_code
+TORCH_LOGS='output_code' python run_vit_b_quant.py 2> quant_code


I think what people might expect to see here is the fused quant and dequant in the generated code so we just add some comments to the checked in code so people can inspect or least that's what I'd expect people to see if they want to make sure torch.compile and quantization compose together

I'll try to add more pictures and also isolate the kernel. It's much clearer from the traces.

jerryzh168 · 2024-03-16T05:38:47Z

tutorials/quantize_vit/run_vit_b_quant.py

+
+## Quantization code - start
+from torchao.quantization import quant_api
+quant_api.change_linear_weights_to_int8_dqtensors(model)


@cpuhrsch not sure if we discussed about this, do you have any thoughts/comments on putting these under the unified quantization API (https://github.com/pytorch-labs/ao/blob/main/torchao/quantization/quant_api.py#L52)?

Quantize vit_b_16 tutorial

68c54ab

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 15, 2024

Add traces

057ef81

msaroufim reviewed Mar 16, 2024

View reviewed changes

jerryzh168 reviewed Mar 16, 2024

View reviewed changes

cpuhrsch added 5 commits March 18, 2024 20:10

Use no_grad

a71c10a

Merge branch 'main' of github.com:pytorch-labs/ao into tutorials2

d322702

Use apply_dynamic_quant

a683b94

Tidy up the generated code

d087ea5

Update run.sh

f60c827

cpuhrsch changed the title ~~Quantize vit_b_16 tutorial~~ Quantize vit_b_16 tutorial - Part 1 Mar 22, 2024

cpuhrsch marked this pull request as ready for review March 22, 2024 23:29

cpuhrsch merged commit 645d654 into main Mar 22, 2024
3 checks passed

dbyoung18 pushed a commit to dbyoung18/ao that referenced this pull request Jul 31, 2024

Quantize vit_b_16 tutorial - Part 1 (pytorch#60)

7ff1e42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantize vit_b_16 tutorial - Part 1 #60

Quantize vit_b_16 tutorial - Part 1 #60

cpuhrsch commented Mar 15, 2024

msaroufim Mar 16, 2024

cpuhrsch Mar 18, 2024

msaroufim Mar 16, 2024 •

edited

Loading

cpuhrsch Mar 22, 2024

jerryzh168 Mar 16, 2024

Quantize vit_b_16 tutorial - Part 1 #60

Quantize vit_b_16 tutorial - Part 1 #60

Conversation

cpuhrsch commented Mar 15, 2024

msaroufim Mar 16, 2024

Choose a reason for hiding this comment

cpuhrsch Mar 18, 2024

Choose a reason for hiding this comment

msaroufim Mar 16, 2024 • edited Loading

Choose a reason for hiding this comment

cpuhrsch Mar 22, 2024

Choose a reason for hiding this comment

jerryzh168 Mar 16, 2024

Choose a reason for hiding this comment

msaroufim Mar 16, 2024 •

edited

Loading