[DCU] fix DCU w8a8c8 GEMM shape #9115

YanhuiDua · 2024-09-10T03:39:16Z

PR types

Bug fixes

PR changes

Others

Description

fix DCU GEMM shape when quant_type == "a8w8c8"

paddle-bot · 2024-09-10T03:39:20Z

Thanks for your contribution!

codecov · 2024-09-10T04:10:56Z

Codecov Report

Attention: Patch coverage is 0% with 7 lines in your changes missing coverage. Please review.

Project coverage is 53.33%. Comparing base (2f31866) to head (81c30d8).
Report is 6 commits behind head on develop.

Files with missing lines	Patch %	Lines
...erimental/transformers/fused_transformer_layers.py	0.00%	4 Missing ⚠️
...dlenlp/experimental/transformers/llama/modeling.py	0.00%	1 Missing ⚠️
...enlp/experimental/transformers/mixtral/modeling.py	0.00%	1 Missing ⚠️
...dlenlp/experimental/transformers/qwen2/modeling.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9115      +/-   ##
===========================================
- Coverage    53.34%   53.33%   -0.01%     
===========================================
  Files          652      652              
  Lines       105401   105404       +3     
===========================================
  Hits         56222    56222              
- Misses       49179    49182       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

yuanlehome · 2024-09-10T06:44:17Z

paddlenlp/experimental/transformers/llama/modeling.py

@@ -631,6 +631,7 @@ def __init__(self, config: LlamaConfig):
            )

        else:
+            print("self.quant_type: ", self.quant_type)


注释删掉

YanhuiDua · 2024-09-10T06:45:21Z

paddlenlp/experimental/transformers/qwen2/modeling.py

@@ -372,7 +372,7 @@ def __init__(self, config: Qwen2Config):
                use_neox_rotary_style=self.use_neox,
                cachekv_int8_type=config.cachekv_int8_type,
                rank_id=config.tensor_parallel_rank,
-                trans_qkvw=(False if paddle.is_compiled_with_rocm() and self.quant_type == "a8w8" else True),
+                trans_qkvw=(False if paddle.is_compiled_with_rocm() and "a8w8" in self.quant_type else True),


这个就是为了修复w8a8c8时的gemm shape ～

CLAassistant · 2024-09-11T12:06:36Z

All committers have signed the CLA.

yuanlehome reviewed Sep 10, 2024

View reviewed changes

YanhuiDua commented Sep 10, 2024

View reviewed changes

yuanlehome approved these changes Sep 10, 2024

View reviewed changes

YanhuiDua closed this Sep 11, 2024

YanhuiDua reopened this Sep 11, 2024

YanhuiDua force-pushed the fix_dcu_gemm branch from f2f33dc to f5b2995 Compare September 11, 2024 12:13

[DCU] fix DCU w8a8c8 GEMM shape

81c30d8

YanhuiDua force-pushed the fix_dcu_gemm branch from f5b2995 to 81c30d8 Compare September 11, 2024 12:16

qingqing01 approved these changes Sep 11, 2024

View reviewed changes

qingqing01 merged commit 73a3db9 into PaddlePaddle:develop Sep 11, 2024
6 of 12 checks passed

YanhuiDua deleted the fix_dcu_gemm branch September 11, 2024 13:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DCU] fix DCU w8a8c8 GEMM shape #9115

[DCU] fix DCU w8a8c8 GEMM shape #9115

YanhuiDua commented Sep 10, 2024 •

edited

Loading

paddle-bot bot commented Sep 10, 2024

codecov bot commented Sep 10, 2024 •

edited

Loading

yuanlehome Sep 10, 2024

YanhuiDua Sep 10, 2024

YanhuiDua Sep 10, 2024

CLAassistant commented Sep 11, 2024 •

edited

Loading

[DCU] fix DCU w8a8c8 GEMM shape #9115

[DCU] fix DCU w8a8c8 GEMM shape #9115

Conversation

YanhuiDua commented Sep 10, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Sep 10, 2024

codecov bot commented Sep 10, 2024 • edited Loading

Codecov Report

yuanlehome Sep 10, 2024

Choose a reason for hiding this comment

YanhuiDua Sep 10, 2024

Choose a reason for hiding this comment

YanhuiDua Sep 10, 2024

Choose a reason for hiding this comment

CLAassistant commented Sep 11, 2024 • edited Loading

YanhuiDua commented Sep 10, 2024 •

edited

Loading

codecov bot commented Sep 10, 2024 •

edited

Loading

CLAassistant commented Sep 11, 2024 •

edited

Loading