[Inference LLM] support static c8 #8833

yuanlehome · 2024-07-30T05:28:13Z

PR types

New features

PR changes

Models

Description

support static c8

paddle-bot · 2024-07-30T05:28:18Z

Thanks for your contribution!

codecov · 2024-07-30T06:01:24Z

Codecov Report

Attention: Patch coverage is 0% with 61 lines in your changes missing coverage. Please review.

Project coverage is 55.49%. Comparing base (ee4944e) to head (7f157ba).
Report is 243 commits behind head on develop.

Files with missing lines	Patch %	Lines
...dlenlp/experimental/transformers/llama/modeling.py	0.00%	18 Missing ⚠️
...erimental/transformers/fused_transformer_layers.py	0.00%	10 Missing ⚠️
...ddlenlp/experimental/transformers/qwen/modeling.py	0.00%	10 Missing ⚠️
...dlenlp/experimental/transformers/bloom/modeling.py	0.00%	5 Missing ⚠️
...enlp/experimental/transformers/chatglm/modeling.py	0.00%	5 Missing ⚠️
...p/experimental/transformers/chatglm_v2/modeling.py	0.00%	5 Missing ⚠️
...addlenlp/experimental/transformers/gpt/modeling.py	0.00%	5 Missing ⚠️
...enlp/experimental/transformers/generation_utils.py	0.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8833      +/-   ##
===========================================
+ Coverage    55.44%   55.49%   +0.05%     
===========================================
  Files          631      631              
  Lines        98542    98554      +12     
===========================================
+ Hits         54632    54697      +65     
+ Misses       43910    43857      -53

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DrownFish19 · 2024-07-30T07:03:51Z

paddlenlp/experimental/transformers/llama/modeling.py

@@ -435,7 +435,7 @@ def __init__(self, config: LlamaConfig):
        ffn1_bias_attrs = None
        ffn2_bias_attrs = None

-        if self.quant_type == "a8w8":
+        if "a8w8" in self.quant_type:


self.quant_type如果使用in进行字符串判断，有没有位置给出quant_type的全部内容，方便后续开发

这一块没问题的，留有后续可扩展的位置，比如quant_type全部可能的值是a_w_c_，_表示数字

嗯嗯，后续文档补充上也可以

DrownFish19 · 2024-07-30T07:05:34Z

paddlenlp/experimental/transformers/llama/modeling.py

-        self.weight_only_quant_bits = config.weight_only_quant_bits
-
-        if self.quant_type is not None and "weight_only_int" in self.quant_type:
+        if config.quant_type == "weight_only_int8":


这里是不是能判断weight_only是否在字符串内，实现判断？命名方式是否有具体文档说明？

已补充～

DrownFish19

LGTM

support llama2 static c8

22201d6

fix

d1bc337

DrownFish19 reviewed Jul 30, 2024

View reviewed changes

fix

7458d95

DrownFish19 previously approved these changes Jul 30, 2024

View reviewed changes

fix

49eada3

yuanlehome dismissed DrownFish19’s stale review via 49eada3 July 30, 2024 12:34

fix ci

e358ce3

yuanlehome force-pushed the support_static_c8 branch from 26686ad to e358ce3 Compare July 31, 2024 04:04

yuanlehome added 3 commits July 31, 2024 05:10

fix ci

0ab30f4

fix ci

a87534a

fix ci

7f157ba

DrownFish19 approved these changes Jul 31, 2024

View reviewed changes

wawltor merged commit a6a7870 into PaddlePaddle:develop Aug 5, 2024
9 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference LLM] support static c8 #8833

[Inference LLM] support static c8 #8833

yuanlehome commented Jul 30, 2024

paddle-bot bot commented Jul 30, 2024

codecov bot commented Jul 30, 2024 •

edited

Loading

DrownFish19 Jul 30, 2024

yuanlehome Jul 30, 2024

DrownFish19 Jul 30, 2024

DrownFish19 Jul 30, 2024

yuanlehome Jul 30, 2024

DrownFish19 left a comment

[Inference LLM] support static c8 #8833

[Inference LLM] support static c8 #8833

Conversation

yuanlehome commented Jul 30, 2024

PR types

PR changes

Description

paddle-bot bot commented Jul 30, 2024

codecov bot commented Jul 30, 2024 • edited Loading

Codecov Report

DrownFish19 Jul 30, 2024

Choose a reason for hiding this comment

yuanlehome Jul 30, 2024

Choose a reason for hiding this comment

DrownFish19 Jul 30, 2024

Choose a reason for hiding this comment

DrownFish19 Jul 30, 2024

Choose a reason for hiding this comment

yuanlehome Jul 30, 2024

Choose a reason for hiding this comment

DrownFish19 left a comment

Choose a reason for hiding this comment

codecov bot commented Jul 30, 2024 •

edited

Loading