[LLM] support Qwen2 #8338

DrownFish19 · 2024-04-28T08:45:04Z

PR types

New features

PR changes

Models

Description

add QWen1.5 Moe model.
add Qwen2 model.
support same prefix for different models, such as QWen and QWen2Moe with same prefix QWen. The longest name will match each model name before others.
support sft and lora.

support models are listed as follows:

Model (qwen-1.5)
Qwen/Qwen1.5-0.5B
Qwen/Qwen1.5-0.5B-Chat
Qwen/Qwen1.5-1.8B
Qwen/Qwen1.5-1.8B-Chat
Qwen/Qwen1.5-4B
Qwen/Qwen1.5-4B-Chat
Qwen/Qwen1.5-7B
Qwen/Qwen1.5-7B-Chat
Qwen/Qwen1.5-14B
Qwen/Qwen1.5-14B-Chat
Qwen/Qwen1.5-32B
Qwen/Qwen1.5-32B-Chat
Qwen/Qwen1.5-72B
Qwen/Qwen1.5-72B-Chat
Qwen/Qwen1.5-110B
Qwen/Qwen1.5-110B-Chat
Qwen/Qwen1.5-MoE-A2.7B
Qwen/Qwen1.5-MoE-A2.7B-Chat

Model (qwen2)
Qwen/Qwen2-0.5B
Qwen/Qwen2-0.5B-Instruct
Qwen/Qwen2-1.5B
Qwen/Qwen2-1.5B-Instruct
Qwen/Qwen2-7B
Qwen/Qwen2-7B-Instruct
Qwen/Qwen2-72B
Qwen/Qwen2-72B-Instruct
Qwen/Qwen2-57B-A14B
Qwen/Qwen2-57B-A14B-Instruct

…-moe

paddle-bot · 2024-04-28T08:45:09Z

Thanks for your contribution!

…P into dev_add_qwen1.5-moe

codecov · 2024-05-06T03:46:48Z

Codecov Report

Attention: Patch coverage is 40.33276% with 1040 lines in your changes missing coverage. Please review.

Project coverage is 54.42%. Comparing base (909be01) to head (48ae2ab).
Report is 240 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/qwen2/modeling.py	14.41%	588 Missing ⚠️
paddlenlp/transformers/qwen2_moe/modeling.py	72.29%	197 Missing ⚠️
paddlenlp/transformers/qwen2/modeling_pp.py	0.00%	112 Missing ⚠️
paddlenlp/transformers/qwen2/tokenizer.py	22.38%	104 Missing ⚠️
paddlenlp/transformers/qwen2/configuration.py	13.33%	39 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8338      +/-   ##
===========================================
- Coverage    54.67%   54.42%   -0.26%     
===========================================
  Files          624      632       +8     
  Lines        97709    99450    +1741     
===========================================
+ Hits         53427    54128     +701     
- Misses       44282    45322    +1040

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ZHUI · 2024-05-08T11:33:23Z

llm/qwen2moe/lora_argument.json

+{
+    "model_name_or_path": "qwen/Qwen1.5-MoE-A2.7B",
+    "dataset_name_or_path": "./data",
+    "output_dir": "./checkpoints/qwen2moe_lora_ckpts",


确认是否ok，并同步更新 readme 文档

ZHUI · 2024-05-08T11:34:31Z

paddlenlp/transformers/qwen2moe/__init__.py

+from .configuration import QWen2MoeConfig
+from .modeling import QWen2MoeForCausalLM
+from .tokenizer import QWen2MoeTokenizer


Suggested change

from .configuration import QWen2MoeConfig

from .modeling import QWen2MoeForCausalLM

from .tokenizer import QWen2MoeTokenizer

from .configuration import *

from .modeling import *

from .tokenizer import*

ZHUI · 2024-05-08T11:35:09Z

paddlenlp/transformers/__init__.py

+from .qwen2moe.modeling import *
+from .qwen2moe.configuration import *
+from .qwen2moe.tokenizer import *


Suggested change

from .qwen2moe.modeling import *

from .qwen2moe.configuration import *

from .qwen2moe.tokenizer import *

from .qwen2moe import *

ZHUI · 2024-05-08T11:37:21Z

tests/transformers/qwen2moe/__init__.py

@@ -0,0 +1,13 @@
+# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


这个文件需要吗？

…P into dev_add_qwen1.5-moe

wawltor

LGTM

DrownFish19 added 23 commits April 17, 2024 10:58

add Qwen2Moe

36ab9a7

update default config

3913e11

Merge remote-tracking branch 'paddlenlp/develop' into dev_add_qwen1.5…

0aa1aca

…-moe

update QWen2Moe modeling

a29e90d

update modeling

d514dff

update ckpt name

1e98323

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

f81bb43

support same prefix model name for auto modeling

37dd2d5

update qwen2moe testing

d12938a

update qwen2moe modeling and config

8cc49fc

update qwen2moe import

9c8222e

fix mlp hidden_size

4d6ff87

update qkv bias convert

f350a2f

update modeling init_weight

c53690d

update _get_name_mappings

9d12995

update _get_name_mappings and _init_weight

dba0f74

add tokenizer

e487606

update modeling

cd9c753

update modeling

10407c4

update tokenizer

beb0f4c

update modeling and tokenizer

beefee9

fix index_add_ error

82ba345

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

d522ee4

DrownFish19 added 4 commits April 28, 2024 11:08

fix

4a1b2e3

Merge branch 'dev_add_qwen1.5-moe' of github.com:DrownFish19/PaddleNL…

526a9db

…P into dev_add_qwen1.5-moe

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

0c9d5ec

update comments

2bb3aba

update lora weights

f203983

DrownFish19 added 6 commits May 29, 2024 10:49

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

c766eb5

update Copyright

5ddc326

update Moe to MoE

de1db67

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

10a194c

update comment

87f0276

update Copyright

8d9970b

ZHUI reviewed Jun 3, 2024

View reviewed changes

DrownFish19 added 12 commits June 3, 2024 15:45

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

89994a6

update readme and json

d57a5b1

update __init__.py

bfb65a1

add qwen-1.5

4b96dd0

update QWen to Qwen

b274f12

update Qwen2MoE to Qwen2Moe

1054f06

update readme

056b04c

update qwen2moe sft and lora json

ab08c17

update qwen2moe base name

ad02fdc

update qwen2

23e39fc

update

36b3897

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

6455445

ZHUI previously approved these changes Jun 11, 2024

View reviewed changes

DrownFish19 changed the title ~~[LLM] support QWen1.5-Moe~~ [LLM] support QWen2 Jun 11, 2024

DrownFish19 added 2 commits June 11, 2024 08:14

update readme

b140df6

Merge branch 'dev_add_qwen1.5-moe' of github.com:DrownFish19/PaddleNL…

c08c9a6

…P into dev_add_qwen1.5-moe

DrownFish19 dismissed ZHUI’s stale review via c08c9a6 June 11, 2024 08:14

DrownFish19 added 2 commits June 11, 2024 08:16

update readme

e6de5f3

update readme

48ae2ab

DrownFish19 changed the title ~~[LLM] support QWen2~~ [LLM] support Qwen2 Jun 11, 2024

wawltor approved these changes Jun 11, 2024

View reviewed changes

wawltor merged commit 4609d07 into PaddlePaddle:develop Jun 11, 2024
8 of 12 checks passed

DrownFish19 deleted the dev_add_qwen1.5-moe branch June 12, 2024 01:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] support Qwen2 #8338

[LLM] support Qwen2 #8338

DrownFish19 commented Apr 28, 2024 •

edited

Loading

paddle-bot bot commented Apr 28, 2024

codecov bot commented May 6, 2024 •

edited

Loading

ZHUI May 8, 2024

ZHUI May 8, 2024

ZHUI May 8, 2024

ZHUI May 8, 2024

wawltor left a comment

		@@ -0,0 +1,13 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.

[LLM] support Qwen2 #8338

[LLM] support Qwen2 #8338

Conversation

DrownFish19 commented Apr 28, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Apr 28, 2024

codecov bot commented May 6, 2024 • edited Loading

Codecov Report

ZHUI May 8, 2024

Choose a reason for hiding this comment

ZHUI May 8, 2024

Choose a reason for hiding this comment

ZHUI May 8, 2024

Choose a reason for hiding this comment

ZHUI May 8, 2024

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

DrownFish19 commented Apr 28, 2024 •

edited

Loading

codecov bot commented May 6, 2024 •

edited

Loading