Add model MegatronBert #1678

Beacontownfc · 2022-02-15T03:34:52Z

Description
Add new model MegatronBert
The model weight:
链接：https://pan.baidu.com/s/1DNoxmqxtRiMycHfVnvwJwg
提取码：olie

examples/language_model/megatronbert/README.md

Steffy-zxf · 2022-02-21T08:03:14Z

paddlenlp/transformers/megatronbert/modeling.py

+                 max_position_embeddings=512,
+                 hidden_dropout_prob=0.1,
+                 position_embedding_type="absolute"):
+        super().__init__()


为了保持与paddlenlp代码风格统一，建议写成：

super(MegatronBertEmbeddings, self).__init__()

Steffy-zxf · 2022-02-21T08:04:37Z

examples/language_model/megatronbert/args.py

+        help="Path to pre-trained model or shortcut name of model.")
+    parser.add_argument(
+        "--output_dir",
+        default="/root/paddlejob/workspace/output",


default=None,

ZHUI · 2022-02-24T07:23:17Z

examples/language_model/megatronbert/args.py

@@ -0,0 +1,135 @@
+import argparse


ZHUI · 2022-02-24T07:27:33Z

examples/language_model/megatronbert/run_squad.py

+import paddle
+
+from paddle.io import DataLoader
+from args import parse_args


按照 pep8 规则 import. https://www.python.org/dev/peps/pep-0008/#imports

Imports should be grouped in the following order: Standard library imports. Related third party imports. Local application/library specific imports. You should put a blank line between each group of imports.

ZHUI · 2022-02-24T07:30:12Z

paddlenlp/transformers/megatronbert/__init__.py

@@ -0,0 +1,2 @@
+from .modeling import *


该文件可留空

ZHUI · 2022-02-24T07:31:02Z

paddlenlp/transformers/megatronbert/modeling.py

+import paddle
+from paddle import nn
+from .. import PretrainedModel, register_base_model
+import paddle.nn.functional as F


import 顺序

ZHUI · 2022-02-24T07:32:52Z

paddlenlp/transformers/megatronbert/modeling.py

+    return x * F.sigmoid(x)
+
+
+def gelu_new(x):


https://www.paddlepaddle.org.cn/documentation/docs/zh/api/paddle/nn/functional/gelu_cn.html#gelu

gelu with approximate=True？

是近似计算，我是参考了fnet和bigbird的写法

可以替换为，上面的 gelu api，性能会高一些

ZHUI

LGTM

ZHUI · 2022-03-10T07:39:41Z

examples/language_model/megatronbert/README.md

+    --learning_rate=1e-5 \
+    --output_dir=output/
+    --device=gpu
+    --num_train_epochs=2


shell \ 有问题

ZHUI · 2022-03-10T07:40:36Z

examples/language_model/megatronbert/README.md

+    --learning_rate=1e-5 \
+    --output_dir=output/
+    --device=gpu
+    --num_train_epochs=2


ZHUI · 2022-03-10T07:41:13Z

examples/language_model/megatronbert/README.md

+```shell
+python -m paddle.distributed.launch run_glue.py \
+    --task_name=mnli \
+    --output_dir=output/


ZHUI · 2022-03-10T07:41:27Z

examples/language_model/megatronbert/args.py

@@ -0,0 +1,150 @@
+# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.


ZHUI · 2022-03-10T07:42:42Z

examples/language_model/megatronbert/args.py

+        "--seed", type=int, default=42, help="random seed for initialization")
+    parser.add_argument(
+        '--device',
+        choices=['cpu', 'gpu'],


这里没有xpu，上面readme xpu的叙述删掉吧

ZHUI · 2022-03-10T07:44:03Z

examples/language_model/megatronbert/run_glue.py

+        "--device",
+        default="gpu",
+        type=str,
+        choices=["cpu", "gpu", "xpu", "npu"],


"xpu", "npu" 删除吧，没有验证

ZHUI · 2022-03-10T07:46:15Z

examples/language_model/megatronbert/run_squad.py

@@ -0,0 +1,353 @@
+# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.


2022，统一check一下

ZHUI · 2022-03-10T07:48:52Z

paddlenlp/transformers/megatronbert/modeling.py

+        return input_tensor + hidden_states
+
+
+# Based on transformers.models.bert.modeling_bert.BertLayer. Added LayerNorm.


注释删除？

ZHUI · 2022-03-10T07:50:13Z

paddlenlp/transformers/megatronbert/modeling.py

+
+        Args:
+            vocab_size (int):
+                Vocabulary size of `inputs_ids` in `ConvBertModel`. Also is the vocab size of token embedding matrix.


ConvBertModel conv 统一搜索一下吧

ZHUI · 2022-03-10T07:52:56Z

paddlenlp/transformers/megatronbert/tokenizer.py

@@ -0,0 +1,102 @@
+# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.


ZHUI

LGTM

yingyibiao · 2022-03-11T08:48:38Z

paddlenlp/transformers/megatronbert/modeling.py

+from paddle import nn
+import paddle.nn.functional as F
+
+from ...ops import einsum


使用 paddle.einsum

Beacontownfc added 2 commits February 15, 2022 11:31

add megatronbert

5e3a0cb

fix link

38d1410

ZHUI self-requested a review February 16, 2022 02:28

Beacontownfc and others added 5 commits February 17, 2022 15:25

Mormalized code

d41c5b2

add doc string

061afce

update modeling

b743422

recover

52b1658

recover

92bef9b

ZeyuChen added the contributions label Feb 20, 2022

Steffy-zxf reviewed Feb 21, 2022

View reviewed changes

Beacontownfc and others added 3 commits February 21, 2022 17:25

Modify according to Steffy

e7b9c3c

delete prefix

e417f7e

Update modeling.py

e9cbbc2

ZHUI reviewed Feb 24, 2022

View reviewed changes

Beacontownfc and others added 5 commits February 24, 2022 16:17

Modifiy according to Zhui

71beda9

Modify gelu_new

a70b27d

Update tokenizer.py

f2e39e7

Update modeling.py

0cbd767

update modeling

4b8ab0b

ZHUI previously approved these changes Mar 10, 2022

View reviewed changes

update megatronbert

48184ff

Beacontownfc dismissed ZHUI’s stale review via 48184ff March 11, 2022 00:58

Beacontownfc and others added 2 commits March 11, 2022 09:03

Merge branch 'develop' into megatronbert

14026f7

recover

e01d195

ZHUI previously approved these changes Mar 11, 2022

View reviewed changes

Beacontownfc and others added 2 commits March 11, 2022 14:09

Merge branch 'develop' into megatronbert

bd5086d

Merge branch 'develop' into megatronbert

6f223d5

yingyibiao reviewed Mar 11, 2022

View reviewed changes

ZeyuChen assigned yingyibiao Mar 11, 2022

Update modeling.py

89a79f3

Beacontownfc dismissed ZHUI’s stale review via 89a79f3 March 11, 2022 14:27

Beacontownfc added 2 commits March 11, 2022 22:27

Merge branch 'develop' into megatronbert

bbd382e

Update README.md

dd3d35f

yingyibiao approved these changes Mar 13, 2022

View reviewed changes

yingyibiao merged commit 48dee51 into PaddlePaddle:develop Mar 13, 2022

yingyibiao mentioned this pull request Mar 17, 2022

PaddleNLP 2.2.5 Release Note Candidate #1772

Closed

guoshengCS mentioned this pull request Apr 29, 2022

PaddleNLP v2.3rc Release Note Candidate #2031

Closed

		@@ -0,0 +1,150 @@
		# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.

		@@ -0,0 +1,353 @@
		# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.

		return input_tensor + hidden_states


		# Based on transformers.models.bert.modeling_bert.BertLayer. Added LayerNorm.

		@@ -0,0 +1,102 @@
		# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.

Add model MegatronBert #1678

Add model MegatronBert #1678

Conversation

Beacontownfc commented Feb 15, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Beacontownfc commented Feb 15, 2022 •

edited

Loading