Renamed `bsz` to `bs` for consistency; removed dead code #299

awgu · 2024-05-03T00:06:26Z

Stack from ghstack (oldest at bottom):

-> Renamed bsz to bs for consistency; removed dead code #299

some minor cleanups

[ghstack-poisoned]

ghstack-source-id: 0b273e8f81013c1c632f0c505b7229d51af3e488 Pull Request resolved: #299

awgu · 2024-05-03T00:06:51Z

torchtitan/models/llama/model.py

@@ -132,7 +132,6 @@ class Attention(nn.Module):
    Attributes:
        n_kv_heads (int): Number of key and value heads.
        n_heads (int): Number of query heads.
-        n_local_kv_heads (int): Number of local key and value heads.


not an attribute (only one occurrence of n_local_kv_heads if you search in this file)

awgu · 2024-05-03T00:07:16Z

torchtitan/models/llama/model.py

@@ -183,12 +182,12 @@ def forward(
            torch.Tensor: Output tensor after attention.

        """
-        bsz, seqlen, _ = x.shape


all inline comments in this method use bs for batch size so can make this bs for consistency

awgu · 2024-05-03T00:07:42Z

torchtitan/models/llama/model.py

@@ -421,7 +420,7 @@ def forward(self, tokens: torch.Tensor):
            torch.Tensor: Output logits after applying the Transformer model.

        """
-        _bsz, seqlen = tokens.shape


similarly, _bsz is unused, so just remove

if it helps readability to know the tokens.shape is (batch size, sequence length), I can keep it and maybe rename it to _bs?

Although not used, it improves code readability -- it tells how many dimensions tokens has, and what they are. So IMO I'd wish they are kept. Also, the "unusedness" has been indicated using the _ prefix.

if it helps readability to know the tokens.shape is (batch size, sequence length), I can keep it and maybe rename it to _bs?

just saw this message, yeah I agree

changed it to _bs

tianyu-l

One comment inline.

tianyu-l · 2024-05-03T00:18:21Z

torchtitan/models/llama/model.py

@@ -421,7 +420,7 @@ def forward(self, tokens: torch.Tensor):
            torch.Tensor: Output logits after applying the Transformer model.

        """
-        _bsz, seqlen = tokens.shape


Although not used, it improves code readability -- it tells how many dimensions tokens has, and what they are. So IMO I'd wish they are kept. Also, the "unusedness" has been indicated using the _ prefix.

some minor cleanups [ghstack-poisoned]

ghstack-source-id: bbedad3819ab9ef90b233209c34dd1dbc846b06a Pull Request resolved: #299

ghstack-source-id: bbedad3819ab9ef90b233209c34dd1dbc846b06a Pull Request resolved: pytorch#299

Renamed bsz to bs for consistency; removed dead code

ccd84c8

[ghstack-poisoned]

awgu added a commit that referenced this pull request May 3, 2024

Renamed bsz to bs for consistency; removed dead code

fda5059

ghstack-source-id: 0b273e8f81013c1c632f0c505b7229d51af3e488 Pull Request resolved: #299

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label May 3, 2024

awgu commented May 3, 2024

View reviewed changes

awgu marked this pull request as ready for review May 3, 2024 00:12

awgu requested review from wanchaol and tianyu-l May 3, 2024 00:12

tianyu-l approved these changes May 3, 2024

View reviewed changes

Update on "Renamed bsz to bs for consistency; removed dead code"

6a3e7b9

some minor cleanups [ghstack-poisoned]

awgu added a commit that referenced this pull request May 3, 2024

Renamed bsz to bs for consistency; removed dead code

4b33a2b

ghstack-source-id: bbedad3819ab9ef90b233209c34dd1dbc846b06a Pull Request resolved: #299

wanchaol approved these changes May 3, 2024

View reviewed changes

awgu merged commit 6a3e7b9 into gh/awgu/6/base May 3, 2024
4 checks passed

awgu added a commit that referenced this pull request May 3, 2024

Renamed bsz to bs for consistency; removed dead code

f72a2a0

ghstack-source-id: bbedad3819ab9ef90b233209c34dd1dbc846b06a Pull Request resolved: #299

awgu deleted the gh/awgu/6/head branch May 3, 2024 00:48

tianyu-l pushed a commit to tianyu-l/torchtitan_intern24 that referenced this pull request Aug 16, 2024

Renamed bsz to bs for consistency; removed dead code

4017bb1

ghstack-source-id: bbedad3819ab9ef90b233209c34dd1dbc846b06a Pull Request resolved: pytorch#299

philippguevorguian pushed a commit to YerevaNN/YNNtitan that referenced this pull request Aug 17, 2024

Renamed bsz to bs for consistency; removed dead code

8996249

ghstack-source-id: bbedad3819ab9ef90b233209c34dd1dbc846b06a Pull Request resolved: pytorch#299

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Renamed `bsz` to `bs` for consistency; removed dead code #299

Renamed `bsz` to `bs` for consistency; removed dead code #299

awgu commented May 3, 2024 •

edited

Loading

awgu May 3, 2024

awgu May 3, 2024

awgu May 3, 2024

awgu May 3, 2024 •

edited

Loading

tianyu-l May 3, 2024

tianyu-l May 3, 2024

awgu May 3, 2024

tianyu-l left a comment

tianyu-l May 3, 2024

Renamed bsz to bs for consistency; removed dead code #299

Renamed bsz to bs for consistency; removed dead code #299

Conversation

awgu commented May 3, 2024 • edited Loading

awgu May 3, 2024

Choose a reason for hiding this comment

awgu May 3, 2024

Choose a reason for hiding this comment

awgu May 3, 2024

Choose a reason for hiding this comment

awgu May 3, 2024 • edited Loading

Choose a reason for hiding this comment

tianyu-l May 3, 2024

Choose a reason for hiding this comment

tianyu-l May 3, 2024

Choose a reason for hiding this comment

awgu May 3, 2024

Choose a reason for hiding this comment

tianyu-l left a comment

Choose a reason for hiding this comment

tianyu-l May 3, 2024

Choose a reason for hiding this comment

Renamed `bsz` to `bs` for consistency; removed dead code #299

Renamed `bsz` to `bs` for consistency; removed dead code #299

awgu commented May 3, 2024 •

edited

Loading

awgu May 3, 2024 •

edited

Loading