[Proposal] Add frequency-based RoPE support for Llama 3.1 models #719

frances720 · 2024-09-09T05:12:15Z

Proposal

Add support for frequency-based RoPE (Rotary Position Embedding) smoothing in the TransformerLens library to match Llama 3.1’s architecture.

Motivation

Llama 3.1 uses frequency-based smoothing in its positional embeddings to handle long-range dependencies more effectively. However, the current version of TransformerLens does not support this feature, limiting the ability to properly analyze Llama 3.1 models.

Pitch

Implement frequency-based RoPE smoothing to enhance positional encoding in Llama 3.1 models. This would improve TransformerLens’s compatibility with Llama 3.1 and provide a better tool for analyzing long-sequence tasks.

Alternatives

Continue using TransformerLens with standard RoPE, but this would not fully support Llama 3.1’s unique architecture.

Checklist

I have checked that there is no similar issue in the repo (required)

frances720 · 2024-09-09T05:13:20Z

I have a PR for it but when I ran git push --set-upstream origin frances/llama31_rope it returned 403

bryce13950 · 2024-09-23T19:23:31Z

@frances720 Sorry for the late reply! It appears that you may be trying to write your branch to the TransformerLens repo? You need to make your PR from your fork. If you need help on this, you can reach me on the slack channel. Let me know if you need an invite!

bryce13950 · 2024-11-03T22:07:48Z

This has been resolved in a recent release

bryce13950 assigned frances720 Sep 23, 2024

bryce13950 closed this as completed Nov 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal] Add frequency-based RoPE support for Llama 3.1 models #719

[Proposal] Add frequency-based RoPE support for Llama 3.1 models #719

frances720 commented Sep 9, 2024

frances720 commented Sep 9, 2024

bryce13950 commented Sep 23, 2024

bryce13950 commented Nov 3, 2024

[Proposal] Add frequency-based RoPE support for Llama 3.1 models #719

[Proposal] Add frequency-based RoPE support for Llama 3.1 models #719

Comments

frances720 commented Sep 9, 2024

Proposal

Motivation

Pitch

Alternatives

Checklist

frances720 commented Sep 9, 2024

bryce13950 commented Sep 23, 2024

bryce13950 commented Nov 3, 2024