LoRA-from-scratch

This repository contains a PyTorch implementation of Low-Rank Adaptation (LoRA), applied to the task of classifying MNIST digits. The implementation demonstrates how LoRA can be integrated into a neural network and fine-tuned on specific tasks, allowing for efficient training and memory optimizations.

How LoRA Works

LoRA introduces two small matrices, A and B, which together approximate the weight update matrix ΔW . The inner dimension r of these matrices is a hyperparameter that controls the rank and complexity of the approximation. This technique modifies the standard training process by updating only these smaller matrices, rather than the entire weight matrix, which can significantly reduce memory usage and computational costs.

Installation

First, clone the repository to your local machine:

git clone https://github.com/lightmatmul/LoRA-from-scratch.git
cd LoRA-from-scratch

Then, install the required dependencies:

pip install -r requirements.txt

Usage

To run the training and testing scripts, use the following command:

python main.py

This command will:

Train a neural network on the MNIST dataset (simulating LLM pretraining).
Fine-tune the network on a poorly performing digit with and without LoRA.
Test the two fine-tunes to compare performances and demonstrate LoRA's efficiency.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
media		media
README.md		README.md
main.py		main.py
models.py		models.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA-from-scratch

How LoRA Works

Installation

Usage

About

Releases

Packages

Languages

lightmatmul/LoRA-from-scratch

Folders and files

Latest commit

History

Repository files navigation

LoRA-from-scratch

How LoRA Works

Installation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages