Efficient fine-tuning of LLaMA2 7B on a single GPU

A minimal implementation for running instruction tuning and inference tasks on LLaMA2 using a single NVIDIA A100 GPU. Applied techniques include Low-rank Adaptation, Auto-mix-precision Training, Gradient Scaling, and Gradient Checkpointing.

Installation

Model

To download LLaMA weights and tokenizer, please visit the Meta website and accept the License. Instructions

Environment

Tested on

gcc/11.3.0
cuda/11.8.0
python/3.9.12
pytorch/2.1.0

Usage

Inference

Change model_path, tokenizer_path, and lora_weights_path in inference.py
```
python inference.py
```
Finetuning
```
python finetune.py
```

Results

Memory Usage

For n_layers = 8 (number of transformer blocks, default=32) , and epochs = 5

Configuration	Trainable Parameters	GPU Memory Usage (MiB)	Training Time (seconds)
Original	1,881,214,976	38,401	/
+ Low-rank Adaptation	2,097,152	10,377	70.31
+ Auto-mix-precision Training & Gradient Scaling	2,097,152	13,079	25.96
+ Gradient Accumulation	2,097,152	13,089	25.12
+ Gradient Checkpointing	2,097,152	9,409	45.98

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
llama		llama
.gitignore		.gitignore
README.md		README.md
alpaca_data_200.json		alpaca_data_200.json
alpaca_data_dummy.json		alpaca_data_dummy.json
finetune.py		finetune.py
inference.py		inference.py
llama2-7b		llama2-7b
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient fine-tuning of LLaMA2 7B on a single GPU

Installation

Model

Environment

Usage

Inference

Finetuning

Results

Memory Usage

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Efficient fine-tuning of LLaMA2 7B on a single GPU

Installation

Model

Environment

Usage

Inference

Finetuning

Results

Memory Usage

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages