Skip to content

Commit

Permalink
update GPU benchmark
Browse files Browse the repository at this point in the history
  • Loading branch information
jstzwj committed Mar 2, 2024
1 parent c6caefe commit 6440063
Showing 1 changed file with 8 additions and 4 deletions.
12 changes: 8 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@ Easy Language Model Trainer

## Let's benchmark GPUs LLM training with katheryne

### Training settings for 7B Llama (100 steps)
```
Stage: Pretrain
Model: meta-llama/Llama-2-7b-hf
Expand All @@ -17,7 +18,10 @@ dtype: bf16
lora: {"r": 16, "target_modules": ["q_proj", "v_proj"]}
```

| GPU | Time | Memory |
|-------------|-------|--------------|
| RTX 3090 | | 14,928MiB |
| Tesla A800 | | |
| GPU | Time | Memory |
|------------------------------|---------|--------------|
| NVIDIA A800 80GB PCIe | 00:41 | 14,850MiB |
| NVIDIA GeForce RTX 4090 | 01:02 | 15,078MiB |
| Iluvatar BI-V150 | 01:09 | 22,798MiB |
| NVIDIA GeForce RTX 3090 | 01:36 | 14,928MiB |

0 comments on commit 6440063

Please sign in to comment.