Skip to content

Commit

Permalink
update data
Browse files Browse the repository at this point in the history
  • Loading branch information
jstzwj committed Mar 2, 2024
1 parent 6440063 commit 98fe3ce
Showing 1 changed file with 11 additions and 8 deletions.
19 changes: 11 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,24 +4,27 @@ Easy Language Model Trainer

## Let's benchmark GPUs LLM training with katheryne

### Training settings for 7B Llama (100 steps)
### Training settings - 7B Llama (100 steps)
```
Stage: Pretrain
Model: meta-llama/Llama-2-7b-hf
Dataset: bigscience-data/roots_zh-cn_wikipedia
per_device_train_batch_size: 2
accumulate_grad_batches: 64
accumulate_grad_batches: 1
max_seq_len: 512
max_steps: 100
gradient_checkpointing: true
dtype: bf16
lora: {"r": 16, "target_modules": ["q_proj", "v_proj"]}
```

| GPU | Time | Memory |
|------------------------------|---------|--------------|
| NVIDIA A800 80GB PCIe | 00:41 | 14,850MiB |
| NVIDIA GeForce RTX 4090 | 01:02 | 15,078MiB |
| Iluvatar BI-V150 | 01:09 | 22,798MiB |
| NVIDIA GeForce RTX 3090 | 01:36 | 14,928MiB |
| GPU | Time | GPU Memory | Memory Usage |
|------------------------------|---------|----------------|----------------|
| NVIDIA A800 80GB PCIe | 00:41 | 80G | 14,850MiB |
| NVIDIA GeForce RTX 4090 | 01:02 | 24G | 15,078MiB |
| Iluvatar BI-V150 | 01:09 | 32G | 22,798MiB |
| NVIDIA RTX A6000 | 01:13 | 48G | 14,944MiB |
| NVIDIA A40 | 01:16 | 48G | 15,809MiB |
| NVIDIA GeForce RTX 3090 | 01:36 | 24G | 14,928MiB |


0 comments on commit 98fe3ce

Please sign in to comment.