www.harmdevries.com[B!]新着記事・評価 - はてなブックマーク

『www.harmdevries.com』

Go smol or go home | Harm de Vries
3 users
www.harmdevries.com

If you have access to a big compute cluster and are planning to train a Large Language Model (LLM), you will need to make a decision on how to allocate your compute budget. This involves selecting the number of model parameters $N$ and the number of training tokens $D$. By applying the scaling laws, you can get guidance on how to reach the best model performance for your given compute budget, and
- 学び
- 2023/05/07 15:46

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx