Replies: 1 comment
-
We are indeed interested in creating smaller MoE models, but it is unlikely that we will train from scratch, as this is extremely computationally expensive. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Does will train from draft for a little moe llama ?
Beta Was this translation helpful? Give feedback.
All reactions