ã¯ããã«
Google TPU Trillium (v6e) ã GA ã«ãªã£ãããã§ãã
v6e ã¯ãTraining ç¨ï¼
ä¸è¨ã®ããã°ã«ã¯ãä¸è¨ã®ãããªã°ã©ããããã¾ããã
Figure 3. Source data: MLPerf⢠4.1 Training Closed results for Trillium (Preview) and v5p on GPT3-175b training task.
Training ã§ã® v5p ã¨ã®æ¯è¼ãv5p-4096 㨠4 x Trillium-256
v5p ã®ä»æ§ã¯ã
Trillium (v6e) ã®ä»æ§ã¯ã
BF16ã¯ãv5p x 2 == v6e ã§ãããv5p ã 2ã³ã¢ã§ã459 TFLOPsãv6e ã 1ã³ã¢ã§ 918 TFLOPs ããªã®ã§4åã§ãããMXUã128x128ãã256x256ã§4åã«ãªã£ã¦ããã¨ãããã¨ãªã®ã§ãv6eã£ã¦ãv5pã¨åãåä½å¨æ³¢æ°ã§åãã¦ãããã§ãããããã
HBMã®å¸¯åã¯ãv5p 㯠2765 GB/sã1ã³ã¢å½ããã1382.5 GB/sãv6e 㯠1536 GB/sãªã®ã§ã1ã³ã¢å½ãããåã 1536 GB/s ãBF16ãã³ã¢å½ãã2åã«ãªã£ã¦ãããHBMã®å¸¯åã¯1å²ç¨åº¦ããå¢ãã¦ãã¾ããã
ããã¯ãv5e => v6e ã§ãåãæã
v5e 㯠BF 197 TFLOPSãHBMã®å¸¯å㯠819 GB/sã
ãããã«
v5e => v6e ã®æ¯è¼ã ãã§ãªããv5p => v6e ã®æ¯è¼ããããã¨ã§ãã¡ãã£ã¨å¤ãã£ãæ°ããã¾ãã
1ã³ã¢å½ãã(BF16)
- v5e : 197 TFLOPS / HBM 819 GB/s (4.157)
- v5p : 224.5 TFLOPS / HBM 1382.5 GB/s (6.158)
- v6e : 459 TFLOPS / HBM 1536 GB/s (3.346)
ãã¼ãã次ã¯ãv6p ã§ã¯ãªããv7e ãªã®ããããã¾ãããããã
é¢é£ããã°