Contact Us Start free

Tutorials

Large Language Models

JetStream MaxText inference on v6e

A guide to set up and use JetStream with MaxText for inference on v6e.
JetStream PyTorch inference on v6e

A guide to set up and use JetStream with PyTorch for inference on v6e.
vLLM inference on v6e

A guide to set up and use vLLM for inference on v6e.
Serve an LLM using TPUs on GKE with vLLM

A guide to using vLLM to serve large language models (LLMs) using Tensor Processing Units (TPUs) on Google Kubernetes Engine (GKE).

Diffusion Models

MaxDiffusion inference on v6e

A guide to set up and use MaxDiffusion for inference on v6e.

Image Classification

Training ResNet on Cloud TPU (PyTorch)

A ResNet image classification model using PyTorch, optimized to run on Cloud TPU.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.