Menu Contact Sales Sign in to Oracle Cloud

GPU Instances

Oracle Cloud Infrastructure (OCI) Compute provides industry-leading scalability and cost-performance for bare metal and virtual machine (VM) instances powered by NVIDIA GPUs for mainstream graphics, AI inference, AI training, digital twins, and HPC.

Talk with a GPU expert

Announcing the world’s first zettascale AI supercomputer

Inworld Innovates Video Game Experience with OCI AI Infrastructure (2:50)

Learn more about the newest accelerators on OCI

OCI Supercluster with NVIDIA H200 Tensor Core and AMD MI300X can support tens of thousands of GPUs, with added benefits such as hardware acceleration, bare metal instances with no hypervisor overhead, and much more.

NVIDIA H200 on OCI

AMD MI300X on OCI

Why use OCI for GPU instances?

Scalability

131,072

Maximum number of GPUs in an OCI Supercluster¹

Performance

3,200

Up to 3,200 Gb/sec of RDMA cluster network bandwidth²

Value

220%

GPUs for other CSPs can be up to 220% more expensive³

Choice

VM/BM

Rightsizing with VM and performance with bare metal instances

1. OCI Supercluster scales up to 131,072 NVIDIA B200 GPUs (planned); more than 100,000 NVIDIA B200 GPUs in NVIDIA GB200 Superchips (planned); 65,536 H200 GPUs; 32,768 NVIDIA A100 GPUs; 16,384 NVIDIA H100 GPUs; and 16,384 AMD MI300X GPUs.

2. For bare metal instances with NVIDIA H100 GPUs and AMD MI300X GPUs.

3. Based on on-demand pricing as of June 5, 2024.

GPU instances—key features

OCI is the only major cloud provider to offer bare metal instances with NVIDIA and AMD GPUs for high performance that’s free of virtualization overhead. For checkpointing during AI training, our instances provide the most local storage per node (61.4 TB with H100 GPUs). For a balance of performance and price, OCI VMs with NVIDIA GPUs are consistently cheaper than AWS and Azure.

High performance NVIDIA and AMD GPUs

NVIDIA Tensor Core GPUs

OCI offers the highest value and performance for bare metal and virtual machine compute instances powered by NVIDIA H100 Tensor Core GPUs, L40S GPUs, A100 Tensor Core GPUs, A10 Tensor Core GPU, and older-generation NVIDIA GPUs. OCI plans to offer instances with NVIDIA H200 and Blackwell GPUs.

NVIDIA superchips

OCI offers the NVIDIA GH200 Grace Hopper Superchip and plans to offer the GB200 Grace Blackwell Superchip for LLM inference.

AMD Instinct GPUs

OCI offers AMD Instinct MI300X GPUs with 192 GB of memory at a competitive price.

High performance cluster networking

Oracle’s ultralow-latency cluster networking, based on remote direct memory access (RDMA), provides microsecond-level latency.

OCI delivers stellar generative AI performance in MLPerf Inference v4.0 benchmarks

Bandwidth against cluster nodes; 1 node = 8 NVIDIA A100 GPUs

Deploy on VMs, bare metal instances, and Kubernetes clusters

VM instances

For VMs, choose from NVIDIA’s Hopper, Ampere, and older GPU architectures with one to four cores, 16 to 64 GB of GPU memory per VM, and up to 48 Gb/sec of network bandwidth.

Bare metal instances

Use OCI Supercluster with bare metal instances that include AMD Instinct GPUs, NVIDIA Blackwell GPUs or Superchips, NVIDIA Hopper GPUs or Superchips, and NVIDIA Ampere GPUs.

Kubernetes orchestration

Take advantage of managed Kubernetes, service mesh, and container registry to orchestrate AI and machine learning (ML) training and inference with containers.

Graphics rendering with NVIDIA A10 GPU shapes on OCI

Choose from a variety of VM and bare metal compute instances

Comparing the performance of NVIDIA V100 and A10 GPUs

Superior GPU and infrastructure pricing

Lower GPU pricing around the world

Competing GPU instances from AWS and Azure can be consistently more expensive.

Block storage price advantage

AWS, Azure, and Google Cloud Platform can be up to 6X more expensive.

Better Kubernetes pricing

AWS, Azure, and Google Cloud Platform can be up to 2X more expensive.

Industry-leading networking prices

Public bandwidth transferred out on OCI can be up to an order of magnitude cheaper than AWS, Azure, and Google Cloud Platform.

Comparing prices of cloud vendors across regions

Access readily available software

Access software and disk images

Oracle Cloud Marketplace provides software and disk images for data science, analytics, artificial intelligence (AI), and machine learning (ML) models so customers can quickly gain insight from their data.

NVIDIA AI Enterprise

Get access to NVIDIA AI Enterprise, an end-to-end software platform for data science and production AI, including generative AI, computer vision, and speech AI.

NVIDIA DGX Cloud

NVIDIA DGX Cloud on OCI is an AI-training-as-a-service platform, offering a serverless experience for developers that’s optimized for generative AI.

NVIDIA GPU Cloud Machine Image

Use NVIDIA GPU Cloud Machine Image for hundreds of GPU-optimized applications for machine learning, deep learning, and high performance computing covering a wide range of industries and workloads.

NVIDIA RTX Virtual Workstation

Deliver powerful workstation performance wherever employees need it by running NVIDIA RTX Virtual Workstation on Oracle Cloud.

Control your AI computing environment and data

Distributed cloud

When combined with GPU compute, OCI’s distributed cloud helps organizations run AI and cloud services where and how they’re needed.

Sovereign cloud

Support data residency within a region or country, including the EU, the US, the UK, and Australia.

Learn how etisalat by e& intends to deploy NVIDIA H100 GPU clusters within its OCI Dedicated Region

OCI Dedicated Region

Deploy a complete cloud region in your data center with OCI Dedicated Region to retain full control of your data and applications.

Oracle Alloy

Become a partner for Oracle Alloy and deliver your cloud services to address specific market needs.

Microservices and containers

Container registry

Developers building applications using containers leverage a highly available, Oracle-managed private container registry service for storing and sharing container images. Push or pull Docker images to and from the registry using the Docker V2 API and the standard Docker command line interface (CLI). Images can be pulled directly into a Kubernetes deployment.

Oracle Functions

Functions as a service (FaaS) lets developers run serverless applications that integrate with Oracle Cloud Infrastructure, Oracle Cloud Applications, and third-party services. Gain developer efficiency along with the community of the open source Fn Project.

GPU instances—use cases

AI infrastructure for deep learning training and inferencing

Train AI models using OCI Data Science, bare metal instances, cluster networking based on RDMA, and NVIDIA GPUs.

Learn about GPUs for AI innovators

Virtual desktop infrastructure (VDI)

OCI Compute powered by NVIDIA GPUs provide consistent high performance for VDI.

Explore virtual desktops and HPC

CFD and high performance computing using GPU instances

OCI enables computer-aided engineering and computational fluid dynamics for fast predictions of the aerodynamic properties of objects.

See how Punch Torino deployed HPC on OCI (3:18)

CFD and high performance computing using GPU instances

GPU instances—customers

Explore more customer stories

November 18, 2024

Now Generally Available: The Largest, Fastest AI Supercomputer in the Cloud

Sagar Zanwar, Principal Product Manager, OCI
Akshai Parthasarathy, Product Marketing Director, OCI

We’re excited to announce the general availability of Oracle Cloud Infrastructure (OCI) Supercluster with NVIDIA H200 Tensor Core GPUs. The largest AI supercomputer available in the cloud, our latest Supercluster scales up to an industry-leading 65,536 GPUs.

Read the complete post

Featured blogs

September 26, 2024 Announcing General Availability of OCI Compute with AMD MI300X GPUs: BM.GPU.MI300X.8
September 11, 2024 Announcing the General Availability of OCI Compute with NVIDIA L40S GPUs for AI, Simulation, and Digital Twin Workloads
September 10, 2024 Announcing the General Availability of OCI Compute with NVIDIA L40S GPU for Medium-Scale AI Workloads, Omniverse, and Visualization

Get started with GPU instances

Try Oracle AI and get a 30-day trial

Oracle offers a free pricing tier for most AI services as well as a free trial account with US$300 in credits to try additional cloud services. AI services are a collection of offerings, including generative AI, with prebuilt machine learning models that make it easier for developers to apply AI to applications and business operations.

Try Oracle AI for free

Which Oracle AI and ML services offer a free pricing tier?
- OCI Speech
- OCI Language
- OCI Vision
- OCI Document Understanding
- Machine Learning in Oracle Database
- OCI Data Labeling
You also only have to pay compute and storage charges for OCI Data Science.

Additional resources

Learn more about AI infrastructure, AI services and generative AI, and compute.

Explore AI infrastructure

Documentation
Related pages

See how much you can save with OCI

Oracle Cloud pricing is simple, with consistent low pricing worldwide, supporting a wide range of use cases. To estimate your low rate, check out the cost estimator and configure the services to suit your needs.

Try Cost Estimator

Experience the difference

1/4 the outbound bandwidth costs
3X the compute price-performance
Same low price in every region
Low pricing without long term commitments

Access a GPU and AI expert

Get help building your next GPU solution or deploying your AI workload on OCI AI infrastructure.