GPU Instances

Oracle Cloud Infrastructure (OCI) Compute provides industry-leading scalability and cost-performance for bare metal and virtual machine (VM) instances powered by NVIDIA GPUs for mainstream graphics, AI inference, AI training, digital twins, and HPC.

Inworld Innovates Video Game Experience with OCI AI Infrastructure (2:50)
Learn more about the newest accelerators on OCI

OCI Supercluster with NVIDIA H200 Tensor Core and AMD MI300X can support tens of thousands of GPUs, with added benefits such as hardware acceleration, bare metal instances with no hypervisor overhead, and much more.

Why use OCI for GPU instances?

Scalability

131,072

Maximum number of GPUs in an OCI Supercluster1

Performance

3,200

Up to 3,200 Gb/sec of RDMA cluster network bandwidth2

Value

220%

GPUs for other CSPs can be up to 220% more expensive3

Choice

VM/BM

Rightsizing with VM and performance with bare metal instances

1. OCI Supercluster scales up to 131,072 NVIDIA B200 GPUs (planned); more than 100,000 NVIDIA B200 GPUs in NVIDIA GB200 Superchips (planned); 65,536 H200 GPUs; 32,768 NVIDIA A100 GPUs; 16,384 NVIDIA H100 GPUs; and 16,384 AMD MI300X GPUs.

2. For bare metal instances with NVIDIA H100 GPUs and AMD MI300X GPUs.

3. Based on on-demand pricing as of June 5, 2024.

GPU instances—key features

OCI is the only major cloud provider to offer bare metal instances with NVIDIA and AMD GPUs for high performance that’s free of virtualization overhead. For checkpointing during AI training, our instances provide the most local storage per node (61.4 TB with H100 GPUs). For a balance of performance and price, OCI VMs with NVIDIA GPUs are consistently cheaper than AWS and Azure.

High performance NVIDIA and AMD GPUs

NVIDIA Tensor Core GPUs

OCI offers the highest value and performance for bare metal and virtual machine compute instances powered by NVIDIA H100 Tensor Core GPUs, L40S GPUs, A100 Tensor Core GPUs, A10 Tensor Core GPU, and older-generation NVIDIA GPUs. OCI plans to offer instances with NVIDIA H200 and Blackwell GPUs.

NVIDIA superchips

OCI offers the NVIDIA GH200 Grace Hopper Superchip and plans to offer the GB200 Grace Blackwell Superchip for LLM inference.

AMD Instinct GPUs

OCI offers AMD Instinct MI300X GPUs with 192 GB of memory at a competitive price.

High performance cluster networking

Oracle’s ultralow-latency cluster networking, based on remote direct memory access (RDMA), provides microsecond-level latency.

Deploy on VMs, bare metal instances, and Kubernetes clusters

VM instances

For VMs, choose from NVIDIA’s Hopper, Ampere, and older GPU architectures with one to four cores, 16 to 64 GB of GPU memory per VM, and up to 48 Gb/sec of network bandwidth.

Bare metal instances

Use OCI Supercluster with bare metal instances that include AMD Instinct GPUs, NVIDIA Blackwell GPUs or Superchips, NVIDIA Hopper GPUs or Superchips, and NVIDIA Ampere GPUs.

Kubernetes orchestration

Take advantage of managed Kubernetes, service mesh, and container registry to orchestrate AI and machine learning (ML) training and inference with containers.

Superior GPU and infrastructure pricing

Lower GPU pricing around the world

Competing GPU instances from AWS and Azure can be consistently more expensive.

Block storage price advantage

AWS, Azure, and Google Cloud Platform can be up to 6X more expensive.

Better Kubernetes pricing

AWS, Azure, and Google Cloud Platform can be up to 2X more expensive.

Industry-leading networking prices

Public bandwidth transferred out on OCI can be up to an order of magnitude cheaper than AWS, Azure, and Google Cloud Platform.

Reduce networking and storage costs
Comparing prices of cloud vendors across regions

Access readily available software

Access software and disk images

Oracle Cloud Marketplace provides software and disk images for data science, analytics, artificial intelligence (AI), and machine learning (ML) models so customers can quickly gain insight from their data.

NVIDIA AI Enterprise

Get access to NVIDIA AI Enterprise, an end-to-end software platform for data science and production AI, including generative AI, computer vision, and speech AI.

NVIDIA DGX Cloud

NVIDIA DGX Cloud on OCI is an AI-training-as-a-service platform, offering a serverless experience for developers that’s optimized for generative AI.

NVIDIA GPU Cloud Machine Image

Use NVIDIA GPU Cloud Machine Image for hundreds of GPU-optimized applications for machine learning, deep learning, and high performance computing covering a wide range of industries and workloads.

NVIDIA RTX Virtual Workstation

Deliver powerful workstation performance wherever employees need it by running NVIDIA RTX Virtual Workstation on Oracle Cloud.

Control your AI computing environment and data

Distributed cloud

When combined with GPU compute, OCI’s distributed cloud helps organizations run AI and cloud services where and how they’re needed.

Sovereign cloud

Support data residency within a region or country, including the EU, the US, the UK, and Australia.

OCI Dedicated Region

Deploy a complete cloud region in your data center with OCI Dedicated Region to retain full control of your data and applications.

Oracle Alloy

Become a partner for Oracle Alloy and deliver your cloud services to address specific market needs.

Microservices and containers

Container registry

Developers building applications using containers leverage a highly available, Oracle-managed private container registry service for storing and sharing container images. Push or pull Docker images to and from the registry using the Docker V2 API and the standard Docker command line interface (CLI). Images can be pulled directly into a Kubernetes deployment.

Oracle Functions

Functions as a service (FaaS) lets developers run serverless applications that integrate with Oracle Cloud Infrastructure, Oracle Cloud Applications, and third-party services. Gain developer efficiency along with the community of the open source Fn Project.

GPU instances—use cases

AI infrastructure for deep learning training and inferencing

Train AI models using OCI Data Science, bare metal instances, cluster networking based on RDMA, and NVIDIA GPUs.


AI training and inferencing This diagram describes two stages of deep learning model development: model training and model inferencing. In model training on the left, the untrained neural network is input to a training algorithm enabled by OCI Data Science, bare metal compute, local storage, and cluster networking. The output of the training algorithm is a trained model with a new capability. The model inferencing step is described on the right. Consider a trained model such as DALL-E 2, which can take text inputs and generate images. A text input is fed into the trained model, and an image output from the model is provided.

Virtual desktop infrastructure (VDI)

OCI Compute powered by NVIDIA GPUs provide consistent high performance for VDI.


Virtual desktop infrastructure Virtual desktop infrastructure

CFD and high performance computing using GPU instances

OCI enables computer-aided engineering and computational fluid dynamics for fast predictions of the aerodynamic properties of objects.


CFD and high performance computing using GPU instances CFD and high performance computing using GPU instances
November 18, 2024

Now Generally Available: The Largest, Fastest AI Supercomputer in the Cloud

Sagar Zanwar, Principal Product Manager, OCI
Akshai Parthasarathy, Product Marketing Director, OCI

We’re excited to announce the general availability of Oracle Cloud Infrastructure (OCI) Supercluster with NVIDIA H200 Tensor Core GPUs. The largest AI supercomputer available in the cloud, our latest Supercluster scales up to an industry-leading 65,536 GPUs.

Read the complete post

Get started with GPU instances

Try Oracle AI and get a 30-day trial

Oracle offers a free pricing tier for most AI services as well as a free trial account with US$300 in credits to try additional cloud services. AI services are a collection of offerings, including generative AI, with prebuilt machine learning models that make it easier for developers to apply AI to applications and business operations.

  • Which Oracle AI and ML services offer a free pricing tier?

    • OCI Speech
    • OCI Language
    • OCI Vision
    • OCI Document Understanding
    • Machine Learning in Oracle Database
    • OCI Data Labeling

    You also only have to pay compute and storage charges for OCI Data Science.

See how much you can save with OCI

Oracle Cloud pricing is simple, with consistent low pricing worldwide, supporting a wide range of use cases. To estimate your low rate, check out the cost estimator and configure the services to suit your needs.

Experience the difference

  • 1/4 the outbound bandwidth costs
  • 3X the compute price-performance
  • Same low price in every region
  • Low pricing without long term commitments

Access a GPU and AI expert

Get help building your next GPU solution or deploying your AI workload on OCI AI infrastructure.

  • They can answer questions such as

    • How do I get started with Oracle Cloud?
    • What kinds of AI workloads can I run on OCI?
    • What types of AI services does OCI offer?