NVIDIA DGX Cloud Lepton

Connecting developers to a global network of GPU compute.

Overview

The World’s GPU Compute at Your Fingertips

Meet NVIDIA DGX™ Cloud Lepton, the AI platform that puts global GPU infrastructure at your command. Designed for developers, DGX Cloud Lepton connects you to a network of high-performance GPUs across multiple cloud providers with a unified experience.

Whether you're fine-tuning or deploying at scale, you can now find the right GPUs in the right region—and run your workloads without the operational overhead. Build and launch AI applications where your data lives. Enable sovereign AI and move faster, with less friction.

Connect to the Global GPU Supply

NVIDIA DGX Cloud Lepton bridges the gap between AI demand and global GPU supply. Develop and then deploy across multiple clouds.

Learn More About NVIDIA DGX Cloud

DGX Cloud is a unified AI platform of fully managed services and software for training, deploying, and optimizing AI.

How It Works

What Is NVIDIA DGX Cloud Lepton?

An AI platform that connects developers with global GPU compute while providing a unified experience to discover and utilize GPU resources, along with integrated AI services to streamline the deployment life cycle across multiple clouds.

DGX Cloud Lepton brings together a global network of GPU cloud providers, GPU marketplaces, and cloud providers to streamline discovery, development, and deployment of AI workloads in a single, developer-friendly platform.

Features and Benefits

Go from Prototype to Production Faster with a Virtual Global AI Factory

Start building with instant access to NVIDIA’s accelerated APIs at build.nvidia.com—including serverless endpoints, prebuilt NVIDIA Blueprints, and GPU-backed compute. When it’s time to scale, NVIDIA DGX Cloud Lepton powers seamless customization and deployment across a global network of GPU cloud providers.

Frictionless Deployment Across Any GPU Cloud

Deploy AI applications across multi-cloud and hybrid environments with minimal operational burden, leveraging integrated services for inference, testing, and training workloads.

Access to Global Compute for Agility and Sovereignty

Quickly access GPU resources in specific regions, achieving compliance with data sovereignty regulations and meeting low-latency requirements for sensitive workloads.

Unmatched Flexibility and Productivity

Boost productivity with a unified experience delivered across development, training, and inferencing, including the ability to discover best-fit GPUs through the DGX Cloud Lepton marketplace.

Predictable Performance

Through cloud partners in the DGX Cloud Lepton marketplace, experience enterprise-grade performance, reliability, and security.

Ecosystem

DGX Cloud Lepton Partners

Access NVIDIA accelerated computing globally in your choice of regions through a vast network of cloud providers.

Next Steps

Ready to Get Started?

Apply now for early access to DGX Cloud Lepton.

Explore NVIDIA DGX Cloud Documentation

Access technical documentation about NVIDIA DGX Cloud.

Legacy Lepton Access

Using Lepton’s legacy  solutions? You can access them here.