"What Would I Do With 8 GPUs?" — 10 Years Later… Millions ft Charlie Boyle from Nvidia

Name: "What Would I Do With 8 GPUs?" — 10 Years Later… Millions ft Charlie Boyle from Nvidia
Uploaded: 2026-04-06T19:06:19.485675+00:00
Duration: 20 min 33 s
Description: Nvidia's Charlie Boyle reflects on 10 years of DGX evolution, from customers puzzling over 8 GPUs to million-GPU AI factories built around the new Vera Rubin platform.

TL;DR

Nvidia's Charlie Boyle reflects on 10 years of DGX evolution, from customers puzzling over 8 GPUs to million-GPU AI factories built around the new Vera Rubin platform.

Key Points

1.DGX1 launched 10 years ago to skeptical customers. The top question in year one was 'what am I going to do with 8 GPUs?' — the industry has since scaled to millions of deployed GPUs across global AI factories.
2.The AI factory concept evolved from the A100 Super Pod. Nvidia's DGX Super Pod — 32 A100s connected via Infiniband with partner storage and a performance guarantee — became the blueprint for today's full AI factory architecture.
3.Vera Rubin introduces a CPU-GPU recipe for agentic workloads. The new pod spec pairs one Vera CPU rack (256 Grace CPUs) with every eight Vera Rubin GPU racks, enabling the CPU-heavy unit testing and reasoning that agentic AI requires.
4.The Groq acquisition adds ultra-low-latency inference to the AI factory stack. The LPX Groq rack complements Vera Rubin for latency-sensitive, monetizable inference — serving fast responses to end users while Rubin handles large-scale compute.
5.STX is a new storage tier built for agentic data velocity. Agents can generate terabytes of temporary data per task at machine speed; STX provides a sandboxed, RDMA-accessible rack-level storage tier with partner-built software stacks and deny-all security policies via existing enterprise ACLs.
6.Grace Blackwell delivered 50–100x speedups over Hopper for real customer workloads. One customer's job that took 18–36 hours on a Hopper Super Pod runs in 20 minutes on Grace Blackwell, enabling proportionally more customers at lower cost.
7.Vera Rubin achieves EDPP1 flat power draw, unlocking near 100% data center utilization. New circuitry at the chip, rack, and PSU level allows software-settable power caps, raising usable provisioned power from ~60% toward 100%, and enabling utilities to remotely dial down non-critical workloads automatically — validated at Nvidia's planned gigawatt-scale DSX facility in Northern Virginia.

Life's too short for long videos.

Summarize any YouTube video in seconds.

Quit Yapping — Try it Free →