NVIDIA announces Vera Rubin POD featuring 1,152 GPUs across 40 racks, delivering 60 exaflops and 10x better inference performance per watt than Blackwell. (ReadNVIDIA announces Vera Rubin POD featuring 1,152 GPUs across 40 racks, delivering 60 exaflops and 10x better inference performance per watt than Blackwell. (Read

NVIDIA Unveils Vera Rubin POD 40-Rack AI Supercomputer for Agentic Workloads

2026/03/17 03:48
Okuma süresi: 3 dk
Bu içerikle ilgili geri bildirim veya endişeleriniz için lütfen crypto.news@mexc.com üzerinden bizimle iletişime geçin.

NVIDIA Unveils Vera Rubin POD 40-Rack AI Supercomputer for Agentic Workloads

Iris Coleman Mar 16, 2026 19:48

NVIDIA announces Vera Rubin POD featuring 1,152 GPUs across 40 racks, delivering 60 exaflops and 10x better inference performance per watt than Blackwell.

NVIDIA Unveils Vera Rubin POD 40-Rack AI Supercomputer for Agentic Workloads

NVIDIA just dropped the specs on its most ambitious AI infrastructure play yet. The Vera Rubin POD packs 1,152 Rubin GPUs across 40 racks, delivering 60 exaflops of compute power and 10 petabytes per second of total scale-up bandwidth. Production units ship in the second half of 2026.

The numbers here are staggering: 1.2 quadrillion transistors, nearly 20,000 NVIDIA dies, all engineered to function as a single coherent supercomputer. NVIDIA claims 4x better training performance and 10x better inference performance per watt compared to its current Blackwell architecture—with token costs dropping to one-tenth of current levels.

Five Purpose-Built Rack Systems

The POD combines five distinct rack-scale systems, each targeting specific bottlenecks in modern AI workloads:

Vera Rubin NVL72 serves as the core compute engine. Each rack integrates 72 Rubin GPUs and 36 Vera CPUs connected through NVLink 6, which pushes 3.6 TB/s bandwidth per GPU—more total bandwidth than the entire global internet, according to NVIDIA. The system targets all four AI scaling laws: pretraining, post-training, test-time scaling, and agentic scaling.

Groq 3 LPX racks tackle the latency problem. With 256 language processing units per rack using SRAM-only architecture, these pair with NVL72 to deliver what NVIDIA claims is 35x more tokens and 10x more revenue opportunity for trillion-parameter models versus Blackwell.

Vera CPU racks provide sandbox environments for agent testing. A single rack sustains over 22,500 concurrent reinforcement learning environments—critical for validating agentic AI outputs before deployment.

BlueField-4 STX racks introduce what NVIDIA calls "AI-native storage" through the CMX context memory platform. By offloading KV cache to dedicated high-bandwidth storage, the system claims 5x higher tokens-per-second and 5x better power efficiency than traditional approaches.

Spectrum-6 SPX networking racks tie everything together with 102.4 Tb/s switches featuring co-packaged optics.

The Token Economics Argument

NVIDIA frames this around a specific market reality: token consumption now exceeds 10 quadrillion annually, and the shift from human-AI to AI-AI interactions will accelerate that growth dramatically. Modern agentic systems generate massive reasoning token volumes while expanding KV cache requirements—exactly the bottleneck this architecture targets.

Third-party SemiAnalysis InferenceMax benchmarks cited by NVIDIA show current Blackwell systems already deliver 50x better performance per watt and 35x lower cost per token compared to H200. Vera Rubin aims to extend that lead.

Thermal and Power Engineering

The third-generation MGX rack architecture introduces Intelligent Power Smoothing with 6x more rack-level energy storage (400 joules per GPU) than previous generations. This reduces peak current demands by up to 25% and eliminates the need for massive battery packs.

All racks operate at 45°C warm-water inlet temperatures, enabling data centers in many climates to use ambient air cooling. NVIDIA claims this frees enough power to add 10% more racks in the same facility power budget.

Looking Ahead

Beyond the initial POD configuration, NVIDIA previewed Vera Rubin Ultra NVL576 scaling to 576 GPUs across eight racks, and the next-generation Kyber architecture targeting NVL1152 with 144 GPUs per rack. The roadmap suggests NVIDIA sees multi-rack NVLink domains as the future of AI infrastructure—not just bigger GPUs, but fundamentally different system architectures.

For enterprises planning AI infrastructure investments, the message is clear: the economics of AI compute are shifting from chip-level to facility-level optimization. Those building out data centers now face a choice between current-generation systems and waiting for Vera Rubin availability in late 2026.

Image source: Shutterstock
  • nvidia
  • ai infrastructure
  • vera rubin
  • data centers
  • enterprise ai
Piyasa Fırsatı
D. Energy Logosu
D. Energy Fiyatı(WATT)
$0.19973
$0.19973$0.19973
-0.29%
USD
D. Energy (WATT) Canlı Fiyat Grafiği
Sorumluluk Reddi: Bu sitede yeniden yayınlanan makaleler, halka açık platformlardan alınmıştır ve yalnızca bilgilendirme amaçlıdır. MEXC'nin görüşlerini yansıtmayabilir. Tüm hakları telif sahiplerine aittir. Herhangi bir içeriğin üçüncü taraf haklarını ihlal ettiğini düşünüyorsanız, kaldırılması için lütfen crypto.news@mexc.com ile iletişime geçin. MEXC, içeriğin doğruluğu, eksiksizliği veya güncelliği konusunda hiçbir garanti vermez ve sağlanan bilgilere dayalı olarak alınan herhangi bir eylemden sorumlu değildir. İçerik, finansal, yasal veya diğer profesyonel tavsiye niteliğinde değildir ve MEXC tarafından bir tavsiye veya onay olarak değerlendirilmemelidir.

Ayrıca Şunları da Beğenebilirsiniz

Why LYNO’s Presale Could Trigger the Next Wave of Crypto FOMO After SOL and PEPE

Why LYNO’s Presale Could Trigger the Next Wave of Crypto FOMO After SOL and PEPE

The post Why LYNO’s Presale Could Trigger the Next Wave of Crypto FOMO After SOL and PEPE appeared on BitcoinEthereumNews.com. Cryptocirca has never been bereft of hype cycles and fear of missing out (FOMO). The case of Solana (SOL) and Pepe (PEPE) is one of the brightest examples that early investments into the correct projects may yield the returns that are drifting. Today there is an emerging rival in the limelight—LYNO. LYNO is in its presale stage, and already it is being compared to former breakout tokens, as many investors are speculating that LYNO will be the next big thing to ignite the market in a similar manner. Early Bird Presale: Lowest Price LYNO is in the Early Bird presale and costs only $0.050 for each token; the initial round will rise to $0.055. To date, approximately 629,165.744 tokens have been sold, with approximately $31,458.287 of that amount going towards the $100,000 project goal.  The crypto presales allow investors the privilege to acquire tokens at reduced prices before they become available to the general market, and they tend to bring substantial returns in the case of great fundamentals. The final goal of the project: 0.100 per token. This gradual development underscores increasing investor confidence and it brings a sense of urgency to those who wish to be first movers. LYNO’s Edge in a Competitive Market LYNO isn’t just another presale token—it’s a powerful AI-driven cross-chain arbitrage platform designed to deliver real utility and long-term growth. Operating across 15+ blockchains, LYNO’s AI engine analyzes token prices, liquidity, volume, and gas fees in real-time to identify the most profitable trade routes. It integrates with bridges like LayerZero, Wormhole, and Axelar, allowing assets to move instantly across networks, so no opportunity is missed.  The platform also includes community governance, letting $LYNO holders vote on protocol upgrades and fee structures, staking rewards for long-term investors, buyback-and-burn mechanisms to support token value, and audited smart…
Paylaş
BitcoinEthereumNews2025/09/18 16:11
The $55 Oil Trade Is Still on the Table, but Brent’s Chart Has Conditions

The $55 Oil Trade Is Still on the Table, but Brent’s Chart Has Conditions

The post The $55 Oil Trade Is Still on the Table, but Brent’s Chart Has Conditions appeared on BitcoinEthereumNews.com. The oil price surged on April 2 as Brent
Paylaş
BitcoinEthereumNews2026/04/02 18:30
Covéa Chooses Shift Technology as Strategic Partner for Fraud and Risk Management

Covéa Chooses Shift Technology as Strategic Partner for Fraud and Risk Management

Covéa has selected Shift Technology as a long-term partner to support a consistent and shared view of risk from policy inception through to claims settlement The
Paylaş
ffnews2026/04/02 07:00

Trade GOLD, Share 1,000,000 USDT

Trade GOLD, Share 1,000,000 USDTTrade GOLD, Share 1,000,000 USDT

0 fees, up to 1,000x leverage, deep liquidity