Tag: Vera CPU

  • Nvidia Rubin AI Cuts Token Costs 10x vs Blackwell, Musk Praises

    Nvidia Rubin AI Cuts Token Costs 10x vs Blackwell, Musk Praises

    Key Takeaways

    1. Nvidia’s Rubin AI architecture features six subsystems, including the Vera CPU and new GPU, designed for enhanced AI inference at lower costs.
    2. The Rubin platform reduces token costs by ten times, requiring only a quarter of the GPUs needed for the Blackwell edition.
    3. The architecture competes with China’s low-cost AI models by addressing both performance and cost concerns.
    4. The Vera CPU is designed for efficient data movement and supports various workloads while maintaining full Arm compatibility.
    5. The Vera CPU offers 88 custom cores and 1.2 TB/s memory bandwidth, improving efficiency over the previous Blackwell platform.


    Nvidia has unveiled its next-generation Rubin AI computational architecture, which aims to align with China’s AI strategy by providing AI inference capabilities at significantly lower costs compared to the existing Blackwell edition.

    Architecture Overview

    As the rumors about Nvidia Rubin suggested, this platform consists of six processing subsystems that work in harmony: the Vera CPU, the new Nvidia Rubin GPU, the third-gen NVLink 6 Switch, the ConnectX-9 SuperNIC, the BlueField-4 DPU, and the Spectrum-6 Ethernet Switch. These chips utilize advanced TSMC foundry nodes and introduce interface optimizations designed to greatly reduce token costs and training times.

    Cost Efficiency

    Nvidia’s approach of “codesign” across these six new chips allows for model training using only a quarter of the GPUs required in the current Blackwell platform, reducing token costs by ten times. Elon Musk has also promised a tenfold decrease in token costs for Tesla’s upcoming AI5 computer, but it won’t begin mass production until next year. Musk referred to Nvidia Rubin as the “rocket engine for AI,” which will facilitate the scaling of edge models.

    Competitive Landscape

    China boasts impressively low AI token prices by open-sourcing models like DeepSeek and linking multiple midrange AI GPUs, such as the Huawei 910C. The Nvidia Rubin architecture addresses both performance and cost concerns for running AI models, making it a competitive option.

    Highlighting the Vera CPU

    One of the most fascinating aspects of the Rubin platform is the new Nvidia Vera CPU, which is “engineered for data movement and agentic reasoning across accelerated systems, with full confidential computing support.” This CPU can function alongside an Nvidia GPU or operate independently, handling “analytics, cloud, orchestration, storage, and high-performance computing (HPC) workloads” while maintaining full Arm compatibility.

    Vera CPU Specifications

    The Vera CPU boasts 88 custom cores and an impressive 1.2 TB/s of LPDDR5X memory bandwidth, all while consuming minimal power. The integration of the NVLink-C2C connectivity interface allows synchronized CPU-GPU memory access, contributing to the Rubin platform’s efficiency, which is significantly better than the Blackwell-based predecessor.

    Purchase Information

    You can find the Nvidia DGX Spark personal AI supercomputer available for purchase on Amazon.

    Source:
    Link