Nvidia has unveiled Eos, a revolutionary supercomputer for data centers, at the Supercomputing 2023 trade show. This supercomputer, known as an "AI factory," is designed to push the boundaries of artificial intelligence development. Eos represents a new era in AI acceleration and has been named after the Greek goddess of dawn.
Impressive Performance
Eos is powered by 576 Nvidia DGX H100 systems, which are integrated with Quantum-2 InfiniBand networking and specialized software. This impressive setup enables Eos to achieve a remarkable 18.4 exaflops of FP8 AI performance. It is a significant advancement from Nvidia's previous supercomputing projects, SaturnV and Selene, showcasing the advanced DGX SuperPOD architecture. This architecture allows for the rapid scaling of AI data center solutions to meet high-performance demands.
Hardware Configuration
At the core of Eos are 4,608 H100 GPUs, distributed across each DGX H100 system's eight H100 Tensor Core APUs. This hardware configuration is specifically designed to handle extensive workloads, including training large language models, running AI recommenders, conducting large-scale analytics, performing quantum simulations, and more.
Optimized for AI Tasks
Eos's architecture is finely tuned for AI tasks that require ultra-low latency and high throughput in massive computing clusters. The supercomputer's networking capabilities, with speeds reaching up to 400GB/s, are crucial for handling the large datasets necessary for training AI models.
Specialized Software Integration
Eos also integrates specialized software to enhance AI development and deployment. Base Command facilitates AI workflow, cluster management, and provides libraries for compute, storage, and network acceleration. AI Enterprise, a cloud-native platform, aims to expedite AI application development and positions itself as the "operating system" for enterprise-level AI. Eos's capabilities have earned it the ninth position on the TOP500 list of the world's fastest supercomputers.