Nvidia Blackwell AI Chips Overheat in Server Racks Issues

Nvidia Blackwell AI Chips Overheat in Server Racks Issues

Nvidia's Blackwell AI chips might be delayed again due to overheating issues in server racks that can hold up to 72 GPUs. According to The Information (via Reuters), Nvidia has asked its suppliers to modify the design of these high-capacity racks and is collaborating with them to enhance thermal efficiency.

Collaboration with Partners

A spokesperson from Nvidia stated to Reuters, "Nvidia is working with leading cloud service providers as an integral part of our engineering team and process. The engineering iterations are normal and expected." This highlights the company's ongoing commitment to address the challenges faced during development.

Previous Delays

This isn't the first instance of delays related to Blackwell. Back in August, Bloomberg shared that Nvidia had to adjust the chip design to ensure it would be more compatible with the Hopper H100 data centers. This history of modifications raises questions about the stability of the release timeline.

Concerns from Major Companies

In March, Nvidia had assured that the new chips would be on the market by the second quarter of the year, but those plans changed due to the recent setbacks. The Information reports that companies like Microsoft, Google, and Meta are now anxious about how these delays could impact their schedules for deploying the new chips in their data centers, which may also slow down the release of next versions of their AI-based products.

As reported by Reuters and Bloomberg, the situation remains critical for Nvidia and its partners.

Source: Link,Link


Leave a Comment

Scroll to Top