DRAMeXchange> Weekly Research> NVIDIA Blackwell Platform and ASIC Chip Upgrades to Boost Liquid Cooling Pe...
        
 

【Market View】NVIDIA Blackwell Platform and ASIC Chip Upgrades to Boost Liquid Cooling Penetration to Over 20% in 2025, Says TrendForce


Published Sep.23 2024,15:30 PM (GMT+8)

NVIDIA Blackwell Platform and ASIC Chip Upgrades to Boost Liquid Cooling Penetration to Over 20% in 2025, Says TrendForce

TrendForce’s latest reports reveal that the launch of NVIDIA’s Blackwell platform, expected in 4Q24, is set to significantly drive the adoption of liquid cooling solutions. Liquid cooling penetration is projected to grow from around 10% in 2024 to over 20% in 2025. This shift will be driven by rising global ESG awareness and the accelerated deployment of AI servers by CSPs, prompting a shift from air cooling to liquid cooling systems.

NVIDIA continues to remain the dominant supplier in the global AI server market in 2024. Specifically, in the GPU AI server market, NVIDIA commands an overwhelming lead with a nearly 90% market share, while AMD follows at a distant 8%.

TrendForce notes that initial shipments of NVIDIA’s Blackwell platform are limited this year, as the supply chain continues to finalize product testing and validation testing, including high-speed transmission and cooling design optimizations. The high energy consumption of the new platform—especially for the GB200 full-rack solution—will require more efficient cooling, thereby driving the penetration of liquid cooling solutions. However, the current adoption of liquid cooling within existing server ecosystems remains low, and ODMs will need to navigate a learning curve to address challenges like leakage and insufficient cooling performance. 

TrendForce forecasts that in 2025, Blackwell’s share in the high-end GPU market could exceed 80%, attracting power supply manufacturers and cooling solution providers to enter the AI liquid cooling market, thereby creating a new competitive landscape in the industry.

Taiwanese suppliers expected to provide quick disconnects by 1H24, as Google leads liquid cooling deployment

In recent years, major CSPs, including Google, AWS, and Microsoft, have accelerated the deployment of AI servers, primarily utilizing NVIDIA GPUs and custom-designed ASICs. TrendForce reveals that NVIDIA’s GB200 NVL72 racks, with a TDP of approximately 140 kW, will require liquid cooling solutions to address heat dissipation, with Liquid-to-Air (L2A) technology expected to become the mainstream approach. Other Blackwell server architectures, such as HGX and MGX, have lower density and will continue to rely on air cooling solutions.

Google is the most proactive in adopting liquid cooling solutions among CSPs developing their own AI ASICs, using both air and liquid cooling for its TPU chips. BOYD and Cooler Master are the primary suppliers of cold plates for Google. In China, Alibaba is aggressively expanding its liquid-cooled data centers, while other Chinese CSPs continue to use air cooling solutions for their AI ASICs.

TrendForce highlights that CSPs are specifying key suppliers for liquid cooling components in NVIDIA’s GB200 racks. Currently, Asia Vital Components and Cooler Master lead in providing cold plates, while Cooler Master and Auras supply manifolds, and Vertiv and Delta Electronics provide coolant distribution units. Quick disconnect (QD) components—critical for preventing leakage—are primarily supplied by international companies such as CPC, Parker Hannifin, Denfoss, and Staubli. However, Taiwanese companies like LOTES and Fositek are in the validation phase, and by the first half of 2025, they are expected to join the list of QD suppliers to help alleviate the current shortage.