Rising AI Infrastructure Demand Highlights Industry Shift Toward Cost-Effective Solutions as DeepSeek Gains Traction
January 30, 2025 | TrendForceEstimated reading time: 2 minutes
TrendForce’s latest investigations have revealed that the recent release of DeepSeek-V3 and DeepSeek-R1 underscores an industry-wide shift toward more cost-effective AI infrastructure. This development is expected to prompt end users to conduct more rigorous evaluations of AI infrastructure investments, focusing on adopting more efficient software computing models to reduce reliance on hardware such as GPUs. CSPs are also likely to expand the adoption of in-house ASIC infrastructure to lower deployment costs. Consequently, actual demand for GPU-based AI chips and semiconductors could see notable changes from 2025 onward.
TrendForce notes that the global AI server market has experienced rapid growth since 2023. By 2025, AI servers are projected to account for over 15% of total server shipments; by 2028, they will be nearing 20%. Major CSPs have aggressively expanded their AI infrastructure in response to escalating AI training demands.
Starting in 2025, the focus will shift toward edge AI inference. Companies will adopt next-generation GPU platforms such as NVIDIA Blackwell and accelerate the development of proprietary ASICs, as seen with AWS. This strategic move aims to enhance cost efficiency and meet the needs of specialized AI applications. Meanwhile, Chinese CSPs and AI firms like DeepSeek are prioritizing the development of more efficient AI chips and algorithms to foster diversified AI applications in the face of U.S. chip export restrictions.
Historically, the AI industry has relied on scaling models, increasing data volume, and enhancing hardware performance for growth. However, escalating costs and efficiency challenges have prompted a shift in strategy. DeepSeek has adopted model distillation techniques to compress large models, improve inference speed, and reduce hardware dependencies. By optimizing the performance of NVIDIA Hopper downscaled chips, DeepSeek maximizes computational resource utilization.
DeepSeek’s competitive advantage stems from its high-performance hardware selection, innovative distillation techniques, and an open API strategy. This approach balances technological innovation and commercial viability while reinforcing the AI industry’s push toward greater efficiency.
TrendForce notes that China's AI market is expected to develop in two key directions in light of ongoing U.S. chip export restrictions. First, AI-related companies will accelerate investments in domestic AI chips and supply chains. Large Chinese CSPS, for instance, will continue procuring available H20 chips while also ramping up the development of proprietary ASICs for deployment in their data centers.
Second, China will leverage its existing internet infrastructure to compensate for hardware limitations with software-based solutions. DeepSeek exemplifies this approach by breaking from conventional methods and adopting model distillation technology to enhance AI applications.
Overall, as the U.S. government potentially tightens AI and semiconductor restrictions on China, domestic AI firms will be compelled to accelerate the development of proprietary AI chips and HBM hardware. While these solutions may not match the performance of NVIDIA’s GPUs, they are primarily designed to support China’s domestic data center infrastructure, where individual chip performance is no longer the sole priority. Additionally, companies like DeepSeek are advancing AI multimodal models, aiming to achieve similar performance in specific application areas at lower training costs to expedite commercialization.
Suggested Items
Real Time with... IPC APEX EXPO 2025: Creative Approaches to Measuring Thermal Warpage
03/31/2025 | Real Time with...IPC APEX EXPONeil Hubble discusses his research on measuring thermal warpage which focuses on challenges in testing small, thin samples. He introduces non-destructive testing methods that effectively measure without damaging components. Neil highlights the industry's growing interest in AI and outlines future technology goals, including improved resolution and automation to enhance production efficiency.
Zenaida Valianu, IPC, Earns IPC Excellence in Education Award at IPC APEX EXPO 2025
03/31/2025 | IPCThe IPC Excellence in Education award was presented to Zenaida (Zenny) Valianu, IPC, at IPC APEX EXPO 2025, recognizing her significant contributions to workforce development and leadership.
IPC APEX EXPO 2025 Review: Expecting the Unexpected
03/31/2025 | Tom Kastner, GP VenturesOne of the best things about trade shows is not the scheduled meetings but the chance meetings that come up unexpectedly. Just because you happened to go down one aisle instead of the next, you bump into an old acquaintance that you have not seen for years, or you happen to talk to the guy in line next to you to get a $5 Pepsi, and it turns into a great, new connection.
Gartner Forecasts Worldwide GenAI Spending to Reach $644 Billion in 2025
03/31/2025 | Gartner, Inc.Worldwide generative AI (GenAI) spending is expected to total $644 billion in 2025, an increase of 76.4% from 2024, according to a forecast by Gartner, Inc.
L3Harris Completes Sale of Commercial Aviation Solutions Business to TJC for $800 Million
03/31/2025 | BUSINESS WIREL3Harris Technologies has completed the previously announced sale of its Commercial Aviation Solutions (CAS) business to an affiliate of TJC L.P. for $800 million. The entire $800 million cash purchase price was paid to L3Harris at the closing of the transaction.