Rising AI Infrastructure Demand Highlights Industry Shift Toward Cost-Effective Solutions as DeepSeek Gains Traction
January 30, 2025 | TrendForceEstimated reading time: 2 minutes
TrendForce’s latest investigations have revealed that the recent release of DeepSeek-V3 and DeepSeek-R1 underscores an industry-wide shift toward more cost-effective AI infrastructure. This development is expected to prompt end users to conduct more rigorous evaluations of AI infrastructure investments, focusing on adopting more efficient software computing models to reduce reliance on hardware such as GPUs. CSPs are also likely to expand the adoption of in-house ASIC infrastructure to lower deployment costs. Consequently, actual demand for GPU-based AI chips and semiconductors could see notable changes from 2025 onward.
TrendForce notes that the global AI server market has experienced rapid growth since 2023. By 2025, AI servers are projected to account for over 15% of total server shipments; by 2028, they will be nearing 20%. Major CSPs have aggressively expanded their AI infrastructure in response to escalating AI training demands.
Starting in 2025, the focus will shift toward edge AI inference. Companies will adopt next-generation GPU platforms such as NVIDIA Blackwell and accelerate the development of proprietary ASICs, as seen with AWS. This strategic move aims to enhance cost efficiency and meet the needs of specialized AI applications. Meanwhile, Chinese CSPs and AI firms like DeepSeek are prioritizing the development of more efficient AI chips and algorithms to foster diversified AI applications in the face of U.S. chip export restrictions.
Historically, the AI industry has relied on scaling models, increasing data volume, and enhancing hardware performance for growth. However, escalating costs and efficiency challenges have prompted a shift in strategy. DeepSeek has adopted model distillation techniques to compress large models, improve inference speed, and reduce hardware dependencies. By optimizing the performance of NVIDIA Hopper downscaled chips, DeepSeek maximizes computational resource utilization.
DeepSeek’s competitive advantage stems from its high-performance hardware selection, innovative distillation techniques, and an open API strategy. This approach balances technological innovation and commercial viability while reinforcing the AI industry’s push toward greater efficiency.
TrendForce notes that China's AI market is expected to develop in two key directions in light of ongoing U.S. chip export restrictions. First, AI-related companies will accelerate investments in domestic AI chips and supply chains. Large Chinese CSPS, for instance, will continue procuring available H20 chips while also ramping up the development of proprietary ASICs for deployment in their data centers.
Second, China will leverage its existing internet infrastructure to compensate for hardware limitations with software-based solutions. DeepSeek exemplifies this approach by breaking from conventional methods and adopting model distillation technology to enhance AI applications.
Overall, as the U.S. government potentially tightens AI and semiconductor restrictions on China, domestic AI firms will be compelled to accelerate the development of proprietary AI chips and HBM hardware. While these solutions may not match the performance of NVIDIA’s GPUs, they are primarily designed to support China’s domestic data center infrastructure, where individual chip performance is no longer the sole priority. Additionally, companies like DeepSeek are advancing AI multimodal models, aiming to achieve similar performance in specific application areas at lower training costs to expedite commercialization.
Suggested Items
TRI at SMTA Wisconsin Expo 2025
04/24/2025 | TRITest Research, Inc. (TRI), the leading test and inspection systems provider for the electronics manufacturing industry, will join the SMTA Wisconsin Expo 2025.
Driving Sustainability in PCB Design
04/24/2025 | Marcy LaRont, I-Connect007Filbert (Fil) Arzola is an electrical engineer at Raytheon. He’s smart, entertaining, and passionate about PCB design. As it turns out, he’s also passionate about “Mother Earth,” as he calls her. Born and raised in Southern California, he freely admits that he turns the water off when he brushes his teeth and yells at his brother for throwing batteries in the garbage. But when looking at the issue of sustainability and PCB design, he urges his audiences to ponder what sustainability looks like. Can PCB designers, he asks, make any impact on sustainability at all?
Real Time with... IPC APEX EXPO: Silicon Geometry's Signal Integrity Impact on PCBs
04/24/2025 | Marcy LaRont, I-Connect007At IPC APEX EXPO 2025, Kris Moyer addressed the importance of understanding the impact of silicon geometry reduction on signal integrity and PCB performance. Kris says signal integrity considerations are necessary for so many designs today, regardless of clock frequency. He discusses valuable insights from attendees regarding embedded resistor technology and the effects of radiation on smaller silicon features in aerospace applications.
Recognizing the IPC Committee Volunteer Awards in 2025
04/24/2025 | Patty Goldman, I-Connect007In 2023, IPC created two new awards to honor standards development committee volunteers: The Hillman-Lambert Award for Volunteer of the Year and the Kessler-Goldman Award for Committee Leader of the Year. The awards were named after Hall of Fame members whose service to standards development is well-known and who have demonstrated the skills and principles that the awardees should also have.
Explore Thermal Management Solutions in Latest Podcast Series—New Episode Now Available
04/23/2025 | I-Connect007I-Connect007 is excited to share the latest episode in our new podcast series! In this episode, Ryan returns to discuss practical strategies for managing heat, starting early in the design planning and specification phases. After all, prevention means there’s less to mitigate later.