-
- News
- Books
Featured Books
- smt007 Magazine
Latest Issues
Current IssueThe Path Ahead
What are you paying the most attention to as we enter 2025? Find out what we learned when we asked that question. Join us as we explore five main themes in the new year.
Soldering Technologies
Soldering is the heartbeat of assembly, and new developments are taking place to match the rest of the innovation in electronics. There are tried-and-true technologies for soldering. But new challenges in packaging, materials, and sustainability may be putting this key step in flux.
The Rise of Data
Analytics is a given in this industry, but the threshold is changing. If you think you're too small to invest in analytics, you may need to reconsider. So how do you do analytics better? What are the new tools, and how do you get started?
- Articles
- Columns
Search Console
- Links
- Media kit
||| MENU - smt007 Magazine
Intel Unveils Next-Generation AI Solutions with the Launch of Xeon 6 and Gaudi 3
September 25, 2024 | IntelEstimated reading time: 3 minutes
As AI continues to revolutionize industries, enterprises are increasingly in need of infrastructure that is both cost-effective and available for rapid development and deployment. To meet this demand head-on, Intel today launched Xeon 6 with Performance-cores (P-cores) and Gaudi 3 AI accelerators, bolstering the company’s commitment to deliver powerful AI systems with optimal performance per watt and lower total cost of ownership (TCO).
“Demand for AI is leading to a massive transformation in the data center, and the industry is asking for choice in hardware, software and developer tools,” said Justin Hotard, Intel executive vice president and general manager of the Data Center and Artificial Intelligence Group. “With our launch of Xeon 6 with P-cores and Gaudi 3 AI accelerators, Intel is enabling an open ecosystem that allows our customers to implement all of their workloads with greater performance, efficiency and security.”
More: Intel Xeon 6 with P-cores and Gaudi 3 AI Accelerators (Press Kit) | Leadership performance with Intel Xeon 6900 P-core series (Fact Sheet) | A New Era of High-Performance Enterprise AI Systems and Solutions (Quote Sheet)
Introducing Intel Xeon 6 with P-cores and Gaudi 3 AI accelerators
Intel’s latest advancements in AI infrastructure include two major updates to its data center portfolio:
Intel® Xeon® 6 with P-cores: Designed to handle compute-intensive workloads with exceptional efficiency, Xeon 6 delivers twice the performance of its predecessor2. It features increased core count, double the memory bandwidth and AI acceleration capabilities embedded in every core. This processor is engineered to meet the performance demands of AI from edge to data center and cloud environments.
Intel® Gaudi® 3 AI Accelerator: Specifically optimized for large-scale generative AI, Gaudi 3 boasts 64 Tensor processor cores (TPCs) and eight matrix multiplication engines (MMEs) to accelerate deep neural network computations. It includes 128 gigabytes (GB) of HBM2e memory for training and inference, and 24 200 Gigabit (Gb) Ethernet ports for scalable networking. Gaudi 3 also offers seamless compatibility with the PyTorch framework and advanced Hugging Face transformer and diffuser models. Intel recently announced a collaboration with IBM to deploy Intel Gaudi 3 AI accelerators as a service on IBM Cloud. Through this collaboration, Intel and IBM aim to lower the total cost of ownership to leverage and scale AI, while enhancing performance.
Enhancing AI Systems with TCO Benefits
Deploying AI at scale involves considerations such as flexible deployment options, competitive price-performance ratios and accessible AI technologies. Intel’s robust x86 infrastructure and extensive open ecosystem position it to support enterprises in building high-value AI systems with an optimal TCO and performance per watt. Notably, 73% of GPU-accelerated servers use Intel Xeon as the host CPU3.
Intel partners with leading OEMs including Dell Technologies and Supermicro to develop co-engineered systems tailored to specific customer needs for effective AI deployments. Dell Technologies is currently co-engineering RAG-based solutions leveraging Gaudi 3 and Xeon 6.
Bridging the Gap from Prototypes to Production with Co-Engineering Efforts
Transitioning generative AI (Gen AI) solutions from prototypes to production-ready systems presents challenges in real-time monitoring, error handling, logging, security and scalability. Intel addresses these challenges through co-engineering efforts with OEMs and partners to deliver production-ready retrieval-augmented generation (RAG) solutions.
These solutions, built on the Open Platform Enterprise AI (OPEA) platform, integrate OPEA-based microservices into a scalable RAG system, optimized for Xeon and Gaudi AI systems, designed to allow customers to easily integrate applications from Kubernetes, Red Hat OpenShift AI and Red Hat Enterprise Linux AI.
Expanding Access to Enterprise AI Applications
Intel’s Tiber portfolio offers business solutions to tackle challenges such as access, cost, complexity, security, efficiency and scalability across AI, cloud and edge environments. The Intel® Tiber™ Developer Cloud now provides preview systems of Intel Xeon 6 for tech evaluation and testing. Additionally, select customers will gain early access to Intel Gaudi 3 for validating AI model deployments, with Gaudi 3 clusters to begin rolling out next quarter for large-scale production deployments.
New service offerings include SeekrFlow, an end-to-end AI platform from Seekr for developing trusted AI applications. The latest updates feature Intel Gaudi software’s newest release and Jupyter notebooks loaded with PyTorch 2.4 and Intel oneAPI and AI tools 2024.2, which include new AI acceleration capabilities and support for Xeon 6 processors.
Suggested Items
Würth Elektronik at PEDC 2025
01/14/2025 | Wurth ElektronikOn January 29 to 30, 2025, the Pan-European Electronics Design Conference (PEDC) will convene leading experts from industry and research in Vienna.
BAE Systems Awarded $347M NERVE Contract From NGA to Modernize and Sustain GEOINT Library
01/13/2025 | BAE SystemsIn 2024, the National Geospatial-Intelligence Agency (NGA) awarded BAE Systems a five-year indefinite-delivery, indefinite-quantity $347 million contract for NERVE, the National System for Geospatial-Intelligence (NSG) Enterprise Repository and Virtual Environment program. NERVE will modernize the NSG Consolidated Library (NCL), which includes expanding it from a physical data center to cloud-based data services.
Intelsat, GCI Expand Alaska Partnership with Multi-Orbit Satellite Services
01/10/2025 | BUSINESS WIREIntelsat, operator of one of the world’s largest integrated satellite and terrestrial networks, signed an expanded satellite services agreement with GCI, Alaska’s largest telecommunications company, that will soon deliver multi-orbit broadband services throughout the largest state in the U.S.
2025 Will See Significant Growth in AI Spending, But Gen AI Will Not Create Expected Value
12/31/2024 | ABI ResearchAs 2025 kicks off, predictions abound on the technology innovations expected in the year ahead. In its new whitepaper, 101 Technology Trends That Will—and Won’t—Shape 2025, analysts from global technology intelligence firm ABI Research.
6G Begins! Embarking on a New Journey of Global Interoperable Standards
12/30/2024 | JCN NewswireOn 3GPP TSG-RAN meeting, 6G RAN level study item supported by 56 co-signed companies was approved, which achieves a significant milestone of 6G standard.