Intel Contributes AI Acceleration to PyTorch 2.0
March 17, 2023 | IntelEstimated reading time: 1 minute

?In the release of Python 2.0, contributions from Intel using Intel® Extension for PyTorch , oneAPI Deep Neural Network Library (oneDNN) and additional support for Intel® CPUs enable developers to optimize inference and training performance for artificial intelligence (AI).
As part of the PyTorch 2.0 compilation stack, the TorchInductor CPU backend optimization by Intel Extension for PyTorch and PyTorch ATen CPU achieved up to 1.7 times faster FP32 inference performance when benchmarked with TorchBench, HuggingFace and timm.1 This update brings notable performance improvements to graph compilation over the PyTorch eager mode.
Other optimizations include:
- Improved message-passing between adjacent neural network nodes to support graph neural network in PyTorch Geometric (PyG) for enhanced inference and performance training on Intel CPUs.
- New x86 quantization backend – a combination of FBGEMM (Facebook General Matrix-Matrix Multiplication) and oneDNN backends – replaces FBGEMM as the default quantization backend for x86 CPU platforms to enable better end-to-end int8 inference performance.
- Extended use of oneDNN with oneDNN Graph API to maximize efficient code generation on AI hardware by automatically identifying the graph partitions to be accelerated through fusion. BFloat16 and Float32 data types are supported and only inference workloads can be optimized; BF16 is only optimized on machines with AVX512_BF16 ISA support.
Suggested Items
Keysight, Synopsys Deliver an AI-Powered RF Design Migration Flow
06/06/2025 | BUSINESS WIREKeysight Technologies, Inc. and Synopsys, Inc. introduced an AI-powered RF design migration flow to expedite migration from TSMC’s N6RF+ process to N4P technology, to address the performance requirements of today’s most demanding wireless integrated circuit applications.
AMD Acquires Brium to Strengthen Open AI Software Ecosystem
06/05/2025 | AMDAt AMD, we’re committed to building a high-performance, open AI software ecosystem that empowers developers and drives innovation. Today, we’re excited to take another step forward with the acquisition of Brium, a team of world-class compiler and AI software experts with deep expertise in machine learning, AI inference, and performance optimization.
Cadence Extends Support for Automotive Solutions on Arm Zena Compute Subsystems
06/05/2025 | Cadence Design Systems, Inc.Cadence announced IP, design solution, and expert design services for software and Systems-on-Chip (SoCs) based on Arm® Zena™ Compute Subsystems (CSS), Arm’s first-generation CSS for automotive.
L3Harris Receives Contract to Develop Next-Generation Security Processor for US Government
06/02/2025 | L3Harris TechnologiesL3Harris Technologies has been awarded a contract by the U.S. government to develop a next-generation security processor to secure communication devices across the globe.
Hon Hai Research Institute Partners with Taiwan Academic Research Institute and KAUST to Participate in CLEO 2025
05/30/2025 | FoxconnThe research team of the Semiconductor Division of Hon Hai Research Institute, together with the research teams of National Taiwan University and King Abdullah University of Science and Technology in Saudi Arabia, has successfully made breakthroughs in multi-wavelength μ -LED technology to achieve high-speed visible light communication and optical interconnection between chips.
Copyright © 2025 I-Connect007 | IPC Publishing Group Inc. All rights reserved.
Log in