-
-
News
News Highlights
- Books
Featured Books
- design007 Magazine
Latest Issues
Current IssueCreating a Culture of Collaboration
PCB designers could learn quite a bit from NASA and the private companies that develop spacecraft: Every one of these vehicles is a testament to the value of collaboration among disparate stakeholders. Without a collaborative culture, the rocket might never get off the ground.
Breaking High-speed Material Constraints
Do you need specialty materials for your high-speed designs? Maybe not. Improvements in resins mean designers of high-speed boards can sometimes use traditional laminate systems. Learn more in this issue.
Level Up Your Design Skills
This month, our contributors discuss the PCB design classes available at IPC APEX EXPO 2024. As they explain, these courses cover everything from the basics of design through avoiding over-constraining high-speed boards, and so much more!
- Articles
- Columns
Search Console
- Links
- Events
||| MENU - design007 Magazine
Intel oneDNN AI Optimizations Enabled as Default in TensorFlow
May 26, 2022 | IntelEstimated reading time: 2 minutes
In the latest release of TensorFlow 2.9, the performance improvements delivered by the Intel® oneAPI Deep Neural Network Library (oneDNN) are turned on by default. This applies to all Linux x86 packages and for CPUs with neural-network-focused hardware features (like AVX512_VNNI, AVX512_BF16, and AMX vector and matrix extensions that maximize AI performance through efficient compute resource usage, improved cache utilization and efficient numeric formatting) found on 2nd Gen Intel Xeon Scalable processors and newer CPUs. These optimizations enabled by oneDNN accelerate key performance-intensive operations such as convolution, matrix multiplication and batch normalization, with up to 3 times performance improvements compared to versions without oneDNN acceleration.
“Thanks to the years of close engineering collaboration between Intel and Google, optimizations in the oneDNN library are now default for x86 CPU packages in TensorFlow. This brings significant performance acceleration to the work of millions of TensorFlow developers without the need for them to change any of their code. This is a critical step to deliver faster AI inference and training and will help drive AI Everywhere,” said Wei Li, Intel vice president and general manager of AI and Analytics.
oneDNN performance improvements becoming available by default in the official TensorFlow 2.9 release will enable millions of developers who already use TensorFlow to seamlessly benefit from Intel software acceleration, leading to productivity gains, faster time to train and efficient utilization of compute. Additional TensorFlow-based applications, including TensorFlow Extended, TensorFlow Hub and TensorFlow Serving also have the oneDNN optimizations. TensorFlow has included experimental support for oneDNN since TensorFlow 2.5.
oneDNN is an open source cross-platform performance library of basic deep learning building blocks intended for developers of deep learning applications and frameworks. The applications and frameworks that are enabled by it can then be used by deep learning practitioners. oneDNN is part of?oneAPI, an open, standards-based, unified programming model for use across CPUs as well as GPUs and other AI accelerators.
While there is an emphasis placed on AI accelerators like GPUs for machine learning and, in particular, deep learning, CPUs continue to play a large role across all stages of the AI workflow. Intel’s extensive software-enabling work makes AI frameworks, such as the TensorFlow platform, and a wide range of AI applications run faster on Intel hardware that is ubiquitous across most personal devices, workstations and data centers. Intel’s rich portfolio of optimized libraries, frameworks and tools serves end-to-end AI development and deployment needs while being built on the foundation of oneAPI.
The oneDNN-driven accelerations to TensorFlow deliver remarkable performance gains that benefit applications spanning natural language processing, image and object recognition, autonomous vehicles, fraud detection, medical diagnosis and treatment and others.
Deep learning and machine learning applications have exploded in number due to increases in processing power, data availability and advanced algorithms. TensorFlow has been one of the world’s most popular platforms for AI application development with over 100 million downloads. Intel-optimized TensorFlow is available both as a standalone component and through the Intel oneAPI AI Analytics Toolkit, and is already being used across a broad range of industry applications including the Google Health project, animation filmmaking at Laika Studios, language translation at Lilt, natural language processing at IBM Watson and many others.
Suggested Items
indie Semiconductor Introduces Class-leading Computer Vision Processor Family
06/28/2024 | BUSINESS WIREindie Semiconductor, Inc., an Autotech solutions innovator, announced the addition of a highly innovative iND880xx product line to its fast-growing vision processor portfolio, developed to address the demanding specifications of Advanced Driver Assistance Systems (ADAS) and driver viewing use cases, such as Surround View systems and Electronic Mirrors.
For Networking, Edge, and IoT Applications: Swissbit Expands PCIe Portfolio
06/27/2024 | SwissbitStorage specialist Swissbit continues to strategically expand its PCIe offerings. The new N2000 (Gen3 PCIe) and N3000 (Gen4 PCIe) product families primarily address applications that require PCIe performance as well as a balanced ratio of low power consumption and reduced heat generation.
Indium Corporation’s New Type 6 Jetting Solder Paste Hits Market
06/26/2024 | Indium CorporationIndium Corporation® is pleased to introduce a new jetting solder paste to join its PicoShot® series of products. PicoShot® NC-6M is a no-clean, halogen-free, Type 6 powder-size material specifically formulated to be compatible with Mycronic jetting systems and Mycronic’s “small dot” ejector.
Cadence Expands System IP Portfolio with Network on Chip to Optimize Electronic System Connectivity
06/26/2024 | Cadence Design Systems, Inc.The Cadence Janus NoC manages these simultaneous high-speed communications efficiently with minimal latency, enabling customers to achieve their PPA targets faster and with lower risk.
Alphawave Semi Tapes Out Industry-First, Multi-Protocol I/O Connectivity Chiplet for High-Performance Compute and AI Infrastructure
06/19/2024 | BUSINESS WIREAlphawave Semi, a global leader in high-speed connectivity and compute silicon for the world’s technology infrastructure, today announced the successful tape-out of the industry’s first off-the-shelf multi-protocol I/O connectivity chiplet on TSMC’s 7nm process.