-
- News
- Books
Featured Books
- pcb007 Magazine
Latest Issues
Current IssueAdditive Manufacturing
In this month’s issue, we explore additive manufacturing technology for the PCB fabricator: where it stands today, the true benefits, and where it seems to be headed.
The Growing Industry
In this issue of PCB007 Magazine, we talk with leading economic experts, advocacy specialists in Washington, D.C., and PCB company leadership to get a well-rounded picture of what’s happening in the industry today. Don’t miss it.
The Sustainability Issue
Sustainability is one of the most widely used terms in business today, especially for electronics and manufacturing but what does it mean to you? We explore the environmental, business, and economic impacts.
- Articles
- Columns
Search Console
- Links
- Events
||| MENU - pcb007 Magazine
Exciting Advances From NVIDIA’s GPU
May 3, 2021 | Dan Feinberg, I-Connect007Estimated reading time: 3 minutes
NVIDIA’s Graphics Processing Technology Conference was, as expected, a showcase of new developments, as well as an opportunity for engineers and developers to learn, enhance skills, and discuss new ideas.
Just hearing about all the amazing new developments and the accelerating expansion of AI in virtually all aspects of modern society gives those who attended a better idea of just how much AI is and will continue to change their work and our world.
AI is driving new computer platforms and is utilizing new advanced super computers with 10x the power of the most powerful one available today and at a fraction of the cost. Consider this again—10x the power and 10% of the cost of just a year or two ago. Autonomous transportation advances seem to be accelerating again after a few years’ hiatus, with AI playing a huge part as well as development of hardware.
For more details and demonstrations, I suggest watching the keynote with NVIDIA CEO Jensen Huang.
Some of most interesting announcements in the keynote included gaming graphics focused on GPUs as well as demonstrations showing their capabilities, and there was so much more. If you accept all that was announced, you can expect to quickly see amazing advances in new computing platforms. In addition to the supercomputer, and full and truly available autonomous transportation, expect to see other computing great leaps forward—science fiction from the ‘90s-type devices such as a new fully functional translator called Jarvis that universally translates five languages. Huang promoted Jarvis as a GPU-accelerated deep learning AI platform for speech recognition and generation, language understanding, and translations. “Jarvis interacts in about 100 milliseconds,” he said.
The conference unveiled a new product for high-performance computing (HPC) clients, NVIDIA’s first-ever data center CPU named “Grace,” after the pioneering computer scientist Grace Hopper. “We are thrilled to announce the Swiss National Supercomputing Center will build a supercomputer powered by Grace and our next-generation GPU,” Huang said. Based on Arm architecture, NVIDIA states that Grace provides 10x better performance than the fastest servers on the market today by focusing on complex artificial intelligence and HPC workloads. NVIDIA's first data center CPU however is not intended to compete directly against Intel's Xeon lineup or AMD's EPYC processors. NVIDIA made a point that it continues to provide full support for all CPUs, including x86 and Arm architectures.
For now, Grace is designed specifically to be "tightly coupled" with NVIDIA's GPUs to remove bottlenecks for complex giant-model AI and HPC applications, compared to today's high-end NVIDIA DGX-based systems which run on Intel CPUs. Grace is built on a 5-nanometer manufacturing process which I am sure will grab Intel’s attention. NVIDIA is planning on making Grace available within two years.
Other announcements included:
- The Bluefield-3 400 Gbps data center Infra processor, a new powerful 16x 78-core processor. It has 22 billion transistors and will allow network processing and storage at the above mentioned 400Gbps. Also discussed was the NV Triton Inference Server 2.9 that maximizes performance and simplifies production deployment at scale.
- The TensorRT 8.0, which is the latest version of its high-performance deep learning inference SDK. The TensorRT includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning optimizations. The core of NVIDIA TensorRT is a C++ library that facilitates high-performance inference on NVIDIA GPU’s (graphics processing units). It is designed complement training frameworks such as TensorFlow, Caffe, PyTorch, and MXNet. It focuses specifically on running an existing network quickly and efficiently on a GPU for the purpose of generating a result.
Huang spent the better part of an hour describing his vision and a near term future filled with autonomous machines, super-powerful AI, fully computer controlled and robotic-manned factories, and unlimited virtual worlds—from silicon to supercomputers to AI software all in one presentation. Grace and BlueField are key parts of a data center roadmap consisting of three chips: CPU, GPU, and DPU.
Huang said, “Each chip architecture has a two-year rhythm with likely a kicker in between. One year will focus on x86 platforms, the next on Arm platforms. Every year will see new exciting products from us. Three chips, yearly leaps, one architecture.”
Over the 25 years that I have covered NVIDIA they have come from a modest computer GPU supplier to a giant multi-technology company. They still are considered the leading computer GPU supplier globally, but they are now so much more, and this keynote address does not hesitate to state that they feel the best is still to come.
Suggested Items
Inmarsat Launches NexusWave: A Game-Changing ‘Bonded’ Network Service For Maritime Communications
05/20/2024 | InmarsatInmarsat Maritime, a Viasat company, has launched NexusWave, a fully managed connectivity service underpinned by a ‘bonded’ multi-dimensional network, offering high-speed connectivity, unlimited data, global coverage, and ‘secure by design’ infrastructure.
Real Time with… IPC APEX EXPO 2024: Automation in North American PCB Shops
05/17/2024 | Real Time with...IPC APEX EXPOBenmayor Group has entered the North American market's automation landscape with their Technosystem division. In this interview, Eduardo Benmayor highlights this underinvestment and current efforts to catch up and address challenges related to strategic planning. Eduardo shares Technosystem's automation journey, from simple equipment to robotic arms, stressing the importance of machine communication and data analysis. He also offers advice on implementing automation in older facilities.
Using AI to Redefine Productivity
05/15/2024 | Nolan Johnson, SMT007 MagazinePlato Systems, a machine perception company spun out of Stanford University, employs AI and video data to analyze and optimize the human component in manufacturing. Initially focused on semiconductors, Plato Systems has expanded into EMS manufacturing. Co-founder and CEO Amin Arbabian, along with product advisor Anders Holden and head of growth Luis Vidal, discuss their approach to changeover optimization and its impact on productivity in the industry. They’ve also included customer Raj Vora in the conversation.
Real Time with… IPC APEX EXPO 2024: Manufacturing Intelligence from the Factory Floor
05/15/2024 | Real Time with...IPC APEX EXPONolan Johnson and Ranjan Chatterjee, Vice President of Smart Factory Business Units at PDF Solutions, discuss the background of Cimetrix and PDF Solutions. They explore the analytics tools provided by PDF Solutions, the merging of semiconductor and electronics manufacturing, and data handling in these industries. They also discuss different product lines, standards, packaging technologies, data usage, and integration with ERP systems
Nolan’s Notes: Coming to Terms With AI
05/07/2024 | Nolan Johnson -- Column: Nolan's NotesHow fast do things move in the world of data analytics? Here’s an example. We’ve been planning this issue on artificial intelligence for the past few months, and, in fact, I had already written this column about a month ago. Then I went to IPC APEX EXPO and upended it all. I originally had compared AI to drag racing in that (CPU) horsepower and new (data) vehicles have steadily delivered higher performance competition. That seemed pretty accurate given how generative AI models dominated the popular media with amazing results—and sometimes spectacular crashes.