NVIDIA A100 Marks Dawn of Next Decade in Accelerated Cloud Computing

November 3, 2020 | NVIDIA Newsroom

Estimated reading time: 1 minute

Amazon Web Services’ first GPU instance debuted 10 years ago, with the NVIDIA M2050. At that time, CUDA-based applications were focused primarily on accelerating scientific simulations, with the rise of AI and deep learning still a ways off.

Since then, AWS has added to its stable of cloud GPU instances, which has included the K80 (p2), K520 (g3), M60 (g4), V100 (p3/p3dn) and T4 (g4).

With its new P4d instance generally available today, AWS is paving the way for another bold decade of accelerated computing powered with the latest NVIDIA A100 Tensor Core GPU.

The P4d instance delivers AWS’s highest performance, most cost-effective GPU-based platform for machine learning training and high performance computing applications. The instances reduce the time to train machine learning models by up to 3x with FP16 and up to 6x with TF32 compared to the default FP32 precision.

They also provide exceptional inference performance. NVIDIA A100 GPUs just last month swept the MLPerf Inference benchmarks — providing up to 237x faster performance than CPUs.

Each P4d instance features eight NVIDIA A100 GPUs and, with AWS UltraClusters, customers can get on-demand and scalable access to over 4,000 GPUs at a time using AWS’s Elastic Fabric Adaptor (EFA) and scalable, high-performant storage with Amazon FSx. P4d offers 400Gbps networking and uses NVIDIA technologies such as NVLink, NVSwitch, NCCL and GPUDirect RDMA to further accelerate deep learning training workloads. NVIDIA GPUDirect RDMA on EFA ensures low-latency networking by passing data from GPU to GPU between servers without having to pass through the CPU and system memory.

In addition, the P4d instance is supported in many AWS services, including Amazon Elastic Container Services, Amazon Elastic Kubernetes Service, AWS ParallelCluster and Amazon SageMaker. P4d can also leverage all the optimized, containerized software available from NGC, including HPC applications, AI frameworks, pre-trained models, Helm charts and inference software like TensorRT and Triton Inference Server.

P4d instances are now available in US East and West, and coming to additional regions soon. The instances can be purchased as On-Demand, with Savings Plans, with Reserved Instances, or as Spot Instances.

The first decade of GPU cloud computing has brought over 100 exaflops of AI compute to the market. With the arrival of the Amazon EC2 P4d instance powered by NVIDIA A100 GPUs, the next decade of GPU cloud computing is off to a great start.

Share on:

Testimonial

"The I-Connect007 team is outstanding—kind, responsive, and a true marketing partner. Their design team created fresh, eye-catching ads, and their editorial support polished our content to let our brand shine. Thank you all! "

Sweeney Ng - CEE PCB

Suggested Items

Macronix Introduces Cutting-Edge Secure-Boot NOR Flash Memory

08/08/2025 | PRNewswire
Macronix International Co., Ltd., a leading integrated device manufacturer in the non-volatile memory (NVM) market, announced ArmorBoot MX76, a robust NOR flash memory combining in a single device, the essential performance and an array of security features that deliver rapid boot times and iron-clad data protection.

UHDI Fundamentals: UHDI Technology and Industry 4.0

08/05/2025 | Anaya Vardya, American Standard Circuits
Ultra high density interconnect (UHDI) technology is rapidly transforming how smart systems are designed and deployed in Industry 4.0. With its capacity to support highly miniaturized, high-performance, and densely packed electronics, UHDI is a critical enabler of the smart, connected, and automated industrial future. This article explores the synergy between UHDI and Industry 4.0 technologies, highlighting applications, benefits, and future directions.

Advint and Sayron Bring Advanced Rectifier Solutions to High-Reliability PCB Manufacturers

08/01/2025 | Advint Incorporated
Advint Incorporated has partnered with Sayron, a leading global rectifier manufacturer, to supply cutting-edge IGBT-based DC rectifiers to high-performance PCB manufacturers across North America and beyond. Sayron’s precision-engineered rectifiers align with the stringent requirements of advanced PCB processes.

Teramount Raises $50M to Address Growing Demand for AI Infrastructure Optical Connectivity

07/31/2025 | PRNewswire
Teramount, the leader in scalable fiber-to-chip interconnect solutions for AI, data centers and advanced computing, today announced it has raised $50 million in financing led by new investor Koch Disruptive Technologies (KDT). Existing investors Grove Ventures and several new strategic investors, including AMD Ventures, Hitachi Ventures, Samsung Catalyst Fund and Wistron, joined the round.

KOKI to Showcase Analytical Services and New HF1200 Solder Paste at SMTA Guadalajara 2025

07/31/2025 | KOKI
KOKI, a global leader in advanced soldering materials and process optimization services, will exhibit at the SMTA Guadalajara Expo & Tech Forum, taking place September 17 & 18, 2025 at Expo Guadalajara, Salón Jalisco Halls D & E in Guadalajara, Mexico.

News Highlights

More News

Featured Books

Article Highlights

More Articles

Latest Columns

See all of our columnists

Media Kit - Choose Your Primary Marketing Focus: