<p><strong>SAN JOSE</strong> <b>—</b>Amazon Web Services (AWS), an Amazon.com company, and NVIDIA announced that the new NVIDIA Blackwell GPU platform is coming to AWS.</p>
<p>AWS will offer the NVIDIA GB200 Grace Blackwell Superchip and B100 Tensor Core GPUs, extending the companies’ longstanding strategic collaboration to deliver the most secure and advanced infrastructure, software, and services to help customers unlock new generative artificial intelligence (AI) capabilities.</p>
<p>NVIDIA and AWS continue to bring together the best of their technologies, including NVIDIA’s newest multi-node systems featuring the next-generation NVIDIA Blackwell platform and AI software, AWS’s Nitro System and AWS Key Management Service (AWS KMS) advanced security, Elastic Fabric Adapter (EFA) petabit scale networking, and Amazon Elastic Compute Cloud (Amazon EC2) UltraCluster hyper-scale clustering. Together, they deliver the infrastructure and tools that enable customers to build and run real-time inference on multi-trillion parameter large language models (LLMs) faster, at massive scale, and at a lower cost than previous-generation NVIDIA GPUs on Amazon EC2.</p>
<p>“The deep collaboration between our two organizations goes back more than 13 years, when together we launched the world’s first GPU cloud instance on AWS, and today we offer the widest range of NVIDIA GPU solutions for customers,” said Adam Selipsky, CEO at AWS. “NVIDIA’s next-generation Grace Blackwell processor marks a significant step forward in generative AI and GPU computing. When combined with AWS’s powerful Elastic Fabric Adapter Networking, Amazon EC2 UltraClusters’ hyper-scale clustering, and our unique Nitro system’s advanced virtualization and security capabilities, we make it possible for customers to build and run multi-trillion parameter large language models faster, at massive scale, and more securely than anywhere else. Together, we continue to innovate to make AWS the best place to run NVIDIA GPUs in the cloud.”</p>
<p>&#8220;AI is driving breakthroughs at an unprecedented pace, leading to new applications, business models, and innovation across industries,” said Jensen Huang, founder and CEO of NVIDIA. “Our collaboration with AWS is accelerating new generative AI capabilities and providing customers with unprecedented computing power to push the boundaries of what&#8217;s possible.&#8221;</p>
<p><b>Latest innovations from AWS and NVIDIA accelerate training of cutting-edge LLMs that can reach beyond 1 trillion parameters</b></p>
<p>AWS will offer the NVIDIA Blackwell platform, featuring GB200 NVL72, with 72 Blackwell GPUs and 36 Grace CPUs interconnected by fifth-generation NVIDIA NVLink™. When connected with Amazon’s powerful networking (<span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fhpc%2Fefa%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=EFA&;index=2&;md5=d3b15a92c63818a68669645013f74c81" target="_blank" rel="noopener" data-cms-ai="0">EFA</a></span></span>), and supported by advanced virtualization (<span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fec2%2Fnitro%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=AWS+Nitro+System&;index=3&;md5=62cd1c54964f92d20318392854bd9d80" target="_blank" rel="noopener" data-cms-ai="0">AWS Nitro System</a></span></span>) and hyper-scale clustering (<span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fec2%2Fultraclusters%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=Amazon+EC2+UltraClusters&;index=4&;md5=6eabc9e5235aeecff065048929a4a0a6" target="_blank" rel="noopener" data-cms-ai="0">Amazon EC2 UltraClusters</a></span></span>), customers can scale to thousands of GB200 Superchips. NVIDIA Blackwell on AWS delivers a massive leap forward in speeding up inference workloads for resource-intensive, multi-trillion parameter language models.</p>
<p>Based on the success of the NVIDIA H100-powered EC2 P5 instances, which are available to customers for short durations through <span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fec2%2Fcapacityblocks%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=Amazon+EC2+Capacity+Blocks+for+ML&;index=5&;md5=67a435a7bb309c42210a95b779659f32" target="_blank" rel="noopener" data-cms-ai="0">Amazon EC2 Capacity Blocks for ML</a></span></span>, AWS plans to offer EC2 instances featuring the new B100 GPUs deployed in EC2 UltraClusters for accelerating generative AI training and inference at massive scale. GB200s will also be available on <span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Fwww.nvidia.com%2Fen-us%2Fdata-center%2Fdgx-cloud%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=NVIDIA+DGX%26%238482%3B+Cloud&;index=6&;md5=b4d764163d2ed6c90c2bb14fbe8ef09a" target="_blank" rel="noopener" data-cms-ai="0">NVIDIA DGX™ Cloud</a></span></span>, an AI platform co-engineered on AWS, that gives enterprise developers dedicated access to the infrastructure and software needed to build and deploy advanced generative AI models. The Blackwell-powered DGX Cloud instances on AWS will accelerate development of cutting-edge generative AI and LLMs that can reach beyond 1 trillion parameters.</p>

SAN FRANCISCO -- Wispr, the voice-to-text AI that turns speech into clear, polished writing in every…
SAN FRANCISCO -- Numeric, an AI accounting automation platform, has raised a $51 million Series…
Apple has announced 45 finalists for this year’s App Store Awards, recognizing the best apps…
The University of California (UC) and the California Nurses Association (CNA) have reached a tentative…
SAN FRANCISCO -- House Rx, a health tech company focused on making specialty medications more accessible and…
Britain's King has given an award to the King of NVIDIA! NVIDIA founder and CEO…