<p><strong>SAN JOSE</strong> <b>—</b>Amazon Web Services (AWS), an Amazon.com company, and NVIDIA announced that the new NVIDIA Blackwell GPU platform is coming to AWS.</p>
<p>AWS will offer the NVIDIA GB200 Grace Blackwell Superchip and B100 Tensor Core GPUs, extending the companies’ longstanding strategic collaboration to deliver the most secure and advanced infrastructure, software, and services to help customers unlock new generative artificial intelligence (AI) capabilities.</p>
<p>NVIDIA and AWS continue to bring together the best of their technologies, including NVIDIA’s newest multi-node systems featuring the next-generation NVIDIA Blackwell platform and AI software, AWS’s Nitro System and AWS Key Management Service (AWS KMS) advanced security, Elastic Fabric Adapter (EFA) petabit scale networking, and Amazon Elastic Compute Cloud (Amazon EC2) UltraCluster hyper-scale clustering. Together, they deliver the infrastructure and tools that enable customers to build and run real-time inference on multi-trillion parameter large language models (LLMs) faster, at massive scale, and at a lower cost than previous-generation NVIDIA GPUs on Amazon EC2.</p>
<p>“The deep collaboration between our two organizations goes back more than 13 years, when together we launched the world’s first GPU cloud instance on AWS, and today we offer the widest range of NVIDIA GPU solutions for customers,” said Adam Selipsky, CEO at AWS. “NVIDIA’s next-generation Grace Blackwell processor marks a significant step forward in generative AI and GPU computing. When combined with AWS’s powerful Elastic Fabric Adapter Networking, Amazon EC2 UltraClusters’ hyper-scale clustering, and our unique Nitro system’s advanced virtualization and security capabilities, we make it possible for customers to build and run multi-trillion parameter large language models faster, at massive scale, and more securely than anywhere else. Together, we continue to innovate to make AWS the best place to run NVIDIA GPUs in the cloud.”</p>
<p>&#8220;AI is driving breakthroughs at an unprecedented pace, leading to new applications, business models, and innovation across industries,” said Jensen Huang, founder and CEO of NVIDIA. “Our collaboration with AWS is accelerating new generative AI capabilities and providing customers with unprecedented computing power to push the boundaries of what&#8217;s possible.&#8221;</p>
<p><b>Latest innovations from AWS and NVIDIA accelerate training of cutting-edge LLMs that can reach beyond 1 trillion parameters</b></p>
<p>AWS will offer the NVIDIA Blackwell platform, featuring GB200 NVL72, with 72 Blackwell GPUs and 36 Grace CPUs interconnected by fifth-generation NVIDIA NVLink™. When connected with Amazon’s powerful networking (<span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fhpc%2Fefa%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=EFA&;index=2&;md5=d3b15a92c63818a68669645013f74c81" target="_blank" rel="noopener" data-cms-ai="0">EFA</a></span></span>), and supported by advanced virtualization (<span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fec2%2Fnitro%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=AWS+Nitro+System&;index=3&;md5=62cd1c54964f92d20318392854bd9d80" target="_blank" rel="noopener" data-cms-ai="0">AWS Nitro System</a></span></span>) and hyper-scale clustering (<span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fec2%2Fultraclusters%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=Amazon+EC2+UltraClusters&;index=4&;md5=6eabc9e5235aeecff065048929a4a0a6" target="_blank" rel="noopener" data-cms-ai="0">Amazon EC2 UltraClusters</a></span></span>), customers can scale to thousands of GB200 Superchips. NVIDIA Blackwell on AWS delivers a massive leap forward in speeding up inference workloads for resource-intensive, multi-trillion parameter language models.</p>
<p>Based on the success of the NVIDIA H100-powered EC2 P5 instances, which are available to customers for short durations through <span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Faws.amazon.com%2Fec2%2Fcapacityblocks%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=Amazon+EC2+Capacity+Blocks+for+ML&;index=5&;md5=67a435a7bb309c42210a95b779659f32" target="_blank" rel="noopener" data-cms-ai="0">Amazon EC2 Capacity Blocks for ML</a></span></span>, AWS plans to offer EC2 instances featuring the new B100 GPUs deployed in EC2 UltraClusters for accelerating generative AI training and inference at massive scale. GB200s will also be available on <span class="Enhancement-inline"><span class="Enhancement-item"><a class="Link" href="https://cts.businesswire.com/ct/CT?id=smartlink&;url=https%3A%2F%2Fwww.nvidia.com%2Fen-us%2Fdata-center%2Fdgx-cloud%2F&;esheet=53911679&;newsitemid=20240318794112&;lan=en-US&;anchor=NVIDIA+DGX%26%238482%3B+Cloud&;index=6&;md5=b4d764163d2ed6c90c2bb14fbe8ef09a" target="_blank" rel="noopener" data-cms-ai="0">NVIDIA DGX™ Cloud</a></span></span>, an AI platform co-engineered on AWS, that gives enterprise developers dedicated access to the infrastructure and software needed to build and deploy advanced generative AI models. The Blackwell-powered DGX Cloud instances on AWS will accelerate development of cutting-edge generative AI and LLMs that can reach beyond 1 trillion parameters.</p>

SAN FRANCISCO -- Wearlinq, a provider of wearable health monitoring and diagnosis solutions, has announced…
SOUTH SAN FRANCISCO -- Link Cell Therapies, an oncology cell therapy company, announced its official…
SAN JOSE — Roku invites viewers to pack their bags for “Broad Trip,” a road…
SAN FRANCISCO — Data and AI company Databricks has snagged a massive $4 billion-plus Series…
For 25 years, the NVIDIA Graduate Fellowship Program has supported graduate students doing outstanding work…
SAN FRANCISCO -- Fal, a real-time generative-media platform powering the next decade of AI-driven content,…