NVIDIA to Build Massive Cloud AI Computer With Microsoft

<p>NVIDIA is teaming up with Microsoft to build one of the most powerful AI supercomputers in the world&period; The machines will be powered by Microsoft Azure’s advanced supercomputing infrastructure combined with NVIDIA GPUs&comma; networking and full stack of AI software to help enterprises train&comma; deploy and scale AI&comma; including large&comma; state-of-the-art models&period;<&sol;p>&NewLine;<p align&equals;"left">Azure’s cloud-based AI supercomputer includes powerful and scalable ND- and NC-series virtual machines optimized for AI distributed training and inference&period; It is the first public cloud to incorporate NVIDIA’s advanced AI stack&comma; adding tens of thousands of <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;nvidia&period;com&sol;en-us&sol;data-center&sol;a100&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>NVIDIA A100<&sol;u><&sol;a> and <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;nvidia&period;com&sol;en-us&sol;data-center&sol;h100&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>H100<&sol;u><&sol;a> GPUs&comma; <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;nvidia&period;com&sol;en-us&sol;networking&sol;quantum2&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>NVIDIA Quantum-2<&sol;u><&sol;a> 400Gb&sol;s InfiniBand networking and the <a title&equals;"" href&equals;"https&colon;&sol;&sol;www&period;nvidia&period;com&sol;en-us&sol;data-center&sol;products&sol;ai-enterprise&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>NVIDIA AI Enterprise<&sol;u><&sol;a> software suite to its platform&period;<&sol;p>&NewLine;<p align&equals;"left">As part of the collaboration&comma; NVIDIA will utilize Azure’s scalable virtual machine instances to research and further accelerate advances in generative AI&comma; a rapidly emerging area of AI in which foundational models like <a title&equals;"" href&equals;"https&colon;&sol;&sol;developer&period;nvidia&period;com&sol;blog&sol;using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>Megatron Turing NLG 530B<&sol;u><&sol;a> are the basis for unsupervised&comma; self-learning algorithms to create new text&comma; code&comma; digital images&comma; video or audio&period;<&sol;p>&NewLine;<p align&equals;"left">The companies will also collaborate to optimize Microsoft’s <a title&equals;"" href&equals;"https&colon;&sol;&sol;developer&period;nvidia&period;com&sol;blog&sol;using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>DeepSpeed<&sol;u><&sol;a> deep learning optimization software&period; NVIDIA’s full stack of AI workflows and software development kits&comma; optimized for Azure&comma; will be made available to Azure enterprise customers&period;<&sol;p>&NewLine;<p align&equals;"left">&OpenCurlyDoubleQuote;AI technology advances as well as industry adoption are accelerating&period; The breakthrough of foundation models has triggered a tidal wave of research&comma; fostered new startups and enabled new enterprise applications&comma;” said Manuvir Das&comma; vice president of enterprise computing at NVIDIA&period; &OpenCurlyDoubleQuote;Our collaboration with Microsoft will provide researchers and companies with state-of-the-art AI infrastructure and software to capitalize on the transformative power of AI&period;”<&sol;p>&NewLine;<p align&equals;"left">&OpenCurlyDoubleQuote;AI is fueling the next wave of automation across enterprises and industrial computing&comma; enabling organizations to do more with less as they navigate economic uncertainties&comma;” said Scott Guthrie&comma; executive vice president of the Cloud &plus; AI Group at Microsoft&period; &OpenCurlyDoubleQuote;Our collaboration with NVIDIA unlocks the world’s most scalable supercomputer platform&comma; which delivers state-of-the-art AI capabilities for every enterprise on Microsoft Azure&period;”<&sol;p>&NewLine;<p align&equals;"left"><strong>Scalable Peak Performance With NVIDIA Compute and Quantum-2 InfiniBand on Azure<&sol;strong><br &sol;>&NewLine;Microsoft Azure’s AI-optimized virtual machine instances are architected with NVIDIA’s most advanced data center GPUs and are the first public cloud instances to incorporate NVIDIA Quantum-2 400Gb&sol;s InfiniBand networking&period; Customers can deploy thousands of GPUs in a single cluster to train even the most massive large language models&comma; build the most complex recommender systems at scale&comma; and enable generative AI at scale&period;<&sol;p>&NewLine;<p align&equals;"left">The current Azure instances feature <a title&equals;"NVIDIA Quantum 200Gb&sol;s InfiniBand networking" href&equals;"https&colon;&sol;&sol;www&period;nvidia&period;com&sol;en-us&sol;networking&sol;products&sol;infiniband&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener">NVIDIA Quantum 200Gb&sol;s InfiniBand networking<&sol;a> with NVIDIA A100 GPUs&period; Future ones will be integrated with NVIDIA Quantum-2 400Gb&sol;s InfiniBand networking and NVIDIA H100 GPUs&period; Combined with Azure’s advanced compute cloud infrastructure&comma; networking and storage&comma; these AI-optimized offerings will provide scalable peak performance for AI training and deep learning inference workloads of any size&period;<&sol;p>&NewLine;<p align&equals;"left"><strong>Accelerating AI Development and Deployment<&sol;strong><br &sol;>&NewLine;Additionally&comma; the platform will support a broad range of AI applications and services&comma; including Microsoft DeepSpeed and the NVIDIA AI Enterprise software suite&period;<&sol;p>&NewLine;<p align&equals;"left">Microsoft DeepSpeed will leverage the <a title&equals;"" href&equals;"https&colon;&sol;&sol;blogs&period;nvidia&period;com&sol;blog&sol;2022&sol;03&sol;22&sol;h100-transformer-engine&sol;" target&equals;"&lowbar;blank" rel&equals;"nofollow noopener"><u>NVIDIA H100 Transformer Engine<&sol;u><&sol;a> to accelerate transformer-based models used for large language models&comma; generative AI and writing computer code&comma; among other applications&period; This technology applies 8-bit floating point precision capabilities to DeepSpeed to dramatically accelerate AI calculations for transformers — at twice the throughput of 16-bit operations&period;<&sol;p>&NewLine;<p align&equals;"left">NVIDIA AI Enterprise — the globally adopted software of the NVIDIA AI platform — is certified and supported on Microsoft Azure instances with NVIDIA A100 GPUs&period; Support for Azure instances with NVIDIA H100 GPUs will be added in a future software release&period;<&sol;p>&NewLine;<p align&equals;"left">NVIDIA AI Enterprise&comma; which includes the NVIDIA Riva for speech AI and NVIDIA Morpheus cybersecurity application frameworks&comma; streamlines each step of the AI workflow&comma; from data processing and AI model training to simulation and large-scale deployment&period;<&sol;p>&NewLine;

Editor

Wispr Scores $25 Million Series A Extension

SAN FRANCISCO -- Wispr, the voice-to-text AI that turns speech into clear, polished writing in every…

1 day

Numeric Dials Up $51 Million Series B

SAN FRANCISCO -- Numeric, an AI accounting automation platform, has raised a $51 million Series…

1 day

Apple Names 45 Finalists for App Store of the Year Awards

Apple has announced 45 finalists for this year’s App Store Awards, recognizing the best apps…

2 days

UC Reaches Agreement With Nurses, Strike Canceled

The University of California (UC) and the California Nurses Association (CNA) have reached a tentative…

4 days

HouseRX Rakes In $55 Million Series B

SAN FRANCISCO -- House Rx, a health tech company focused on making specialty medications more accessible and…

4 days

King Charles Honors NVIDIA’s Jensen Huang

Britain's King has given an award to the King of NVIDIA! NVIDIA founder and CEO…

4 days