NVIDIA delivers AI-ready servers to accelerate enterprise generative AI applications - The EE

NVIDIA delivers AI-ready servers to accelerate enterprise generative AI applications

NVIDIA announced that system manufacturers will deliver AI-ready servers that support VMware Private AI Foundation with NVIDIA. The move will help companies customize and deploy generative AI applications using their proprietary business data. These servers feature NVIDIA L40S GPUs (graphics processing unit) and NVIDIA BlueField coming soon from Dell Technologies, Hewlett Packard Enterprise (HPE) and Lenovo to support VMware Private AI Foundation with NVIDIA.

“A new computing era has begun,” says Jensen Huang, founder and CEO of NVIDIA. “Companies in every industry are racing to adopt generative AI. With our ecosystem of world-leading software and system partners, we are bringing generative AI to the world’s enterprises.”

NVIDIA AI-ready servers are an ideal platform for businesses that will deploy VMware Private AI Foundation with NVIDIA.

“Generative AI is supercharging digital transformation, and enterprises need a fully integrated solution to more securely build applications that enable them to advance their business,” says Raghu Raghuram, CEO of VMware. “Through the combined expertise of VMware, NVIDIA and our server manufacturer partners, businesses will be able to develop and deploy AI with data privacy, security and control.”

Generative AI transformation in enterprise

NVIDIA AI-ready servers are designed to provide full-stack accelerated infrastructure and software for industries racing to adopt generative AI for a broad range of applications, including drug discovery, retail product descriptions, intelligent virtual assistants, manufacturing simulation and fraud detection.

The servers feature NVIDIA AI enterprise, the operating system of the NVIDIA AI platform. The software provides production-ready enterprise support and security for over 100 frameworks, pretrained models, toolkits and software, including NVIDIA NeMo for LLMs, NVIDIA modulus for simulations, NVIDIA RAPIDS for data science and NVIDIA triton inference server for production AI.

Built to handle complex AI workloads with billions of parameters, L40S GPUs include fourth-generation Tensor Cores and an FP8 Transformer Engine, delivering over 1.45 petaflops of tensor processing power and up to 1.7x training performance compared with the NVIDIA A100 Tensor Core GPU.

For generative AI applications such as intelligent chatbots, assistants, search and summarisation, the NVIDIA L40S enables up to 1.2x more generative AI inference performance than the NVIDIA A100 GPU.

Integrating NVIDIA BlueField DPUs drives further speedups by accelerating, offloading and isolating the tremendous compute load of virtualisation, networking, storage, security and other cloud-native AI services.

NVIDIA ConnectX-7 SmartNICs offer advanced hardware offloads and ultra-low latency, delivering best-in-class, scalable performance for data-intensive generative AI workloads.

Broad ecosystem to speed enterprise generative AI deployments

The computer makers are building NVIDIA AI-ready servers, including the Dell PowerEdge R760xa, HPE ProLiant Gen11 servers for VMware Private AI Foundation with NVIDIA, and Lenovo ThinkSystem SR675 V3.

“Generative AI is a catalyst for innovation, helping to solve some of the world’s most pressing challenges,” says Michael Dell, chairman and chief executive officer, Dell Technologies. “Dell Generative AI Solutions with NVIDIA AI-ready servers will play a critical role in advancing human progress by driving unprecedented levels of productivity and revolutionising the way industries operate.”

“Generative AI will usher in a new scale of productivity for enterprises, from powering chatbots and digital assistants to helping with the design and development of new solutions,” says Antonio Neri, president and CEO of HPE. “We are pleased to continue working closely with NVIDIA to feature its GPUs and software in a range of enterprise tuning and inference workload solutions that will accelerate deployments of generative AI.”

“Businesses are eager to adopt generative AI to power intelligent transformation,” says Yang Yuanqing, chairman and CEO of Lenovo. “In collaboration with NVIDIA and VMware, Lenovo is further extending our leadership in generative AI and solidifying our unique position in helping customers in their AI journey.”


NVIDIA AI-ready servers with L40S GPUs and BlueField DPUs (data processing units) will be available by year-end, with instances available from cloud service providers expected in the coming months.

Follow us and Comment on Twitter @TheEE_io

By continuing to use the site, you agree to the use of cookies. more information

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.