Google Cloud and Nvidia unveil new AI infrastructure and software

During the Google Cloud Next 2023 conference, the parties disclosed details of new AI infrastructure and software, which are expected to help customers develop and deploy massive models for generative AI and speed up data science workloads

NEXT2023_0829_092703-3610_ALIVECOVERAGE_AAedit2 (1)-min (1) (1) (1)

Google Cloud expands partnership with Nvidia to advance AI computing, software, and services. (Credit: NVIDIA Corporation)

Google Cloud has expanded its partnership with American technology company Nvidia to advance artificial intelligence (AI) computing, software, and services.

During the Google Cloud Next 2023 conference, the parties unveiled details of new AI infrastructure and software. The new offerings are expected to help customers develop and deploy massive models for generative AI and accelerate data science workloads.

The companies aim to simplify the operation of AI supercomputers with Google Cloud offerings built on Nvidia technologies.

Nvidia said that the new hardware and software integrations leverage the company’s technologies that have been deployed by Google DeepMind and Google research teams over the last two years.

Nvidia founder and CEO Jensen Huang said: “We’re at an inflection point where accelerated computing and generative AI have come together to speed innovation at an unprecedented pace.

“Our expanded collaboration with Google Cloud will help developers accelerate their work with infrastructure, software and services that supercharge energy efficiency and reduce costs.”

PaxML, which is Google’s framework for building massive large language models (LLMs), has been now optimised for Nvidia’s accelerated computing, said the parties.

The framework will now allow developers to utilise NVIDIA H100 and A100 Tensor Core graphics processing units (GPUs) for advanced and fully configurable experimentation and scale.

The companies also revealed Google’s integration of serverless Spark with Nvidia GPUs through Google’s Dataproc service. This will enable data scientists to speed up Apache Spark workloads for preparing data for AI development.

Besides, Google Cloud expects its purpose-built Google Cloud A3 VMs powered by NVIDIA H100 GPUs to be generally available in September 2023. This is anticipated to make Nvidia’s AI platform more accessible for a wider set of workloads.

Google Cloud CEO Thomas Kurian said: “Google Cloud has a long history of innovating in AI to foster and speed innovation for our customers.

“Many of Google’s products are built and served on NVIDIA GPUs, and many of our customers are seeking out NVIDIA accelerated computing to power efficient development of LLMs to advance generative AI.”

Furthermore, Google Cloud will have access to the NVIDIA DGX GH200 AI supercomputer, which is driven by the NVIDIA Grace Hopper Superchip. This access will let Google Cloud explore its potential for handling generative AI workloads.