NVIDIA and Google Cloud Partner to Advance Secure Agentic AI Deployment

NVIDIA and Google Cloud are collaborating to bring secure, on-premises agentic AI to enterprises by integrating Google’s Gemini models with NVIDIA’s Blackwell platforms. Leveraging confidential computing and enhanced infrastructure like the GKE Inference Gateway and Triton Inference Server, the partnership ensures scalable AI deployment without compromising regulatory compliance or data sovereignty.
NVIDIA and Google Cloud Partner to Advance Secure Agentic AI Deployment
Image Credit: NVIDIA and Google Cloud

NVIDIA and Google Cloud are joining forces to enhance enterprise AI applications by integrating Google Gemini AI models with NVIDIA‘s advanced computing platforms. This collaboration aims to facilitate the deployment of agentic AI locally while ensuring strict compliance with data privacy and regulatory standards.

Enhanced Data Security with NVIDIA and Google Cloud


The partnership centers on the use of NVIDIAs Blackwell HGX and DGX platforms, which are now integrated with Google Clouds distributed infrastructure. This setup allows enterprises to operate Googles powerful Gemini AI models directly within their data centers. A key feature of this integration is NVIDIA Confidential Computing, which provides an additional layer of security by safeguarding sensitive code in the Gemini models against unauthorized access and potential data breaches.

Sachin Gupta, Vice President and General Manager of Infrastructure and Solutions at Google Cloud, emphasized the security and operational benefits of this collaboration. “By deploying our Gemini models on-premises with NVIDIA Blackwells exceptional performance and confidential computing capabilities, were enabling enterprises to leverage the full capabilities of agentic AI in a secure and efficient manner,” Gupta stated.

The Advent of Agentic AI in Enterprise Technology

Agentic AI represents a significant evolution in artificial intelligence technology, offering enhanced problem-solving capabilities over traditional AI models. Unlike conventional AI, which operates based on pre-learned information, agentic AI can reason, adapt, and make autonomous decisions in dynamic settings. For instance, in IT support, an agentic AI system can not only retrieve troubleshooting guides but also diagnose and resolve issues autonomously, escalating complex problems as needed.

In the financial sector, while traditional AI might identify potential fraud based on existing patterns, agentic AI goes a step further by proactively investigating anomalies and taking preemptive actions, such as blocking suspicious transactions or dynamically adjusting fraud detection mechanisms.

Addressing On-Premises Deployment Challenges

The ability to deploy agentic AI models on-premises addresses a critical need for organizations with stringent security or data sovereignty requirements. Until now, these organizations have faced significant challenges in utilizing advanced AI models, which often require integration of diverse data types such as text, images, and code, while still adhering to strict regulatory standards.

With Google Cloud now offering one of the first cloud services that enables confidential computing for agentic AI workloads in any environment, be it cloud, on-premises, or hybrid enterprises, no longer have to compromise between advanced AI capabilities and compliance with security regulations.

Future-Proofing AI Deployments

To further support the deployment of AI, Google Cloud has introduced the GKE Inference Gateway. This new service is designed to optimize AI inference workloads, featuring advanced routing, scalability, and integration with NVIDIA’s Triton Inference Server and NeMo Guardrails. It ensures efficient load balancing, enhanced performance, reduced operational costs, and centralized model security and governance.

Looking forward, Google Cloud plans to improve observability for agentic AI workloads by incorporating NVIDIA Dynamo, an open-source library designed to scale reasoning AI models efficiently across various deployment environments.

These advancements in AI deployment and management were highlighted at the Google Cloud Next conference, where NVIDIA held a special address and provided insights through sessions, demonstrations, and expert discussions.

Through this strategic collaboration, NVIDIA and Google Cloud are setting a new standard for secure, efficient, and scalable agentic AI applications, enabling enterprises to harness the full potential of AI while adhering to necessary security and compliance requirements.


Recent Content

The collision of two digital titans – AI and Bitcoin are on a collision course. One optimises the future; the other burns through energy to preserve the past. As AI sharpens its tools – from tracing tainted coins to auto-generating smart contracts – it is exposing crypto’s inefficiencies and vulnerabilities. Bitcoin may not die, but AI could force it to evolve: or risk irrelevance in a world demanding speed, sustainability and real utility.
Singtel launches 5G+, introducing nationwide network slicing for both consumers and enterprises, a global first. This upgrade brings faster speeds, lower latency, stronger indoor coverage, and real-time cyber protection to over 1.5 million users. Singtel 5G+ enhances mobile connectivity with the 700MHz spectrum, priority plans, and app-based slicing for business-critical apps, aligning with Singapore’s Smart Nation goals.
 Virgin Media O2 and Daisy Group have joined forces to form a £1.4B B2B telecom and IT services powerhouse, targeting UK enterprises with an integrated offering that includes private 5G, cloud, AI, and cybersecurity solutions. With Virgin Media O2 holding a 70% stake and Daisy 30%, the new entity aims to accelerate enterprise digital transformation, drive operational synergies, and compete against both traditional telcos and cloud-first players in a fast-evolving market.
OpenAI’s Stargate project—a $500B plan to build global AI infrastructure—is facing delays in the U.S. due to rising tariffs and economic uncertainty. While the first phase in Texas slows, OpenAI is shifting focus internationally with “OpenAI for Countries,” a new initiative to co-build sovereign AI data centers worldwide. Backed by Oracle and SoftBank, Stargate is designed to support massive AI workloads and reshape global compute power distribution.
Twelve major European telecom providers, including Vodafone and Deutsche Telekom, have jointly urged the EU to allocate the full upper 6GHz band (6.425–7.125 GHz) for mobile use, citing the spectrum’s critical role in future 6G deployment. With the U.S. and China already advancing in this area, operators warn that delays could jeopardize Europe’s digital leadership and hinder next-generation connectivity infrastructure.
Dirty data in data centers undermines everything from AI accuracy to energy efficiency. With poor metadata, data drift, and dark data hoarding driving up costs and emissions, organizations must adopt DataOps, metadata tools, and a strong data culture to reverse the trend. Learn how clean data fuels smarter automation, compliance, and sustainability.
Whitepaper
Explore how Generative AI is transforming telecom infrastructure by solving critical industry challenges like massive data management, network optimization, and personalized customer experiences. This whitepaper offers in-depth insights into AI and Gen AI's role in boosting operational efficiency while ensuring security and regulatory compliance. Telecom operators can harness these AI-driven...
Supermicro and Nvidia Logo
Whitepaper
The whitepaper, "How Is Generative AI Optimizing Operational Efficiency and Assurance," provides an in-depth exploration of how Generative AI is transforming the telecom industry. It highlights how AI-driven solutions enhance customer support, optimize network performance, and drive personalized marketing strategies. Additionally, the whitepaper addresses the challenges of integrating AI into...
RADCOM Logo
Article & Insights
Non-terrestrial networks (NTNs) have evolved from experimental satellite systems to integral components of global connectivity. The transition from geostationary satellites to low Earth orbit constellations has significantly enhanced mobile broadband services. With the adoption of 3GPP standards, NTNs now seamlessly integrate with terrestrial networks, providing expanded coverage and new opportunities,...

Download Magazine

With Subscription

Subscribe To Our Newsletter

Scroll to Top