NVIDIA and Google Cloud Partner to Advance Secure Agentic AI Deployment

NVIDIA and Google Cloud are collaborating to bring secure, on-premises agentic AI to enterprises by integrating Googleโ€™s Gemini models with NVIDIAโ€™s Blackwell platforms. Leveraging confidential computing and enhanced infrastructure like the GKE Inference Gateway and Triton Inference Server, the partnership ensures scalable AI deployment without compromising regulatory compliance or data sovereignty.
NVIDIA and Google Cloud Partner to Advance Secure Agentic AI Deployment
Image Credit: NVIDIA and Google Cloud

NVIDIA and Google Cloud are joining forces to enhance enterprise AI applications by integrating Google Gemini AI models with NVIDIA‘s advanced computing platforms. This collaboration aims to facilitate the deployment of agentic AI locally while ensuring strict compliance with data privacy and regulatory standards.

Enhanced Data Security with NVIDIA and Google Cloud


The partnership centers on the use of NVIDIAs Blackwell HGX and DGX platforms, which are now integrated with Google Clouds distributed infrastructure. This setup allows enterprises to operate Googles powerful Gemini AI models directly within their data centers. A key feature of this integration is NVIDIA Confidential Computing, which provides an additional layer of security by safeguarding sensitive code in the Gemini models against unauthorized access and potential data breaches.

Sachin Gupta, Vice President and General Manager of Infrastructure and Solutions at Google Cloud, emphasized the security and operational benefits of this collaboration. “By deploying our Gemini models on-premises with NVIDIA Blackwells exceptional performance and confidential computing capabilities, were enabling enterprises to leverage the full capabilities of agentic AI in a secure and efficient manner,” Gupta stated.

The Advent of Agentic AI in Enterprise Technology

Agentic AI represents a significant evolution in artificial intelligence technology, offering enhanced problem-solving capabilities over traditional AI models. Unlike conventional AI, which operates based on pre-learned information, agentic AI can reason, adapt, and make autonomous decisions in dynamic settings. For instance, in IT support, an agentic AI system can not only retrieve troubleshooting guides but also diagnose and resolve issues autonomously, escalating complex problems as needed.

In the financial sector, while traditional AI might identify potential fraud based on existing patterns, agentic AI goes a step further by proactively investigating anomalies and taking preemptive actions, such as blocking suspicious transactions or dynamically adjusting fraud detection mechanisms.

Addressing On-Premises Deployment Challenges

The ability to deploy agentic AI models on-premises addresses a critical need for organizations with stringent security or data sovereignty requirements. Until now, these organizations have faced significant challenges in utilizing advanced AI models, which often require integration of diverse data types such as text, images, and code, while still adhering to strict regulatory standards.

With Google Cloud now offering one of the first cloud services that enables confidential computing for agentic AI workloads in any environment, be it cloud, on-premises, or hybrid enterprises, no longer have to compromise between advanced AI capabilities and compliance with security regulations.

Future-Proofing AI Deployments

To further support the deployment of AI, Google Cloud has introduced the GKE Inference Gateway. This new service is designed to optimize AI inference workloads, featuring advanced routing, scalability, and integration with NVIDIA’s Triton Inference Server and NeMo Guardrails. It ensures efficient load balancing, enhanced performance, reduced operational costs, and centralized model security and governance.

Looking forward, Google Cloud plans to improve observability for agentic AI workloads by incorporating NVIDIA Dynamo, an open-source library designed to scale reasoning AI models efficiently across various deployment environments.

These advancements in AI deployment and management were highlighted at the Google Cloud Next conference, where NVIDIA held a special address and provided insights through sessions, demonstrations, and expert discussions.

Through this strategic collaboration, NVIDIA and Google Cloud are setting a new standard for secure, efficient, and scalable agentic AI applications, enabling enterprises to harness the full potential of AI while adhering to necessary security and compliance requirements.


Recent Content

In Technology Game Changers, leaders from Agility Robotics, Lenovo, Databricks, Mistral AI, and Maven Clinic showcase how AI and robotics are moving from novelty to necessity. From Peggy Johnsonโ€™s Digit transforming warehouse labor, to Lenovoโ€™s hybrid AI ecosystem, Databricks’ frictionless AI UIs, Mistralโ€™s sovereignty-focused open-source models, and Mavenโ€™s virtual womenโ€™s health platform, this article explores the intelligent, personalized, and responsible future of tech. The next frontier of innovation isnโ€™t just smartโ€”itโ€™s human-centered.
Global Shifts explores how leaders like Keyu Jin and Gregory Allen are analyzing the breakdown of old globalization models and the rise of new strategic paradigms. Jin outlines the emergence of regional economic blocs, Chinaโ€™s shift toward technology self-reliance, and the decentralization of capital. Allen frames AI as a strategic battleground, discussing export controls, the rise of DeepSeek, and the risks of decoupling. The piece offers a critical look at how economic power and innovation are evolving in an era defined by urgency, sovereignty, and competition.
In Technology, Climate Change and Justice, top leaders from Arm, The B Team, Vattenfall, and Silo AI outline how technology can both fuel and fix the climate crisis. From Leah Seligmannโ€™s values-driven climate leadership to Anna Borgโ€™s clean-energy grids and Peter Sarlinโ€™s push for efficient, open-source AI, this piece highlights how innovation must align with inclusion, sustainability, and resilience. The message is clear: solving climate change isnโ€™t just about new techโ€”itโ€™s about how we deploy it, who benefits, and whether it truly serves a livable future.
In Innovation In Action, executives from Time, Sierra, and Axios share how they’re redefining business, media, and journalism with AI. Time is unlocking over a century of content for fair AI use, while Sierraโ€™s “agentic AI” elevates the customer experience across industries. Axios emphasizes human-first reporting with AI support. Across the board, these leaders show how strategic adaptation can embrace AI without compromising trust, transparency, or editorial integrity.
The future of manufacturing is intelligent, autonomous, and sustainable. Powered by private 5G networks, AI, and digital twins, smart factories are revolutionizing how goods are produced and maintained. From predictive maintenance to immersive virtual twins and AI-optimized energy systems, smart manufacturing is unlocking new levels of efficiency and innovation across industriesโ€”from ports and shipyards to agriculture and healthcare.
Smart mobility is reshaping how the world moves, powered by 5G, AI, and edge computing. From autonomous vehicles and real-time logistics to AI-driven drones and connected public transport, intelligent transportation systems are redefining urban mobility, logistics, and industrial automation. As global investment and collaboration grow, the transportation industry is transforming into a $11.1 trillion smart ecosystem focused on sustainability, efficiency, and connectivity.

Download Magazine

With Subscription
Whitepaper
As VoLTE becomes the standard for voice communication, its rapid deployment exposes telecom networks to new security risks, especially in roaming scenarios. SecurityGenโ€™s research uncovers key vulnerabilities like unauthorized access to IMS, SIP protocol threats, and lack of encryption. Learn how to strengthen VoLTE security with proactive measures such as...
Whitepaper
Dive into the comprehensive analysis of GTPu within 5G networks in our whitepaper, offering insights into its operational mechanics, strategic importance, and adaptation to the evolving landscape of cellular technologies....

It seems we can't find what you're looking for.

Subscribe To Our Newsletter

Scroll to Top