Private Network Check Readiness - TeckNexus Solutions

Bloomberg AI Researchers Mitigate Risks of “Unsafe” RAG LLMs and GenAI in Finance

There's immense pressure for companies in every industry to adopt AI, but not everyone has the in-house expertise, tools, or resources to understand where and how to deploy AI responsibly. Bloomberg hopes this taxonomy – when combined with red teaming and guardrail systems – helps to responsibly enable the financial industry to develop safe and reliable GenAI systems, be compliant with evolving regulatory standards and expectations, as well as strengthen trust among clients.
Bloomberg AI Researchers Mitigate Risks of "Unsafe" RAG LLMs and GenAI in Finance

Two new academic papers reflect Bloomberg’s commitment to transparent, trustworthy, and responsible AI


From discovering that retrieval augmented generation (RAG)-based large language models (LLMs) are less “safe” to introducing an AI content risk taxonomy meeting the unique needs of GenAI systems in financial services, researchers across Bloomberg’s AI Engineering group, Data AI group, and CTO Office aim to help organizations deploy more trustworthy solutions.

They have published two new academic papers that have significant implications for how organizations deploy GenAI systems more safely and responsibly, particularly in high-stakes domains like capital markets financial services.

In RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models,” Bloomberg researchers found that RAG, a widely-used technique that integrates context from external data sources to enhance the accuracy of LLMs, can actually make models less “safe” and their outputs less reliable.

To determine whether RAG-based LLMs are safer than their non-RAG counterparts, the authors used more than 5,000 harmful questions to assess the safety profiles of 11 popular LLMs, including Claude-3.5-Sonnet, Llama-3-8B, Gemma-7B, and GPT-4o. Comparing the resulting behaviors across 16 safety categories, the findings demonstrate that there were large increases in unsafe responses under the RAG setting. In particular, they discovered that even very “safe” models, which refused to answer nearly all harmful queries in the non-RAG setting, become more vulnerable in the RAG setting [see Figure 3 from the paper].

The change of risk profile from non-RAG to RAG is model dependent. (Figure 3, RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models, arXiv, 2025.)

This research clearly underscores the need for anyone using RAG LLMs to assess whether their models have any hidden layers of vulnerability and what additional safeguards they might need to add.

“This counterintuitive finding has far-reaching implications given how ubiquitously RAG is used in GenAI applications such as customer support agents and question-answering systems. The average Internet user interacts with RAG-based systems daily,” explained Dr. Amanda Stent, Bloomberg’s Head of AI Strategy & Research in the Office of the CTO. “AI practitioners need to be thoughtful about how to use RAG responsibly, and what guardrails are in place to ensure outputs are appropriate. Our research offers a framework for approaching that so others can evaluate their own solutions and identify any potential blind spots.”

In a related paper, “Understanding and Mitigating Risks of Generative AI in Financial Services,” Bloomberg’s researchers examined how GenAI is being used in capital markets financial services and found that existing general purpose safety taxonomies and guardrail systems fail to account for domain-specific risks.

To close this gap, they introduced a new AI content risk taxonomy that meets the needs of real-world GenAI systems for financial services. It goes beyond what may be addressed by general-purpose safety taxonomies and guardrail systems by addressing risks specific to the financial sector such as confidential disclosure, counterfactual narrative, financial services impartiality, and financial services misconduct.

“There have been strides in academic research addressing toxicity, bias, fairness, and related safety issues for GenAI applications for a broad consumer audience, but there has been significantly less focus on GenAI in industry applications, particularly in financial services,” said David Rabinowitz, Technical Product Manager for AI Guardrails at Bloomberg.

[See Table 1 from the paper]

The categories in Bloomberg’s AI content safety taxonomy for financial services. (Table 1, Understanding and Mitigating Risks of Generative AI in Financial Services, 2025.)

“There’s immense pressure for companies in every industry to adopt AI, but not everyone has the in-house expertise, tools, or resources to understand where and how to deploy AI responsibly,” said Dr. Sebastian Gehrmann, Bloomberg’s Head of Responsible AI. “Bloomberg hopes this taxonomy – when combined with red teaming and guardrail systems – helps to responsibly enable the financial industry to develop safe and reliable GenAI systems, be compliant with evolving regulatory standards and expectations, as well as strengthen trust among clients.”

The RAG safety paper will be presented at the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025) in Albuquerque, New Mexico later this week. The AI risk taxonomy paper will be presented at the ACM Conference on Fairness, Accountability, and Transparency (FAccT) in Athens, Greece in June. For more details, read the Tech At Bloomberg blog post and both papers:

About AI at Bloomberg
Since 2009, Bloomberg has been building and using artificial intelligence (AI) in the finance domain – including machine learning (ML), natural language processing (NLP), information retrieval (IR), time-series analysis, and generative models – to help process and organize the ever-increasing volume of structured and unstructured financial information. With this technology, Bloomberg is developing new ways for financial professionals and business leaders to derive valuable intelligence and actionable insights from high-quality financial information and make more informed business decisions. Learn more about Bloomberg’s AI solutions at www.bloomberg.com/AIatBloomberg.

About Bloomberg
Bloomberg is a global leader in business and financial information, delivering trusted data, news, and insights that bring transparency, efficiency, and fairness to markets. The company helps connect influential communities across the global financial ecosystem via reliable technology solutions that enable our customers to make more informed decisions and foster better collaboration. For more information, visit Bloomberg.com/company or request a demo.


Recent Content

A new Ciena and Heavy Reading study signals that AI will become a primary source of metro and long-haul traffic within three years while most optical networks remain only partially prepared. AI training and inference are shifting from contained data center domains to distributed, edge-to-core workflows that stress transport capacity, latency, and automation end-to-end. Expectations are even higher for long-haul: 52% see AI surpassing 30% of traffic and 29% expect AI to account for more than half. Yet only 16% of respondents rate their optical networks as very ready for AI workloads, underscoring an execution gap that will shape capex priorities, service roadmaps, and partnership models through 2027.
South Korea’s government and its three national carriers are aligning fresh capital to speed AI and semiconductor competitiveness and to anchor a private-led innovation flywheel. SK Telecom, KT, and LG Uplus will seed a new pool exceeding 300 billion won (about $219 million) via the Korea IT Fund (KIF) to back core and foundational AI, AI transformation (AX), and commercialization in ICT. KIF, formed in 2002 by the carriers, will receive 150 billion won in new commitments, matched by at least an equal amount from external fund managers. The platforms lifespan has been extended to 2040 to sustain long-cycle bets.
NTT DATA and Google Cloud expanded their global partnership to speed the adoption of agentic AI and cloud-native modernization across regulated and dataintensive industries. The push emphasizes sovereign cloud options using Google Distributed Cloud, with both airgapped and connected deployments to meet data residency and regulatory needs without stalling innovation. The partners plan to build industry-specific agentic AI solutions on Google Agent space and Gemini models, underpinned by secure data clean rooms and modernized data platforms. NTT DATA is standing up a dedicated Google Cloud Business Group with thousands of engineers and aims to certify 5,000 practitioners to accelerate delivery, migrations, and managed services.
Lumen surpassing 1,000 customers on its Network-as-a-Service platform is a clear marker for where enterprise networking is headed. AI adoption, multi-cloud architectures, and distributed applications are pushing organizations toward on-demand, software-driven connectivity. Lumens platform bundles three core service types under a single digital experience. The platform integrates with major hyperscalers, enabling direct paths to AWS, Microsoft Azure, and Google Cloud. All can be provisioned self-service, scaled up or down based on demand, and stitched to cloud regions and third-party data centers via cloud on-ramps.
Vietnam is entering the hyperscale AI data center map, with VNPT and LG CNS positioning to meet local and regional demand. For telecom operators and enterprises, now is the time to align AI roadmaps with data center strategy: plan for high-density racks and liquid cooling, secure GPU capacity, engineer diverse connectivity, and build energy resilience. As the regions AI infrastructure forms, those who co-design workload placement, interconnect, and power from the outset will gain durable cost and performance advantages.
NTT DATA has launched a Global Microsoft Cloud Business Unit to help enterprises worldwide accelerate AI-powered cloud transformation. Backed by 24,000 Microsoft-certified specialists in over 50 countries, the unit focuses on cloud-native modernization, cybersecurity, Agentic AI orchestration, and sovereign cloud adoption. With deep integration into Microsoft’s engineering and sales ecosystem, NTT DATA aims to deliver secure, scalable, and compliant digital transformation at global scale.
Whitepaper
Explore how Generative AI is transforming telecom infrastructure by solving critical industry challenges like massive data management, network optimization, and personalized customer experiences. This whitepaper offers in-depth insights into AI and Gen AI's role in boosting operational efficiency while ensuring security and regulatory compliance. Telecom operators can harness these AI-driven...
Supermicro and Nvidia Logo
Whitepaper
The whitepaper, "How Is Generative AI Optimizing Operational Efficiency and Assurance," provides an in-depth exploration of how Generative AI is transforming the telecom industry. It highlights how AI-driven solutions enhance customer support, optimize network performance, and drive personalized marketing strategies. Additionally, the whitepaper addresses the challenges of integrating AI into...
RADCOM Logo
Article & Insights
Non-terrestrial networks (NTNs) have evolved from experimental satellite systems to integral components of global connectivity. The transition from geostationary satellites to low Earth orbit constellations has significantly enhanced mobile broadband services. With the adoption of 3GPP standards, NTNs now seamlessly integrate with terrestrial networks, providing expanded coverage and new opportunities,...

Download Magazine

With Subscription

Subscribe To Our Newsletter

Private Network Awards 2025 - TeckNexus
Scroll to Top

Private Network Awards

Recognizing excellence in 5G, LTE, CBRS, and connected industries. Nominate your project and gain industry-wide recognition.
Early Bird Deadline: Sept 5, 2025 | Final Deadline: Sept 30, 2025