Oracle Launches HeatWave GenAI with In-Database LLMs and Vector Store

Oracle introduces HeatWave GenAI, featuring the industry's first in-database large language models and an automated vector store. This innovation enables enterprises to develop AI applications without data migration, AI expertise, or additional costs, outperforming competitors like Snowflake and Google BigQuery.
Oracle Launches HeatWave GenAI with In-Database LLMs and Vector Store
Image Credit: Oracle

Customers can now build generative AI applications seamlessly without AI expertise, data movement, or additional costs. HeatWave GenAI surpasses Snowflake, Google BigQuery, and Databricks in vector processing speeds, achieving 30x, 18x, and 15x faster performance, respectively.

Introduction of HeatWave GenAI

Oracle has announced the availability of HeatWave GenAI, featuring the industry’s first in-database large language models (LLMs) and an automated in-database vector store. This new offering also includes scale-out vector processing and contextual natural language interactions informed by unstructured content. HeatWave GenAI enables enterprises to leverage generative AI directly within their database, eliminating the need for AI expertise or data migration to separate vector databases. It is now available across all Oracle Cloud regions, Oracle Cloud Infrastructure (OCI) Dedicated Region, and multiple clouds at no extra cost to HeatWave customers.

Key Features of Oracle HeatWave GenAI

Developers can now create a vector store for unstructured enterprise content using a single SQL command, thanks to built-in embedding models. Users can perform natural language searches in one step, utilizing either in-database or external LLMs. With HeatWaveโ€™s high scalability and performance, there is no need to provision GPUs, simplifying application complexity, enhancing performance, and improving data security while lowering costs.

Enhancing Enterprise AI with HeatWave

โ€œHeatWaveโ€™s innovation continues with the integration of HeatWave GenAI,โ€ said Edward Screven, Oracleโ€™s Chief Corporate Architect. โ€œThese AI enhancements enable developers to quickly build rich generative AI applications without AI expertise or data movement. Users can interact intuitively with enterprise data to get accurate business insights swiftly.โ€

Industry Reactions to HeatWave GenAI

Vijay Sundhar, CEO of SmarterD, commented, โ€œHeatWave GenAI simplifies generative AI use, significantly reducing application complexity, inference latency, and costs. This democratization of AI will enhance our productivity and enrich our applications.โ€

Innovative In-Database LLMs and Vector Store by Oracle

Oracleโ€™s in-database LLMs reduce the complexity and cost of developing generative AI applications. Customers can perform data searches, content generation, and retrieval-augmented generation (RAG) within HeatWaveโ€™s vector store. Additionally, HeatWave GenAI integrates with OCI Generative AI services to access pre-trained models from leading LLM providers.

The automated in-database vector store allows businesses to utilize generative AI with their documents without transferring data to a separate database. The process, including document discovery, parsing, embedding generation, and insertion into the vector store, is fully automated within the database, making HeatWave Vector Store efficient and user-friendly.

Oracle HeatWaveโ€™s Advanced Vector Processing

HeatWaveโ€™s scale-out vector processing supports fast and accurate semantic search results. It introduces a native VECTOR data type and an optimized distance function, enabling semantic queries using standard SQL. HeatWaveโ€™s in-memory hybrid columnar representation and scale-out architecture ensure near-memory bandwidth execution and parallel processing across up to 512 nodes.

HeatWave Chat: Simplifying User Interactions

HeatWave Chat, a Visual Code plug-in for MySQL Shell, provides a graphical interface for HeatWave GenAI. It allows developers to ask questions in natural language or SQL, maintain context, and verify answer sources. The integrated Lakehouse Navigator facilitates the creation of vector stores from object storage, enhancing the user experience.

Impressive Benchmark Performance of HeatWave GenAI

HeatWave GenAI demonstrates impressive performance in creating vector stores and processing vector queries. It is 23x faster than Amazon Bedrock for creating vector stores and up to 80x faster than Amazon Aurora PostgreSQL for similarity searches, delivering accurate results with predictable response times.

HeatWave GenAI: Customer and Analyst Endorsements

Safarath Shafi, CEO of EatEasy, praised HeatWaveโ€™s in-database AutoML and LLMs for their differentiated capabilities, enabling new customer offerings and improving performance and quality of LLM results.

Eric Aguilar, founder of Aiwifi, highlighted HeatWaveโ€™s simplicity, security, and cost-effectiveness in leveraging generative AI for enterprise needs.

Holger Mueller, VP at Constellation Research, emphasized HeatWaveโ€™s integration of automated in-database vector stores and LLMs, which allows developers to create innovative applications without moving data, ensuring high performance and cost efficiency.

Oracle HeatWave: Integrated AI and Analytics Solution

HeatWave is the only cloud service offering integrated generative AI and machine learning for transactions and lakehouse-scale analytics. It is a key component of Oracleโ€™s distributed cloud strategy, available on OCI, AWS, Microsoft Azure via Oracle Interconnect for Azure, and in customer data centers with OCI Dedicated Region and Oracle Alloy.

Read theย HeatWave technical blog


Recent Content

Award Category: Private Network Excellence in Manufacturing

Winner: Ericsson


Ericsson has been recognized with the TeckNexus 2024 Award for “Private Network Excellence in Manufacturing” for its transformative work at the USA 5G Smart Factory in Lewisville, Texas, and global deployments such as the Smart Factory Innovation Centre in Wolverhampton, UK, Atlas Copco Tools, and Toyota Material Handlingโ€™s facility in Columbus, Indiana. By integrating private 5G connectivity with advanced Industry 4.0 technologies, Ericsson has set new benchmarks for optimizing manufacturing processes, enhancing supply chain resilience, and elevating operational efficiency. This award underscores Ericssonโ€™s leadership in leveraging private 5G to drive innovation in areas such as remote inspections, predictive maintenance, and sustainable production, redefining modern manufacturing standards through secure and scalable connectivity solutions.

Award Category: Private Network Excellence in Energy and Utilities

Winner: Nokia & Southern California Edison (SCE)


Nokia and Southern California Edison (SCE) have set a new benchmark for utility innovation with the launch of North Americaโ€™s first private 5G Field Area Network (FAN), earning the 2024 TeckNexus “Private Network Excellence in Energy and Utilities” award. This transformative initiative leverages Nokiaโ€™s advanced 5G technology and SCEโ€™s Citizens Broadband Radio Service (CBRS) spectrum to optimize utility grid operations, enhancing resilience, efficiency, and the integration of renewable energy sources. By modernizing utility grids through scalable, high-performance private 5G connectivity, Nokia and SCE demonstrate the potential of private networks to meet evolving energy demands, improve sustainability, and address critical environmental challenges in the energy and utilities sector.

Award Category: Private Network Excellence in Education

Winner: InfiniG

Partner: Parkside Elementary School, Intel, AT&T, and T-Mobile


InfiniGโ€™s Mobile Coverage-as-a-Service (MCaaS) solution has earned the 2024 TeckNexus “Private Network Excellence in Education” award for its transformative impact on student safety and connectivity at Parkside Elementary in Murray, Utah. This innovative deployment, completed in partnership with Intel, AT&T, and T-Mobile, provided comprehensive in-building mobile coverage to address critical safety and communication challenges for students, teachers, staff, and parents. By enhancing secure and connected educational environments, InfiniGโ€™s solution exemplifies the potential of private networks to improve campus security and foster more connected learning experiences.

Award Category: Private Network Excellence in Agriculture

Winner: Invences &

Partner: Trilogy Networks


Invences Inc., in collaboration with Trilogy Networks, has been recognized with the 2024 TeckNexus “Private Network Excellence in Agriculture” award for their pioneering deployment of a private 5G network tailored to transform farming operations. Implemented at a large-scale agricultural project in Fargo, North Dakota, this innovative collaboration drives digital transformation in agriculture through precision farming, real-time monitoring, AI-driven insights, and seamless data integration across rural and remote environments. Their efforts exemplify how 5G technology can revolutionize agricultural productivity and sustainability, setting new standards for efficiency and innovation in the sector.
SoftBank and Fujitsu are joining forces to advance the commercialization of AI-RAN, integrating AI with Radio Access Networks to enhance communication performance and efficiency. Targeted for deployment by 2026, this collaboration focuses on R&D, vRAN software development, and AI-driven optimization of mobile networks, with trials underway and a dedicated verification lab set to open in Dallas.
Whitepaper
As VoLTE becomes the standard for voice communication, its rapid deployment exposes telecom networks to new security risks, especially in roaming scenarios. SecurityGenโ€™s research uncovers key vulnerabilities like unauthorized access to IMS, SIP protocol threats, and lack of encryption. Learn how to strengthen VoLTE security with proactive measures such as...
Whitepaper
Dive into the comprehensive analysis of GTPu within 5G networks in our whitepaper, offering insights into its operational mechanics, strategic importance, and adaptation to the evolving landscape of cellular technologies....
Article & Insights
Non-terrestrial networks (NTNs) have evolved from experimental satellite systems to integral components of global connectivity. The transition from geostationary satellites to low Earth orbit constellations has significantly enhanced mobile broadband services. With the adoption of 3GPP standards, NTNs now seamlessly integrate with terrestrial networks, providing expanded coverage and new opportunities,...

Subscribe To Our Newsletter

Scroll to Top