Published Date : 05/06/2025
This month, IBM is taking a significant step forward in simplifying the enterprise data stack. The company has introduced new software that unifies, governs, and activates unstructured enterprise data, essential for powering AI agents and other advanced AI applications.
As organizations scale agents and other advanced AI applications, IBM is connecting them to essential unstructured data. This is achieved through two new products: IBM watsonx.data integration and IBM watsonx.data intelligence. These products complement IBM’s hybrid, open data lakehouse, watsonx.data, which manages the entire data-for-AI lifecycle in a single experience.
The new software is hybrid and open, connecting with third-party data stacks to provide flexibility, interoperability, and drive innovation from across the ecosystem. According to IBM internal testing, the new products can enable 40% more accurate AI than conventional RAG.
IBM client Lockheed Martin recently leveraged the transformed watsonx.data, enabling 70,000 engineers, scientists, and technicians to retrieve answers and information from millions of documents using natural language. John Clark, senior vice president of Technology and Strategic Innovation at Lockheed, commented, “We are rapidly accelerating our innovation and efficiency, to get solutions out of the lab and into the field, helping create a safer, more secure world.”
Businesses need generative AI and increasingly, agentic AI, to drive innovation, unlock productivity, and remain competitive. According to IBM’s new CEO study, 72% of business leaders view their organization’s proprietary data as key to unlocking the value of generative AI. However, this data is often unstructured and difficult to harness, trapped inside emails, PDFs, presentations, and videos.
Conventional RAG cannot handle the scale and complexity of unstructured data or properly combine it with structured data. As a result, enterprises' unstructured data, which can constitute up to 90% of their total data, is largely underutilized and not represented in their AI agents and other generative AI applications.
Watsonx.data integration introduces a new, unified data integration control plane designed to scale the delivery of AI-ready data. Data engineers can bridge across low-code, code-first, and agentic tools, allowing different authoring entry points. The software orchestrates data movement across diverse integration styles and features bulk and batch ETL/ELT, real-time streaming, data replication, and data observability capabilities across structured or unstructured data. With flexibility and adaptability at its core, data teams no longer need to navigate fragmented tooling or incur new technical debt with every new data storage paradigm shift – future-proofing their data infrastructure.
Watsonx.data integration is available as a standalone product beginning June 11. Its unstructured data integration and observability capabilities can also be leveraged through watsonx.data.
Watsonx.data intelligence transforms how organizations curate, manage, and utilize data, leveraging the power of AI to simplify data delivery across hybrid ecosystems. The software unifies data governance, quality, lineage, and sharing, empowering organizations to discover, trust, and access meaningful data. Watsonx.data intelligence is available as a standalone product beginning June 11. Its capabilities can also be leveraged through watsonx.data for data under management in the lakehouse.
Following IBM's acquisition of DataStax, IBM will continue to integrate its tools and technologies into watsonx.data. These include Astra DB and Hyper-Converged Database, which provide NoSQL and vector database capabilities powered by the open-source Apache Cassandra® and will be available June 11.
In June, IBM will debut watsonx BI, an AI analytics agent that rewires how teams engage with data and harnesses its power to deliver exceptional business intelligence using natural language. The agent can answer marketing, sales, operations, finance, and other domain questions in seconds and provides step-by-step explanations of its reasoning. Watsonx BI will be available as a standalone product and will also be available through watsonx.data.
IBM is integrating Gluten Accelerated Spark into watsonx.data to enhance performance of compute-intensive Spark SQL workloads. The technology can help accelerate query processing and enhance resource efficiency for large-scale data analytics, at a time when enterprise data volume is growing fast.
IBM also recently announced the addition of watsonx as an API provider within Meta’s Llama Stack, enhancing enterprises’ ability to deploy generative AI at scale and with openness at the core. Watsonx.data's Milvus database is already part of the Llama Stack framework, and this integration enables additional unstructured and structured data management and agentic retrieval.
IBM recently unveiled new agentic AI tools for the enterprise, including pre-build domain-specific agents, watsonx Orchestrate Agent Builder, and agentic AI governance capabilities. Paired with today's robust data offerings, organizations will have the data and tools necessary to successfully deploy agentic AI at scale.
Q: What is the primary purpose of IBM's new watsonx.data integration and watsonx.data intelligence products?
A: The primary purpose of IBM's new watsonx.data integration and watsonx.data intelligence products is to unify, govern, and activate unstructured enterprise data, essential for powering AI agents and other advanced AI applications.
Q: How does watsonx.data integration help data engineers?
A: Watsonx.data integration helps data engineers by providing a unified data integration control plane that bridges across low-code, code-first, and agentic tools, allowing different authoring entry points and orchestrating data movement across diverse integration styles.
Q: What is the significance of unstructured data in AI applications?
A: Unstructured data is significant in AI applications because it constitutes up to 90% of an enterprise's total data and is often underutilized. It is essential for AI agents and other generative AI applications to be accurate and performant.
Q: What new AI analytics agent is IBM introducing, and what can it do?
A: IBM is introducing watsonx BI, an AI analytics agent that can answer marketing, sales, operations, finance, and other domain questions in seconds using natural language and provides step-by-step explanations of its reasoning.
Q: How does IBM's integration with Meta’s Llama Stack benefit enterprises?
A: IBM's integration with Meta’s Llama Stack enhances enterprises’ ability to deploy generative AI at scale and with openness at the core, enabling additional unstructured and structured data management and agentic retrieval.