Part 4 of CRN’s Big Data 100 takes a look at the vendors solution providers should know in the data warehouse and data lake systems space.
As part of the CRN 2026 Big Data 100, we’ve put together the following list of data warehouse and data lake companies—from well-established vendors to those in startup mode—that solution providers should be familiar with.
These vendors offer data warehouse and data lake systems that provide data and related capabilities such as data transformation and data governance needed to support advanced data analysis tasks.
(Note: A number of other IT vendors that are prominent in the data lakehouse space, including Databricks and Starburst, and major players in the data warehouse space, including AWS and Google Cloud, are included in the CRN Big Data 100 in other categories.)
Given the wave of AI development that’s boosting the demand for data, many data warehouse and data lake technology developers are adapting their platforms to go beyond data analytics to provide the high volumes of high-quality data needed for AI, generative AI and AI agent tasks.
This week CRN is running the 2026 Big Data 100 list in a series of slide shows, organized by technology category, spotlighting vendors of business analytics software, database systems, data warehouse and data lake systems, data management and integration software, data observability tools, and big data systems and cloud platforms.
Some vendors have big data product portfolios that span multiple technology categories. They appear in the slideshow for the technology segment in which they are most prominent.

Dremio
Top Executive: Sendur Sellakumar, CEO
Dremio develops a high-performance data lakehouse platform for managing huge volumes of data that support business intelligence, data science and agentic AI workloads.
Dremio natively incorporates the open-source Apache Polaris data catalog technology for Apache Iceberg tables.
On May 4, software giant SAP announced a deal to acquire Dremio for an undisclosed sum. The Dremio technology will be used to transform the SAP Business Data Cloud into an agentic data lakehouse to power AI agents.

Ocient
Top Executive: John Morris, CEO
Ocient’s flagship OcientAIQ unified data platform manages petabyte-scale enterprise data for agents, analysts and applications. The company is focused on providing industry-specific solutions built on its core system.
In March, Ocient announced a strategic partnership with TekSnyap, a federal technology and systems integration services provider, to provide hyperscale data analytics solutions for U.S. government agencies.
Also in March, Ocient said it was teaming up with AI technology developer Accrete AI to develop and deliver an AI-driven intelligence system for national security organizations.

OneHouse
Top Executive: Vinoth Chandar, Co-Founder and CEO
OneHouse offers its Universal Data Lakehouse platform, a cloud-native, fully managed lakehouse service built on the Apache Hudi software. The company says its technology blends the ease-of-use of a data warehouse with the scalability of a data lake.
In 2025, OneHouse debuted OneHouse Compute Runtime, providing the ability to manage and optimize data lakehouse workloads across multiple cloud platforms, query engines and open table formats.
Also in 2025, the company released “Open Engines,” a capability that enables organizations to leverage the interoperability of its data lakehouse while supporting open-source engines such as Apache Flink, Trino and Ray.

Teradata
Top Executive: Steve McMillan, President and CEO
Teradata, founded in 1979, was the IT industry’s data warehouse pioneer.
On May 7, the company unveiled the Teradata Autonomous Knowledge Platform, a new enterprise data and AI system offering that unifies structured and unstructured data, analytics and autonomous AI agents.
Also in May, the company launched the Teradata Factory, built on Dell Technologies server and storage systems and incorporating Teradata’s analytics software. It provides a pre-integrated, on-premises extension for the Teradata Autonomous Knowledge Platform.
For 2025 Teradata reported revenue of $1.66 billion, down 5 percent from $1.75 billion in 2024.

Yellowbrick Data
Top Executive: Neil Carson, Co-Founder and CEO
Yellowbrick Data’s SQL data platform supports a range of big data tasks including data warehousing, application analytics, streaming analytics, data residency and cloud migration.
The Yellowbrick Data system, which boasts high-performance data load and query capabilities, operates using a private data cloud architecture that does not expose data to the public internet, ensuring data is fully secured and protected.







