Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Data Warehouse Service - Late Stage
Discover the top 50 Data Warehouse Service startups at Late Stage. Browse funding data, key metrics, and company insights. Average funding: $208.9M.
Sort by
The startup operates a data analytics platform that enables rapid analysis of large datasets, handling tens of terabytes to exabytes with trillions of rows. By ingesting billions of rows per second and providing filtered aggregate results, the platform simplifies complex data ecosystems for organizations.
Funding: $147.7M
Rough estimate of the amount of funding raised
GreycroftOCA Ventures
GreycroftOCA Ventures
Funding: $147.7M
Rough estimate of the amount of funding raised
Firebolt is a cloud data warehousing platform that utilizes specialized indexing and JOIN acceleration to deliver sub-second query performance on terabytes of data. It enables businesses to analyze large datasets efficiently, reducing query latency from days to seconds while minimizing storage costs.
Funding: $268.7M
Rough estimate of the amount of funding raised
Alkeon Capital
Alkeon Capital
Funding: $268.7M
Rough estimate of the amount of funding raised
MotherDuck provides a serverless cloud data warehouse built on DuckDB for fast, scalable analytics. It offers independent, per-user compute nodes ensuring low-latency performance without resource contention. The platform supports SQL and natural language querying for both internal insights and customer-facing embedded analytics.
Funding: $100.0M
Rough estimate of the amount of funding raised
Felicis
Felicis
Funding: $100.0M
Rough estimate of the amount of funding raised
Onehouse is a fully managed cloud-native lakehouse service that ingests data from various sources in near real-time, enabling organizations to maintain a single source of truth without the need for complex data replication. By leveraging Apache Hudi and supporting multiple query engines, it reduces operational costs by over 50% while providing scalable access to analytics-ready data.
Funding: $68.0M
Rough estimate of the amount of funding raised
Craft Ventures
Craft Ventures
Funding: $68.0M
Rough estimate of the amount of funding raised
Yellowbrick Data provides a high-performance SQL data platform that supports enterprise data warehousing and streaming analytics with continuous data ingestion and low-latency query execution. This technology enables organizations to efficiently handle large-scale, concurrent workloads while minimizing unpredictable query runtimes, facilitating faster decision-making.
Funding: $75.0M
Rough estimate of the amount of funding raised
Funding: $75.0M
Rough estimate of the amount of funding raised
Airbyte is an open-source data integration engine that enables organizations to sync data from various applications to data warehouses, facilitating seamless data movement across multi-cloud environments. By providing a platform for building custom connectors with low-code or no-code options, Airbyte addresses the challenge of managing diverse data sources while ensuring data privacy and governance.
Funding: $181.2M
Rough estimate of the amount of funding raised
8VCAccelRebel Fund
8VCAccelRebel Fund
Funding: $181.2M
Rough estimate of the amount of funding raised
Fivetran provides an automated ELT platform that extracts, loads, and optionally transforms data from over 700 SaaS, database, ERP, and file sources into data warehouses, lakes, or downstream applications. The service handles schema drift, change‑data‑capture, and real‑time replication without custom code, offering enterprise‑grade security, governance, and hybrid deployment options. Users configure pipelines via a web UI or API and are billed per million rows synced.
Funding: $125.0M
Rough estimate of the amount of funding raised
Vista Credit Partners
Vista Credit Partners
Funding: $125.0M
Rough estimate of the amount of funding raised
Dremio provides a unified lakehouse platform that combines the flexibility of data lakes with the performance of data warehouses, utilizing Apache Iceberg for efficient data management and optimization. This solution enables organizations to perform high-speed, self-service analytics on all their data without the complexities of traditional ETL processes, significantly reducing total cost of ownership and time to insight.
Funding: $407.0M
Rough estimate of the amount of funding raised
Adams Street Partners
Adams Street Partners
Funding: $407.0M
Rough estimate of the amount of funding raised
RudderStack provides a warehouse-native customer data platform that enables businesses to collect, unify, and activate customer data in real-time. By centralizing data collection and ensuring data quality, it eliminates the complexities of data integration and compliance, allowing teams to deliver actionable insights and improve customer engagement efficiently.
Funding: $82.0M
Rough estimate of the amount of funding raised
Insight Partners
Insight Partners
Funding: $82.0M
Rough estimate of the amount of funding raised
Provides a data analytics platform built on an enhanced Trino SQL engine, enabling businesses to query and analyze data across hybrid, on-premises, and multi-cloud environments without moving it. This approach reduces data processing time by 25% and supports complex queries over exabytes of data, streamlining insights for data teams while maintaining security and scalability.
Funding: $431.1M
Rough estimate of the amount of funding raised
Alkeon Capital
Alkeon Capital
Funding: $431.1M
Rough estimate of the amount of funding raised
This company deploys autonomous robots to scan warehouse environments, creating a live digital twin with real-time data. The platform uses AI-powered analytics to provide inventory visibility, flag discrepancies, and offer actionable intelligence for operational optimization. This solution significantly reduces manual stocktaking, improves inventory accuracy, and enhances overall site safety and efficiency.
100+
10K+Approximate amount of employees
Funding: $115.4M
Rough estimate of the amount of funding raised
DTCP
DTCP
Funding: $115.4M
Rough estimate of the amount of funding raised
Reveal is a cloud-based platform that enables organizations to capture and analyze critical business data efficiently. By streamlining data collection and reporting processes, it enhances decision-making and operational effectiveness for enterprises.
Funding: $200.0M
Rough estimate of the amount of funding raised
K1 Investment Management
K1 Investment Management
Funding: $200.0M
Rough estimate of the amount of funding raised
Cybersyn provides data-as-a-service (DaaS) by delivering analytics-ready data directly to Snowflake instances, enabling businesses to make informed decisions without the need for complex data engineering. The platform offers real-time insights into consumer behavior and market trends, allowing companies to enhance their competitive strategies and operational efficiency.
Funding: $63.0M
Rough estimate of the amount of funding raised
Snowflake
Snowflake
Funding: $63.0M
Rough estimate of the amount of funding raised
Databricks provides a unified Data Intelligence Platform that combines data lake and warehouse capabilities into a single, scalable environment for data storage, governance, ETL, analytics, and AI model development. The platform offers serverless PostgreSQL lakebases, integrated open‑source tools like Spark, Delta Lake, and MLflow, and multi‑cloud support to reduce operational complexity and total cost of ownership for enterprise data and AI teams.
Funding: $1.0B
Rough estimate of the amount of funding raised
+ 4 Other investorsAndreessen HorowitzWCM Investment Management
+ 4 Other investorsAndreessen HorowitzWCM Investment Management
Funding: $1.0B
Rough estimate of the amount of funding raised
Snowflake provides a fully managed, multi‑cloud data platform that consolidates storage, compute, and data services for analytics, AI, and transactional workloads. It automates data ingestion, transformation, governance, and offers built‑in AI tools and a marketplace, enabling enterprises to run secure, scalable workloads without managing infrastructure.
Funding: $621.5M
Rough estimate of the amount of funding raised
Funding: $621.5M
Rough estimate of the amount of funding raised
Census is a Data Activation and Reverse ETL platform that enables businesses to define and sync trusted data from their data warehouse to over 150 operational tools without the need for code or CSVs. This solution eliminates data silos, allowing marketing and data teams to collaborate effectively by providing real-time access to actionable insights and standardized datasets.
Funding: $80.3M
Rough estimate of the amount of funding raised
Insight PartnersTiger Global Management
Insight PartnersTiger Global Management
Funding: $80.3M
Rough estimate of the amount of funding raised
Incorta provides a unified data and analytics platform that connects directly to source systems such as ERP, CRM, and other core applications using its Direct Data Mapping engine, removing the need for traditional batch ETL pipelines. The platform streams transaction‑level data into an in‑memory engine for sub‑second queries, offers low‑code data modeling, AI‑driven natural‑language analytics, automated workflows, and role‑based security, with open APIs for integration into existing BI and cloud environments.
Funding: $120.0M
Rough estimate of the amount of funding raised
Prysm Capital
Prysm Capital
Funding: $120.0M
Rough estimate of the amount of funding raised
The startup offers a financial database platform that aggregates real estate data from public, private, and internal sources to provide detailed market evaluations, property valuations, and tax assessments. This platform enables clients to reduce manual analytics costs and enhance their strategic decision-making processes.
Funding: $130.1M
Rough estimate of the amount of funding raised
Funding: $130.1M
Rough estimate of the amount of funding raised
The startup operates a cloud-based corporate sales support database that utilizes artificial intelligence to aggregate and analyze global sales data. This platform enables clients to efficiently identify and target prospective customers, enhancing the effectiveness of their sales operations.
Funding: $82.2M
Rough estimate of the amount of funding raised
Z Venture Capital
Z Venture Capital
Funding: $82.2M
Rough estimate of the amount of funding raised
Dune provides a cloud‑native platform that aggregates and normalizes on‑chain data from over 100 public blockchains into a unified, queryable schema. Users can run SQL‑compatible queries, build visual dashboards, and access results via REST APIs or export connectors for data warehouses and machine‑learning pipelines, all with enterprise‑grade security and near‑real‑time freshness.
Funding: $69.4M
Rough estimate of the amount of funding raised
Coatue
Coatue
Funding: $69.4M
Rough estimate of the amount of funding raised
StarTree provides a platform-as-a-service built on Apache Pinot, enabling real-time analytics with sub-second query response times on petabyte-scale data. This solution allows businesses to efficiently handle high concurrency demands while minimizing costs associated with data processing and analysis.
Funding: $75.0M
Rough estimate of the amount of funding raised
Notable Capital
Notable Capital
Funding: $75.0M
Rough estimate of the amount of funding raised
The startup provides a straightforward tool for collecting and organizing data from API endpoints, enabling users to efficiently manage their data flow. This solution addresses the challenge of data fragmentation by simplifying the integration and accessibility of diverse API data sources.
Funding: $100.0M
Rough estimate of the amount of funding raised
Funding: $100.0M
Rough estimate of the amount of funding raised
Impetus Technologies provides data analytics and enterprise AI solutions that enhance decision-making processes for businesses. By leveraging advanced algorithms and machine learning techniques, the company enables organizations to extract actionable insights from their data, improving operational efficiency and strategic planning.
Funding: $350.0M
Rough estimate of the amount of funding raised
Kedaara Capital
Kedaara Capital
Funding: $350.0M
Rough estimate of the amount of funding raised
Unravel offers an AI‑driven data observability platform that continuously monitors performance, cost, and data quality across major cloud data warehouses and processing engines. The system automatically generates and applies optimization actions, delivering real‑time insights and FinOps analytics through a dashboard and API to help data engineering teams meet SLA targets and reduce cloud spend.
Funding: $50.0M
Rough estimate of the amount of funding raised
+ 4 Other investorsThird Point Ventures
+ 4 Other investorsThird Point Ventures
Funding: $50.0M
Rough estimate of the amount of funding raised
Treasure Data offers a cloud-based data analytics platform that enables organizations to manage and analyze large volumes of data efficiently. The platform addresses the challenges of data silos and integration, providing actionable insights to enhance decision-making and operational performance.
Funding: $234.0M
Rough estimate of the amount of funding raised
SoftBank
SoftBank
Funding: $234.0M
Rough estimate of the amount of funding raised
Anomalo provides automated AI-driven data quality monitoring for enterprise data warehouses, utilizing unsupervised machine learning to detect anomalies and validate data integrity without requiring code. This solution addresses the issue of unreliable data by enabling rapid identification and resolution of data quality problems, ensuring accurate and trustworthy insights for business operations.
Funding: $121.0M
Rough estimate of the amount of funding raised
Smith Point Capital
Smith Point Capital
Funding: $121.0M
Rough estimate of the amount of funding raised
This entity focuses on advancing AI through rigorous science, particularly in the areas of Large Language Models (LLMs) and generative AI. They develop and release open-source technologies like DBRX and MPT models, alongside tools for efficient deep learning training and evaluation. The work aims to provide high-quality, commercially usable models and performance optimizations for the AI community.
1000+
50K+Approximate amount of employees
Funding: $1.0B
Rough estimate of the amount of funding raised
Andreessen HorowitzWCM Investment Management
Andreessen HorowitzWCM Investment Management
Funding: $1.0B
Rough estimate of the amount of funding raised
Flexe provides a cloud‑based platform that links enterprises to a network of over 800 warehouse operators across the United States and Canada, allowing on‑demand scaling of storage and fulfillment capacity. The system integrates with WMS, OMS and IMS via API, EDI or XML and delivers real‑time order routing, inventory visibility, and analytics while using a pay‑as‑you‑go pricing model to avoid capital expenditures and long‑term contracts. A dedicated logistics analyst control‑tower monitors performance and ensures service‑level compliance across the flexible network.
Funding: $119.0M
Rough estimate of the amount of funding raised
BlackRock
BlackRock
Funding: $119.0M
Rough estimate of the amount of funding raised
Sigma provides a cloud analytics solution with a spreadsheet-like interface that allows users to analyze billions of records in real-time using SQL, Python, or AI. This platform enables teams to collaborate effectively and automate data workflows while maintaining security and performance, addressing the need for accessible and scalable data analysis in organizations of all sizes.
Funding: $200.0M
Rough estimate of the amount of funding raised
AvenirSpark Capital
AvenirSpark Capital
Funding: $200.0M
Rough estimate of the amount of funding raised
Voltron Data provides Theseus, a GPU-accelerated SQL engine designed for processing petabyte-scale data without the need for indexing or data movement. It enables enterprises to significantly reduce query times, server counts, and operational costs, making it ideal for large-scale ETL and machine learning preprocessing tasks.
Funding: $110.0M
Rough estimate of the amount of funding raised
Walden Catalyst
Walden Catalyst
Funding: $110.0M
Rough estimate of the amount of funding raised
The startup offers an artificial intelligence-native platform that integrates data analytics and machine learning to enhance product development and customer service. Its developer-centric customer relationship management software enables teams to efficiently build and support products, significantly increasing productivity for clients.
Funding: $185.8M
Rough estimate of the amount of funding raised
Khosla VenturesMayfield Fund
Khosla VenturesMayfield Fund
Funding: $185.8M
Rough estimate of the amount of funding raised
MindsDB provides an AI analytics solution that enables teams to generate complex analysis and actionable answers across diverse, petabyte-scale data sources using natural language. The platform eliminates ETL by connecting directly to structured and unstructured data repositories, allowing non-technical users to query data and build AI models conversationally. This approach delivers real-time, trustworthy business intelligence without requiring data movement or extensive data engineering expertise.
Funding: $55.6M
Rough estimate of the amount of funding raised
NVentures
NVentures
Funding: $55.6M
Rough estimate of the amount of funding raised
Hex Technologies is a collaborative data science platform that integrates SQL, Python, and R within a modular, notebook-based workspace, enabling teams to conduct data analysis and create interactive applications. It addresses the challenge of fragmented data workflows by providing a unified environment for exploration, visualization, and reporting, enhancing team collaboration and decision-making.
Funding: $101.5M
Rough estimate of the amount of funding raised
Sequoia Capital
Sequoia Capital
Funding: $101.5M
Rough estimate of the amount of funding raised
The startup offers a cloud-based research data management platform that automates workflows for biomedical research, enabling collaborative studies and machine learning applications. This platform enhances data scalability and analysis, facilitating multicenter clinical trials and accelerating the pace of scientific discoveries.
100+
5K+Approximate amount of employees
Funding: $131.1M
Rough estimate of the amount of funding raised
Funding: $131.1M
Rough estimate of the amount of funding raised
The startup offers an autonomous big data analytics platform that utilizes declarative configurations and automation to manage cloud infrastructure and optimize data pipelines. This technology reduces maintenance efforts throughout the data lifecycle, enabling business managers to initiate projects and make informed decisions with ease.
Funding: $54.0M
Rough estimate of the amount of funding raised
Tiger Global Management
Tiger Global Management
Funding: $54.0M
Rough estimate of the amount of funding raised
VAST Data provides a unified data platform that integrates storage, database, and compute capabilities, eliminating the need for data tiering and silos. This architecture enables organizations to manage unstructured data efficiently, enhancing accessibility and performance for AI-driven applications.
Funding: $398.0M
Rough estimate of the amount of funding raised
Fidelity
Fidelity
Funding: $398.0M
Rough estimate of the amount of funding raised
Materialize is a cloud operational data store that uses Differential Dataflow to provide strongly consistent, real-time views of operational data with sub-second latency. This technology enables businesses to quickly respond to changes by integrating and querying data from multiple sources without the complexity of traditional data processing methods.
Funding: $100.5M
Rough estimate of the amount of funding raised
Redpoint
Redpoint
Funding: $100.5M
Rough estimate of the amount of funding raised
VIMAAN provides AI-driven computer vision solutions that enhance inventory tracking in warehouses by automating cycle counting, order validation, and real-time inventory visibility. This technology significantly reduces labor costs and improves inventory accuracy, enabling warehouses to achieve nearly instant ROI and minimize mis-shipments.
Funding: $53.3M
Rough estimate of the amount of funding raised
Amazon
Amazon
Funding: $53.3M
Rough estimate of the amount of funding raised
Acceldata provides a unified data observability platform that enables businesses to monitor data pipelines, detect anomalies, and ensure data quality in real-time. This technology helps organizations prevent data failures and optimize costs, ultimately enhancing the reliability of their data infrastructure.
Funding: $105.6M
Rough estimate of the amount of funding raised
Prosperity7 Ventures
Prosperity7 Ventures
Funding: $105.6M
Rough estimate of the amount of funding raised
inVia Robotics provides a Robots-as-a-Service solution that integrates autonomous mobile robots and AI-powered Warehouse Execution System software to enhance warehouse productivity. Their technology enables e-commerce distribution centers to achieve up to 5x productivity increases while minimizing labor costs and utilizing existing infrastructure.
Funding: $60.2M
Rough estimate of the amount of funding raised
M12 - Microsoft's Venture FundQualcomm Ventures
M12 - Microsoft's Venture FundQualcomm Ventures
Funding: $60.2M
Rough estimate of the amount of funding raised
Striim provides a unified data integration and streaming platform that enables real-time data ingestion and processing from diverse sources, including transaction logs and IoT sensors, using a SQL-like streaming engine. This technology allows businesses to achieve immediate insights and automate decision-making processes, enhancing operational efficiency and responsiveness to customer needs.
Funding: $50.0M
Rough estimate of the amount of funding raised
GS Growth
GS Growth
Funding: $50.0M
Rough estimate of the amount of funding raised
WEKA provides a cloud-native, software-defined data platform that enables organizations to efficiently store, process, and manage large volumes of data across on-premises and cloud environments. By transforming stagnant data silos into streaming data pipelines, WEKA enhances performance for AI and high-performance computing workloads while reducing energy consumption and carbon emissions.
Funding: $140.0M
Rough estimate of the amount of funding raised
Valor Equity Partners
Valor Equity Partners
Funding: $140.0M
Rough estimate of the amount of funding raised
Cribl provides a unified data management platform that enables organizations to collect, process, and route logs, metrics, and traces to various destinations without the need for additional agents or infrastructure. This technology reduces data volume and storage costs while ensuring that relevant data is delivered in the appropriate format for IT and security operations.
Funding: $725.2M
Rough estimate of the amount of funding raised
Google Ventures
Google Ventures
Funding: $725.2M
Rough estimate of the amount of funding raised
Astronomer offers a fully-managed data orchestration platform built on Apache Airflow, enabling organizations to deploy, manage, and scale their data workflows efficiently. The platform addresses the challenges of data reliability and integration, ensuring seamless data delivery for AI applications and data-driven decision-making.
Funding: $282.1M
Rough estimate of the amount of funding raised
Insight Partners
Insight Partners
Funding: $282.1M
Rough estimate of the amount of funding raised
Instabase is a platform that automates the extraction and analysis of unstructured data from various document types, enabling businesses to generate actionable insights and streamline workflows. By connecting applications without moving data, it addresses inefficiencies in data processing and enhances operational productivity across industries such as finance, healthcare, and public services.
Funding: $291.9M
Rough estimate of the amount of funding raised
Qatar Investment Authority
Qatar Investment Authority
Funding: $291.9M
Rough estimate of the amount of funding raised
CData Software provides data integration solutions that enable real-time access to over 300 data sources, including databases, APIs, and cloud applications, through standardized connectors. This technology allows organizations to streamline data access and eliminate the complexities of integrating disparate data systems, enhancing operational efficiency and data-driven decision-making.
250+
10K+Approximate amount of employees
Funding: $350.0M
Rough estimate of the amount of funding raised
Warburg Pincus
Warburg Pincus
Funding: $350.0M
Rough estimate of the amount of funding raised
Lucidworks offers an AI‑powered enterprise search and discovery platform that unifies structured and unstructured data, applying neural‑hybrid and generative models to infer user intent and deliver personalized, high‑relevance results for commerce, knowledge management, and customer‑service applications. The solution includes a no‑code orchestration layer, real‑time behavior signals via a lightweight JavaScript beacon, and built‑in analytics, A/B testing, and KPI dashboards to automate relevance tuning and enable continuous optimization. It can be deployed as SaaS, on‑premises, or hybrid, allowing large enterprises to integrate the platform within existing infrastructure without custom code.
Funding: $100.0M
Rough estimate of the amount of funding raised
Francisco Partners
Francisco Partners
Funding: $100.0M
Rough estimate of the amount of funding raised
ClickHouse develops a real-time analytical processing database management system optimized for online analytical processing (OLAP) that enables organizations to perform fast queries on large datasets. It addresses the challenge of slow data retrieval and high costs associated with traditional databases, providing significant improvements in query speed and storage efficiency.
Funding: $350.0M
Rough estimate of the amount of funding raised
Khosla Ventures
Khosla Ventures
Funding: $350.0M
Rough estimate of the amount of funding raised
Hydrolix is a streaming data lake that utilizes decoupled storage, indexed search, and stream processing to manage terabyte-scale log data efficiently. The platform reduces log data retention costs by 75% while enabling real-time query performance and eliminating the need for data aggregation or sampling.
Funding: $68.9M
Rough estimate of the amount of funding raised
S3 Ventures
S3 Ventures
Funding: $68.9M
Rough estimate of the amount of funding raised
Prophecy offers a Data Transformation Copilot that enables users to build, deploy, and monitor data pipelines using an AI-powered visual interface that generates native Spark or SQL code. This platform addresses the challenge of inefficient data processing by allowing business users to self-serve, significantly reducing reliance on data engineers and accelerating analytics workflows.
Funding: $115.6M
Rough estimate of the amount of funding raised
Smith Point Capital
Smith Point Capital
Funding: $115.6M
Rough estimate of the amount of funding raised