Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Ai Compute Platform - Late Stage
Discover the top 50 Ai Compute Platform startups at Late Stage. Browse funding data, key metrics, and company insights. Average funding: $211M.
Sort by
Nscale
-London, United KingdomNscale provides a GPU cloud platform optimized for AI workloads, featuring on-demand compute and inference services, dedicated training clusters, and scalable GPU nodes. The platform addresses the high costs and inefficiencies associated with AI model training and deployment by offering a fully integrated infrastructure powered by renewable energy in Europe.
Funding: $100M+
Rough estimate of the amount of funding raised
CoreWeave
-Roseland, United StatesCoreWeave provides a Kubernetes-native cloud platform optimized for large-scale, GPU-accelerated workloads, offering on-demand access to a diverse range of NVIDIA GPUs and CPU instances. This infrastructure enables businesses to achieve up to 35 times faster performance and 80% cost savings compared to traditional cloud providers, addressing the need for efficient and scalable compute resources in AI and machine learning applications.
Funding: $500M+
Rough estimate of the amount of funding raised
Akeana
-San Jose, United StatesThe startup provides CPU cores designed for managing circuits and complex data center processors, focusing on flexibility and performance. Their technology enhances AI integration, enabling clients to efficiently deploy processors across diverse computing systems.
Funding: $100M+
Rough estimate of the amount of funding raised
Vultr
-West Palm Beach, United StatesVultr provides cloud infrastructure with dedicated clusters and on-demand virtual machines powered by AMD and NVIDIA GPUs, enabling efficient deployment of AI and high-performance computing workloads. The platform offers scalable solutions at competitive pricing, addressing the need for accessible and powerful computing resources for developers and businesses globally.
Funding: $200M+
Rough estimate of the amount of funding raised
Krutrim
-Alo, IndiaKrutrim provides an AI computing infrastructure and AI-powered applications tailored for the Indian market, enabling businesses to leverage machine learning and data analytics. This platform addresses the need for accessible and scalable AI solutions, enhancing operational efficiency and decision-making capabilities for local enterprises.
Funding: $50M+
Rough estimate of the amount of funding raised
Ampere Computing
-Santa Clara, CubaAmpere Computing designs high-performance data center processors, including the Ampere Altra and AmpereOne families, which feature up to 192 cores optimized for cloud-native applications. These processors deliver significant price-performance advantages, achieving up to 50% better efficiency compared to traditional x86 architectures, while addressing the growing demand for scalable and sustainable computing resources.
Funding: $100M+
Rough estimate of the amount of funding raised
Enflame
Enflame develops cloud-based deep learning chips specifically designed for AI training platforms, enhancing computational efficiency and speed. This technology addresses the high resource demands of AI model training, enabling faster iterations and reduced operational costs for businesses.
Funding: $200M+
Rough estimate of the amount of funding raised
Panmnesia
-Daejeon, South KoreaThe startup manufactures a chip that utilizes Compute Express Link technology to enable data center operators to efficiently pool and manage artificial intelligence accelerators, processors, and memory. This approach enhances system performance by providing adequate memory resources for diverse device integration, addressing the challenges of scalability and resource allocation in large-scale computing environments.
Funding: $50M+
Rough estimate of the amount of funding raised
Foundry
-Palo Alto, United StatesFoundry provides an orchestration platform that enables AI developers to access NVIDIA GPU clusters on-demand, facilitating training, fine-tuning, and inference without long-term contracts. The platform addresses the challenge of unpredictable compute needs by offering flexible pricing options, including reserved and spot instances, ensuring reliable performance for critical workloads.
Funding: $50M+
Rough estimate of the amount of funding raised
Nebius AI
-Amsterdam, The NetherlandsProvides a fully managed AI cloud platform powered by NVIDIA® H100 and H200 Tensor Core GPUs, offering scalable GPU clusters with InfiniBand networking for high-speed data processing. Enables efficient model training, fine-tuning, and inference with tools like MLflow, PostgreSQL, and Apache Spark, reducing the complexity and cost of deploying AI applications at scale.
Funding: $500M+
Rough estimate of the amount of funding raised
Taalas
-Toronto, CanadaTaalas provides a platform that transforms any AI model into custom silicon, creating Hardcore Models that are hardwired for optimal performance. This approach significantly enhances computational efficiency, achieving up to 1000 times the performance of traditional software implementations.
Funding: $50M+
Rough estimate of the amount of funding raised
Voltage Park
-Berkeley, United StatesProvides bare-metal access to high-performance AI compute infrastructure powered by NVIDIA HGX H100 GPUs and a 3200 Gbps InfiniBand network, enabling low-latency, scalable training and inference for large-scale machine learning models. Offers transparent pricing and flexible deployment options, including on-demand nodes and long-term contracts, to meet the needs of demanding workloads in AI, HPC, and real-time applications.
Funding: $500M+
Rough estimate of the amount of funding raised
Modular
-Palo Alto, United StatesModular provides a unified AI software development platform that enables developers to optimize and deploy AI models across various cloud environments without rewriting code. By leveraging high-performance Mojo programming and a secure inference stack, the platform significantly reduces cloud costs while enhancing computational efficiency for enterprise applications.
Funding: $100M+
Rough estimate of the amount of funding raised
Luminous Computing
-Mountain View, United StatesLuminous Computing develops photonics chips designed to provide the necessary compute, memory, and bandwidth for advanced artificial intelligence applications. This technology addresses the limitations of current hardware, enabling instant processing of complex queries and facilitating the development of next-generation AI solutions.
Funding: $100M+
Rough estimate of the amount of funding raised
Clarifai
-Wilmington, United StatesClarifai offers an end-to-end AI lifecycle platform that automates data labeling, model training, and deployment, enabling organizations to build and operationalize AI applications efficiently. By standardizing workflows and optimizing compute resources, the platform reduces development time and costs, allowing enterprises to scale AI solutions rapidly.
Funding: $50M+
Rough estimate of the amount of funding raised
Habana
-San Jose, United StatesHabana Labs develops Intel® Gaudi® AI accelerators designed for high-performance deep learning training and inference, providing enterprises and cloud providers with efficient compute solutions. Their technology delivers up to 40% better price/performance on cloud instances, addressing the need for cost-effective and scalable AI infrastructure.
Funding: $50M+
Rough estimate of the amount of funding raised
Exafunction
-San Jose, United StatesExafunction optimizes GPU workloads by relocating code execution to remote resources while maintaining core logic on cost-effective CPU instances. This approach reduces operational costs and enhances computational efficiency for businesses reliant on high-performance computing.
Funding: $100M+
Rough estimate of the amount of funding raised
Fireworks AI
-Redwood City, United StatesFireworks AI provides a serverless inference platform that enables the rapid deployment and fine-tuning of compound AI models, optimizing for speed and cost efficiency. The technology addresses the challenges of slow model inference and high operational costs, allowing businesses to scale AI applications effectively while maintaining low latency and high throughput.
Funding: $50M+
Rough estimate of the amount of funding raised
Untether AI
-Toronto, CanadaUntether AI develops high-density AI accelerators that utilize at-memory computing to enhance the speed and energy efficiency of AI inference tasks. Their technology enables real-world applications, such as autonomous vehicles and smart cities, to operate more effectively and affordably.
Funding: $100M+
Rough estimate of the amount of funding raised
Ceremorphic
-San Jose, United StatesDevelops a proprietary AI supercomputing architecture using a 3nm process and multi-threaded processing (ThreadArch®) to deliver energy-efficient, high-performance training for large AI models. This technology addresses the growing demands of AI applications and drug discovery by reducing compute costs and energy consumption while enabling faster model training and in silico drug design.
Funding: $50M+
Rough estimate of the amount of funding raised
G42
-Abu Dhabi, United Arab EmiratesG42 is an artificial intelligence and cloud computing company that delivers tailored AI solutions across various industries to enhance operational efficiency and data management. By leveraging advanced machine learning algorithms and scalable cloud infrastructure, G42 addresses the challenges of data processing and analytics in complex business environments.
Funding: $1B+
Rough estimate of the amount of funding raised
Rescale
-San Francisco, United StatesRescale provides a cloud-based high-performance computing platform that enables scientific and engineering simulations with customizable resources tailored to specific workloads. This platform reduces turnaround times and enhances data insights, allowing organizations to optimize their research and development processes efficiently.
Funding: $100M+
Rough estimate of the amount of funding raised
NeuReality
-Caesarea, IsraelNeuReality designs AI-centric infrastructure that integrates a network addressable processing unit (NAPU) with purpose-built software to streamline AI inference workflows. This solution reduces reliance on traditional CPUs and networking components, addressing the complexity and inefficiencies that hinder AI model deployment and scalability.
Funding: $100M+
Rough estimate of the amount of funding raised
d-Matrix
-Santa Clara, CubaD-Matrix has developed Corsair, an AI inference platform that achieves 60,000 tokens per second with 1 ms latency for Llama3 8B models, significantly enhancing throughput and energy efficiency in datacenters. This technology addresses the high computational costs and energy consumption associated with large-scale AI inference, enabling organizations to scale their AI capabilities sustainably.
Funding: $100M+
Rough estimate of the amount of funding raised
Anyscale
-San Francisco, United StatesAnyscale provides a configurable AI platform powered by RayTurbo, enabling developers to optimize and scale AI applications across any cloud and hardware configuration. The platform enhances GPU utilization and reduces cloud costs by up to 50%, facilitating faster model training and deployment for complex AI workloads.
Funding: $200M+
Rough estimate of the amount of funding raised
Celestial AI
-Santa Clara, CubaCelestial AI develops the Photonic Fabric™, an optical interconnect technology that enhances data transfer speeds and reduces latency in AI computing and memory infrastructure. This technology addresses the limitations of traditional interconnects by enabling high-bandwidth, low-latency connections essential for processing extensive AI workloads efficiently.
Funding: $100M+
Rough estimate of the amount of funding raised
Petuum
-Pittsburgh, United StatesThe startup offers a machine learning infrastructure platform that provides a flexible operating system and virtualization interface for building and deploying machine learning and deep learning applications at scale. This technology enables enterprises to manage applications and hardware from a single terminal, resulting in increased productivity, reduced operational costs, and faster delivery times.
Funding: $100M+
Rough estimate of the amount of funding raised
GMI Cloud
-San Jose, United StatesGMI Cloud provides instant access to NVIDIA H100 GPUs for training and deploying generative AI applications, utilizing a Kubernetes-based cluster engine for efficient workload orchestration. This platform addresses the need for rapid GPU provisioning and management, enabling developers to focus on building AI models without the complexities of infrastructure setup.
Funding: $100M+
Rough estimate of the amount of funding raised
Ori Industries
-London, United KingdomOri provides on-demand access to top-tier GPUs and serverless Kubernetes for training and deploying machine learning models at scale. The platform offers cost-optimized solutions that allow users to pay only for the resources they utilize, addressing the need for flexible and efficient AI infrastructure.
Funding: $100M+
Rough estimate of the amount of funding raised
Lightelligence
-Boston, United StatesDevelops photonic computing solutions that integrate optical and electronic components to accelerate AI workloads, addressing the limitations of traditional electronic systems, such as the "memory wall." Products like the HUMMINGBIRD optical network-on-chip and PACE photonic arithmetic engine enable exponential increases in processing speed and efficiency for domain-specific applications.
Funding: $200M+
Rough estimate of the amount of funding raised
Groq
-Mountain View, United StatesGroq accelerates AI inference with custom-designed Language Processing Units (LPUs) that deliver sub-millisecond latency and consistent performance. Their cloud platform and on-premise solutions enable developers to deploy AI models efficiently and cost-effectively.
Funding: $500M+
Rough estimate of the amount of funding raised
Volumez
-Boston, United StatesVolumez offers a Data Infrastructure as a Service (DIaaS) platform that dynamically orchestrates compute, network, and storage resources across cloud environments to create optimized data infrastructures for various workloads. This solution addresses the challenges of performance inconsistency and resource inefficiency in data-intensive applications by delivering guaranteed high throughput, low latency, and maximized GPU utilization.
Funding: $50M+
Rough estimate of the amount of funding raised
MatX
-Mountain View, United StatesMatX manufactures specialized hardware designed for training and inference of large AI models, delivering up to 10× more computing power for workloads with over 7 billion parameters. This enables researchers and startups to efficiently train advanced models, significantly reducing the time and cost associated with developing state-of-the-art AI systems.
Funding: $100M+
Rough estimate of the amount of funding raised
Recogni
-San Jose, United StatesRecogni develops a multimodal AI inference system utilizing its proprietary Pareto AI Math to enhance performance while significantly reducing power consumption. This technology addresses the high costs and energy demands of generative AI models, enabling efficient and accurate processing for data centers.
Funding: $100M+
Rough estimate of the amount of funding raised
NextSilicon
-Tel Aviv, IsraelNextSilicon's Maverick-2 Intelligent Compute Accelerator (ICA) utilizes software-defined hardware to dynamically optimize performance for high-performance computing (HPC) and artificial intelligence (AI) workloads. This technology eliminates the need for extensive code rewrites, significantly reducing development time and enabling faster insights across various applications.
Funding: $200M+
Rough estimate of the amount of funding raised
StackPath
-Dallas, United StatesThe startup operates a cloud computing edge platform that enables the deployment and management of virtual machines, server-less applications, and enterprise website security, including DDoS protection. This platform addresses the challenges of workload acceleration and infrastructure management, allowing developers and enterprises to enhance their operational efficiency and security.
Funding: $200M+
Rough estimate of the amount of funding raised
Axelera AI
10
Relative Traction Score based on online presence metrics compared to companies in the same age group.
Axelera AI develops and sells high-performance, energy-efficient AI inference hardware for edge devices. Their Metis AI Platform integrates a specialized in-memory computing architecture with a comprehensive software stack, enabling efficient deployment of deep learning models for computer vision and natural language processing applications.
Funding: $50M+
Rough estimate of the amount of funding raised
Esperanto Technologies
-Mountain View, United StatesEsperanto Technologies develops massively parallel, energy-efficient chips based on the RISC-V instruction set architecture, specifically designed for Generative AI and high-performance computing (HPC) applications. Their ET-SoC-1 chip features over a thousand low-power RISC-V cores, providing superior compute efficiency and significantly reducing total cost of ownership for AI inference and HPC workloads.
Funding: $50M+
Rough estimate of the amount of funding raised
DreamBig Semiconductor
-San Jose, United StatesDreamBig Semiconductor offers a Chiplet platform with SMARTNIC-DPU solutions designed for low latency and high throughput in AI, data centers, and storage acceleration. Their technology addresses the need for efficient data processing and inherent security in high-demand computing environments.
Funding: $50M+
Rough estimate of the amount of funding raised
Liquid AI
-Cambridge, United KingdomLiquid AI develops non-transformer generative AI models, known as Liquid Foundation Models (LFMs), which require less memory while maintaining high performance across various applications. These models address the inefficiencies and high computational costs associated with traditional large language models, enabling scalable AI solutions for diverse industries.
Funding: $200M+
Rough estimate of the amount of funding raised
Ventana Micro Systems
-San Jose, United StatesVentana develops high-performance RISC-V CPUs with extensible instruction sets, available as multi-core chiplets or core IP, tailored for demanding applications in cloud, data center, 5G, AI, and automotive sectors. Their technology enables rapid productization and cost-effective custom system designs, addressing the need for efficient and scalable processing solutions in modern computing environments.
Funding: $100M+
Rough estimate of the amount of funding raised
SiMa.ai
-San Jose, United StatesSiMa.ai develops a software-centric platform utilizing its proprietary Machine Learning System on Chip (MLSoC) technology to enable efficient deployment of multimodal AI applications at the edge. This platform addresses the need for high-performance, power-efficient solutions that can scale across various edge devices and applications, significantly improving processing speed and energy consumption.
Funding: $200M+
Rough estimate of the amount of funding raised
SiPearl
-Maisons-Laffitte, FranceSiPearl is developing a high-performance, low-power microprocessor specifically for supercomputing and artificial intelligence, designed to integrate with any third-party accelerator. This technology addresses the need for efficient processing of large volumes of data in critical fields such as medical research, energy management, and climate modeling, while minimizing carbon footprint.
Funding: $100M+
Rough estimate of the amount of funding raised
Together AI
-San Francisco, United StatesTogether AI provides a cloud-based platform for training, fine-tuning, and deploying open-source generative AI models using NVIDIA's latest GPUs, enabling users to achieve high performance at a lower cost. The platform addresses the need for scalable AI infrastructure by offering customizable solutions that support the entire generative AI lifecycle, from model development to production deployment.
Funding: $200M+
Rough estimate of the amount of funding raised
Gensyn
-London, United KingdomGensyn is a machine learning compute protocol that connects distributed resources to facilitate the training of deep learning models. This approach addresses the need for open, permissionless, and neutral frameworks that enable efficient scaling and collaboration in machine intelligence development.
Funding: $50M+
Rough estimate of the amount of funding raised
Axelera AI
-Eindhoven, The NetherlandsAxelera AI manufactures AI acceleration hardware, specifically the Metis AI Processing Unit (AIPU), designed for efficient edge computing with up to 214 TOPS performance and 15 TOPS per watt. The technology addresses the need for cost-effective and energy-efficient solutions in generative AI and computer vision applications across various industries, including retail and security.
Funding: $100M+
Rough estimate of the amount of funding raised
Abacus.AI
-San Francisco, United StatesAbacus.AI offers an AI-assisted data science and MLOps platform that enables enterprises to automate the development and deployment of machine learning models using state-of-the-art generative AI technology. The platform addresses the complexity of building and managing applied AI systems by allowing organizations to create custom AI agents and streamline workflows across various data sources.
Funding: $50M+
Rough estimate of the amount of funding raised
WEKA
-Campbell, United StatesWEKA provides a cloud-native, software-defined data platform that enables organizations to efficiently store, process, and manage large volumes of data across on-premises and cloud environments. By transforming stagnant data silos into streaming data pipelines, WEKA enhances performance for AI and high-performance computing workloads while reducing energy consumption and carbon emissions.
Funding: $100M+
Rough estimate of the amount of funding raised
FuriosaAI
-Seoul, South KoreaFuriosaAI develops the RNGD data center accelerator, utilizing a Tensor Contraction Processor architecture to enhance the efficiency of AI inference with a power profile of just 150W. This technology enables enterprises to deploy large language models and multimodal applications with low latency and high throughput, significantly reducing energy consumption and operational costs in data centers.
Funding: $100M+
Rough estimate of the amount of funding raised
webAI
-Grand Rapids, United StateswebAI provides an enterprise platform for on-device AI development and deployment, enabling organizations to build, train, and run custom models locally. This approach ensures full data sovereignty, reduces latency, and lowers operational costs by eliminating cloud dependencies.
Funding: $50M+
Rough estimate of the amount of funding raised