Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Ai Accelerator Hardware
Discover the top 50 Ai Accelerator Hardware startups. Browse funding data, key metrics, and company insights. Average funding: $101.1M.
Sort by
The startup develops a low-power system on a chip (SoC) with an integrated AI accelerator, designed for use in smartwatches, electronic shelf labels, and battery-powered IoT devices. This technology enables manufacturers to create energy-efficient smart devices that enhance user experience and functionality.
Rebellions develops AI accelerators that utilize HBM3e chiplet architecture and 5nm System-on-Chip technology to enhance energy efficiency and computational performance for deep learning applications. The company addresses the need for scalable and efficient AI inference solutions in the rapidly growing generative AI market.
Funding: $224.7M
Rough estimate of the amount of funding raised
KT CorpWa’ed Ventures
KT CorpWa’ed Ventures
Funding: $224.7M
Rough estimate of the amount of funding raised
Unaware provides a PCIe plug‑and‑play AI accelerator ASIC that runs neural network inference directly on hardware without a host operating system or runtime libraries. The chip’s dataflow architecture, on‑chip weight storage, and secure enclaves deliver over 10 TOPS/W efficiency while protecting model and data privacy, targeting privacy‑focused AI developers, edge‑computing startups, and small research labs.
Untether AI develops high-density AI accelerators that utilize at-memory computing to enhance the speed and energy efficiency of AI inference tasks. Their technology enables real-world applications, such as autonomous vehicles and smart cities, to operate more effectively and affordably.
Funding: $125.0M
Rough estimate of the amount of funding raised
Intel CapitalTracker Capital Management
Intel CapitalTracker Capital Management
Funding: $125.0M
Rough estimate of the amount of funding raised
Tiiny AI provides the Pocket Lab, a 200 g plug‑and‑play USB‑C edge compute module with an integrated GPU/TPU accelerator that runs full‑stack AI inference locally without subscription or token fees. The device includes a Python/ONNX SDK, secure boot, encrypted storage, and OTA model and firmware updates, targeting developers and small‑to‑mid‑size enterprises that require low‑latency, on‑device AI.
MatX manufactures specialized hardware designed for training and inference of large AI models, delivering up to 10× more computing power for workloads with over 7 billion parameters. This enables researchers and startups to efficiently train advanced models, significantly reducing the time and cost associated with developing state-of-the-art AI systems.
Funding: $119.9M
Rough estimate of the amount of funding raised
Spark Capital
Spark Capital
Funding: $119.9M
Rough estimate of the amount of funding raised
Axelera AI develops AI processing units (AIPUs) and associated software like the Voyager SDK to accelerate AI inference workloads. Their hardware solutions, including Metis AIPUs on M.2 and PCIe cards, deliver high performance and power efficiency for edge computing applications. This technology enables customers to deploy complex deep neural networks for computer vision and analytics at a lower cost and power consumption than traditional GPU solutions.
Funding: $135.6M
Rough estimate of the amount of funding raised
Funding: $135.6M
Rough estimate of the amount of funding raised
EdgeCortix develops the SAKURA-II Edge AI Platform, an energy-efficient AI accelerator that delivers up to 240 TOPS for real-time inferencing in compact, low-power modules. This technology addresses the need for high-performance AI processing at the edge, significantly reducing operational costs across various sectors, including defense, robotics, and smart manufacturing.
Funding: $37.0M
Rough estimate of the amount of funding raised
NEDO
NEDO
Funding: $37.0M
Rough estimate of the amount of funding raised
HyperAccel engineers specialized AI semiconductors, utilizing a novel LPU architecture designed for high-performance and energy-efficient Generative AI workloads. Their product line spans from edge devices to cloud datacenters, offering optimized solutions for LLM inference. The company supports leading AI frameworks through a dedicated software platform, ensuring seamless integration for developers.
30+
700+Approximate amount of employees
Funding: $38.4M
Rough estimate of the amount of funding raised
Korea Investment Partners
Korea Investment Partners
Funding: $38.4M
Rough estimate of the amount of funding raised
EnCharge AI develops high-efficiency analog in-memory computing GPUs and digital AI accelerators for edge-to-cloud deployment. Their validated hardware and flexible software offer significant improvements in performance, TCO, and sustainability compared to traditional solutions. The company provides versatile products from chiplets to PCIe cards, enabling seamless orchestration for on-device and cloud AI inference.
Funding: $44.3M
Rough estimate of the amount of funding raised
DARPA
DARPA
Funding: $44.3M
Rough estimate of the amount of funding raised
Lemurian Labs develops Tachyon, a software stack designed to eliminate hardware dependency in AI workloads. This platform enables AI applications to run on any hardware or cloud infrastructure while matching or exceeding the performance of hand-tuned kernels. Tachyon provides AI developers with enhanced productivity, superior performance, and complete portability across diverse computing environments.
Funding: $9.1M
Rough estimate of the amount of funding raised
Alumni VenturesPlug and PlayRaptor Group
Alumni VenturesPlug and PlayRaptor Group
Funding: $9.1M
Rough estimate of the amount of funding raised
Hailo develops AI processors optimized for deep learning applications on edge devices, enabling high-performance video processing and analytics with low power consumption. Their technology addresses the need for efficient AI inferencing in various industries, including automotive and industrial automation, by facilitating the deployment of complex neural networks in resource-constrained environments.
Funding: $341.2M
Rough estimate of the amount of funding raised
Funding: $341.2M
Rough estimate of the amount of funding raised
SiliconBee designs custom AI ASICs with dedicated tensor cores delivering up to 200 TOPS per watt, integrated HBM2e memory exceeding 1 TB/s bandwidth, and a low‑latency mesh interconnect for scalable multi‑chip configurations. Its hardware and accompanying software stack provide native support for TensorFlow, PyTorch, and ONNX, enabling high‑throughput training and inference across data‑center and edge form factors. The company serves data‑center operators, AI‑focused cloud providers, and OEMs building edge inference devices that require efficient, high‑performance machine‑learning compute.
NextSilicon's Maverick-2 Intelligent Compute Accelerator (ICA) utilizes software-defined hardware to dynamically optimize performance for high-performance computing (HPC) and artificial intelligence (AI) workloads. This technology eliminates the need for extensive code rewrites, significantly reducing development time and enabling faster insights across various applications.
Funding: $270.0M
Rough estimate of the amount of funding raised
Third Point Ventures
Third Point Ventures
Funding: $270.0M
Rough estimate of the amount of funding raised
The startup provides AI hardware-software solutions that enhance computational efficiency and performance for data-intensive applications. By integrating specialized AI accelerators with optimized software frameworks, the company addresses the high energy consumption and latency issues faced by traditional computing systems.
Founded 2022
NEUCHIPS develops AI ASIC solutions, including the Evo Gen 5 PCIe Card and Gen AI N3000 Accelerator, specifically designed for deep learning inference in data centers. Their technology addresses the need for energy-efficient hardware that minimizes total cost of ownership (TCO) while enhancing performance for machine learning applications.
Funding: $90.0M
Rough estimate of the amount of funding raised
Funding: $90.0M
Rough estimate of the amount of funding raised
Arago builds a silicon‑photonic inference accelerator that combines optical interconnects with standard CMOS manufacturing to deliver high‑bandwidth, low‑latency matrix‑multiply performance for large language models. The chip integrates PCIe Gen5/CXL interfaces and a compiler‑driven runtime that maps TensorFlow, PyTorch, and ONNX workloads with minimal code changes, offering over 2× throughput and >50 TOPS/W compared to GPUs. It targets hyperscale cloud providers and enterprises needing efficient AI inference at data‑center and edge scale.
Zettascale Computing Corporation designs energy-efficient, reconfigurable dataflow chips (XPUs) for AI training and inference. These chips adapt their architecture to specific AI models, optimizing dataflow and reducing memory movement for superior energy efficiency and throughput compared to traditional accelerators.
Habana Labs develops Intel® Gaudi® AI accelerators designed for high-performance deep learning training and inference, providing enterprises and cloud providers with efficient compute solutions. Their technology delivers up to 40% better price/performance on cloud instances, addressing the need for cost-effective and scalable AI infrastructure.
Funding: $75.0M
Rough estimate of the amount of funding raised
Intel Capital
Intel Capital
Funding: $75.0M
Rough estimate of the amount of funding raised
MithrilAI supplies silicon‑validated AI accelerator IP that integrates randomized multi‑party computation, hardware masking, and fault‑injection detection to protect inference workloads from side‑channel and physical attacks. The solution delivers high‑throughput inference with less than 10 % performance overhead and complies with DoD, FIPS, and ISO/IEC 15408 standards, enabling semiconductor manufacturers and system integrators to embed provably secure AI cores in defense, aerospace, autonomous transportation, and critical‑infrastructure devices.
Luminous Computing develops photonics chips designed to provide the necessary compute, memory, and bandwidth for advanced artificial intelligence applications. This technology addresses the limitations of current hardware, enabling instant processing of complex queries and facilitating the development of next-generation AI solutions.
Funding: $105.0M
Rough estimate of the amount of funding raised
Funding: $105.0M
Rough estimate of the amount of funding raised
Daidalos offers a parameterizable RTL IP library of AI accelerator cores that deliver high performance, low power, and deterministic latency for convolution, transformer, and matrix‑multiply workloads. The cores integrate with standard EDA toolchains and bus interfaces, include dynamic voltage/frequency scaling, fine‑grained power gating, and hardware security features, allowing fabless and ASIC designers to accelerate development and reduce silicon area and energy consumption across edge, automotive, and data‑center applications.
Exlords develops an AI and machine learning-integrated hardware platform for edge processors, enabling real-time AI inference on user devices through on-device neural processing units. This technology addresses the need for efficient, localized computing solutions that enhance performance and connectivity in heterogeneous computing environments.
Founded 2023
Mythic provides analog compute‑in‑memory AI inference accelerators that integrate compute and weight storage on a single silicon plane, eliminating off‑chip memory traffic. Delivered as standard M.2 cards, the APUs achieve up to 25 TOPS with 3‑4× lower power than comparable digital accelerators, and are compatible with TensorFlow and PyTorch for edge devices such as robots, drones, and smart‑city cameras.
Funding: $13.0M
Rough estimate of the amount of funding raised
Atreides ManagementLux Capital
Atreides ManagementLux Capital
Funding: $13.0M
Rough estimate of the amount of funding raised
This company develops compiler technology and hardware-software co-design methodologies to improve the performance and efficiency of AI chips. Their solutions aim to reduce power consumption and accelerate the development cycle for AI systems.
The startup manufactures a chip that utilizes Compute Express Link technology to enable data center operators to efficiently pool and manage artificial intelligence accelerators, processors, and memory. This approach enhances system performance by providing adequate memory resources for diverse device integration, addressing the challenges of scalability and resource allocation in large-scale computing environments.
Funding: $70.2M
Rough estimate of the amount of funding raised
InterVest Co.
InterVest Co.
Funding: $70.2M
Rough estimate of the amount of funding raised
Develops high-performance GPUs optimized for artificial intelligence workloads, leveraging advanced parallel processing architectures to accelerate training and inference tasks. These GPUs address the growing demand for scalable, efficient computing power in AI research and deployment, enabling faster model development and improved operational efficiency.
Founded 2024
Flux Computing builds silicon‑photonic accelerator modules that perform matrix multiplications with light, delivering over ten times the performance per watt of conventional GPUs. The system uses wavelength‑division multiplexed optical interconnects and exposes standard CUDA/OpenCL and TensorFlow/PyTorch APIs, allowing seamless integration into existing AI software stacks. Modular cards can be tiled in standard racks, giving hyperscale data centers and AI labs a scalable, energy‑efficient compute solution.
Artemis is an ultra‑low‑power ASIC accelerator that operates at near‑threshold voltage (≈0.275 V) and uses an asynchronous massively parallel architecture to deliver AI inference with about 35 % lower power consumption than comparable HPC accelerators. The platform integrates on‑chip ADCs, vector extensions, and a CXL‑compatible interconnect to enable near‑data processing and coherent memory access for hyperscale data‑center and edge AI workloads.
BrainChip licenses AI accelerator hardware designs and development tools for on-device intelligence. Their Akida processor IP utilizes sparsity and event-based neural networks to deliver unmatched efficiency for real-time AI applications. This technology reduces latency and power consumption, enabling devices to detect, analyze, and respond to events without cloud dependence.
Funding: $21.5M
Rough estimate of the amount of funding raised
Funding: $21.5M
Rough estimate of the amount of funding raised
SEMIQA provides an analog in‑memory accelerator that performs matrix‑vector operations directly within configurable synapse crossbars, reducing data movement and energy consumption. The platform delivers sub‑nanosecond latency with >10× lower energy per operation than digital ASICs, supports on‑chip learning, and includes a software stack that maps TensorFlow/PyTorch models to analog kernels for edge and data‑center deployments.
FuriosaAI develops the RNGD data center accelerator, utilizing a Tensor Contraction Processor architecture to enhance the efficiency of AI inference with a power profile of just 150W. This technology enables enterprises to deploy large language models and multimodal applications with low latency and high throughput, significantly reducing energy consumption and operational costs in data centers.
Funding: $194.3M
Rough estimate of the amount of funding raised
Funding: $194.3M
Rough estimate of the amount of funding raised
Lumai builds a data‑center‑grade AI processor that uses a 3‑dimensional free‑space optical matrix multiplier to execute parallel multiply‑accumulate operations at photon speed. The architecture provides up to 50× higher throughput and roughly 90 % lower power consumption than conventional silicon GPUs/TPUs while fitting into standard rack‑mount servers via PCIe and supporting major AI frameworks. This enables hyperscale cloud providers and enterprise AI clusters to scale training and inference workloads with reduced energy and hardware costs.
AnalogPort supplies semiconductor IP and design services that deliver power‑performance‑area‑optimized high‑speed interconnects for AI accelerators, including UCIe chiplet interfaces, multi‑protocol SerDes links, and advanced memory interfaces. Its silicon‑proven portfolio supports 16‑64 Gbps chiplet communication, up to 112 Gbps serial links, and GDDR7/HBM memory, enabling fabless AI chip designers and data‑center hardware vendors to integrate disaggregated compute with reduced risk and faster time‑to‑market.
MegaCool Technologies provides modular, closed-loop liquid cooling platforms designed for AI accelerator workloads. Their high‑conductivity heat exchangers and AI‑driven thermal management software dynamically adjust coolant flow to maintain target junction temperatures, delivering over 20 % higher thermal efficiency for hyperscale data‑center and enterprise GPU/ASIC clusters.
The startup provides CPU cores designed for managing circuits and complex data center processors, focusing on flexibility and performance. Their technology enhances AI integration, enabling clients to efficiently deploy processors across diverse computing systems.
Funding: $100.0M
Rough estimate of the amount of funding raised
Funding: $100.0M
Rough estimate of the amount of funding raised
GigaIO provides an open, composable infrastructure platform designed to dynamically pool and manage accelerators for AI and HPC workloads. Their AI fabric technology enables seamless edge-to-core scaling and supports heterogeneous compute resources like GPUs and inference accelerators. This solution delivers high-performance interconnectivity, leading to faster model training and reduced latency across diverse computing environments.
MultiCortex OS is an AI‑focused operating system that natively schedules and balances large‑language‑model inference across CPUs, GPUs, NPUs and other accelerators within a single node using a unified driver stack and dynamic workload allocator. It maximizes hardware throughput, lowers token‑per‑second costs, and provides enterprise‑grade security and AWS Marketplace integration for plug‑and‑play deployment of pre‑optimized models.
This startup develops AI hardware designed for real-time, ultra-low latency applications. Their platform enables adaptive applications previously considered unattainable due to performance limitations.
Founded 2021
AiReplyit Inc has developed a patent-pending PCB layout design that transforms standard gaming GPUs into efficient processors for large language models, significantly lowering the cost of AI computing. This technology enables researchers and small companies to access high-performance AI capabilities without the need for expensive, specialized hardware.
Founded 2024
Kalray offers high-performance processing acceleration solutions powered by its MPPA® architecture. These solutions efficiently handle data-intensive workloads in AI, automotive, and telecommunications, delivering superior performance and energy efficiency for demanding applications.
Funding: $16.1M
Rough estimate of the amount of funding raised
BNP Paribas
BNP Paribas
Funding: $16.1M
Rough estimate of the amount of funding raised
DashCrystal develops machine learning models and toolchains optimized for deployment on silicon and FPGA platforms, enabling high-performance inference directly on hardware accelerators. Their solutions target semiconductor manufacturers and hardware designers who need to integrate AI capabilities into chips and edge devices, reducing latency and power consumption. The company monetizes through licensing of proprietary models, software SDKs, and custom integration services for enterprise customers.
The startup provides a domain‑adaptive processor IP that can reconfigure accelerator pipelines at runtime, delivering ASIC‑level performance with the flexibility needed for diverse edge workloads. Integrated into a SoC, its on‑chip high‑bandwidth interconnect and unified compiler/runtime eliminate off‑chip data movement, reducing latency and power consumption. The IP is offered under a scalable licensing model for semiconductor OEMs and system integrators building IoT, autonomous vehicle, and wireless communication edge platforms.
Qingwei Intelligent Technology designs and manufactures AI chips optimized for high-performance computing tasks in artificial intelligence applications. Their technology addresses the demand for efficient processing power in AI systems, enabling faster data analysis and improved machine learning capabilities.
Founded 2018
DeGirum develops an AI Hub that integrates hardware and software infrastructure to streamline the development and deployment of edge AI applications. By providing a unified software toolchain and flexible hardware options, it reduces time to market and minimizes costs associated with multiple hardware and application investments.
Funding: $42.1M
Rough estimate of the amount of funding raised
Funding: $42.1M
Rough estimate of the amount of funding raised
The startup develops deep sub-threshold analog in-memory computation semiconductors that execute data-center-grade AI workloads with significantly lower power consumption and no active cooling. This technology processes data within memory, enabling industries to efficiently handle complex tasks while minimizing energy use.
20+
200+Approximate amount of employees
Funding: $50.5M
Rough estimate of the amount of funding raised
Funding: $50.5M
Rough estimate of the amount of funding raised
Hanxu Technology develops specialized computing chips, including the SpinPU-M01 FPGA and SpinPU-M02 ASIC, utilizing spintronics to enhance performance in complex optimization problems. Their technology operates at room temperature and offers significant improvements in speed and accuracy over traditional computing methods, addressing the limitations of quantum computing.
Founded 2023
ThinkForce is a Shanghai-based manufacturer of artificial intelligence chips designed to enhance processing efficiency in machine learning applications. Their technology addresses the demand for high-performance computing solutions in industries reliant on real-time data analysis.
Founded 2017
The startup provides a deep learning acceleration platform that enables developers to efficiently build, optimize, and deploy high-performance machine learning models across diverse hardware configurations. This technology reduces the time and resources required for model training and deployment, enhancing operational efficiency in machine learning applications.
Founded 1993
CoughAPP offers a wearable hardware module equipped with a high‑sensitivity microphone array and an on‑device AI accelerator that detects, classifies (dry vs. wet), and quantifies cough events in real time. All audio processing occurs locally for privacy, with encrypted logs and optional cloud sync via FHIR‑compatible APIs, allowing clinicians to monitor respiratory conditions continuously through integrated EHR and tele‑medicine platforms.