Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Ai Inference Engine in Europe
Discover the top 50 Ai Inference Engine startups in Europe. Browse funding data, key metrics, and company insights. Average funding: $37.5M.
Sort by
Pruna AI
Pruna AI provides an AI optimization engine that enhances machine learning model performance with just two lines of code, utilizing execution kernel and graph optimization techniques. This solution reduces runtime costs and carbon emissions by making AI models faster and more efficient, enabling scalable inference without extensive re-engineering.
Funding: $5M+
Rough estimate of the amount of funding raised
AICA
AICA provides a visual, node‑based software platform that lets system integrators and robotics engineers build sensor‑driven, adaptive robot applications without custom code. Its hardware abstraction layer and built‑in AI inference engine enable the same skill set to run across multiple robot and sensor vendors, with cloud‑edge deployment, version control, and remote diagnostics.
Funding: $2M+
Rough estimate of the amount of funding raised
SEMRON
SEMRON develops a 3D-scalable AI inference chip using its proprietary CapRAM™ technology, which integrates compute-in-memory architecture to enhance energy efficiency and parameter density for AI applications. This technology addresses the high costs and power consumption of traditional AI chips, enabling efficient deployment of generative AI models directly on edge devices like smartphones and wearables.
Funding: $5M+
Rough estimate of the amount of funding raised
XMOS
XMOS provides the XCORE® Generative System‑on‑Chip (GenSoC), a programmable silicon platform that compiles natural‑language system specifications into deterministic, parallel firmware with sub‑microsecond latency. The SoC integrates audio I/O, voice‑fusion DSP, motor‑control peripherals and an on‑chip AI inference engine, allowing OEMs to replace multiple discrete chips with a single component for audio, voice, robotics and industrial automation applications. This reduces hardware bill‑of‑materials, development time and timing‑error risk while delivering guaranteed real‑time performance.
Funding: $10M+
Rough estimate of the amount of funding raised
RaiderChip
RaiderChip designs semiconductor hardware accelerators that enhance AI performance by addressing memory bandwidth limitations. Their solutions enable efficient AI inference for both edge and cloud applications, allowing users to run complex large language models locally with full privacy and without ongoing subscriptions.
Funding: $1M+
Rough estimate of the amount of funding raised
Nebius AI
Provides a fully managed AI cloud platform powered by NVIDIA® H100 and H200 Tensor Core GPUs, offering scalable GPU clusters with InfiniBand networking for high-speed data processing. Enables efficient model training, fine-tuning, and inference with tools like MLflow, PostgreSQL, and Apache Spark, reducing the complexity and cost of deploying AI applications at scale.
Funding: $500M+
Rough estimate of the amount of funding raised
Mind Foundry
Mind Foundry has developed an AI platform that utilizes Bayesian inference and probabilistic numerics to create responsible AI solutions for high-stakes applications in sectors such as defense, infrastructure, and insurance. The platform addresses the challenges of risk and complexity by enabling organizations to design, test, and deploy AI systems that enhance decision-making and improve operational outcomes.
Funding: $20M+
Rough estimate of the amount of funding raised
Embedl
Provides a Model Optimization SDK that reduces deep learning model memory usage by up to 95% and energy consumption by up to 83%, enabling efficient AI deployment on resource-constrained embedded systems. This technology accelerates inference speeds by up to 18x, helping industries like automotive, aerospace, and IoT develop cost-effective, high-performance AI solutions.
Funding: $5M+
Rough estimate of the amount of funding raised
Vespa.ai
Provides a scalable platform that combines a distributed text search engine with a vector database, enabling real-time querying, ranking, and inference over billions of data items with sub-100ms latency. It supports applications like hybrid search, recommendation systems, and generative AI by integrating machine-learned relevance models and multi-vector representations for improved accuracy and performance.
Funding: $20M+
Rough estimate of the amount of funding raised
DataCrunch
DataCrunch provides on-demand access to high-performance GPU instances and custom-built clusters powered by NVIDIA H200 and H100 technology, enabling efficient model inference and training for machine learning applications. The platform utilizes 100% renewable energy, offering a scalable solution that reduces the infrastructure burden for businesses deploying AI models.
Funding: $10M+
Rough estimate of the amount of funding raised
FluidStack
FluidStack provides on-demand access to thousands of NVIDIA A100 and H100 GPUs, enabling AI engineers to rapidly scale their training and inference workloads without long-term contracts. The platform offers fully managed GPU clusters with 24/7 support, significantly reducing operational overhead and accelerating model deployment.
Funding: $3M+
Rough estimate of the amount of funding raised
Neurobus
Neurobus provides neuromorphic AI hardware and software that delivers sub‑millisecond inference at milliwatt power for edge‑deployed drones, ground stations, and space assets. Its radiation‑hardened processors and event‑driven sensor fusion enable autonomous perception, navigation, and swarm coordination without continuous human or ground‑station control. The platform includes a real‑time operating system, simulation tools, and open APIs for integration into aerospace, defense, and logistics systems.
Seldon
Seldon is a machine learning deployment platform that enables organizations to deploy and manage models at scale, reducing deployment time from months to minutes. By providing production-ready inference servers and advanced experimentation tools, Seldon enhances operational efficiency and reduces infrastructure costs, delivering an average productivity gain of 85%.
Nscale
Nscale provides a GPU cloud platform optimized for AI workloads, featuring on-demand compute and inference services, dedicated training clusters, and scalable GPU nodes. The platform addresses the high costs and inefficiencies associated with AI model training and deployment by offering a fully integrated infrastructure powered by renewable energy in Europe.
Funding: $100M+
Rough estimate of the amount of funding raised
Hopsworks
The startup offers a Python-centric feature store that enables data teams to efficiently build and manage feature, training, and inference pipelines at scale. This platform enhances the performance and availability of AI and machine learning applications across diverse data sources and environments, facilitating the development of data-driven products for business growth.
Funding: $10M+
Rough estimate of the amount of funding raised
Four/Four
The startup develops an AI engine that integrates with customer relationship management (CRM) data to analyze customer behavior and preferences. This technology generates actionable insights that enhance customer segmentation and improve revenue retention by predicting trends and behaviors.
Funding: $300K+
Rough estimate of the amount of funding raised
Conjecture
Develops a new AI architecture called Cognitive Emulation, which translates expert workflows into reusable AI pipelines that mimic human reasoning. This approach addresses issues of unpredictability, incoherence, and inefficiency in current AI systems by enabling reliable, interpretable, and task-specific AI deployment.
Funding: $20M+
Rough estimate of the amount of funding raised
Synthara
Synthara provides ComputeRAM™ in‑memory computing IP that integrates MAC operations directly into SRAM cells of standard ASIC/FPGA designs, eliminating external memory accesses. The drop‑in IP delivers up to 100× higher inference throughput and 100× lower energy consumption for edge AI workloads without increasing die area, enabling ultra‑low‑power devices such as wearables, drones and smart sensors. A cloud‑based validation suite models performance and power budgets to accelerate time‑to‑market for fabless semiconductor and OEM customers.
Funding: $5M+
Rough estimate of the amount of funding raised
roofline
Roofline provides a software solution that enables the deployment of AI models across diverse hardware platforms with a single Python call, optimizing and quantizing models for efficient edge computing. This approach addresses the challenges of traditional deployment methods, which often lack adaptability and performance, by significantly reducing memory usage and latency while maintaining accuracy.
Innatera
Innatera provides ultra-low-power neuromorphic processors for edge AI applications. Their spiking neural processors enable real-time pattern recognition with sub-milliwatt power consumption and significantly reduced latency for battery-powered devices.
Funding: $20M+
Rough estimate of the amount of funding raised
Outmind
The startup has developed an AI-powered search engine that quickly retrieves and synthesizes information from internal tools, documents, and conversations. This technology enables organizations to locate relevant data efficiently, reducing time spent searching and minimizing errors in decision-making.
Funding: $2M+
Rough estimate of the amount of funding raised
Hive Power
Hive Power's FLEXO AI engine optimizes energy management for connected devices on the grid, including electric vehicle (EV) charging and energy communities. By utilizing data-driven algorithms, it enhances demand response and local energy consumption, enabling significant cost savings and improved grid efficiency.
Funding: $5M+
Rough estimate of the amount of funding raised
Unlikely AI
Unlikely AI is developing neurosymbolic AI that combines large language models with symbolic reasoning to enhance the accuracy, trustworthiness, and explainability of automated systems. This approach addresses the opacity of traditional AI models, enabling users to understand and trust AI-generated outcomes.
Funding: $10M+
Rough estimate of the amount of funding raised
InstaDeep
InstaDeep develops AI-powered decision-making systems utilizing GPU-accelerated computing, deep learning, and reinforcement learning to tackle complex challenges in industries such as logistics, energy, and biology. Their technology enhances operational efficiency and precision, enabling enterprises to make data-driven decisions in an increasingly AI-centric landscape.
Funding: $100M+
Rough estimate of the amount of funding raised
Sprout.ai
Sprout.ai is an intelligent claims automation engine that utilizes AI-powered technology to deliver real-time claims decisions with a 96% accuracy rate. It addresses the issue of lengthy claims processing times by enabling insurers to settle over 60% of claims instantly, regardless of the line of business.
Funding: $20M+
Rough estimate of the amount of funding raised
GEMESYS
The startup develops a neuromorphic chip that mimics human brain information-processing mechanisms to enhance artificial intelligence hardware. This technology addresses computing bottlenecks by enabling more efficient training of neural networks for AI applications.
Tembi
The startup offers an AI-as-a-service platform that aggregates data from various open and publicly accessible sources and applies machine learning models to enhance this data. Businesses can access enriched data and algorithm results through a user-friendly interface or API, facilitating informed decision-making without the need for extensive data processing expertise.
Funding: $3M+
Rough estimate of the amount of funding raised
Ori Industries
Ori provides on-demand access to top-tier GPUs and serverless Kubernetes for training and deploying machine learning models at scale. The platform offers cost-optimized solutions that allow users to pay only for the resources they utilize, addressing the need for flexible and efficient AI infrastructure.
Funding: $100M+
Rough estimate of the amount of funding raised
Leeroo
Leeroo provides a platform for the rapid development of end-to-end trainable AI systems by automating the integration of pre-trained models, tools, and data into cohesive workflows. This eliminates the complexity of manual integration, enabling businesses to efficiently transition from concept to production while fine-tuning entire AI systems to meet specific operational needs.
SKY ENGINE AI
SKY ENGINE AI provides a Synthetic Data Cloud that generates multimodal synthetic data for training deep learning models in computer vision, significantly reducing the need for real-world image acquisition. This technology enhances model accuracy by up to 4150% and accelerates AI development cycles by up to 3340 times, addressing the challenges of data scarcity and high costs in various industries such as automotive, healthcare, and robotics.
Funding: $5M+
Rough estimate of the amount of funding raised
Mistral AI
Mistral AI provides open-weight generative AI models that developers and businesses can customize and deploy in various environments, including on-premise and cloud platforms. Their technology enhances AI application development by offering high-performance models with validated reasoning capabilities, ensuring independence from specific cloud providers.
Inven
Inven utilizes AI algorithms to analyze extensive data from various sources, enabling professionals in private equity, investment banking, and management consulting to identify high-potential companies and acquisition targets quickly. This technology significantly reduces the time spent on manual research, allowing users to source deals and gain market insights more efficiently.
TitanML
TitanML provides an enterprise-grade LLM cluster for high-performance language model inference, enabling organizations to deploy AI applications securely within their own infrastructure. This solution addresses the need for data privacy and control while optimizing operational costs and performance through advanced inference techniques.
Funding: $10M+
Rough estimate of the amount of funding raised
Vsim Technology
Vsim develops a multi-physics simulation engine that utilizes proprietary machine learning models to enable real-time, physics-based simulations for robotics and AI applications. This technology accelerates research and reduces costs by allowing AI agents to learn complex tasks in accurately simulated environments significantly faster than traditional methods.
Funding: $20M+
Rough estimate of the amount of funding raised
Deckard Technologies
Deckard is an AI engine that utilizes natural language processing to aggregate and organize project knowledge from various software tools, enabling teams to access relevant information efficiently. It addresses the challenge of outdated documentation by transforming chat and notes into dynamic code documentation, facilitating seamless onboarding and collaboration among developers.
Funding: $3M+
Rough estimate of the amount of funding raised
NetMind
NetMind offers a unified platform for accessing and deploying diverse AI models, including LLMs and multimodal capabilities, through standard APIs and the Model Context Protocol. The service simplifies AI infrastructure by providing on-demand GPU cluster rentals and managed inference endpoints, enabling developers to integrate AI without managing complex deployments.
Literal Labs
Literal Labs develops AI models based on Tsetlin Machine algorithms, which provide ultra-low power consumption and up to 250 times faster inference compared to traditional neural networks. Their technology enables on-device training and explainable AI, addressing the need for energy-efficient and transparent solutions in edge computing applications.
Stanhope AI
Stanhope AI develops intelligent decision-making systems using Active Inference, enabling machines and robots to autonomously navigate real-world scenarios without extensive training data. Their technology enhances energy efficiency and computational performance, allowing for on-device learning and providing explainable outputs that foster accountability in AI applications.
Funding: $2M+
Rough estimate of the amount of funding raised
Graphcore
Graphcore designs and manufactures Intelligence Processing Units (IPUs) and the Poplar software stack to accelerate machine learning workloads. Their technology enables faster training and inference for complex AI models across various industries. IPUs are optimized for the parallel processing demands of deep learning, offering a distinct advantage for AI innovation.
XYZ Robotics
The company provides modular robotic automation platforms that combine six‑degree‑of‑freedom arms with integrated stereo vision and AI inference for high‑precision assembly, inspection, and material handling. Its plug‑and‑play hardware and RESTful/OPC‑UA software enable real‑time motion planning, vision‑based closed‑loop control, and seamless integration with MES and ERP systems, while cloud dashboards deliver KPI monitoring and predictive maintenance. The solution scales from single‑cell units to coordinated workcells for mid‑size to large manufacturers in automotive, electronics, and consumer goods.
Stream Analyze
Stream Analyze offers a platform for developing and managing analytical AI models on autonomous edge devices, enabling real-time data processing with minimal operational footprint. This technology reduces data transmission by 10,000 times and accelerates decision-making, addressing the need for efficient analytics in industries reliant on connected devices.
Funding: $3M+
Rough estimate of the amount of funding raised
Dtechtive
The startup has developed a Conversational AI Engine that enables exhaustive data search and quick analysis of datasets, providing insights into their quality and usage. This technology allows businesses to discover relevant datasets that are often inaccessible through traditional search engines, enhancing data utilization for both users and providers.
Funding: $300K+
Rough estimate of the amount of funding raised
mindtrace.ai
The startup develops neuromorphic algorithms and low-energy machine vision systems that utilize asynchronous event-based processing to enhance predictive decision-making. This technology enables clients to minimize reliance on large datasets, significantly lowering costs associated with AI deployment.
Funding: $5M+
Rough estimate of the amount of funding raised
Weco AI
The startup offers an artificial intelligence agent platform that automates machine learning tasks by analyzing data and generating code for data preprocessing, model training, inference, and evaluation. This tool enables developers to efficiently produce customized code and research reports, significantly reducing the time and expertise required for machine learning projects.
Funding: $500K+
Rough estimate of the amount of funding raised
Raven
Raven integrates an AI engine into Grasshopper, enabling users to create or modify parametric node graphs through natural‑language or image prompts. The system automatically generates compatible definitions across 900+ plugins, supports team naming conventions, and offers an interactive chat for iterative refinement, with enterprise options for data isolation and one‑click deployment to ShapeDiver.
GENXT
GENXT.AI provides confidential AI solutions that allow enterprises to utilize large language models (LLMs) without exposing sensitive data, ensuring that all business and private information remains encrypted and inaccessible to third parties. Their technology enables secure model deployment, fine-tuning, and inference within isolated cloud environments, mitigating the risks of data leakage and ensuring compliance with data protection regulations.
Daslab
Daslab provides a no-code platform for building, training, and deploying custom AI models. Its visual interface empowers domain experts to create bespoke machine learning solutions and deploy them as scalable APIs, accelerating AI integration into business workflows.
Funding: $5M+
Rough estimate of the amount of funding raised
Sqwish
Sqwish offers a real-time input optimization layer via API to compress generative AI prompts and context by up to tenfold, significantly reducing token usage and inference costs. Its reinforcement learning engine adapts model selection and context based on live user interactions, optimizing AI performance directly against business outcomes like conversions.
Funding: $2M+
Rough estimate of the amount of funding raised
Outter
Outter provides a plug-and-play AI platform that enables businesses to rapidly integrate advanced AI features like co-pilots and recommendation engines into their existing products. Their proprietary AI Engine™ and data privacy protocol abstract AI development complexities, delivering measurable ROI and enhancing user experience.
Focoos AI
The startup develops AI-driven software that automates the design and training of neural networks for artificial vision applications. This platform enables companies to deploy optimized vision models that achieve high accuracy while minimizing power consumption.
Funding: $300K+
Rough estimate of the amount of funding raised