Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Deep Reinforcement Learning
Discover the top 50 Deep Reinforcement Learning startups. Browse funding data, key metrics, and company insights. Average funding: $56.8M.
Sort by
Phaidra
Phaidra develops AI-driven control systems that utilize deep reinforcement learning to optimize operations in mission-critical facilities, enhancing stability, energy efficiency, and sustainability. By replacing static, hard-coded control systems, Phaidra's technology continuously adapts and improves, significantly reducing energy consumption and CO2 emissions in industrial environments.
Funding: $50M+
Rough estimate of the amount of funding raised
Predictiva
Predictiva develops autonomous trading platforms that utilize deep reinforcement learning algorithms to analyze market data and execute trades without human intervention. This technology addresses the challenges of human error and emotional bias in financial trading, enabling users to achieve consistent, market-beating returns across various asset classes.
Funding: $3M+
Rough estimate of the amount of funding raised
Composabl
Composabl provides a platform for engineers to build intelligent autonomous agents by integrating technologies such as Deep Reinforcement Learning and Machine Learning using modular building blocks. This enables organizations to automate complex industrial processes without requiring extensive coding skills, enhancing operational efficiency and reducing reliance on software developers.
Funding: $5M+
Rough estimate of the amount of funding raised
StormForge
The startup offers a performance testing and application optimization platform that utilizes deep reinforcement learning and augmented intelligence to enhance data center energy efficiency. By leveraging advanced data science techniques, the platform helps businesses maintain operational uptime while reducing energy consumption.
AgileRL
AgileRL provides an open-source framework for reinforcement learning that enhances training speed by up to 10 times through RLOps, supporting both single-agent and multi-agent environments. The platform utilizes evolutionary hyperparameter optimization and distributed training to enable efficient convergence on optimal performance, addressing the challenges of slow training times and complex task management in AI development.
InstaDeep
InstaDeep develops AI-powered decision-making systems utilizing GPU-accelerated computing, deep learning, and reinforcement learning to tackle complex challenges in industries such as logistics, energy, and biology. Their technology enhances operational efficiency and precision, enabling enterprises to make data-driven decisions in an increasingly AI-centric landscape.
Funding: $100M+
Rough estimate of the amount of funding raised
May Mobility
May Mobility develops autonomous vehicles utilizing a Multi-Policy Decision Making (MPDM) system, a real-time reinforcement-learning AI that enables vehicles to learn and adapt to their environment every 200 milliseconds. This technology addresses the challenge of safely navigating unpredictable driving scenarios, allowing for efficient deployment of autonomous transportation solutions in diverse settings.
Funding: $100M+
Rough estimate of the amount of funding raised
Magic
Magic is developing AI models that automate software engineering and research by utilizing frontier-scale pre-training and domain-specific reinforcement learning. This technology addresses the inefficiencies in code generation and AI research, enabling developers to build applications more effectively and reliably than traditional methods.
Funding: $200M+
Rough estimate of the amount of funding raised
Augento (YC W25)
We offer a fine-tuning service for AI agents, allowing users to provide feedback on agent performance and improve the underlying model. Our platform uses reinforcement learning to optimize agent behavior based on user input, providing a turnkey solution for enhancing agent performance.
SynthLabs
Synth Labs is developing a hybrid approach that combines Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from AI Feedback (RLAIF) to enhance the alignment of large language models with human preferences. This method addresses the limitations of current alignment techniques by utilizing synthetic preferences to improve model performance and scalability.
Swaayatt Robots
Swaayatt Robots develops self-driving technology utilizing reinforcement learning to navigate complex and unpredictable traffic environments without the need for high-definition mapping. Their solutions aim to enhance the safety and efficiency of autonomous vehicles, making connected driving technology more accessible and cost-effective.
Funding: $5M+
Rough estimate of the amount of funding raised
poolside
Poolside is developing a foundation model specifically designed for software engineering, utilizing reinforcement learning from code execution feedback to enhance coding performance. The platform enables businesses to create custom AI models that continuously learn from their unique codebases and practices, improving developer efficiency and software quality.
Funding: $500M+
Rough estimate of the amount of funding raised
General Intuition
General Intuition provides a platform for training AI agents through rule‑based game simulations that combine perception, world‑model construction, and reinforcement learning. The system lets agents generate and test hypotheses in low‑cost simulated environments before transferring skills to real‑world robotics and autonomous systems via an open‑source API. It targets AI research labs and robotics manufacturers seeking embodied, adaptive intelligence.
Adaptive ML
Adaptive ML develops a platform that enables companies to privately tune and deploy language models using reinforcement learning from human feedback. This technology allows businesses to enhance model performance while maintaining data privacy and optimizing outputs based on specific user metrics.
Imitation Machines
Imitation Machines offers a robot learning platform that allows robots to acquire complex skills through human demonstrations. This approach uses imitation and reinforcement learning to create replicable behavioral models, eliminating the need for traditional robot programming and making advanced automation more accessible.
OfferFit
OfferFit is an AI-driven experimentation platform that utilizes reinforcement learning to autonomously test and optimize personalized marketing strategies for individual customers. By replacing traditional A/B testing, it enables rapid decision-making based on first-party data, significantly increasing conversion rates and customer engagement.
Funding: $50M+
Rough estimate of the amount of funding raised
Periodic Labs
Periodic Labs provides autonomous laboratory platforms that conduct high‑throughput, multi‑modal experiments and capture large, structured datasets for accelerated material discovery. Their reinforcement‑learning AI scientists analyze the data to generate and rank candidate materials, experimental protocols, and performance forecasts, accessible through an API or web dashboard for industrial R&D and academic labs.
EquiLibre Technologies
EquiLibre Technologies develops an algorithmic trading system that utilizes game theory and reinforcement learning to enhance trading strategies. This technology aims to improve decision-making in financial markets by optimizing trade execution and maximizing returns.
Funding: $10M+
Rough estimate of the amount of funding raised
Aampe
Aampe utilizes reinforcement learning and contextual bandit algorithms to create a personalized customer data platform that assigns virtual agents to each user, optimizing engagement based on individual behaviors. This approach addresses the inefficiencies of traditional data processing methods, enabling companies to leverage complex data for targeted messaging and improved conversion rates.
Funding: $20M+
Rough estimate of the amount of funding raised
Chai
CHAI is an AI platform that utilizes techniques such as Reinforcement Learning from Human Feedback (RLHF) and long-context modeling to create engaging conversational agents. The platform addresses the challenge of generating AI interactions that are both factually accurate and socially entertaining, empowering users to create and share their own content.
Funding: $10M+
Rough estimate of the amount of funding raised
Scale AI
Scale provides a full-stack Generative AI Platform that utilizes enterprise data to customize and fine-tune foundational models through supervised learning and reinforcement learning from human feedback (RLHF). The Scale Data Engine enhances model performance by delivering high-quality, curated data, addressing the critical bottleneck of data quality in AI development.
Diffblue
Diffblue Cover is an AI-powered solution that autonomously generates reliable Java unit and regression tests using reinforcement learning, significantly increasing testing speed and accuracy. This technology enables development teams to achieve high code coverage and maintain code quality while reducing the time spent on manual test writing by up to 95%.
Funding: $20M+
Rough estimate of the amount of funding raised
Weights & Biases
Weights & Biases provides an AI developer platform that centralizes experiment tracking, hyperparameter optimization, and model versioning across the machine‑learning lifecycle. Its SDKs and visual dashboards let teams log metrics, manage artifacts, and share insights, while hosted inference and serverless reinforcement‑learning services support model deployment and fine‑tuning. The platform includes enterprise‑grade security and compliance with optional dedicated or self‑hosted deployment options.
Warburg AI
Warburg AI develops modular, self-improving financial prediction models using machine learning and reinforcement learning techniques to enhance algorithmic trading for financial institutions. Their technology addresses the need for accurate market trend predictions, enabling clients to make informed investment decisions while adapting to changing market conditions.
Funding: $100K+
Rough estimate of the amount of funding raised
EnliteAI
The startup develops a geospatial data platform that utilizes artificial intelligence, specifically reinforcement learning and computer vision, for object detection in mobile mapping data. This technology enables infrastructure and asset management professionals to efficiently monitor and analyze assets throughout their entire life cycle.
Funding: $2M+
Rough estimate of the amount of funding raised
Bigyellowfish
Bigyellowfish is a behavioral risk management platform that utilizes reinforcement learning and data analytics to enhance workplace safety, productivity, and employee engagement in safety-critical industries. By monitoring job demands and resources, it helps organizations build resilience and improve operational efficiency, resulting in measurable reductions in insurance liabilities and operational costs.
Funding: $1M+
Rough estimate of the amount of funding raised
minds.ai
minds.ai's DeepSim platform utilizes supervised learning, reinforcement learning, and generative AI to optimize semiconductor manufacturing processes and enhance operational efficiency across all fabrication facilities. By automating software generation for hardware control and process design, it improves key performance indicators without disrupting existing workflows.
Funding: $5M+
Rough estimate of the amount of funding raised
Feedback Intelligence
Provides a platform for enterprises to implement reinforcement learning from human feedback (RLHF) in large language models (LLMs), centralizing data collection from both implicit and explicit user interactions. It identifies and prioritizes performance issues, generates synthetic data, and automates prompt tuning to optimize LLM reliability and reduce manual workload. This enables faster development cycles, improved user satisfaction, and reduced customer churn by aligning AI systems more closely with user needs.
Funding: $500K+
Rough estimate of the amount of funding raised
Sqwish
Sqwish offers a real-time input optimization layer via API to compress generative AI prompts and context by up to tenfold, significantly reducing token usage and inference costs. Its reinforcement learning engine adapts model selection and context based on live user interactions, optimizing AI performance directly against business outcomes like conversions.
Funding: $2M+
Rough estimate of the amount of funding raised
Exia Labs
Develops AI-driven wargaming and simulation tools that generate and evaluate both friendly and enemy courses of action using reinforcement learning. These tools enable military and defense organizations to identify optimal strategies at operational and strategic levels, improving decision-making in complex scenarios. Additionally, the company designs scalable data mesh systems for efficient data ingestion, storage, processing, and visualization.
VerifAI
VerifAI utilizes multiple large language models and reinforcement learning to automate the generation of tests, stimuli, and bug fixes for hardware and software verification. This approach accelerates verification processes by up to 100 times, significantly reducing regression times and enhancing overall testing efficiency.
Funding: $500K+
Rough estimate of the amount of funding raised
PhoenixAI
PhoenixAI develops a multi-modal AI navigation system for UAVs that utilizes reinforcement learning algorithms to optimize routes based on real-time factors such as signal quality, latency, and environmental conditions. This technology enables safe and efficient Beyond Visual Line of Sight (BVLOS) flights, addressing the challenges of maintaining reliable connectivity in diverse operational environments.
Denovo Sciences
Denovo Sciences utilizes a reinforcement learning-based platform to design and optimize chemical structures without relying on training datasets, enabling the discovery of novel therapeutics for targets with limited data. This technology allows for the rapid generation of multitarget small molecules, addressing the challenge of complex diseases that require modulation of multiple biological targets simultaneously.
Astrikos.Ai
Astrikos.ai offers a Smart Infra Platform that utilizes machine learning and deep reinforcement learning to analyze unstructured streaming data from operational systems in smart cities and industries. This platform provides real-time insights and predictive analytics, enabling data centers and other sectors to enhance operational efficiency and proactively manage emerging challenges.
Funding: $300K+
Rough estimate of the amount of funding raised
uno.ai
Uno provides an AI-driven Governance, Risk, and Compliance (GRC) co-pilot that automates the analysis and remediation of vulnerabilities across multi-cloud and hybrid environments. By leveraging large language models and deep reinforcement learning, Uno enhances incident response times by up to 30 times and scales operations tenfold, enabling organizations to manage IT and security risks more effectively.
Funding: $500K+
Rough estimate of the amount of funding raised
The Institute for Learning-enabled Optimization at Scale (TILOS)
TILOS is developing learning-enabled optimization techniques that bridge discrete and continuous optimization, enabling advancements in chip design, robotics, and communication networks. The initiative addresses the challenges of dynamic decision-making under uncertainty and nonconvex optimization in deep learning, enhancing efficiency in critical technology sectors.
Funding: $20M+
Rough estimate of the amount of funding raised
Plaif
This startup develops artificial intelligence robotics solutions that utilize deep learning, object recognition, reinforcement learning, and unsupervised learning to create fully autonomous robots. By enhancing robotic vision and sensory systems, the company enables industries to automate operations, reducing reliance on human intervention.
Funding: $3M+
Rough estimate of the amount of funding raised
Brick
Brick provides an integrated software and hardware solution that uses reinforcement learning and AI predictive analytics to optimize energy consumption in facilities and properties. This helps businesses reduce energy costs and decarbonize their operations through next-generation sustainable infrastructure.
DevLand
DevLand offers a collaborative coding platform with gamified AI learning experiences, allowing users to practice AI concepts like reinforcement learning and computer vision through interactive games. It also supports open-source project contributions, enabling developers to build practical skills and collaborate with a community.
GOAT.AI
GOAT.AI develops compact language models that utilize reinforcement learning and proximal policy optimization to mitigate issues such as hallucinations and context limitations in AI-generated content. The company focuses on enhancing human-AI interactions by training smaller models that outperform larger counterparts on domain-specific tasks.
Tessa
The startup is developing AI agents capable of autonomous internet interaction and self-improvement through machine learning techniques. These agents enhance operational efficiency by continuously adapting and learning from their environment, similar to human cognitive processes.
Funding: $500K+
Rough estimate of the amount of funding raised
Atman Labs
Atman Labs develops AI systems that utilize custom Reinforcement Learning, large-scale Knowledge Representation, and multi-modal Generative Models to emulate human expertise and facilitate proactive interactions. Their platform addresses the limitations of traditional AI by creating intelligent systems that enhance decision-making and user experience across various sectors, including e-commerce, travel, healthcare, and education.
VizLore LLC
This startup develops an artificial intelligence-powered cryptocurrency trading software that utilizes deep reinforcement learning and agent technology to generate hourly trading signals based on multiple market analysis indicators. By providing real-time dashboards for performance tracking, the software enables users to optimize trading strategies and achieve passive income with minimal execution errors.
Funding: $100K+
Rough estimate of the amount of funding raised
DeepVu
DeepVu provides Generative AI decisioning agents that utilize reinforcement learning and multi-scenario digital twins to enhance supply chain resilience, sustainability, and profitability. The platform enables real-time, data-driven planning decisions that optimize key performance indicators while addressing risks and inefficiencies in supply chain operations.
Funding: $500K+
Rough estimate of the amount of funding raised
Octane
Octane is a developer-focused platform that utilizes reinforcement learning to simulate over 80,000 potential attackers per second, enabling continuous scanning for vulnerabilities in smart contracts. By providing one-click bug fixes and proactive threat detection, Octane enhances security and accelerates deployment for blockchain developers.
eYARD
The startup develops a cloud-based AI optimization engine that utilizes deep reinforcement learning to enhance container terminal operations. By integrating with existing Terminal Operating Systems, it reduces costs, minimizes unproductive moves, and improves visibility in yard operations.
Funding: $500K+
Rough estimate of the amount of funding raised
Ludus Labs
Ludus Labs builds AI athletes trained with reinforcement learning to compete in novel sports and athletic challenges. These AI competitors, unbound by biological limitations, push the boundaries of performance and strategy in high-fidelity simulated environments.
Tagflow AI
Tagflow AI provides an automated platform for fine-tuning machine learning models using Reinforcement Learning from Human Feedback (RLHF), AI-assisted data labeling, and synthetic data generation. This technology enhances model accuracy and reduces training costs and turnaround times, making advanced AI accessible for businesses across various sectors.
ZeroAI
ZEROAI develops an autonomous driving system that utilizes a large language model (LLM) and a Stackelberg Reinforcement Learning Framework to enhance input filtering and ensure safe human intervention during operation. This technology addresses the risks of unsafe driving by providing real-time safety assessments and optimizing traffic flow, ultimately improving road safety and travel efficiency.
Deep Learning Robotics
The startup develops a vision-based robotic controller that utilizes deep learning algorithms to enable robots to learn tasks by observing human actions. This technology allows businesses to automate repetitive processes, reducing labor costs and increasing operational efficiency.
Funding: $500K+
Rough estimate of the amount of funding raised