Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Ai Inference Engine in Asia
Discover the top 50 Ai Inference Engine startups in Asia. Browse funding data, key metrics, and company insights. Average funding: $108.1M.
Sort by
Simplismart provides a high-performance inference engine that enables rapid deployment and fine-tuning of generative AI models on-premises or across various cloud platforms. This technology reduces model deployment time from months to days, significantly lowering operational costs while enhancing inference speed and scalability.
Funding: $8.3M
Rough estimate of the amount of funding raised
Google for Startups
Google for Startups
Funding: $8.3M
Rough estimate of the amount of funding raised
Rebellions develops AI accelerators that utilize HBM3e chiplet architecture and 5nm System-on-Chip technology to enhance energy efficiency and computational performance for deep learning applications. The company addresses the need for scalable and efficient AI inference solutions in the rapidly growing generative AI market.
Funding: $224.7M
Rough estimate of the amount of funding raised
KT CorpWa’ed Ventures
KT CorpWa’ed Ventures
Funding: $224.7M
Rough estimate of the amount of funding raised
NeuReality designs AI-centric infrastructure that integrates a network addressable processing unit (NAPU) with purpose-built software to streamline AI inference workflows. This solution reduces reliance on traditional CPUs and networking components, addressing the complexity and inefficiencies that hinder AI model deployment and scalability.
Funding: $114.6M
Rough estimate of the amount of funding raised
Alumni VenturesXT Venture Capital
Alumni VenturesXT Venture Capital
Funding: $114.6M
Rough estimate of the amount of funding raised
Cortica provides an autonomous AI platform that converts visual, audio, radar and time‑series sensor streams into compressed neural signatures using self‑learning, brain‑inspired networks. The system trains on unlabelled production data, runs inference on low‑power hardware, and adapts continuously to avoid bias, allowing partners in manufacturing, automotive, security, and healthcare to deploy domain‑specific perception and analytics without building foundational models.
Funding: $40.0M
Rough estimate of the amount of funding raised
CVS Health VenturesLRVHealth
CVS Health VenturesLRVHealth
Funding: $40.0M
Rough estimate of the amount of funding raised
The startup offers a visual recognition platform that autonomously processes diverse visual data, including infrared and X-ray images, while accurately tagging objects of interest. This technology enhances operational efficiency and ensures high-quality results for clients across various industries.
Funding: $24.7M
Rough estimate of the amount of funding raised
Funding: $24.7M
Rough estimate of the amount of funding raised
The startup operates an IoT platform that utilizes deep learning inference on edge devices to gather and analyze real-world data. This technology enables businesses to efficiently deploy and manage edge computing systems, reducing operational costs and time to market.
Funding: $32.7M
Rough estimate of the amount of funding raised
Global Brain Corporation
Global Brain Corporation
Funding: $32.7M
Rough estimate of the amount of funding raised
The startup has developed an AI dialogue engine that interprets ambiguous expressions in business communications by analyzing contextual cues. This technology enables clients to foster trust and empathy in customer interactions, enhancing engagement and satisfaction.
Exlords develops an AI and machine learning-integrated hardware platform for edge processors, enabling real-time AI inference on user devices through on-device neural processing units. This technology addresses the need for efficient, localized computing solutions that enhance performance and connectivity in heterogeneous computing environments.
Founded 2023
The startup provides AI infrastructure and services that enable businesses to access and implement machine learning models without requiring extensive technical expertise. By simplifying the deployment of AI technology, the company helps organizations leverage data-driven insights to enhance operational efficiency and decision-making.
Founded 2023
EdgeCortix develops the SAKURA-II Edge AI Platform, an energy-efficient AI accelerator that delivers up to 240 TOPS for real-time inferencing in compact, low-power modules. This technology addresses the need for high-performance AI processing at the edge, significantly reducing operational costs across various sectors, including defense, robotics, and smart manufacturing.
Funding: $37.0M
Rough estimate of the amount of funding raised
NEDO
NEDO
Funding: $37.0M
Rough estimate of the amount of funding raised
Hailo develops AI processors optimized for deep learning applications on edge devices, enabling high-performance video processing and analytics with low power consumption. Their technology addresses the need for efficient AI inferencing in various industries, including automotive and industrial automation, by facilitating the deployment of complex neural networks in resource-constrained environments.
Funding: $341.2M
Rough estimate of the amount of funding raised
Funding: $341.2M
Rough estimate of the amount of funding raised
Socra AI offers a modular AI platform that embeds custom machine‑learning pipelines into existing enterprise applications, automating data ingestion, preprocessing, model training, and real‑time inference through RESTful APIs and event‑driven microservices. The solution includes drag‑and‑drop pipeline building, automated hyper‑parameter optimization, enterprise‑grade security, and both SaaS and on‑premise containerized deployment options for data‑driven firms in finance, manufacturing, logistics, and retail.
75+
3K+Approximate amount of employees
Luchen Technology provides a platform for training and deploying large AI models, significantly reducing training and inference costs by up to 90% while enhancing model capacity and speed. Their solutions enable businesses to efficiently build high-quality AI applications with minimal resources, streamlining the development process across various hardware environments.
Founded 2021
The startup develops an AI-driven decentralized computing platform that integrates blockchain technology with edge computing protocols, enabling users to share computing resources and process data efficiently. By utilizing smart contracts and decentralized storage, the platform enhances transaction verification and provides reliable computing power for decentralized applications (dApps).
5+
100+Approximate amount of employees
Funding: $20.0M
Rough estimate of the amount of funding raised
Amber GroupPolygon Ventures
Amber GroupPolygon Ventures
Funding: $20.0M
Rough estimate of the amount of funding raised
Myelin Foundry develops edge AI algorithms that process complex unstructured data from video, voice, and sensors in real-time, optimizing performance on low-power devices. This technology enables enterprises to achieve immediate insights and automation, reducing operational costs and enhancing user experiences.
Funding: $9.7M
Rough estimate of the amount of funding raised
SIDBI Venture Capital
SIDBI Venture Capital
Funding: $9.7M
Rough estimate of the amount of funding raised
HyperAccel engineers specialized AI semiconductors, utilizing a novel LPU architecture designed for high-performance and energy-efficient Generative AI workloads. Their product line spans from edge devices to cloud datacenters, offering optimized solutions for LLM inference. The company supports leading AI frameworks through a dedicated software platform, ensuring seamless integration for developers.
30+
700+Approximate amount of employees
Funding: $38.4M
Rough estimate of the amount of funding raised
Korea Investment Partners
Korea Investment Partners
Funding: $38.4M
Rough estimate of the amount of funding raised
Sophie AI is a visual agentic AI platform that combines multimodal computer vision, large‑language models, and augmented‑reality overlays to diagnose and guide repair of hardware products in real time. It embeds AI‑driven visual workflows into self‑service portals, contact‑center interfaces, and field‑service apps, enabling customers and agents to follow step‑by‑step AR instructions, reducing on‑site dispatches and improving first‑time‑fix rates. The solution includes scalable cloud inference, secure data handling, and analytics dashboards for operational KPI tracking.
Funding: $30.0M
Rough estimate of the amount of funding raised
OurCrowdSalesforce Ventures
OurCrowdSalesforce Ventures
Funding: $30.0M
Rough estimate of the amount of funding raised
FuriosaAI develops the RNGD data center accelerator, utilizing a Tensor Contraction Processor architecture to enhance the efficiency of AI inference with a power profile of just 150W. This technology enables enterprises to deploy large language models and multimodal applications with low latency and high throughput, significantly reducing energy consumption and operational costs in data centers.
Funding: $194.3M
Rough estimate of the amount of funding raised
Funding: $194.3M
Rough estimate of the amount of funding raised
Nazar provides a platform for training, deploying, and managing vision AI models, including image segmentation and feature point detection. The platform simplifies the AI pipeline, allowing users to own their data and models while offering optimized inference speeds and model parallelization. Nazar aims to support various data types and neural network architectures in the future.
Founded 2025
NEUCHIPS develops AI ASIC solutions, including the Evo Gen 5 PCIe Card and Gen AI N3000 Accelerator, specifically designed for deep learning inference in data centers. Their technology addresses the need for energy-efficient hardware that minimizes total cost of ownership (TCO) while enhancing performance for machine learning applications.
Funding: $90.0M
Rough estimate of the amount of funding raised
Funding: $90.0M
Rough estimate of the amount of funding raised
Mianbi Smart develops lightweight, high-performance AI models that efficiently run on mainstream consumer electronics and various terminal devices, optimizing computational resources for real-time applications. Their technology addresses the need for scalable AI solutions in edge computing, enabling advanced functionalities in mobile devices, wearables, and smart environments while significantly reducing inference costs and memory usage.
Founded 2022
AI21 Labs develops generative AI systems that utilize advanced foundation models and a built-in Retrieval-Augmented Generation (RAG) engine to create conversational AI applications grounded in enterprise data. Their technology enhances enterprise workflows by providing accurate, reliable, and scalable AI solutions tailored to specific organizational needs.
Funding: $508.0M
Rough estimate of the amount of funding raised
Comcast VenturesIntel Capital
Comcast VenturesIntel Capital
Funding: $508.0M
Rough estimate of the amount of funding raised
Axonvertex AI offers a private, on‑premise AI engineering platform for healthcare and clinical‑trial organizations, enabling secure execution of large language and tiny models with built‑in NIST‑aligned responsible‑AI safeguards, human‑in‑the‑loop controls, and FHIR‑based data integration. The platform provides auditable AI agents and edge inference to support compliant decision‑support and workflow automation while keeping patient data local.
Founded 20235+
Homebrew develops local AI solutions, including the Jan AI Assistant and the Ichigo real-time voice AI, utilizing energy-efficient hardware to enhance performance. The company addresses the need for accessible, efficient AI tools that operate without reliance on cloud infrastructure, ensuring user privacy and reducing latency.
Cloudwalk offers an AIoT platform that integrates edge devices, a collaborative operating system (CWOS), and multimodal foundation models to provide on‑device inference and standardized APIs for vision, speech, and language processing. The solution includes privacy‑computing, data‑governance, and AI‑Agent tools, allowing large enterprises and public agencies in finance, manufacturing, energy, and smart city domains to deploy AI capabilities without extensive custom integration.
Funding: $253.7M
Rough estimate of the amount of funding raised
Funding: $253.7M
Rough estimate of the amount of funding raised
Baidu provides an integrated AI ecosystem comprising a cloud‑based AI Open Platform with over 270 pre‑trained model APIs for vision, speech, and language, the DuerOS voice‑assistant SDK for multimodal interaction, and the Apollo autonomous‑driving stack offering perception, planning, and safety‑critical tools. These services run on Baidu’s Kunlun AI chips and the PaddlePaddle deep‑learning framework, delivering scalable, production‑grade performance and pay‑as‑you‑go pricing for developers, enterprise IT teams, and automotive OEMs.
Funding: $617.1M
Rough estimate of the amount of funding raised
Funding: $617.1M
Rough estimate of the amount of funding raised
The startup offers a blockchain-based platform that enables global sharing and utilization of GPU resources. This approach addresses the inefficiency of underutilized computing power, allowing users to access and monetize excess GPU capacity effectively.
Funding: $2.6M
Rough estimate of the amount of funding raised
DePIN X
DePIN X
Funding: $2.6M
Rough estimate of the amount of funding raised
Provides an AI-driven platform for optimizing deep learning models through techniques like model compression, pruning, and heterogeneous scaling. This enables businesses to reduce GPU computing costs by up to 80%, achieve up to 6x faster inference speeds, and deploy resource-efficient models on edge devices while ensuring data privacy and security.
VESSL AI offers an end-to-end MLOps platform that enables machine learning teams to build, train, and deploy models efficiently across various infrastructures with a single command. The platform addresses the challenges of resource management and deployment speed by providing serverless deployment, real-time monitoring, and automated CI/CD workflows.
Funding: $16.4M
Rough estimate of the amount of funding raised
A Ventures
A Ventures
Funding: $16.4M
Rough estimate of the amount of funding raised
CogniFiber provides anomaly detection services for industrial IoT, cybersecurity, and fintech using photonics technology, achieving 100 times faster processing at half the cost. This solution enables organizations to identify and respond to irregularities in real-time, enhancing operational efficiency and security.
Funding: $13.5M
Rough estimate of the amount of funding raised
Chartered GroupEastern Epic Capitals
Chartered GroupEastern Epic Capitals
Funding: $13.5M
Rough estimate of the amount of funding raised
Krutrim provides an AI computing infrastructure and AI-powered applications tailored for the Indian market, enabling businesses to leverage machine learning and data analytics. This platform addresses the need for accessible and scalable AI solutions, enhancing operational efficiency and decision-making capabilities for local enterprises.
Funding: $75.6M
Rough estimate of the amount of funding raised
Z47
Z47
Funding: $75.6M
Rough estimate of the amount of funding raised
The startup offers a cloud-based platform that optimizes AI infrastructure and workloads through the integration of open-source tools and resource estimation. This platform enhances operational efficiency by improving the deployment process of artificial intelligence applications for businesses.
30+
7K+Approximate amount of employees
Funding: $8.0M
Rough estimate of the amount of funding raised
GMS Capital Partners
GMS Capital Partners
Funding: $8.0M
Rough estimate of the amount of funding raised
Whale offers an enterprise AI platform that combines edge AI cameras, IoT hubs, and a cloud‑native analytics engine to capture visual, audio, and contextual data from physical spaces. The system provides real‑time metrics such as foot‑traffic counts, demographic profiles, and safety incidents, and includes low‑code workflow automation, generative content creation, and integration with CRM/ERP systems, all secured with SOC 2‑compliant encryption.
Funding: $60.0M
Rough estimate of the amount of funding raised
Temasek Holdings
Temasek Holdings
Funding: $60.0M
Rough estimate of the amount of funding raised
4Paradigm provides an AI enablement platform that delivers industry‑specific large models built from multi‑modal data and a software‑defined compute layer that abstracts hardware for high‑throughput, low‑cost processing. The platform includes AutoML, transfer‑learning tools, and a generative‑AI development suite that automates model creation, code generation, review, and deployment, all delivered via secure, GDPR‑compliant cloud services.
Funding: $166.7M
Rough estimate of the amount of funding raised
Wuji Capital
Wuji Capital
Funding: $166.7M
Rough estimate of the amount of funding raised
Deepleaper develops a business semantics processing engine that utilizes artificial intelligence to curate relevant content and consumer services based on real-time contextual data. This technology enhances user experiences by presenting personalized recommendations, enabling individuals to discover and enjoy the best aspects of their daily lives.
SportsSeam is an AI-driven platform that utilizes computer vision and statistical machine learning to analyze sports videos, providing real-time insights on player performance and game strategies. By integrating high-quality data and continuous learning, it enables fantasy players and teams to make informed decisions and optimize their strategies effectively.
Founded 2017
Rens provides an API‑first model management platform that centralizes version control, containerized deployment, and real‑time monitoring for machine‑learning models. It automates generation of versioned containers, provisions Kubernetes‑native inference services with auto‑scaling and canary rollouts, and streams latency, error and drift metrics to customizable dashboards with alerting. The platform integrates with Kubeflow Pipelines, MLflow, and CI/CD tools while offering role‑based access control and immutable audit logs for governance.
20+
700+Approximate amount of employees
Funding: $34.0M
Rough estimate of the amount of funding raised
FBG CapitalPolychain
FBG CapitalPolychain
Funding: $34.0M
Rough estimate of the amount of funding raised
The startup develops on-device artificial intelligence systems that utilize large language models to convert diverse data types, including text, images, and videos, into searchable vectors. This technology enables businesses to efficiently process complex data, enhancing their analytical capabilities and competitive positioning in the market.
Funding: $25.6M
Rough estimate of the amount of funding raised
Funding: $25.6M
Rough estimate of the amount of funding raised
The startup develops a generative AI-based speech-to-text (STT) engine tailored for financial institutions, facilitating the integration of artificial intelligence contact center (AICC) technology into their operations. By providing specialized voice technology services, the company enhances communication efficiency and operational collaboration across the financial, corporate, and public sectors.
Founded 2022
Uses machine learning algorithms to analyze patient data and improve diagnostic accuracy, streamline healthcare operations, and enhance treatment outcomes. By integrating AI-driven solutions into clinical workflows, it addresses inefficiencies and variability in patient care, enabling healthcare providers to deliver more precise and timely interventions.
Founded 2023
Fetcherr utilizes AI-driven algorithms for real-time dynamic pricing and market simulation in the airline industry, enabling precise demand predictions and inventory management. This technology addresses revenue loss by optimizing pricing strategies and automating inventory workflows, ensuring airlines maximize their profitability in volatile markets.
Funding: $138.7M
Rough estimate of the amount of funding raised
Battery Ventures
Battery Ventures
Funding: $138.7M
Rough estimate of the amount of funding raised
Vastai Technologies develops high-performance AI chips specifically designed for computer vision and video processing, including the SV100 series for cloud-based inference and the VE series for edge computing. Their technology addresses the need for efficient, low-latency video analysis and processing in applications such as real-time video streaming and industrial automation.
Founded 2018
Gigablue utilizes nature-driven carbon capture and removal techniques, enhanced by an AI prediction engine, to optimize the efficiency of carbon sequestration in ocean environments. The company addresses the urgent need for effective and affordable solutions to help corporations and governments achieve their negative emission targets.
At One Ventures
Xinyaotu provides a full-stack integrated single-chip solution that combines intelligent sensors and computing with proprietary AI algorithms. This technology enables efficient data collection and processing for applications requiring real-time insights, enhancing operational performance in various industries.
Founded 2022
Dipeak provides a data virtualization engine, X-Engine, that integrates heterogeneous data sources and enables real-time analytics with a performance increase of 5-10 times over similar products. Their AI-driven tools, AskBI and AskDoc, deliver precise data insights and document analysis with over 95% accuracy, addressing the need for efficient decision-making in various industries.
Founded 2022
Micro-nano core designs and produces IoT and AIoT system-on-a-chip (SoC) solutions focused on ultra-low power consumption. Their chips integrate sensor acquisition and embedded AI to improve the energy efficiency, precision, and edge inference capabilities of IoT devices.
Edge Matrix Computing operates a decentralized AI infrastructure that utilizes a GPU-based distributed computing network to provide scalable computing resources for AI applications. This platform addresses the inefficiencies of centralized computing by enabling developers to deploy and manage AI dApps with reduced latency and improved resource allocation.
Founded 2023
PasPros Genius offers an AI-powered Customer Relationship Management (CRM) platform designed to help businesses manage customer data, automate workflows, and enhance interactions. The platform provides intelligent automation, 360-degree customer insights, and personalized communication to streamline operations and improve customer engagement.
Founded 2021
The startup develops an AI engine for voice-based digital engagement tailored specifically for young children, utilizing advanced speech recognition and generative conversation technology. This platform creates personalized, safe content that aligns with each child's developmental level, fostering creativity and interactive learning experiences.
Funding: $7.2M
Rough estimate of the amount of funding raised
Amiti VenturesMoreVC
Amiti VenturesMoreVC
Funding: $7.2M
Rough estimate of the amount of funding raised
Cerebrum Technologies develops AI-driven solutions utilizing machine learning algorithms for natural language processing and computer vision to enhance operational efficiency and customer experiences across various sectors. Their technology unlocks new growth opportunities for businesses in an increasingly digital landscape by optimizing processes and improving decision-making capabilities.
Funding: $3.5M
Rough estimate of the amount of funding raised
Boğaziçi Ventures
Boğaziçi Ventures
Funding: $3.5M
Rough estimate of the amount of funding raised