Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Data Labeling Service - Series B
Discover the top 50 Data Labeling Service startups at Series B. Browse funding data, key metrics, and company insights. Average funding: $52M.
Sort by
Surge AI provides a data labeling platform that utilizes human feedback to enhance the training of large language models (LLMs). By delivering high-quality labeled data, Surge AI enables organizations to improve the accuracy and performance of their NLP applications.
Funding: $25.0M
Rough estimate of the amount of funding raised
Funding: $25.0M
Rough estimate of the amount of funding raised
HumanSignal provides a data labeling platform that combines automation and human oversight to prepare training data, fine-tune large language models, and evaluate AI outputs. This solution enhances model accuracy and efficiency while ensuring compliance and data security across various use cases and data types.
Funding: $30.2M
Rough estimate of the amount of funding raised
Redpoint
Redpoint
Funding: $30.2M
Rough estimate of the amount of funding raised
Labelbox operates a data training platform that utilizes AI-assisted labeling and a global network of experts to provide high-quality data curation and evaluation for machine learning applications. This platform addresses the challenge of efficiently managing large-scale data labeling and evaluation, enabling businesses to accelerate model development and improve AI performance.
Funding: $188.9M
Rough estimate of the amount of funding raised
SoftBank Vision Fund
SoftBank Vision Fund
Funding: $188.9M
Rough estimate of the amount of funding raised
Centaur Labs provides a medical AI platform that utilizes a global network of expert annotators for precise data labeling across various modalities, including text, audio, and imaging. This approach addresses the challenge of slow and inconsistent data annotation by ensuring high-quality labels through automated quality checks and performance metrics.
Funding: $31.9M
Rough estimate of the amount of funding raised
AccelAlumni VenturesHack VC
AccelAlumni VenturesHack VC
Funding: $31.9M
Rough estimate of the amount of funding raised
Klleon provides an AI-powered platform for automated data labeling and annotation services. The system accelerates the preparation of high-quality training datasets necessary for machine learning model development. This service streamlines the workflow for computer vision and NLP projects by ensuring data accuracy and consistency at scale.
Funding: $40.8M
Rough estimate of the amount of funding raised
LB Investment
LB Investment
Funding: $40.8M
Rough estimate of the amount of funding raised
Clarifai offers an end-to-end AI lifecycle platform that automates data labeling, model training, and deployment, enabling organizations to build and operationalize AI applications efficiently. By standardizing workflows and optimizing compute resources, the platform reduces development time and costs, allowing enterprises to scale AI solutions rapidly.
Funding: $60.0M
Rough estimate of the amount of funding raised
New Enterprise Associates
New Enterprise Associates
Funding: $60.0M
Rough estimate of the amount of funding raised
V7 is an AI training data platform that provides high-quality image and video annotations for computer vision models, utilizing AI-assisted labeling tools to enhance accuracy and efficiency. The platform addresses the challenge of slow and error-prone data labeling processes by streamlining workflows and enabling rapid deployment of training data.
Funding: $43.3M
Rough estimate of the amount of funding raised
Radical VenturesTemasek Holdings
Radical VenturesTemasek Holdings
Funding: $43.3M
Rough estimate of the amount of funding raised
Snorkel Flow is an AI data development platform that enables data scientists to programmatically label and annotate large datasets, significantly reducing the time required for data preparation. By leveraging domain knowledge and automated techniques, the platform enhances the accuracy and efficiency of training data for specialized AI applications in fields like bioinformatics and natural language processing.
Funding: $138.3M
Rough estimate of the amount of funding raised
QBE Ventures
QBE Ventures
Funding: $138.3M
Rough estimate of the amount of funding raised
SuperAnnotate is an AI data platform that integrates dataset creation, curation, and model evaluation into a single workflow, enabling users to build and fine-tune high-quality models efficiently. The platform addresses the challenges of data annotation and model performance assessment by providing customizable tools and access to a global marketplace of trained annotation teams.
Funding: $53.5M
Rough estimate of the amount of funding raised
Base10 PartnersDatabricks VenturesNVIDIA
Base10 PartnersDatabricks VenturesNVIDIA
Funding: $53.5M
Rough estimate of the amount of funding raised
Superb AI offers an end-to-end training data platform that automates data preparation and curation, enabling rapid and systematic dataset creation for AI model development. This solution addresses the inefficiencies in data handling, allowing organizations to streamline their AI workflows and enhance model deployment speed.
Funding: $37.8M
Rough estimate of the amount of funding raised
Duke UniversityHyundai Motor GroupKakao Investment
Duke UniversityHyundai Motor GroupKakao Investment
Funding: $37.8M
Rough estimate of the amount of funding raised
Kili Technology provides tailored data annotation and evaluation services for large language models, utilizing expert-led project management to streamline the data pipeline. This approach eliminates data bottlenecks, enabling companies to enhance model performance and accelerate AI project deployment.
Funding: $31.9M
Rough estimate of the amount of funding raised
Balderton Capital
Balderton Capital
Funding: $31.9M
Rough estimate of the amount of funding raised
Latent Labs provides curated, version‑controlled datasets for computer vision, natural language processing, and speech applications, delivered via secure API or bulk download. Its platform combines automated preprocessing pipelines with expert‑validated annotations and integrated compliance checks (e.g., GDPR, HIPAA) to ensure data quality and legal safety. The service also offers on‑demand custom data collection for enterprise AI teams and research labs.
20+
7K+Approximate amount of employees
Funding: $40.0M
Rough estimate of the amount of funding raised
Radical VenturesSofinnova Partners
Radical VenturesSofinnova Partners
Funding: $40.0M
Rough estimate of the amount of funding raised
Cleanlab automates data error detection and correction using AI-powered algorithms to enhance the quality of datasets for machine learning and analytics. This technology addresses issues such as label noise, outliers, and data drift, significantly reducing the time and cost associated with data management while improving model performance.
Funding: $30.0M
Rough estimate of the amount of funding raised
Menlo VenturesTQ Ventures
Menlo VenturesTQ Ventures
Funding: $30.0M
Rough estimate of the amount of funding raised
Encord provides a multimodal data layer infrastructure for training and deploying physical AI systems across various modalities like video, LiDAR, and sensor fusion. The platform supports the entire AI lifecycle, from data collection and automated labeling to dataset curation and post-training model alignment. This unified solution enables AI teams to manage and scale complex data workflows for robotics, autonomous vehicles, and generative AI applications.
Funding: $50.0M
Rough estimate of the amount of funding raised
Crane Venture PartnersCRVHarpoon
Crane Venture PartnersCRVHarpoon
Funding: $50.0M
Rough estimate of the amount of funding raised
Kognic offers a data annotation platform specifically designed for sensor-fusion datasets, enabling efficient management and accurate labeling of complex multi-sensor data. By utilizing an auto-label co-pilot, Kognic reduces annotation time by up to 68%, addressing the high costs and complexities associated with generating and curating representative datasets.
Funding: $42.8M
Rough estimate of the amount of funding raised
Funding: $42.8M
Rough estimate of the amount of funding raised
Provides a platform for building, deploying, and scaling computer vision models tailored to specific industry tasks, such as object detection and optical character recognition. By integrating with tools like Snowflake, it enables organizations to perform visual AI tasks directly on their data without moving it, reducing deployment time by 80% and supporting over 1 billion annual image inferences with 99.99% uptime.
Funding: $57.0M
Rough estimate of the amount of funding raised
Pure Storage
Pure Storage
Funding: $57.0M
Rough estimate of the amount of funding raised
Pienso provides a no-code platform for training and deploying customized Large Language Models (LLMs) using both structured and unstructured data, enabling users to categorize, label, and analyze their data efficiently. The solution ensures data privacy by operating in the user's environment, allowing businesses to gain real-time insights while maintaining control over their sensitive information.
Funding: $29.2M
Rough estimate of the amount of funding raised
Latimer Ventures
Latimer Ventures
Funding: $29.2M
Rough estimate of the amount of funding raised
Toloka provides specialized AI training data for complex models, including Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). They leverage a global network of AI tutors to generate high-quality, diverse datasets for applications like coding copilots and conversational agents.
Blackshark.ai provides a geospatial platform that generates real-time, photorealistic 3D digital twins of the Earth using satellite and aerial imagery processed through machine learning. This technology enables accurate visualization and analysis of global infrastructure, facilitating applications in urban planning, risk assessment, and simulation without the need for extensive coding expertise.
Funding: $35.0M
Rough estimate of the amount of funding raised
Funding: $35.0M
Rough estimate of the amount of funding raised
Nansen is a blockchain analytics platform that utilizes wallet labeling and on-chain data querying to provide crypto investors with actionable insights and real-time alerts on market movements. By enabling users to identify significant wallet activities and trends across multiple blockchains, Nansen helps investors make informed decisions and mitigate risks in their portfolios.
Funding: $89.3M
Rough estimate of the amount of funding raised
Accel
Accel
Funding: $89.3M
Rough estimate of the amount of funding raised
The startup offers a visual recognition platform that autonomously processes diverse visual data, including infrared and X-ray images, while accurately tagging objects of interest. This technology enhances operational efficiency and ensures high-quality results for clients across various industries.
Funding: $24.7M
Rough estimate of the amount of funding raised
Funding: $24.7M
Rough estimate of the amount of funding raised
Sahara AI provides an AI‑native blockchain platform that combines curated data services, on‑demand decentralized compute, and a marketplace for AI assets. It records immutable on‑chain provenance for datasets, models, and agents, uses the $SAHARA token for licensing, per‑inference payments and automatic royalty distribution, and offers SOC2‑certified security. The solution enables model developers, enterprise AI teams, and research labs to access trusted data, scalable compute, and a secure monetization layer while reducing intermediaries.
Funding: $37.0M
Rough estimate of the amount of funding raised
Pantera CapitalPolychain
Pantera CapitalPolychain
Funding: $37.0M
Rough estimate of the amount of funding raised
Outlier AI connects domain experts with leading AI companies to provide human feedback for improving large language models (LLMs). Experts perform tasks such as writing challenging prompts, creating grading rubrics, and rating AI-generated answers to enhance model accuracy. The platform offers flexible, remote work opportunities for subject matter experts to earn income while gaining hands-on experience in AI training.
Funding: $22.1M
Rough estimate of the amount of funding raised
Emergence Capital
Emergence Capital
Funding: $22.1M
Rough estimate of the amount of funding raised
The startup develops data management software that utilizes artificial intelligence and machine learning to automate the identification, categorization, and storage of investment documents. This technology reduces manual data entry and latency, enhancing the efficiency of managing alternative investment data for businesses.
Funding: $82.0M
Rough estimate of the amount of funding raised
Goldman Sachs Asset Management
Goldman Sachs Asset Management
Funding: $82.0M
Rough estimate of the amount of funding raised
Ripcord utilizes robotics and AI to digitize and classify both paper and digital documents, extracting and enriching data for easy access and automation. This process addresses the inefficiency of managing unstructured data, enabling organizations to streamline operations and enhance decision-making with accurate, readily available information.
Funding: $151.5M
Rough estimate of the amount of funding raised
Google VenturesLux Capital
Google VenturesLux Capital
Funding: $151.5M
Rough estimate of the amount of funding raised
Voxel51 provides the FiftyOne platform, which enables machine learning and computer vision teams to efficiently curate, visualize, and manage large datasets while automating the identification of annotation errors. This technology enhances model performance by ensuring high-quality data is readily available for training and evaluation, streamlining the development of visual AI applications.
Funding: $45.5M
Rough estimate of the amount of funding raised
Bessemer Venture Partners
Bessemer Venture Partners
Funding: $45.5M
Rough estimate of the amount of funding raised
Accern provides a no-code natural language processing (NLP) platform that classifies content to enhance research workflows and improve model accuracy across various industries. By automating the classification of key information, the platform helps businesses reduce costs and increase revenue through more efficient data utilization.
Funding: $20.0M
Rough estimate of the amount of funding raised
Fusion Fund
Fusion Fund
Funding: $20.0M
Rough estimate of the amount of funding raised
super.AI offers Intelligent Document Processing (IDP) that automates data extraction from complex documents, utilizing a combination of AI, human, and software workers to ensure high accuracy and efficiency. This technology addresses the challenges of manual data handling by processing 100% of documents, significantly reducing turnaround time and improving operational productivity.
Funding: $33.6M
Rough estimate of the amount of funding raised
HV Capital
HV Capital
Funding: $33.6M
Rough estimate of the amount of funding raised
aiMotive provides an end‑to‑end platform that automates sensor data ingestion, AI‑assisted labeling, and photorealistic simulation while delivering modular, ISO‑26262‑aligned perception, planning, and control software for radar‑camera‑only ADAS and automated driving. The integrated cloud‑based NPU emulator enables faster‑than‑real‑time software‑in‑the‑loop testing within CI/CD pipelines, helping OEMs and Tier‑1 suppliers reduce development time and validation costs for L2‑L4 features.
Funding: $20.0M
Rough estimate of the amount of funding raised
Funding: $20.0M
Rough estimate of the amount of funding raised
The startup develops a data platform that utilizes algorithms and analytical tools to process large datasets, enabling investors to track portfolio companies, competitors, and market sectors. This platform provides actionable insights that help fund managers identify investment opportunities and manage risks, ultimately enhancing investment performance.
Funding: $29.3M
Rough estimate of the amount of funding raised
Valor Equity Partners
Valor Equity Partners
Funding: $29.3M
Rough estimate of the amount of funding raised
Coactive provides a Multimodal AI Platform designed to accelerate content workflows by processing visual assets. The platform automatically generates rich, contextual metadata for videos and images at scale, enabling powerful semantic search and content discovery. This capability allows enterprises to enhance personalization, streamline content moderation, and optimize content performance analysis.
Funding: $44.0M
Rough estimate of the amount of funding raised
Cherryrock CapitalEmerson Collective
Cherryrock CapitalEmerson Collective
Funding: $44.0M
Rough estimate of the amount of funding raised
Worlds provides an AI platform that utilizes real-time video and sensor data to create custom AI applications for enterprise operations. This technology enables companies to automate processes such as hazard detection, asset tracking, and environmental compliance, significantly reducing human effort and improving operational efficiency.
Funding: $40.5M
Rough estimate of the amount of funding raised
Moneta Ventures
Moneta Ventures
Funding: $40.5M
Rough estimate of the amount of funding raised
Prolific provides a platform for researchers to access high-quality data from a global community of over 200,000 vetted participants, enabling rapid collection of detailed responses for surveys and AI training tasks. The service addresses the challenge of slow and unreliable data acquisition by allowing researchers to launch studies in 15 minutes and receive responses within 2 hours.
Funding: $33.5M
Rough estimate of the amount of funding raised
10x Value PartnersOxford Science EnterprisesPartech
10x Value PartnersOxford Science EnterprisesPartech
Funding: $33.5M
Rough estimate of the amount of funding raised
Defined.ai provides a marketplace for ethically sourced training data, specializing in diverse datasets for speech recognition, natural language processing, and medical image analysis. The company addresses the need for high-quality, bias-free data that complies with ethical and legal standards, enabling organizations to develop AI solutions responsibly and effectively.
Funding: $81.9M
Rough estimate of the amount of funding raised
Funding: $81.9M
Rough estimate of the amount of funding raised
Datagen Technologies develops simulated data technology that generates scalable, bias-free datasets with automatic annotation capabilities. This technology addresses the challenges of data scarcity and bias in machine learning, enabling more accurate and reliable model training.
Funding: $50.0M
Rough estimate of the amount of funding raised
Scale Venture Partners
Scale Venture Partners
Funding: $50.0M
Rough estimate of the amount of funding raised
The startup offers a technical recruiting and data analytics platform that standardizes and enhances the searchability of personnel profiles across various sectors, including sales, marketing, and talent acquisition. By providing data-driven insights, the platform enables organizations to efficiently identify and resolve identity discrepancies in recruiting and market research processes.
Funding: $54.5M
Rough estimate of the amount of funding raised
Craft Ventures
Craft Ventures
Funding: $54.5M
Rough estimate of the amount of funding raised
This company offers an AI-powered optical character recognition (OCR) technology that extracts data from images, including barcodes and QR codes, directly on devices. Their solution converts scanned text into editable data without requiring a server connection, enabling offline data extraction for various applications.
Funding: $20.0M
Rough estimate of the amount of funding raised
Yttrium
Yttrium
Funding: $20.0M
Rough estimate of the amount of funding raised
TrustLab provides AI trust and transparency solutions through continuous monitoring and quality evaluation to ensure AI decisions are objective-aligned and explainable. The company offers ModAI for efficient multi-modal content labeling, SuperviseAI for real-time LLM response monitoring, and DetectAI for intellectual property protection and content misuse detection. These no-code integrations help businesses boost AI deployment ROI while limiting operational and reputational risks across various sectors.
Funding: $22.9M
Rough estimate of the amount of funding raised
Foundation CapitalU.S. Venture Partners
Foundation CapitalU.S. Venture Partners
Funding: $22.9M
Rough estimate of the amount of funding raised
Parsable.ai provides a RESTful API that extracts structured data from PDFs, scanned images, DOCX, HTML, and other office files using AI‑enhanced OCR and transformer‑based NLP. Users define extraction templates through a low‑code UI or programmatically via the API, receiving results in JSON, XML, or CSV and integrating with cloud storage, webhooks, or message queues. The service includes enterprise‑grade security, audit logging, and a monitoring dashboard to automate data entry for finance, real‑estate, and HR processes.
Funding: $60.0M
Rough estimate of the amount of funding raised
Activate Capital PartnersGlade Brook Capital Partners
Activate Capital PartnersGlade Brook Capital Partners
Funding: $60.0M
Rough estimate of the amount of funding raised
Roboflow provides a platform for developers to manage image data and streamline the process of training and deploying computer vision models. By offering tools for dataset annotation, preprocessing, and one-click model training, it simplifies the complexities of computer vision projects, enabling faster development and deployment.
Funding: $99.7M
Rough estimate of the amount of funding raised
Craft VenturesGoogle VenturesLachy Groom
Craft VenturesGoogle VenturesLachy Groom
Funding: $99.7M
Rough estimate of the amount of funding raised
Edge Impulse provides a platform for developing embedded machine learning models that run on various edge devices, including microcontrollers and gateways. This technology enables manufacturers to optimize sensor data processing, reduce bill of materials costs, and accelerate time to market for their products.
Funding: $54.4M
Rough estimate of the amount of funding raised
Coatue
Coatue
Funding: $54.4M
Rough estimate of the amount of funding raised
Sentra provides AI-data governance and continuous compliance solutions for securing sensitive data at petabyte scale. The platform continuously discovers, classifies, and manages regulated data across cloud and on-premise environments to minimize exposure risks. It integrates with existing tools to enhance Data Loss Prevention (DLP) effectiveness and ensure adherence to over 40 compliance frameworks.
Funding: $53.0M
Rough estimate of the amount of funding raised
Funding: $53.0M
Rough estimate of the amount of funding raised
Indico Data offers an AI-powered Decision Automation Platform that processes unstructured data to enhance underwriting, claims evaluation, and policy management in the insurance industry. By automating data extraction and analysis, the platform enables insurers to make faster, more informed decisions while reducing operational risks and improving profitability.
Funding: $49.4M
Rough estimate of the amount of funding raised
.406 VenturesGeneral CatalystGuidewire Software
.406 VenturesGeneral CatalystGuidewire Software
Funding: $49.4M
Rough estimate of the amount of funding raised
Stonal is a European platform that utilizes artificial intelligence to automate the collection, structuring, and analysis of real estate data from various sources, ensuring data reliability for optimal asset management. This technology enhances decision-making, preserves asset value, and improves liquidity by providing accurate and actionable insights tailored to specific business needs.
Funding: $28.1M
Rough estimate of the amount of funding raised
Aareon
Aareon
Funding: $28.1M
Rough estimate of the amount of funding raised
Title: DatologyAI: Train Better Models, Faster and Smaller
URL Source: http://datologyai.com/
Markdown Content:
# DatologyAI: Train Better Models, Faster and Smaller
We value your privacy
We use cookies to enhance your browsing experience, serve personalised ads or content, and analyse our traffic. By clicking "Accept All", you consent to our use of cookies.
Customise Reject All Accept All
Powered by [](https://www.cookieyes.com/product/cookie-consent/?ref=cypbcyb&utm_source=cookie-banner&
Funding: $57.7M
Rough estimate of the amount of funding raised
Felicis
Felicis
Funding: $57.7M
Rough estimate of the amount of funding raised
DataLoops provides a data management and annotation platform that automates the preprocessing and curation of unstructured visual data, enabling the rapid generation of machine-readable datasets. This solution enhances the efficiency of AI application development by streamlining data pipelines and integrating human feedback for improved accuracy.
Funding: $49.3M
Rough estimate of the amount of funding raised
Alpha Wave GlobalNGP Capital
Alpha Wave GlobalNGP Capital
Funding: $49.3M
Rough estimate of the amount of funding raised
Convex provides software solutions and data analytics tailored for commercial contractors and service businesses, enhancing project management and operational efficiency. The platform addresses inefficiencies in workflow and data handling, enabling users to streamline processes and improve decision-making.
Funding: $59.6M
Rough estimate of the amount of funding raised
Emergence CapitalFifth WallNotable Capital
Emergence CapitalFifth WallNotable Capital
Funding: $59.6M
Rough estimate of the amount of funding raised
The startup operates a cloud-based corporate sales support database that utilizes artificial intelligence to aggregate and analyze global sales data. This platform enables clients to efficiently identify and target prospective customers, enhancing the effectiveness of their sales operations.
Funding: $82.2M
Rough estimate of the amount of funding raised
Z Venture Capital
Z Venture Capital
Funding: $82.2M
Rough estimate of the amount of funding raised
Synthesis AI offers a synthetic data generation platform specifically designed for computer vision applications, enabling the creation of privacy-compliant and unbiased datasets. This technology addresses the need for high-quality training data in areas such as biometric identification, autonomous vehicle behavior simulation, and augmented reality, facilitating faster model development and deployment.
Funding: $25.0M
Rough estimate of the amount of funding raised
Funding: $25.0M
Rough estimate of the amount of funding raised
DigitalOwl is an AI-powered platform that transforms unstructured medical records into structured data, enabling faster and more accurate reviews for insurance and legal professionals. By automating the medical review process, it reduces processing time by up to 72% while maintaining an accuracy rate of 97% or higher.
Funding: $40.8M
Rough estimate of the amount of funding raised
Reinsurance Group Of America
Reinsurance Group Of America
Funding: $40.8M
Rough estimate of the amount of funding raised