Find Investable Startups and Competitors
Search thousands of startups using natural language—just describe what you're looking for
Top 50 Data Labeling Service in Europe
Discover the top 50 Data Labeling Service startups in Europe. Browse funding data, key metrics, and company insights. Average funding: $10.3M.
Sort by
Rapidata
Rapidata is a data processing platform that utilizes crowd intelligence to provide human-verified data labeling and processing services, enabling businesses to efficiently transform large datasets into actionable insights. By leveraging a global network of annotators across 192 countries, the platform ensures accurate and unbiased labeling tailored to specific regional preferences, significantly reducing the time and cost associated with data preparation.
Funding: $10M+
Rough estimate of the amount of funding raised
V7
V7 is an AI training data platform that provides high-quality image and video annotations for computer vision models, utilizing AI-assisted labeling tools to enhance accuracy and efficiency. The platform addresses the challenge of slow and error-prone data labeling processes by streamlining workflows and enabling rapid deployment of training data.
Funding: $20M+
Rough estimate of the amount of funding raised
Kognic
Kognic offers a data annotation platform specifically designed for sensor-fusion datasets, enabling efficient management and accurate labeling of complex multi-sensor data. By utilizing an auto-label co-pilot, Kognic reduces annotation time by up to 68%, addressing the high costs and complexities associated with generating and curating representative datasets.
Funding: $20M+
Rough estimate of the amount of funding raised
Kili Technology
Kili Technology provides tailored data annotation and evaluation services for large language models, utilizing expert-led project management to streamline the data pipeline. This approach eliminates data bottlenecks, enabling companies to enhance model performance and accelerate AI project deployment.
Funding: $20M+
Rough estimate of the amount of funding raised
Picsellia
Picsellia provides an end-to-end MLOps platform specifically designed for Computer Vision, enabling users to manage, label, and deploy visual data efficiently. The platform addresses challenges in data organization, annotation accuracy, and model performance monitoring, facilitating the development of high-quality AI applications.
Funding: $3M+
Rough estimate of the amount of funding raised
David AI
David AI generates and labels proprietary audio datasets, including over 10,000 hours of speaker-separated, natural conversations at 24+ kHz, to enhance the training of advanced speech recognition models. This unique dataset addresses the need for high-quality, non-public audio data, enabling AI developers to improve model accuracy and performance.
Pienso
Pienso provides a no-code platform for training and deploying customized Large Language Models (LLMs) using both structured and unstructured data, enabling users to categorize, label, and analyze their data efficiently. The solution ensures data privacy by operating in the user's environment, allowing businesses to gain real-time insights while maintaining control over their sensitive information.
Funding: $20M+
Rough estimate of the amount of funding raised
DevisionX
Tuba.AI is a no-code platform that enables users to develop AI computer vision applications by providing tools for automatic image labeling, model training, and deployment without requiring coding skills. This solution addresses the challenge of accessibility in AI development, allowing businesses to efficiently implement computer vision technology tailored to their specific needs.
aiMotive
aiMotive provides an end‑to‑end platform that automates sensor data ingestion, AI‑assisted labeling, and photorealistic simulation while delivering modular, ISO‑26262‑aligned perception, planning, and control software for radar‑camera‑only ADAS and automated driving. The integrated cloud‑based NPU emulator enables faster‑than‑real‑time software‑in‑the‑loop testing within CI/CD pipelines, helping OEMs and Tier‑1 suppliers reduce development time and validation costs for L2‑L4 features.
Funding: $20M+
Rough estimate of the amount of funding raised
Rabbitt AI
Rabbitt.AI develops reliable generative AI solutions by leveraging enterprise data to create custom large language models and high-quality training datasets. The platform addresses the challenge of inconsistent AI performance by providing precise data annotation and AI-assisted quality checks, ensuring accurate and effective model outputs.
Funding: $2M+
Rough estimate of the amount of funding raised
Caplena
Caplena provides a text analysis platform that utilizes collaborative AI to automatically categorize and tag open-ended customer and employee feedback, enabling topic-level sentiment analysis. This technology significantly reduces the time required for data processing, allowing organizations to quickly extract actionable insights from large volumes of qualitative data.
Funding: $3M+
Rough estimate of the amount of funding raised
Peroptyx
Peroptyx provides location-based machine learning training data and model evaluation solutions, utilizing authenticated ground truth data to enhance the accuracy of AI applications. The platform addresses the need for reliable data to improve model performance and local relevance across diverse geographic areas.
Funding: $3M+
Rough estimate of the amount of funding raised
LetXbe
Letxbe provides a no-code platform for intelligent document processing that utilizes advanced algorithms to classify and extract information from documents with up to 98% accuracy. This technology significantly reduces processing time by tenfold and cuts document management costs by 80%, enabling non-technical business owners to efficiently manage their data.
Funding: $2M+
Rough estimate of the amount of funding raised
ScaleHub
The startup offers a crowdsourcing platform that leverages artificial intelligence for cloud-based data extraction and document processing. It connects businesses with global public and private crowd communities, enabling scalable document automation for shared service centers and business process outsourcers.
Funding: $5M+
Rough estimate of the amount of funding raised
Klimato
Klimato provides food businesses with carbon footprint calculators and sustainability reporting tools that enable precise measurement and labeling of the environmental impact of recipes. This technology helps companies reduce their carbon emissions and enhance transparency, ultimately driving profitability through climate-friendly menu options.
Funding: $5M+
Rough estimate of the amount of funding raised
Argilla
Argilla offers an open-source, AI-driven platform that enables collaboration between AI engineers and domain experts to create high-quality datasets for natural language processing. The platform automates data management tasks, facilitating efficient fine-tuning and evaluation of language models while ensuring data integrity and transparency.
Funding: $5M+
Rough estimate of the amount of funding raised
Defined.ai
Defined.ai provides a marketplace for ethically sourced training data, specializing in diverse datasets for speech recognition, natural language processing, and medical image analysis. The company addresses the need for high-quality, bias-free data that complies with ethical and legal standards, enabling organizations to develop AI solutions responsibly and effectively.
Funding: $50M+
Rough estimate of the amount of funding raised
Mindtech Global Limited
The startup develops a behavioral simulator that automates the collection and curation of training data for AI computer vision applications, significantly reducing the time required for model preparation. Its platform enables the deployment of production-ready AI systems across various sectors, including retail, healthcare, and smart cities, by enhancing the understanding of human interactions.
Funding: $10M+
Rough estimate of the amount of funding raised
Lemon AI
Lemon AI generates high-quality synthetic data to enhance the training and fine-tuning of large language models (LLMs), addressing the scarcity and quality issues of real-world datasets. By automating data curation and integrity analysis, Lemon AI enables organizations to build customized LLMs more efficiently, reducing time and costs associated with manual data preparation.
Funding: $500K+
Rough estimate of the amount of funding raised
Bummock AI
Automates legal due diligence by transforming unstructured data into categorized, metadata-rich datarooms in minutes, covering eight legal subjects such as incorporation, IP, and employment. Bummock’s platform provides real-time updates, issue spotting, and comprehensive legal health reports, reducing the time and cost associated with traditional due diligence processes while enabling ongoing legal monitoring and management.
Malted AI
Malted AI develops custom Small Language Models (SLMs) that are 10-100 times smaller and more efficient than traditional Large Language Models, enabling enterprises to deploy domain-specific AI solutions at a significantly reduced cost. Their distillation technology automates data generation for training SLMs, addressing the inefficiencies and high costs associated with manual data annotation.
Funding: $5M+
Rough estimate of the amount of funding raised
Mindee
Mindee provides an AI-driven platform for precise data extraction from various document types, significantly reducing manual data entry errors by up to 30%. The solution enables businesses to automate complex workflows, enhancing operational efficiency and cutting turnaround times by 57%.
Tembi
The startup offers an AI-as-a-service platform that aggregates data from various open and publicly accessible sources and applies machine learning models to enhance this data. Businesses can access enriched data and algorithm results through a user-friendly interface or API, facilitating informed decision-making without the need for extensive data processing expertise.
Funding: $3M+
Rough estimate of the amount of funding raised
info.link
info.link offers a platform for generating compliant, accessible, and brand-aligned Digital Labels linked via GS1 Digital Link QR codes. The system provides a library of over 50 module templates for use cases like Green Claims and Digital Product Passports, enabling rapid deployment without IT projects.
Relativity6
Provides an API that uses AI to classify private companies with 6-digit NAICS and SIC codes, offering confidence scores, business descriptions, and real-time activity data. This enables industries like insurance, lending, and payment processing to improve risk assessment, streamline underwriting, and enhance data accuracy in CRM systems.
Duco
Duco provides a no-code, AI-powered platform for automating the lifecycle management of both structured and unstructured data across financial services operations. By significantly reducing the time spent on data-related tasks by up to 90%, Duco enables firms to cut costs, enhance operational efficiency, and improve data accuracy.
Valyu
Provides a data licensing and provenance platform that connects rights holders with AI developers, streamlining the acquisition of high-quality, legally compliant datasets for model training. It addresses challenges like attribution concerns, legal complexities, and slow partnership development, enabling faster and more responsible AI innovation.
Funding: $500K+
Rough estimate of the amount of funding raised
Explosion AI
Explosion develops developer tools such as spaCy and Prodigy for natural language processing, machine learning, and data annotation, enabling efficient text analysis and model training. Their solutions address the challenges of data labeling and model deployment, facilitating the creation of robust AI applications across various industries.
Funding: $5M+
Rough estimate of the amount of funding raised
DATA SWEEP
DATA SWEEP is an AI-driven platform that automates the data collection process by providing ready-to-use datasets, dataset cleaning, and merging capabilities. It enables researchers to efficiently access reliable data, allowing them to focus on analysis and insights rather than time-consuming data processing tasks.
Segments.ai
Segments.ai provides a multi-sensor labeling platform that utilizes deep learning for instance and semantic segmentation of images and 3D point clouds, enabling simultaneous annotation across various data modalities. This technology reduces the time spent on quality checks and corrections, streamlining the data labeling process for machine learning teams in robotics and autonomous vehicles.
Funding: $1M+
Rough estimate of the amount of funding raised
Galeio
Galeio offers foundation models for processing complex visual data like satellite imagery and radar, enabling faster AI development with reduced manual labeling. Their locally deployable solutions integrate with existing infrastructure, providing secure and adaptable analytics for sectors such as energy and environmental monitoring.
Delpha
Delpha is an AI-driven DataOps platform that utilizes intelligent AI Agents to analyze and correct customer data across multiple dimensions, ensuring high accuracy and reliability. This solution addresses the issue of data decay and entry errors, enabling businesses to enhance operational efficiency and improve revenue performance.
Funding: $1M+
Rough estimate of the amount of funding raised
Lightly
Lightly provides a data curation platform that utilizes self-supervised learning and active learning techniques to optimize the selection of training data for machine learning models. By reducing data redundancy and bias, Lightly enables companies to achieve up to 92% lower labeling costs and improve model accuracy by 19%.
Funding: $3M+
Rough estimate of the amount of funding raised
Ohalo
Provides a data governance platform that automates the lifecycle management of unstructured data across hybrid, multi-cloud, and legacy systems. By using a proprietary classification engine, it uncovers hidden risks, ensures compliance with regulations like GDPR and HIPAA, and transforms scattered data into structured insights for improved security and efficiency.
Funding: $5M+
Rough estimate of the amount of funding raised
Legit
Legit offers a privacy automation platform, Data Privacy Manager, that utilizes AI to automate personal data discovery, classification, and compliance management. This technology addresses the challenges of regulatory compliance and data retention by streamlining privacy processes for businesses in a digital environment.
TetraKit Technologies
TetraKit Technologies has developed a click chemistry-based radiolabeling platform that enables the efficient labeling of cancer-targeting molecules with radionuclide pairs, specifically fluorine-18 and astatine-211, for theranostic applications. This platform addresses the need for a practical and universal solution in targeted radionuclide therapy, enhancing the production of radiopharmaceuticals for improved cancer diagnosis and treatment.
Funding: $500K+
Rough estimate of the amount of funding raised
Co-one
Co-one offers a data-centric platform that combines AI and human expertise to provide model evaluation solutions for generative AI, focusing on uncertainty assessment and continuous learning. Their customizable APIs and data annotation services enhance the performance and accuracy of AI models, enabling enterprises to effectively manage complex data.
Funding: $500K+
Rough estimate of the amount of funding raised
Lamin
Provides an open-source data infrastructure for biological research, enabling seamless management of large-scale datasets and metadata through a unified API. It tracks data lineage across notebooks, scripts, and pipelines while integrating workflow managers like Redun and Nextflow, ensuring reproducibility and transparency. The platform standardizes, validates, and annotates biological data with minimal code, facilitating collaboration between dry and wet labs and supporting scalable learning through API-first access.
Funding: $500K+
Rough estimate of the amount of funding raised
Pointly
Pointly is a cloud-based platform that utilizes AI techniques for the automatic and manual classification of large 3D point clouds, enabling efficient data vectorization and precise 3D modeling. This technology addresses the challenge of slow and inaccurate point cloud analysis, significantly reducing processing time and improving classification accuracy for various applications.
Peftrust®
The startup offers an environmental footprint analysis platform that enables fashion brands to monitor and score the environmental impact of their products throughout the supply chain. This tool facilitates compliance with mandatory environmental labeling requirements, allowing marketers to quantify and communicate the sustainability of their offerings.
Funding: $1M+
Rough estimate of the amount of funding raised
DeepMask
DeepMask provides a secure platform for companies to upload and utilize internal data to fine-tune industry-specific Large Language Models (LLMs) while ensuring data protection. This enables organizations to create tailored use cases that enhance operational efficiency and leverage their proprietary information without compromising security.
ScoutX
Scout provides a machine learning algorithm that captures the context of data flows and enforces usage rights, ensuring data reliability and security for organizations. By automating the extraction of key information from data usage agreements and tagging datasets with usage rights, Scout enables firms to monetize existing data while maintaining compliance and protecting data integrity.
Hasty | a CloudFactory Company
Hasty provides a computer vision annotation and model development platform integrated into CloudFactory’s AI Data Platform, enabling manufacturers and agricultural companies to enhance their products with vision AI capabilities. This integration streamlines AI-driven workflows, improving the efficiency and accuracy of data processing for high-value industries.
Zeenea
Zeenea provides a data catalog and governance system that enables organizations to manage their data assets effectively. By facilitating data discovery and compliance, Zeenea helps businesses maintain data integrity and streamline regulatory reporting processes.
TrustMyContent
TrustMyContent provides a labeling service for digital content, including images, videos, and audio, that securely displays the provenance of each piece to combat disinformation and enhance audience trust. The platform also offers investigation services for identifying fake news and integrates the C2PA standard to ensure transparency and authenticity in content management.
ekko
ekko provides a plug-and-play solution for warehouse digitization, utilizing technologies such as digital labeling, pick-by-light systems, and eKanban to automate routine tasks and enhance material flow transparency. By minimizing manual interventions, ekko significantly reduces errors and process times, leading to measurable efficiency gains in manufacturing and logistics operations.
Funding: $2M+
Rough estimate of the amount of funding raised
ActiveLabel
The startup develops smart label technology that utilizes optical methodologies and e-cloud data storage to monitor the temperature of perishable products during transport and storage. This technology ensures that food and pharmaceutical sectors maintain optimal storage conditions, thereby preserving the quality and safety of their packaged goods.
Funding: $500K+
Rough estimate of the amount of funding raised
1-eSim
1-eSim offers a white-label eSIM platform that allows travel service providers to embed global mobile data connectivity into their existing packages. This enables providers to create their own branded data plans, enhancing customer convenience and generating ancillary revenue through seamless integration and flexible pricing.
Exonar
Exonar provides a data‑discovery platform that continuously crawls on‑premise and cloud repositories, indexing structured and unstructured assets and enriching them with metadata and sensitive‑data tags. The solution uses pre‑trained and customizable machine‑learning classifiers to identify GDPR, HIPAA, PCI, IP and other high‑risk data, and presents risk insights through an interactive dashboard and sub‑second keyword search. An open API enables automated remediation actions such as quarantine, encryption, or deletion, while role‑based access controls and audit logs support compliance reporting.
Funding: $5M+
Rough estimate of the amount of funding raised
ramblr
Ramblr.ai develops a comprehensive egocentric data pipeline that captures, segments, and annotates first-person view data to enhance Augmented Reality applications. This technology addresses the challenges of processing large datasets from diverse use cases, enabling precise insights and actionable intelligence for various industries.