Foundry

About Foundry

Foundry provides an orchestration platform that enables AI developers to access NVIDIA GPU clusters on-demand, facilitating training, fine-tuning, and inference without long-term contracts. The platform addresses the challenge of unpredictable compute needs by offering flexible pricing options, including reserved and spot instances, ensuring reliable performance for critical workloads.

```xml <problem> AI developers often face challenges in accessing and managing NVIDIA GPU clusters for training, fine-tuning, and inference, particularly with unpredictable compute demands and the limitations of long-term contracts. This can lead to inefficient resource utilization and increased costs due to idle capacity or difficulty in scaling resources on demand. </problem> <solution> Foundry provides a flexible compute platform that allows AI developers to access NVIDIA GPUs on-demand, eliminating the need for long-term contracts and simplifying resource management. The platform offers both reserved and spot instances, enabling users to optimize cost and performance based on workload requirements. Reserved instances guarantee capacity for critical workloads and can be resold when not in use, while spot instances provide cost-efficient compute for flexible training and inference tasks. By offering programmatic scaling via API and integration with Kubernetes, Foundry streamlines workload orchestration and allows developers to focus on AI development rather than infrastructure management. </solution> <features> - On-demand access to NVIDIA H100s, A100s, A40s, and A5000 GPUs without contracts - Flexible pricing options with reserved and spot instances to optimize cost and performance - High-performance networking with up to 3200Gbps InfiniBand for distributed training - Native Kubernetes integration for simplified workload orchestration and horizontal scaling - API for programmatic scaling and instance management - Co-located storage with no ingress or egress fees - Support for custom scripts on startup and SSH access - SOC2 Type II certification and HIPAA compliance options for enterprise-grade security </features> <target_audience> The primary target audience includes AI engineers, researchers, and scientists who require flexible and scalable access to GPU compute resources for training, fine-tuning, and inference tasks. </target_audience> ```

What does Foundry do?

Foundry provides an orchestration platform that enables AI developers to access NVIDIA GPU clusters on-demand, facilitating training, fine-tuning, and inference without long-term contracts. The platform addresses the challenge of unpredictable compute needs by offering flexible pricing options, including reserved and spot instances, ensuring reliable performance for critical workloads.

Where is Foundry located?

Foundry is based in Palo Alto, United States.

When was Foundry founded?

Foundry was founded in 2022.

How much funding has Foundry raised?

Foundry has raised 79100000.

Who founded Foundry?

Foundry was founded by Jared Davis.

  • Jared Davis - Founder/CEO
Location
Palo Alto, United States
Founded
2022
Funding
79100000
Employees
34 employees
Major Investors
Lightspeed Venture Partners, Sequoia Capital
Looking for specific startups?
Try our free semantic startup search

Foundry

Score: 100/100
AI-Generated Company Overview (experimental) – could contain errors

Executive Summary

Foundry provides an orchestration platform that enables AI developers to access NVIDIA GPU clusters on-demand, facilitating training, fine-tuning, and inference without long-term contracts. The platform addresses the challenge of unpredictable compute needs by offering flexible pricing options, including reserved and spot instances, ensuring reliable performance for critical workloads.

mlfoundry.com3K+
cb
Crunchbase
Founded 2022Palo Alto, United States

Funding

$

Estimated Funding

$79.1M+

Major Investors

Lightspeed Venture Partners, Sequoia Capital

Team (30+)

Jared Davis

Founder/CEO

Company Description

Problem

AI developers often face challenges in accessing and managing NVIDIA GPU clusters for training, fine-tuning, and inference, particularly with unpredictable compute demands and the limitations of long-term contracts. This can lead to inefficient resource utilization and increased costs due to idle capacity or difficulty in scaling resources on demand.

Solution

Foundry provides a flexible compute platform that allows AI developers to access NVIDIA GPUs on-demand, eliminating the need for long-term contracts and simplifying resource management. The platform offers both reserved and spot instances, enabling users to optimize cost and performance based on workload requirements. Reserved instances guarantee capacity for critical workloads and can be resold when not in use, while spot instances provide cost-efficient compute for flexible training and inference tasks. By offering programmatic scaling via API and integration with Kubernetes, Foundry streamlines workload orchestration and allows developers to focus on AI development rather than infrastructure management.

Features

On-demand access to NVIDIA H100s, A100s, A40s, and A5000 GPUs without contracts

Flexible pricing options with reserved and spot instances to optimize cost and performance

High-performance networking with up to 3200Gbps InfiniBand for distributed training

Native Kubernetes integration for simplified workload orchestration and horizontal scaling

API for programmatic scaling and instance management

Co-located storage with no ingress or egress fees

Support for custom scripts on startup and SSH access

SOC2 Type II certification and HIPAA compliance options for enterprise-grade security

Target Audience

The primary target audience includes AI engineers, researchers, and scientists who require flexible and scalable access to GPU compute resources for training, fine-tuning, and inference tasks.