News: Koyeb startup program

High-performance Infrastructure
for

Deploy intensive applications across GPUs, CPUs, and Accelerators in minutes - scale in 50+ locations

Trusted by the most ambitious teams

Next-generation cloud experience

No ops, servers, or infrastructure management.

Extreme performanceExtreme performanceAccelerated infrastructure

Accelerated infrastructure

Run all your models and apps on high-performance CPUs, GPUs, and accelerators from AMD, Intel, and Nvidia.
Automatic scalingAutomatic scalingServerless containers

Serverless containers

Deploy production-grade containers with zero configuration — we scale to hundreds of servers and back to zero in seconds.
GlobalGlobalAvailable globally and locally

Available globally and locally

Improve availability and get sub-100ms latency worldwide with over 50 locations. Pick between one and all locations.
Any stackAny stackBuild and deploy anything

Build and deploy anything

Build APIs, distributed systems, or blazing-fast inference endpoints. Deploy your code, containers, or models with a Git push or CLI call.

The serverless runtime for

From dev to high-throughput inference in minutes
Deploy and scale ML models to production without managing infrastructure.
10x faster inference with dedicated performance
Scale to millions of requests with built-in autoscaling on dedicated GPUs.
80% savings compared to hyperscalers
Combine autoscaling with the most efficient GPUs and accelerators on the market.
Autoscaling with sub-200ms cold-start
Seamlessly scale up from zero to hundreds.
Get Started
AI Inference
Extreme performance
Deploy Now
100%
More performance
<250ms
CPU cold start
10
Datacenters
100k
Developers
Build with the languages and frameworks you love, from web apps to inference

Deploy any application seamlessly with native support for popular languages and Docker containers, without any modification.

Deploy now
Fearless development with your team and instant deployment
Fearless development with your team and instant deployment

Experience collaborative development on high-performance cloud infrastructure with a simple Git push. Build together and push to prod with confidence.

Start now
/start building

Deploy in production with one-click apps

From AI models to full-stack apps and databases, start in seconds.

Enterprise-Ready.

Koyeb is a powerful platform with security built-in at all layers.

Any CloudAny Cloud
10 Regions / 3 Continents10 Regions / 3 Continents
Any HardwareAny Hardware
Certifications
Get in touch
/What our customers say

We use GPUs for training and inference, and Koyeb streamlines our workload deployment, optimizing efficiency and simplifying infrastructure management. This allows us to focus on what really matters without worrying about ops and infra.

Samuel Bernard
Samuel Bernard
Founder
View customer stories

Everything you need for production

GPU, NPU, Accelerators, or just CPU
GPU, NPU, Accelerators, or just CPU

Access a wide range of optimized hardware for scale-out workloads, on demand, in seconds.

Deploy now
Instant API Endpoint
Instant API Endpoint

Just hit deploy to provision an API endpoint ready to handle requests in seconds. No waiting, no config.

Get Started
Smart and Fast Autoscaling
Smart and Fast Autoscaling

Get efficient autoscaling to adapt infrastructure to demand, with imperceptible cold start.

Learn more
Zero-Downtime Deployments
Zero-Downtime Deployments

Enjoy built-in continuous deployment with automatic health checks to prevent bad deployment and ensure you’re always up and running.

View docs
Native HTTP/2, WebSocket, and gRPC Support
Native HTTP/2, WebSocket, and gRPC Support

Stream large or partial responses to end-users and accelerate your connections through a global edge network for instant feedback and responsive applications.

Get started
/and much more
Postgres + pgvector

Store, index, and search embeddings with your data at scale using Koyeb's fully managed Serverless Postgres.

Get Started
Ultra-Fast NVME Storage

Store datasets, models, and fine-tune weights on blazing-fast NVME disks offering extremely high write and read throughput for exceptional performance.

Get started
Logs and Instance Access

Troubleshoot and investigate issues easily using real-time logs, or directly connect to your instances.

Get started
Compute costs
RTX-4000-SFF-ADA
VRAM 20GB
(CPU 6 / RAM 44GB)
$0.50 /hr
$0.00014 /sec
RTX-A6000
VRAM 48GB
(CPU 6 / RAM 100GB)
$0.73 /hr
$0.000202 /sec
V100 SXM2 16GB
VRAM 16GB
(CPU 8 / RAM 44GB)
$0.85 /hr
$0.000236 /sec
L4
VRAM 24GB
(CPU 15 / RAM 44GB)
$1.00 /hr
$0.000277 /sec
L40S
VRAM 48GB
(CPU 30 / RAM 180GB)
$1.55 /hr
$0.000430 /sec
View pricing details
Early stage startup?
Get up to $30k in credits to accelerate your go-to-market with high-performance cloud infrastructure.
Apply now
Pay for what you use
Scale as you grow with transparent pricing starting at $0.0022/h. No commitment, no contracts, no hidden costs. Upgrade anytime to unlock features. Get started for free.
View pricing
/Changelog

Last update from the team

Docker image download progress, new website, and more
Changelog
Dec 06, 2024
Docker image download progress, new website, and more

  • Docker image download progress
  • New website
  • Control panel: Discard pending service changes
  • Control panel: Easily duplicate Services
Read more
Volume Snapshots in Public Preview, enhanced experience in service scaling, and more
Changelog
Nov 29, 2024
Volume Snapshots in Public Preview, enhanced experience in service scaling, and more

  • Volume Snapshots in Public Preview
  • Enhanced experience in service scaling
  • Bulk import secret
  • New tutorial: Use FLUX, PyTorch, and Streamlit to Build an AI Image Generation App
Read more
Snapshots: Create a Point-in-Time Copy of your High-Performance Volumes
Blog
Nov 26, 2024
Snapshots: Create a Point-in-Time Copy of your High-Performance Volumes
Backup your high-performance Volumes, simplify data management, and enable reproducibility!
Read more
Gemma 2 9b
One Click
Nov 21, 2024
Gemma 2 9b
Deploy Gemma 2 9BB on Koyeb high-performance GPU.
Read more

Deploy AI apps to production in minutes

Koyeb is a developer-friendly serverless platform to deploy apps globally. No-ops, servers, or infrastructure management.
All systems operational
© Koyeb