Trusted by 50,000+ developers

Run machine learning models
with a few lines of code.

No infrastructure to manage. No GPUs to provision. Just push a URL and get predictions back from thousands of open-source and research models.

Start building for free → View documentation

predict.py

import replicate

output = replicate.run(
    "stability-ai/sdxl",
    input={"prompt": "A serene mountain lake at sunset"}
)

print(output["url"])
# https://replicate.delivery/...

Popular Models

Thousands of models.
One API.

Llama 3 70B

Stable Diffusion XL

Whisper Large v3

ControlNet

MusicGen

Video Diffusion

CLIP Interrogator

Depth Anything

Segment Anything

+ 5,000 more

Why Replicate

ML infrastructure that
scales with you.

From prototype to production, we handle the heavy lifting so you can focus on building great products.

⚡

Instant Deployment

Every model runs in its own container with automatic scaling. No servers to manage, no cold starts to worry about.

🔌

Predictions API

A simple REST API for every model. Standardized inputs and outputs mean you switch models without changing your code.

📊

Pay Per Second

Only pay for compute time you actually use. No monthly minimums, no idle costs. Start free, scale infinitely.

🔓

Open Source First

We host and optimize open-source models so anyone can use them. Bring your own model or use ours — same API.

Pricing

Simple, usage-based pricing.

Start for free. Pay only for what you use as you grow.

Hobbyist

$0/mo

Perfect for experimentation

$1 free credit on signup
All public models
Community support
Standard API access

Pro

Pay as you go

For growing applications

Bulk pricing discounts
Priority inference queue
Private model hosting
Email & Discord support
Detailed analytics

Enterprise

Custom

For large-scale deployments

Reserved GPU capacity
SLA guarantees (99.99%)
SSO & audit logs
Dedicated support engineer
On-premise options

Run machine learning modelswith a few lines of code.

Thousands of models.One API.

ML infrastructure thatscales with you.