Trusted by 50,000+ developers

Run machine learning models
with a few lines of code.

No infrastructure to manage. No GPUs to provision. Just push a URL and get predictions back from thousands of open-source and research models.

predict.py
import replicate

output = replicate.run(
    "stability-ai/sdxl",
    input={"prompt": "A serene mountain lake at sunset"}
)

print(output["url"])
# https://replicate.delivery/...
Popular Models

Thousands of models.
One API.

Llama 3 70B
Stable Diffusion XL
Whisper Large v3
ControlNet
MusicGen
Video Diffusion
CLIP Interrogator
Depth Anything
Segment Anything
+ 5,000 more
Why Replicate

ML infrastructure that
scales with you.

From prototype to production, we handle the heavy lifting so you can focus on building great products.

โšก

Instant Deployment

Every model runs in its own container with automatic scaling. No servers to manage, no cold starts to worry about.

๐Ÿ”Œ

Predictions API

A simple REST API for every model. Standardized inputs and outputs mean you switch models without changing your code.

๐Ÿ“Š

Pay Per Second

Only pay for compute time you actually use. No monthly minimums, no idle costs. Start free, scale infinitely.

๐Ÿ”“

Open Source First

We host and optimize open-source models so anyone can use them. Bring your own model or use ours โ€” same API.

Pricing

Simple, usage-based pricing.

Start for free. Pay only for what you use as you grow.

Hobbyist

$0/mo

Perfect for experimentation

  • $1 free credit on signup
  • All public models
  • Community support
  • Standard API access

Enterprise

Custom

For large-scale deployments

  • Reserved GPU capacity
  • SLA guarantees (99.99%)
  • SSO & audit logs
  • Dedicated support engineer
  • On-premise options