Replicate

Run open-source models at scale

Model marketplace๐Ÿ‡บ๐Ÿ‡ธ USinferencemodelsmarketplace

Replicate is a platform for running machine learning models in the cloud, offering thousands of open-source models with simple API access and pay-per-use pricing.

4
GPU Models
$0.81
From / hour

Available GPUs

Hourly on-demand pricing. Click column headers to sort.

Prices last updated: January 28, 2026

GPU Modelโ†‘
Memoryโ†‘
GPUsโ†‘
Price / hrโ†‘
A100 SXM80GB1x$5.04/hr
A100 SXM80GB2x$5.04/hr
A100 SXM80GB4x$5.04/hr
A100 SXM80GB8x$5.04/hr
H10080GB1x$5.49/hr
H10080GB2x$5.49/hr
H10080GB4x$5.49/hr
H10080GB8x$5.49/hr
L40S48GB1x$3.51/hr
L40S48GB2x$3.51/hr
L40S48GB4x$3.51/hr
L40S48GB8x$3.51/hr
Tesla T416GB1x$0.81/hr

Pros & Cons

Advantages

  • Largest selection of open-source models on one platform
  • Simple pay-per-prediction pricing with no minimum
  • Easy deployment of custom models via Cog
  • Active community contributing new models daily

Limitations

  • Cold start latency for less popular models
  • Pricing can be unpredictable for high-volume use
  • Less optimized than specialized inference providers

Key Features

Vast Model Library

Access thousands of open-source models including LLMs, image generators, and more

Simple API

Consistent REST API across all models with webhooks for async processing

Custom Model Hosting

Deploy your own models using Cog containerization

Serverless Scaling

Automatic scaling with cold-start optimization

Pricing Options

OptionDetails
Pay-per-predictionCharged per model run based on compute time and hardware
Free tierLimited free predictions for new users

Availability & Support

Regions

US-based infrastructure with global CDN

Support

Documentation, Discord community, email support

Getting Started

  1. 1

    Create an account

    Sign up at replicate.com with GitHub or email

  2. 2

    Get API token

    Copy your API token from account settings

  3. 3

    Run a prediction

    Use the API or Python client to run any model

Compare Providers

Find the best prices for the same GPUs from other providers

Vast.ai logo

Vast.ai

4 shared GPUs with Replicate

CoreWeave logo

CoreWeave

3 shared GPUs with Replicate

RunPod logo

RunPod

3 shared GPUs with Replicate