Replicate runs open-source models in the cloud. Simple API for model inference. Community model library available. Custom model deployment supported.
Using Models
Browse model library for capabilities. Run predictions with simple API calls. Handle async predictions for slow models. Access model versions for reproducibility.
- Find models in the Replicate library
- Run predictions with REST or SDK
- Handle webhooks for async results
- Pin model versions for stability
- Deploy custom models with Cog
Custom Models
Package models with Cog. Deploy to Replicate infrastructure. Scale automatically with demand. Share publicly or keep private.