Back to Insights
Artificial Intelligence•October 27, 2023•8 min read

Replicate: Run ML Models via API

Replicate hosts open-source ML models accessible via simple API calls.

#replicate#ml-models#api#inference

Replicate runs open-source models in the cloud. Simple API for model inference. Community model library available. Custom model deployment supported.

Using Models

Browse model library for capabilities. Run predictions with simple API calls. Handle async predictions for slow models. Access model versions for reproducibility.

  • Find models in the Replicate library
  • Run predictions with REST or SDK
  • Handle webhooks for async results
  • Pin model versions for stability
  • Deploy custom models with Cog

Custom Models

Package models with Cog. Deploy to Replicate infrastructure. Scale automatically with demand. Share publicly or keep private.

Tags

replicateml-modelsapiinferencedeployment