Skip to main content

Model Serving Pricing

Pricing

Model Serving

Make live predictions in your apps and websites.

Select plan

help me choose

Select cloud

Select region

Select
Loading...

Notes:

1. Additional Launch charge of $0.07 a maximum of 2x per hour if scale to zero enabled

2. DBU rates per GPU type are shown in the FAQ

Model Serving pricing examples

pricing

FAQ

Our regional prices are based on the regional cost of infrastructure supporting our serverless products