Model Serving Pricing

Model Serving
Make live predictions in your apps and websites.
Loading...
Model Serving pricing examples

Notes:
1. The simple examples above are baseline configurations. Example 2 requires launch charge from cold start.
2. Users may also select “scale to zero” or “very latency sensitive” options, which would change DBU emissions.
FAQ
4 GB per concurrent request