Model Serving Pricing
Make live predictions in your apps and websites.
Select planhelp me choose
Model Serving pricing examples
1. The simple examples above are baseline configurations. Example 2 requires launch charge from cold start.
2. Users may also select “scale to zero” or “very latency sensitive” options, which would change DBU emissions.
4 GB per concurrent request