Skip to main content

Model Serving Pricing

Pricing

Model Serving

Make live predictions in your apps and websites

Select plan

help me choose

Select cloud

Select region*

Select
Loading...

GPU Model Serving DBU Rate

Instance SizeGPU configurationDBUs / hour
SmallT4 or equivalent10.48
MediumA10G x 1GPU or equivalent20.00
Medium 4XA10G x 4GPU or equivalent112.00
Medium 8xA10G x 8GPU or equivalent290.80
XLargeA100 40GB x 8GPU or equivalent538.40
XLargeA100 80GB x 8GPU or equivalent628.00

Pay as you go with a 14-day free trial or contact us for committed-use discounts or custom requirements.

FAQ

Our regional prices are based on the regional cost of infrastructure supporting our serverless products