Skip to main content

Proprietary Foundation Model Serving

Serve state-of-the-art proprietary foundation models for real-time and batch inference workload needs. This enables you to quickly and easily build applications that leverage high-quality proprietary generative AI models from various vendors directly on the Databricks platform without the need to additionally and separately engage with other vendors.

Loading...

* For Azure customers, if you have an Azure Commit with Databricks, Databricks may make available this service as an ADI Service that integrates with Azure Databricks. The ADI Service is sold and invoiced by Databricks. Contact Sales to get access.

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
OpenAI
GPT 5GlobalAll Lengths17.857142.85717.8571.786131.429
In-geo19.643157.14319.6431.965144.571
GPT 5 miniGlobalAll Lengths3.57128.5713.5710.35771.429
In-geo3.92931.4293.9290.39378.571
GPT 5 nanoGlobalAll Lengths0.7145.7140.7140.07153.571
In-geo0.7866.2860.7860.07858.929
Anthropic
Claude Opus 4 / 4.1Global/In-geoAll Lengths214.2861,071.429267.85721.429514.286
Claude Sonnet 4.5GlobalShort Context42.857214.28653.5714.286214.286
In-geo47.143235.71558.9284.715235.715
GlobalLong Context
(>200k tokens)
85.714321.429107.1438.571214.286
In-geo94.285353.572117.8579.428235.715
Claude Sonnet 3.7 / 4 / 4.1Global/In-geoShort Context42.857214.28653.5714.286214.286
Long Context
(>200k tokens)
85.714321.429107.1438.571214.286
Claude Haiku 4.5GlobalAll Lengths14.28671.42917.8571.429n/a
In-geo15.71578.57219.6431.572n/a
Google
Gemini 2.5 ProGlobal/In-geoShort Context17.857142.857n/an/an/a
Long Context
(>200k tokens)
35.714214.286n/an/an/a
Gemini 2.5 FlashGlobal/In-geoShort Context4.28635.714n/an/an/a
Long Context
(>200k tokens)
4.28635.714n/an/an/a

Pay as you go with a 14-day free trial or contact us for committed-use discounts or custom requirements.

Proprietary Foundation Model Serving FAQ