Proprietary Foundation Model Serving
Serve state-of-the-art proprietary foundation models for real-time and batch inference workload needs. This enables you to quickly and easily build applications that leverage high-quality proprietary generative AI models from various vendors directly on the Databricks platform without the need to additionally and separately engage with other vendors.
Loading...
* For Azure customers, if you have an Azure Commit with Databricks, Databricks may make available this service as an ADI Service that integrates with Azure Databricks. The ADI Service is sold and invoiced by Databricks. Contact Sales to get access.
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| OpenAI | |||||||
| GPT 5 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5 nano | Global | All Lengths | 0.714 | 5.714 | 0.714 | 0.071 | 53.571 |
| In-geo | 0.786 | 6.286 | 0.786 | 0.078 | 58.929 | ||
| Anthropic | |||||||
| Claude Opus 4 / 4.1 | Global/In-geo | All Lengths | 214.286 | 1,071.429 | 267.857 | 21.429 | 514.286 |
| Claude Sonnet 4.5 | Global | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| In-geo | 47.143 | 235.715 | 58.928 | 4.715 | 235.715 | ||
| Global | Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | |
| In-geo | 94.285 | 353.572 | 117.857 | 9.428 | 235.715 | ||
| Claude Sonnet 3.7 / 4 / 4.1 | Global/In-geo | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | ||
| Claude Haiku 4.5 | Global | All Lengths | 14.286 | 71.429 | 17.857 | 1.429 | n/a |
| In-geo | 15.715 | 78.572 | 19.643 | 1.572 | n/a | ||
| Gemini 2.5 Pro | Global/In-geo | Short Context | 17.857 | 142.857 | n/a | n/a | n/a |
| Long Context (>200k tokens) | 35.714 | 214.286 | n/a | n/a | n/a | ||
| Gemini 2.5 Flash | Global/In-geo | Short Context | 4.286 | 35.714 | n/a | n/a | n/a |
| Long Context (>200k tokens) | 4.286 | 35.714 | n/a | n/a | n/a | ||
Pay as you go with a 14-day free trial or contact us for committed-use discounts or custom requirements.