Servicio de modelos fundacionales propietarios
Utilice modelos fundacionales propietarios de última generación para las necesidades de cargas de trabajo de inferencia en tiempo real y por lotes. Esto le permite crear aplicaciones de forma rápida y sencilla que aprovechan modelos de IA generativa propietarios de alta calidad de varios proveedores directamente en la plataforma Databricks, sin la necesidad de interactuar adicional y separadamente con otros proveedores.
* Para los clientes de Azure, si tienes un compromiso de Azure con Databricks, Databricks puede poner a disposición este servicio como un servicio ADI que se integra con Azure Databricks. El servicio ADI es vendido y facturado por Databricks. Contactar a Ventas para obtener acceso.
1. Azure Databricks, como un servicio de origen en Microsoft Azure, ofrece facturación y soporte unificados por parte de Microsoft.
1. El nivel Premium en Azure Databricks corresponde al nivel Enterprise en AWS y GCP
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| OpenAI | |||||||
| GPT 5.2 | Global | All Lengths | 25.000 | 200.000 | 25.000 | 2.500 | 184.286 |
| In-geo | 27.500 | 220.000 | 27.500 | 2.750 | 202.714 | ||
| GPT 5.1 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5.1 Codex Max | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5.1 Codex Mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5 nano | Global | All Lengths | 0.714 | 5.714 | 0.714 | 0.071 | 53.571 |
| In-geo | 0.786 | 6.286 | 0.786 | 0.078 | 58.929 | ||
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| Anthropic | |||||||
| Claude Opus 4.6 | Global | Short Context | 71.429 | 357.143 | 89.286 | 7.143 | 178.571 |
| In-geo | 78.571 | 392.857 | 98.214 | 7.857 | 196.429 | ||
| Global | Long Context (>200k tokens) | 142.858 | 535.715 | 178.572 | 14.286 | 178.571 | |
| In-geo | 157.142 | 589.286 | 196.428 | 15.714 | 196.429 | ||
| Claude Opus 4.5 | Global | Short Context | 71.429 | 357.143 | 89.286 | 7.143 | 178.571 |
| In-geo | 78.571 | 392.857 | 98.214 | 7.857 | 196.429 | ||
| Claude Opus 4 / 4.1 | Global/In-geo | All Lengths | 214.286 | 1,071.429 | 267.857 | 21.429 | 514.286 |
| Claude Sonnet 4.5 / 4.6 | Global | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| In-geo | 47.143 | 235.715 | 58.928 | 4.715 | 235.715 | ||
| Global | Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | |
| In-geo | 94.285 | 353.572 | 117.857 | 9.428 | 235.715 | ||
| Claude Sonnet 3.7 / 4 / 4.1 Claude 3.7 Sonnet will be deprecated on April 12, 2026 | Global/In-geo | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | ||
| Claude Haiku 4.5 | Global | All Lengths | 14.286 | 71.429 | 17.857 | 1.429 | 114.286 |
| In-geo | 15.715 | 78.572 | 19.643 | 1.572 | 125.714 | ||
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| Gemini 3.0/3.1 Pro | Global/In-geo | Short Context | 35.714 | 214.286 | 35.714 | 3.571 | 230.357 |
| Long Context (>200k tokens) | 71.429 | 321.429 | 71.429 | 7.143 | 230.357 | ||
| Gemini 3.0 Flash | Global/In-geo | Short Context | 8.929 | 53.571 | 8.929 | 0.893 | 125.000 |
| Long Context (>200k tokens) | 8.929 | 53.571 | 8.929 | 0.893 | 125.000 | ||
| Gemini 2.5 Pro | Global/In-geo | Short Context | 17.857 | 142.857 | n/a | n/a | 164.286 |
| Long Context (>200k tokens) | 35.714 | 214.286 | n/a | n/a | 164.286 | ||
| Gemini 2.5 Flash | Global/In-geo | Short Context | 4.286 | 35.714 | n/a | n/a | 107.143 |
| Long Context (>200k tokens) | 4.286 | 35.714 | n/a | n/a | 107.143 | ||
Paga por uso con una prueba gratuita de 14 días o contáctanos para obtener descuentos por compromiso de uso o para requisitos personalizados.