Proprietary Foundation Model Serving
Ofereça modelos básicos abertos de última geração para atender às necessidades de carga de trabalho de inferência em tempo real e em lote. Isto permite que você crie, de forma rápida e simples, aplicações que utilizam modelos proprietários de AI generativa de alta qualidade, de vários fornecedores, diretamente na plataforma Databricks, sem precisar se envolver adicionalmente e separadamente com outros fornecedores.
Loading...
Tarifas DBU para fornecimento do Proprietary Foundation Model Serving
| Model | Pay-Per-Token | ||
|---|---|---|---|
| DBU / 1M INPUT tokens (Global) | DBU / 1M OUTPUT tokens (Global) | ||
| OpenAI | |||
| GPT 5 | Global | 17.857 | 142.857 |
| In-geo | 19.643 | 157.143 | |
| GPT 5 Mini | Global | 3.571 | 28.571 |
| In-geo | 3.929 | 31.429 | |
| GPT 5 Nano | Global | 0.714 | 5.714 |
| In-geo | 0.786 | 6.286 | |
| Anthropic | |||
| Claude Opus 4.1 | Global | 214.286 | 1,071.43 |
| Claude Sonnet 4.5 | Global | 42.857 | 214.286 |
| In-geo | 47.143 | 235.715 | |
| Claude Sonnet 4 | Global | 42.857 | 214.286 |
| Claude Sonnet 3.7 | Global | 42.857 | 214.286 |
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| OpenAI | |||||||
| GPT 5.2 | Global | All Lengths | 25.000 | 200.000 | 25.000 | 2.500 | 184.286 |
| In-geo | 27.500 | 220.000 | 27.500 | 2.750 | 202.714 | ||
| GPT 5.1 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5.1 Codex Max | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5.1 Codex Mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5 nano | Global | All Lengths | 0.714 | 5.714 | 0.714 | 0.071 | 53.571 |
| In-geo | 0.786 | 6.286 | 0.786 | 0.078 | 58.929 | ||
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| Anthropic | |||||||
| Claude Opus 4.6 | Global | Short Context | 71.429 | 357.143 | 89.286 | 7.143 | 178.571 |
| In-geo | 78.571 | 392.857 | 98.214 | 7.857 | 196.429 | ||
| Global | Long Context (>200k tokens) | 142.858 | 535.715 | 178.572 | 14.286 | 178.571 | |
| In-geo | 157.142 | 589.286 | 196.428 | 15.714 | 196.429 | ||
| Claude Opus 4.5 | Global | Short Context | 71.429 | 357.143 | 89.286 | 7.143 | 178.571 |
| In-geo | 78.571 | 392.857 | 98.214 | 7.857 | 196.429 | ||
| Claude Opus 4 / 4.1 | Global/In-geo | All Lengths | 214.286 | 1,071.429 | 267.857 | 21.429 | 514.286 |
| Claude Sonnet 4.5 / 4.6 | Global | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| In-geo | 47.143 | 235.715 | 58.928 | 4.715 | 235.715 | ||
| Global | Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | |
| In-geo | 94.285 | 353.572 | 117.857 | 9.428 | 235.715 | ||
| Claude Sonnet 3.7 / 4 / 4.1 Claude 3.7 Sonnet will be deprecated on April 12, 2026 | Global/In-geo | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | ||
| Claude Haiku 4.5 | Global | All Lengths | 14.286 | 71.429 | 17.857 | 1.429 | 114.286 |
| In-geo | 15.715 | 78.572 | 19.643 | 1.572 | 125.714 | ||
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| Gemini 3.0/3.1 Pro | Global/In-geo | Short Context | 35.714 | 214.286 | 35.714 | 3.571 | 230.357 |
| Long Context (>200k tokens) | 71.429 | 321.429 | 71.429 | 7.143 | 230.357 | ||
| Gemini 3.0 Flash | Global/In-geo | Short Context | 8.929 | 53.571 | 8.929 | 0.893 | 125.000 |
| Long Context (>200k tokens) | 8.929 | 53.571 | 8.929 | 0.893 | 125.000 | ||
| Gemini 2.5 Pro | Global/In-geo | Short Context | 17.857 | 142.857 | n/a | n/a | 164.286 |
| Long Context (>200k tokens) | 35.714 | 214.286 | n/a | n/a | 164.286 | ||
| Gemini 2.5 Flash | Global/In-geo | Short Context | 4.286 | 35.714 | n/a | n/a | 107.143 |
| Long Context (>200k tokens) | 4.286 | 35.714 | n/a | n/a | 107.143 | ||
Pague conforme o uso com um teste gratuito de 14 dias ou entre em contato conosco para obter descontos de uso contínuo ou requisitos personalizados.