Proprietary Foundation Model Serving
Servez des modèles de fondation ouverts haut de gamme pour vos charges de travail d'inférence en batch et en temps réel. Cela vous permet de créer rapidement et facilement des applications qui exploitent des modèles d’AI générative propriétaires de haute qualité, proposés par divers fournisseurs, directement sur la plateforme Databricks, sans démarches supplémentaires ni contacts séparés avec d’autres fournisseurs.
Loading...
Proprietary Foundation Model Serving DBU rates
| Model | Pay-Per-Token | ||
|---|---|---|---|
| DBU / 1M INPUT tokens (Global) | DBU / 1M OUTPUT tokens (Global) | ||
| OpenAI | |||
| GPT 5 | Global | 17.857 | 142.857 |
| In-geo | 19.643 | 157.143 | |
| GPT 5 Mini | Global | 3.571 | 28.571 |
| In-geo | 3.929 | 31.429 | |
| GPT 5 Nano | Global | 0.714 | 5.714 |
| In-geo | 0.786 | 6.286 | |
| Anthropic | |||
| Claude Opus 4.1 | Global | 214.286 | 1,071.43 |
| Claude Sonnet 4.5 | Global | 42.857 | 214.286 |
| In-geo | 47.143 | 235.715 | |
| Claude Sonnet 4 | Global | 42.857 | 214.286 |
| Claude Sonnet 3.7 | Global | 42.857 | 214.286 |
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| OpenAI | |||||||
| GPT 5.2 | Global | All Lengths | 25.000 | 200.000 | 25.000 | 2.500 | 184.286 |
| In-geo | 27.500 | 220.000 | 27.500 | 2.750 | 202.714 | ||
| GPT 5.1 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5.1 Codex Max | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 | Global | All Lengths | 17.857 | 142.857 | 17.857 | 1.786 | 131.429 |
| In-geo | 19.643 | 157.143 | 19.643 | 1.965 | 144.571 | ||
| GPT 5 mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5.1 Codex Mini | Global | All Lengths | 3.571 | 28.571 | 3.571 | 0.357 | 71.429 |
| In-geo | 3.929 | 31.429 | 3.929 | 0.393 | 78.571 | ||
| GPT 5 nano | Global | All Lengths | 0.714 | 5.714 | 0.714 | 0.071 | 53.571 |
| In-geo | 0.786 | 6.286 | 0.786 | 0.078 | 58.929 | ||
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| Anthropic | |||||||
| Claude Opus 4.6 | Global | Short Context | 71.429 | 357.143 | 89.286 | 7.143 | 178.571 |
| In-geo | 78.571 | 392.857 | 98.214 | 7.857 | 196.429 | ||
| Global | Long Context (>200k tokens) | 142.858 | 535.715 | 178.572 | 14.286 | 178.571 | |
| In-geo | 157.142 | 589.286 | 196.428 | 15.714 | 196.429 | ||
| Claude Opus 4.5 | Global | Short Context | 71.429 | 357.143 | 89.286 | 7.143 | 178.571 |
| In-geo | 78.571 | 392.857 | 98.214 | 7.857 | 196.429 | ||
| Claude Opus 4 / 4.1 | Global/In-geo | All Lengths | 214.286 | 1,071.429 | 267.857 | 21.429 | 514.286 |
| Claude Sonnet 4.5 / 4.6 | Global | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| In-geo | 47.143 | 235.715 | 58.928 | 4.715 | 235.715 | ||
| Global | Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | |
| In-geo | 94.285 | 353.572 | 117.857 | 9.428 | 235.715 | ||
| Claude Sonnet 3.7 / 4 / 4.1 Claude 3.7 Sonnet will be deprecated on April 12, 2026 | Global/In-geo | Short Context | 42.857 | 214.286 | 53.571 | 4.286 | 214.286 |
| Long Context (>200k tokens) | 85.714 | 321.429 | 107.143 | 8.571 | 214.286 | ||
| Claude Haiku 4.5 | Global | All Lengths | 14.286 | 71.429 | 17.857 | 1.429 | 114.286 |
| In-geo | 15.715 | 78.572 | 19.643 | 1.572 | 125.714 | ||
Proprietary Foundation Model Serving DBU rates
| Model | Endpoint type | Context Length | Pay Per Token | Batch Inference | |||
|---|---|---|---|---|---|---|---|
| Input | Output | Cache writes | Cache reads | ||||
| DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / 1M Tokens | DBU / hour | |||
| Gemini 3.0/3.1 Pro | Global/In-geo | Short Context | 35.714 | 214.286 | 35.714 | 3.571 | 230.357 |
| Long Context (>200k tokens) | 71.429 | 321.429 | 71.429 | 7.143 | 230.357 | ||
| Gemini 3.0 Flash | Global/In-geo | Short Context | 8.929 | 53.571 | 8.929 | 0.893 | 125.000 |
| Long Context (>200k tokens) | 8.929 | 53.571 | 8.929 | 0.893 | 125.000 | ||
| Gemini 2.5 Pro | Global/In-geo | Short Context | 17.857 | 142.857 | n/a | n/a | 164.286 |
| Long Context (>200k tokens) | 35.714 | 214.286 | n/a | n/a | 164.286 | ||
| Gemini 2.5 Flash | Global/In-geo | Short Context | 4.286 | 35.714 | n/a | n/a | 107.143 |
| Long Context (>200k tokens) | 4.286 | 35.714 | n/a | n/a | 107.143 | ||
Payez à l'utilisation avec un essai gratuit de 14 jours. Ou contactez-nous pour connaître les remises sur engagements de dépenses et nous détailler vos besoins spécifiques.