Revenir au contenu principal

Proprietary Foundation Model Serving

Servez des modèles de fondation ouverts haut de gamme pour vos charges de travail d'inférence en batch et en temps réel. Cela vous permet de créer rapidement et facilement des applications qui exploitent des modèles d’AI générative propriétaires de haute qualité, proposés par divers fournisseurs, directement sur la plateforme Databricks, sans démarches supplémentaires ni contacts séparés avec d’autres fournisseurs.

Loading...

Proprietary Foundation Model Serving DBU rates

Model Pay-Per-Token
 DBU / 1M INPUT tokens
(Global)
DBU / 1M OUTPUT tokens
(Global)
OpenAI
GPT 5Global17.857142.857
In-geo19.643157.143
GPT 5 MiniGlobal3.57128.571
In-geo3.92931.429
GPT 5 NanoGlobal0.7145.714
In-geo0.7866.286
Anthropic
Claude Opus 4.1Global214.2861,071.43
Claude Sonnet 4.5Global42.857214.286
In-geo47.143235.715
Claude Sonnet 4Global42.857214.286
Claude Sonnet 3.7Global42.857214.286

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
OpenAI
GPT 5.5GlobalShort71.429428.57171.4297.143214.286
In-geo78.572471.42878.5727.857235.715
GlobalLong142.857642.857142.85714.286214.286
In-geo157.143707.143157.14315.714235.715
GPT 5.4 ProGlobalShort428.5712,571.429428.57142.8571,142.857
In-geo471.4282,828.572471.42847.1431,257.143
GlobalLong857.1423,857.144857.14285.7141,142.857
In-geo942.8564,242.858942.85694.2861,257.143
GPT 5.4GlobalShort35.714214.28635.7143.571192.857
In-geo39.285235.71539.2853.929212.143
GPT 5.4GlobalLong71.428321.42971.4287.143192.857
In-geo78.571353.57278.5717.857212.143
GPT 5.4 miniGlobalAll Lengths10.71464.28610.7141.071107.143
In-geo11.78670.71411.7861.179117.857
GPT 5.4 nanoGlobalAll Lengths2.85717.8572.8570.28671.429
In-geo3.14319.6433.1430.31478.571
GPT 5.2/5.3 CodexGlobalAll Lengths25.000200.00025.0002.500n/a
In-geo27.500220.00027.5002.750n/a
GPT 5.2GlobalAll Lengths25.000200.00025.0002.500184.286
In-geo27.500220.00027.5002.750202.714
GPT 5.1GlobalAll Lengths17.857142.85717.8571.786131.429
In-geo19.643157.14319.6431.965144.571
GPT 5.1 Codex MaxGlobalAll Lengths17.857142.85717.8571.786n/a
In-geo19.643157.14319.6431.965n/a
GPT 5GlobalAll Lengths17.857142.85717.8571.786131.429
In-geo19.643157.14319.6431.965144.571
GPT 5 miniGlobalAll Lengths3.57128.5713.5710.35771.429
In-geo3.92931.4293.9290.39378.571
GPT 5.1 Codex MiniGlobalAll Lengths3.57128.5713.5710.357n/a
In-geo3.92931.4293.9290.393n/a
GPT 5 nanoGlobalAll Lengths0.7145.7140.7140.07153.571
In-geo0.7866.2860.7860.07858.929

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
Anthropic
Claude Opus 4.5 / 4.6 / 4.7 / 4.8GlobalAll Lengths71.429357.14389.2867.143178.571
In-geo78.571392.85798.2147.857196.429
Claude Opus 4 / 4.1Global/In-geoAll Lengths214.2861,071.429267.85721.429514.286
Claude Sonnet 4.5 / 4.6GlobalAll Lengths42.857214.28653.5714.286214.286
In-geo47.143235.71558.9284.715235.715
Claude Sonnet 4Global/In-geoAll Lengths42.857214.28653.5714.286214.286
Claude Haiku 4.5GlobalAll Lengths14.28671.42917.8571.429114.286
In-geo15.71578.57219.6431.572125.714

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
Google
Gemini 3.5 FlashGlobalAll Lengths26.786160.71426.7862.679196.429
In-geoAll Lengths29.464176.78629.4642.946216.071
Gemini 3.1 Flash LiteGlobalAll Lengths4.46426.7864.4640.44689.286
In-geoAll Lengths4.91129.4644.9110.49198.214
Gemini 3.0 / 3.1 ProGlobal/In-geoShort Context35.714214.28635.7143.571230.429
Long Context
(>200k tokens)
71.429321.42971.4297.143230.429
Gemini 3.0 FlashGlobal/In-geoAll Lengths8.92953.5718.9290.893125.000
Gemini 2.5 ProGlobal/In-geoShort Context22.321178.57122.3212.232164.286
Long Context
(>200k tokens)
44.643267.85744.6434.464164.286
Gemini 2.5 FlashGlobal/In-geoAll Lengths5.35744.6435.3570.536107.143
Gemini 2.5 Flash LiteGlobal/In-geoAll Lengths1.7867.1431.7860.179n/a

NOTE: The Gemini model DBU rates shown here do not include a promotional discount of 20% (promotional pricing is 20% lower than shown). The promotion will run until June 30, 2026 after which all prices will revert to the DBU rates shown in this table.

Payez à l'utilisation avec un essai gratuit de 14 jours. Ou contactez-nous pour connaître les remises sur engagements de dépenses et nous détailler vos besoins spécifiques.

FAQ sur le Service du Modèle de Fondation Partenaire