Direkt zum Hauptinhalt

Proprietary Foundation Model Serving

Bedienen Sie hochmoderne Foundation-Modelle von Anthropic für beide Echtzeit-Inferenz-Workload-Anforderungen. Dies ermöglicht Ihnen, schnell und einfach Anwendungen zu erstellen, die hochwertige generative KI-Modelle unserer Partner direkt auf der Databricks-Plattform nutzen, ohne zusätzlich und separat mit anderen Anbietern interagieren zu müssen.

Loading...

DBU-Raten für das Proprietary Foundation Model Serving

Model Pay-Per-Token
 DBU / 1M INPUT tokens
(Global)
DBU / 1M OUTPUT tokens
(Global)
OpenAI
GPT 5Global17.857142.857
In-geo19.643157.143
GPT 5 MiniGlobal3.57128.571
In-geo3.92931.429
GPT 5 NanoGlobal0.7145.714
In-geo0.7866.286
Anthropic
Claude Opus 4.1Global214.2861,071.43
Claude Sonnet 4.5Global42.857214.286
In-geo47.143235.715
Claude Sonnet 4Global42.857214.286
Claude Sonnet 3.7Global42.857214.286

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
OpenAI
GPT 5.2GlobalAll Lengths25.000200.00025.0002.500184.286
In-geo27.500220.00027.5002.750202.714
GPT 5.1GlobalAll Lengths17.857142.85717.8571.786131.429
In-geo19.643157.14319.6431.965144.571
GPT 5.1 Codex MaxGlobalAll Lengths17.857142.85717.8571.786131.429
In-geo19.643157.14319.6431.965144.571
GPT 5GlobalAll Lengths17.857142.85717.8571.786131.429
In-geo19.643157.14319.6431.965144.571
GPT 5 miniGlobalAll Lengths3.57128.5713.5710.35771.429
In-geo3.92931.4293.9290.39378.571
GPT 5.1 Codex MiniGlobalAll Lengths3.57128.5713.5710.35771.429
In-geo3.92931.4293.9290.39378.571
GPT 5 nanoGlobalAll Lengths0.7145.7140.7140.07153.571
In-geo0.7866.2860.7860.07858.929

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
Anthropic
Claude Opus 4.5 / 4.6GlobalShort Context71.429357.14389.2867.143178.571
In-geo78.571392.85798.2147.857196.429
Claude Opus 4 / 4.1Global/In-geoAll Lengths214.2861,071.429267.85721.429514.286
Claude Sonnet 4.5 / 4.6GlobalShort Context42.857214.28653.5714.286214.286
In-geo47.143235.71558.9284.715235.715
GlobalLong Context
(>200k tokens)
85.714321.429107.1438.571214.286
In-geo94.285353.572117.8579.428235.715
Claude Sonnet 3.7 / 4 / 4.1

Claude 3.7 Sonnet will be deprecated on April 12, 2026
Global/In-geoShort Context42.857214.28653.5714.286214.286
Long Context
(>200k tokens)
85.714321.429107.1438.571214.286
Claude Haiku 4.5GlobalAll Lengths14.28671.42917.8571.429114.286
In-geo15.71578.57219.6431.572125.714

Proprietary Foundation Model Serving DBU rates

ModelEndpoint typeContext Length

Pay Per Token

Batch Inference
InputOutputCache writesCache reads 
DBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / 1M TokensDBU / hour
Google
Gemini 3.0 ProGlobal/In-geoShort Context35.714214.28635.7143.571n/a
Long Context
(>200k tokens)
71.429321.42971.4297.143n/a
Gemini 3.0 FlashGlobal/In-geoShort Context8.92953.5718.9290.893n/a
Long Context
(>200k tokens)
8.92953.5718.9290.893n/a
Gemini 2.5 ProGlobal/In-geoShort Context17.857142.857n/an/an/a
Long Context
(>200k tokens)
35.714214.286n/an/an/a
Gemini 2.5 FlashGlobal/In-geoShort Context4.28635.714n/an/an/a
Long Context
(>200k tokens)
4.28635.714n/an/an/a

Nutzungsbasierte Abrechnung mit einer 14-tägigen kostenlosen Testversion oder kontaktieren Sie uns für Rabatte für die verbindliche Nutzung oder benutzerdefinierte Anforderungen.

Proprietary Foundation Model Serving Serving FAQ