Deploy at Scale. Infer in Real Time. Pay for What You Use.
Model APIs
Spin up and run inference in seconds — only pay for what you use.
Price Per 1M Tokens :
Model
Input
Output

Mistral: Mistral Small 3.2 24B
$0.075
$0.2

Mistral: Mistral Nemo
$0.025
$0.05

TheDrummer: UnslopNemo 12B
$0.37
$0.37

TheDrummer: Rocinante 12B
$0.19
$0.40

NeverSleep: Lumimaid v0.2 8B
$0.16
$0.9
Talk to Engineering