CompactifAI·Inference API

The fastest and most affordable way to access leading AI models

Original and Slim by CompactifAI — directly through

/img/logos/aws-marketplace.svg

Why our API?

Lowest Cost for Open-Source Models

Strongest Throughput & TTFT-to-Price Performance Ratio

Plug & Play, No Infrastructure Needed

Scalable Enterprise Deployment & Billed per Usage

Private Endpoints Available on Private Offer

Model Catalog

CompactifAI Only
Market-Leading Price
TOP Speed-to-Price Ratio
Best Value
Multimodal
OpenAI gpt-oss-20b
OpenAI gpt-oss-20b
Input Cost
$0.03/M
Output Cost
$0.10/M
OpenAI gpt-oss-120b
OpenAI gpt-oss-120b
Input Cost
$0.05/M
Output Cost
$0.23/M
Llama 3.3 70B Slim
Llama 3.3 70B Slim
CompactifAI
Input Cost
$0.10/M
Output Cost
$0.21/M
Llama 3.3 70B
Llama 3.3 70B
Input Cost
$0.15/M
Output Cost
$0.31/M
Mistral Small 3.1 Slim
Mistral Small 3.1 Slim
CompactifAI
Input Cost
$0.05/M
Output Cost
$0.08/M
Mistral Small 3.1
Mistral Small 3.1
Input Cost
$0.11/M
Output Cost
$0.17/M
Llama 3.1 8B Slim
Llama 3.1 8B Slim
CompactifAI
Input Cost
$0.01/M
Output Cost
$0.07/M
Llama 3.1 8B
Llama 3.1 8B
Input Cost
$0.02/M
Output Cost
$0.09/M
Llama 4 Scout Slim
Llama 4 Scout Slim
CompactifAI
Input Cost
$0.07/M
Output Cost
$0.10/M
Llama 4 Scout
Llama 4 Scout
Input Cost
$0.10/M
Output Cost
$0.14/M
DeepSeek R1 Slim
DeepSeek R1 Slim
CompactifAI
Input Cost
$0.28/M
Output Cost
$0.44/M
Whisper Large V3
Whisper Large V3
New
Transcription Cost
$0.00034/Min (Audio)

Need a Private Endpoint or Have Questions?

Our team is ready to help you with custom deployments, private offers, and any technical questions you may have.