Skip to content
Inference Exchange
How it works
Menu
🔥
Trending
New
Chat
Code
Reasoning
Vision
Agents
Open Source
Cheapest
Fastest
chat · Meta
Llama 3.3 70B
$0.87
/1M output
â–¼
decreased by
2.4%
groq
$0.87
together
$0.97
Output /1M
$0.87
Input /1M
$0.65
Latency
~42ms
Providers
2
Hot Models
›
1
Kimi K2.5
$3.30
â–¼
decreased by
11.8%
2
DeepSeek R1 70B
$0.87
â–¼
decreased by
8.1%
3
Llama 3.1 8B
$0.09
â–¼
decreased by
5.0%
4
GPT-4o Mini
$0.66
â–¼
decreased by
3.5%
5
Llama 3.3 70B
$0.87
â–¼
decreased by
2.4%
Inference Insights
›
2h ago
DeepSeek R1 output price dropped 8.1% this week
5h ago
Kimi K2 now available via Together AI
1d ago
Groq inference uptime: 99.99% this month
2d ago
Llama 3.3 70B crosses 500K daily requests
Explore all
All models
9 models
chat
Llama 3.3 70B
Meta · 2 providers
$0.87
/1M
â–¼
decreased by
2.4%
groq · together
reasoning
DeepSeek R1 70B
DeepSeek · 2 providers
$0.87
/1M
â–¼
decreased by
8.1%
groq · together
reasoning
GPT-4o
OpenAI · 1 provider
$11.00
/1M
—
unchanged at
0.0%
openai
code
Kimi K2.5
Moonshot · 1 provider
$3.30
/1M
â–¼
decreased by
11.8%
moonshot
reasoning
Claude Sonnet
Anthropic · 1 provider
$16.50
/1M
—
unchanged at
0.0%
anthropic
code
Mixtral 8x7B
Mistral · 2 providers
$0.26
/1M
â–¼
decreased by
1.2%
groq · together
chat
Claude Haiku 3.5
Anthropic · 1 provider
$4.40
/1M
—
unchanged at
0.0%
anthropic
chat
GPT-4o Mini
OpenAI · 1 provider
$0.66
/1M
â–¼
decreased by
3.5%
openai
chat
Llama 3.1 8B
Meta · 2 providers
$0.09
/1M
â–¼
decreased by
5.0%
groq · together
Ask IX
Shift + /