AI INFERENCE | COMPETITIVE INTELLIGENCE
Market Overview
16 companies15 reports393 sourcesFeb 16, 2026
Threat Matrix
| # | Company | Category | Threat | Valuation | Revenue | Cadence |
|---|---|---|---|---|---|---|
| 1 | Fireworks AIDIRECT | Inference Platform | CRITICAL | $4B | ~$280M ARR | Bi-weekly |
| 2 | NebiusDIRECT | GPU AI Cloud | CRITICAL | $9.6B (public) | $117.7M (Q3 2024) | Weekly |
| 3 | Cerebras | Custom Silicon | HIGH | $22B (pre-IPO) | Undisclosed | Weekly |
| 4 | Groq | Custom Silicon | HIGH | ~$20B (Nvidia acq.) | $500M target 2025 | Monthly |
| 5 | Baseten | Inference Platform | HIGH | $5B | Undisclosed | Monthly |
| 6 | CrusoeDIRECT | GPU AI Cloud | HIGH | $10B+ (Oct 2025) | ~$1B (projected 2025) | Weekly |
| 7 | DeepInfra | Inference Platform | HIGH | ~$100M (est.) | ~$3.8M | Bi-weekly |
| 8 | CoreWeave | GPU AI Cloud | MEDIUM | $49B (public) | $3.6B (9-mo 2024) | Weekly |
| 9 | Together AI | GPU AI Cloud | MEDIUM | $3.3B | ~$300M ARR | Monthly |
| 10 | OpenRouter | Aggregator / Marketplace | MEDIUM | Undisclosed | Undisclosed | Quarterly |
| 11 | Replicate | Inference Platform | MEDIUM | $350M (pre-acq.) | ~$5.3M | Quarterly |
| 12 | Lepton AI | Inference Platform | MEDIUM | Undisclosed | Undisclosed | Quarterly |
| 13 | Modal | Inference Platform | MEDIUM | $1.1B | ~$50M ARR | Monthly |
| 14 | Lambda | GPU AI Cloud | LOW | $4B+ | $425M (2024) | Quarterly |
| 15 | SambaNova | Custom Silicon | LOW | $1.6B (Intel offer) | Undisclosed | Quarterly |
| 16 | Inference.net | Aggregator / Marketplace | LOW | Undisclosed | Undisclosed | Quarterly |
Pricing per 1M Tokens
ProviderModelInputOutput
CerebrasLlama 3 70B$0.60/M$0.60/M
Fireworks AILlama 3.1 8B$0.20/M$0.20/M
GroqLlama 3 70B$0.59/M$0.79/M
Together AILlama 3.1 8B$0.20/M$0.20/M
NebiusLlama 3 70B$0.13/M$0.40/M
DeepInfraLlama 3.1 8B$0.03/M$0.05/M
RECENT REPORTS
View allBy Category
Custom Silicon3
GPU AI Cloud5
Inference Platform6
Aggregator / Marketplace2
By Threat
CRITICAL
2HIGH
5MEDIUM
6LOW
3