AI INFERENCE | COMPETITIVE INTELLIGENCE
Market Overview
21 companies27 reports1156 sourcesFeb 22, 2026
Threat Matrix
| # | Company | Category | Threat | Valuation | Revenue | Cadence |
|---|---|---|---|---|---|---|
| 1 | Fireworks AIDIRECT | Inference Platform | CRITICAL | $4B | ~$280M ARR | Bi-weekly |
| 2 | NebiusDIRECT | GPU AI Cloud | CRITICAL | $9.6B (public) | $117.7M (Q3 2024) | Weekly |
| 3 | Cerebras | Custom Silicon | HIGH | $22B (pre-IPO) | Undisclosed | Weekly |
| 4 | Groq | Custom Silicon | HIGH | ~$20B (Nvidia acq.) | $500M target 2025 | Monthly |
| 5 | Baseten | Inference Platform | HIGH | $5B | Undisclosed | Monthly |
| 6 | CrusoeDIRECT | GPU AI Cloud | HIGH | $10B+ (Oct 2025) | ~$1B (projected 2025) | Weekly |
| 7 | DeepInfra | Inference Platform | HIGH | ~$100M (est.) | ~$3.8M | Bi-weekly |
| 8 | Inferact | Inference Platform | HIGH | $800M | Undisclosed | Weekly |
| 9 | Cloudflare Workers AI | Inference Platform | HIGH | $68.8B (public) | $614.5M (Q4 2025) | Weekly |
| 10 | Taalas | Custom Silicon | HIGH | ~$500M (est.) | Pre-revenue | Bi-weekly |
| 11 | CoreWeave | GPU AI Cloud | MEDIUM | $49B (public) | $3.6B (9-mo 2024) | Weekly |
| 12 | Together AI | GPU AI Cloud | MEDIUM | $3.3B | ~$300M ARR | Monthly |
| 13 | OpenRouter | Aggregator / Marketplace | MEDIUM | Undisclosed | Undisclosed | Quarterly |
| 14 | Replicate | Inference Platform | MEDIUM | $350M (pre-acq.) | ~$5.3M | Quarterly |
| 15 | Lepton AI | Inference Platform | MEDIUM | Undisclosed | Undisclosed | Quarterly |
| 16 | Modal | Inference Platform | MEDIUM | $1.1B | ~$50M ARR | Monthly |
| 17 | fal.ai | Inference Platform | MEDIUM | $4.5B | ~$200M ARR (est.) | Monthly |
| 18 | Nscale | GPU AI Cloud | MEDIUM | $2B+ | Pre-platform revenue | Monthly |
| 19 | Lambda | GPU AI Cloud | LOW | $4B+ | $425M (2024) | Quarterly |
| 20 | SambaNova | Custom Silicon | LOW | $1.6B (Intel offer) | Undisclosed | Quarterly |
| 21 | Inference.net | Aggregator / Marketplace | LOW | Undisclosed | Undisclosed | Quarterly |
Pricing per 1M Tokens
ProviderModelInputOutput
CerebrasLlama 3 70B$0.60/M$0.60/M
Fireworks AILlama 3.1 8B$0.20/M$0.20/M
GroqLlama 3 70B$0.59/M$0.79/M
Together AILlama 3.1 8B$0.20/M$0.20/M
NebiusLlama 3 70B$0.13/M$0.40/M
DeepInfraLlama 3.1 8B$0.03/M$0.05/M
Cloudflare Workers AILlama 3.2 1B$0.01/M$0.01/M
RECENT REPORTS
View allGPU & AI Accelerator Roadmap 2026-2028
Feb 22, 2026 | 97 sources
AI Inference Economics: The Race to Zero and Where Margin Survives
Feb 22, 2026 | 84 sources
Enterprise AI Inference Buyers: The $37B Demand-Side Landscape
Feb 21, 2026 | 85 sources
Bitcoin Miners' HPC/AI Transition: The $65B Infrastructure Pivot
Feb 21, 2026 | 87 sources
AI Inference Engines & Frameworks: The Technology Layer Powering the $126B Market
Feb 20, 2026 | 81 sources
Threat Distribution
CRITICAL (2)
HIGH (8)
MEDIUM (8)
LOW (3)
Category Breakdown