AI INFERENCE | COMPETITIVE INTELLIGENCE

Market Overview

21 companies27 reports1156 sourcesFeb 22, 2026
Threat Matrix
#CompanyCategoryThreatValuationRevenueCadence
1Fireworks AIDIRECTInference Platform
CRITICAL
$4B~$280M ARRBi-weekly
2NebiusDIRECTGPU AI Cloud
CRITICAL
$9.6B (public)$117.7M (Q3 2024)Weekly
3CerebrasCustom Silicon
HIGH
$22B (pre-IPO)UndisclosedWeekly
4GroqCustom Silicon
HIGH
~$20B (Nvidia acq.)$500M target 2025Monthly
5BasetenInference Platform
HIGH
$5BUndisclosedMonthly
6CrusoeDIRECTGPU AI Cloud
HIGH
$10B+ (Oct 2025)~$1B (projected 2025)Weekly
7DeepInfraInference Platform
HIGH
~$100M (est.)~$3.8MBi-weekly
8InferactInference Platform
HIGH
$800MUndisclosedWeekly
9Cloudflare Workers AIInference Platform
HIGH
$68.8B (public)$614.5M (Q4 2025)Weekly
10TaalasCustom Silicon
HIGH
~$500M (est.)Pre-revenueBi-weekly
11CoreWeaveGPU AI Cloud
MEDIUM
$49B (public)$3.6B (9-mo 2024)Weekly
12Together AIGPU AI Cloud
MEDIUM
$3.3B~$300M ARRMonthly
13OpenRouterAggregator / Marketplace
MEDIUM
UndisclosedUndisclosedQuarterly
14ReplicateInference Platform
MEDIUM
$350M (pre-acq.)~$5.3MQuarterly
15Lepton AIInference Platform
MEDIUM
UndisclosedUndisclosedQuarterly
16ModalInference Platform
MEDIUM
$1.1B~$50M ARRMonthly
17fal.aiInference Platform
MEDIUM
$4.5B~$200M ARR (est.)Monthly
18NscaleGPU AI Cloud
MEDIUM
$2B+Pre-platform revenueMonthly
19LambdaGPU AI Cloud
LOW
$4B+$425M (2024)Quarterly
20SambaNovaCustom Silicon
LOW
$1.6B (Intel offer)UndisclosedQuarterly
21Inference.netAggregator / Marketplace
LOW
UndisclosedUndisclosedQuarterly
Pricing per 1M Tokens
ProviderModelInputOutput
CerebrasLlama 3 70B$0.60/M$0.60/M
Fireworks AILlama 3.1 8B$0.20/M$0.20/M
GroqLlama 3 70B$0.59/M$0.79/M
Together AILlama 3.1 8B$0.20/M$0.20/M
NebiusLlama 3 70B$0.13/M$0.40/M
DeepInfraLlama 3.1 8B$0.03/M$0.05/M
Cloudflare Workers AILlama 3.2 1B$0.01/M$0.01/M