Sample data: Phase-1 sample snapshot. Official crawling and weekly benchmark jobs are not connected yet. All price, latency and score values validate the product structure only and must be replaced by traceable production data before launch.
Best LLMs for Low Latency
Interactive user-facing flows where TTFT matters most.
| Model | Input | Output | TTFT | Context | Score | Updated | |
|---|---|---|---|---|---|---|---|
| 4NGPT-4.1 NanoOpenAI · closed | $0.10/1M | $0.40/1M | 68ms | 1M | 82 | 2026-06-09 | Compare |
| 4MGPT-4.1 MiniOpenAI · closed | $0.40/1M | $1.60/1M | 92ms | 1M | 79 | 2026-06-09 | Compare |
| CHClaude 3 HaikuAnthropic · closed | $0.25/1M | $1.25/1M | 86ms | 200K | 79 | 2026-06-09 | Compare |
| GMGemini 2.0 FlashGoogle · closed | $0.10/1M | $0.40/1M | 112ms | 1M | 78 | 2026-06-09 | Compare |
| 8BLlama 3.1 8BMeta · open | $0.05/1M | $0.05/1M | 74ms | 128K | 78 | 2026-06-09 | Compare |
| DSDeepSeek V3DeepSeek · closed | $0.14/1M | $0.28/1M | 124ms | 128K | 77 | 2026-06-09 | Compare |
| DLDoubao LiteVolcano Engine · closed | $0.03/1M | $0.06/1M | 109ms | 32K | 77 | 2026-06-09 | Compare |
| QTQwen TurboAlibaba Cloud · closed | $0.05/1M | $0.20/1M | 118ms | 1M | 76 | 2026-06-09 | Compare |
| MSMistral SmallMistral AI · closed | $0.20/1M | $0.60/1M | 104ms | 32K | 76 | 2026-06-09 | Compare |
| G2Gemma 2 27BGoogle · open | $0.20/1M | $0.20/1M | 98ms | 8K | 75 | 2026-06-09 | Compare |
| DBDoubao ProVolcano Engine · closed | $0.11/1M | $0.22/1M | 141ms | 128K | 74 | 2026-06-09 | Compare |
| QPQwen PlusAlibaba Cloud · closed | $0.20/1M | $0.60/1M | 136ms | 128K | 74 | 2026-06-09 | Compare |
| GAGLM-4 AirZhipu AI · closed | $0.10/1M | $0.10/1M | 132ms | 128K | 74 | 2026-06-09 | Compare |
| QWQwen 2.5 72BAlibaba Cloud · open | $0.35/1M | $0.70/1M | 156ms | 128K | 72 | 2026-06-09 | Compare |
| M1Moonshot v1 32KMoonshot AI · closed | $0.18/1M | $0.18/1M | 148ms | 32K | 72 | 2026-06-09 | Compare |
| KMKimi K2Moonshot AI · closed | $0.18/1M | $0.72/1M | 168ms | 200K | 70 | 2026-06-09 | Compare |
| ABMiniMax abab 6.5Volcano Engine · closed | $0.30/1M | $0.30/1M | 158ms | 245K | 70 | 2026-06-09 | Compare |
| CMCommand RCohere · closed | $0.50/1M | $1.50/1M | 134ms | 128K | 69 | 2026-06-09 | Compare |
| G4GPT-4oOpenAI · closed | $2.50/1M | $10.00/1M | 89ms | 128K | 68 | 2026-06-09 | Compare |
| YIYi LargeZhipu AI · closed | $0.50/1M | $0.50/1M | 162ms | 32K | 68 | 2026-06-09 | Compare |
| MXMixtral 8x22BMistral AI · open | $0.90/1M | $0.90/1M | 142ms | 64K | 68 | 2026-06-09 | Compare |
| GLGLM-4 PlusZhipu AI · closed | $0.80/1M | $0.80/1M | 184ms | 128K | 66 | 2026-06-09 | Compare |
| MLMistral LargeMistral AI · closed | $2.00/1M | $6.00/1M | 132ms | 128K | 65 | 2026-06-09 | Compare |
| 70Llama 3.1 70BMeta · open | $0.88/1M | $0.88/1M | 172ms | 128K | 65 | 2026-06-09 | Compare |
| C3Claude 3.5 SonnetAnthropic · closed | $3.00/1M | $15.00/1M | 95ms | 200K | 64 | 2026-06-09 | Compare |
| R1DeepSeek R1DeepSeek · closed | $0.55/1M | $2.19/1M | 224ms | 128K | 62 | 2026-06-09 | Compare |
| GXGrok 3xAI · closed | $3.00/1M | $15.00/1M | 102ms | 128K | 61 | 2026-06-09 | Compare |
| GPGemini 1.5 ProGoogle · closed | $1.25/1M | $5.00/1M | 178ms | 2M | 61 | 2026-06-09 | Compare |
| L3Llama 3.1 405BMeta · open | $1.79/1M | $1.79/1M | 210ms | 128K | 59 | 2026-06-09 | Compare |
| CRCommand R+Cohere · closed | $2.50/1M | $10.00/1M | 146ms | 128K | 56 | 2026-06-09 | Compare |
| COClaude 3 OpusAnthropic · closed | $15.00/1M | $75.00/1M | 156ms | 200K | 54 | 2026-06-09 | Compare |