Model comparison

Sample data: Phase-1 sample snapshot. Official crawling and weekly benchmark jobs are not connected yet. All price, latency and score values validate the product structure only and must be replaced by traceable production data before launch.

The `models=a,b,c` URL parameter already drives the comparison page; selectors and saved comparisons come next.

Model	Input	Output	TTFT	Context	Value	Updated
DSDeepSeek V3DeepSeek · closed	$0.14/1M	$0.28/1M	124ms	128K	96	2026-06-09	Compare
G4GPT-4oOpenAI · closed	$2.50/1M	$10.00/1M	89ms	128K	73	2026-06-09	Compare
C3Claude 3.5 SonnetAnthropic · closed	$3.00/1M	$15.00/1M	95ms	200K	70	2026-06-09	Compare

DeepSeek V3

DeepSeek · Strong value baseline for coding and Chinese tasks in the sample set.

Quality

Chinese

GPT-4o

OpenAI · High quality and low TTFT in this sample, with higher output token cost.

Quality

Chinese

Claude 3.5 Sonnet

Anthropic · Excellent coding and reasoning sample scores; expensive output tier.

Quality

Chinese