Current latency table

Provider	Model	Region	P50	P95	P99	TTFT	Tokens/sec	Collected
Anthropic	claude-3-haiku	Europe	536ms	1280ms	1984ms	610ms	94	May 12, 2026, 01:57 PM UTC

Provider-by-provider breakdown

Best provider for Europe by use case

Use case	Best provider/model	Reason
Real-time chat	Anthropic Claude 3 Haiku	Fastest European row in the current sample.
Compliance-sensitive apps	Provider with explicit EU routing	Regulatory constraints can dominate a 100ms difference.
Bulk summarization	Google Gemini Flash	Use a high-throughput model when the job is not interactive.

Europe shows a larger latency spread than US regions. Provider POP selection and data residency controls can matter as much as raw model speed.

Run a Europe-specific leaderboard instead of assuming US latency applies to London, Frankfurt, and Paris.
Label data residency mode in benchmark metadata because it changes routing and tail latency.
Measure from the same cloud region that your production API uses, not from a laptop speed test.

Compare this page with the global leaderboard and the benchmark method in How to Measure LLM Latency Correctly.