Skip to content

Speech-to-Speech Benchmark

3 models · Speko-measured · coffee-order-happy scenario · N=20 per region · April 2026

ProviderArchitecture
P50 latency
tool-call turn
Task SuccessNSource
OpenAI gpt-realtime
Native S2S3485ms85%20OpenAI gpt-realtime
xAI grok-voice-think-fast-1.0
Native S2S1319ms95%20xAI Grok Voice Agent
Google gemini-live-2.5-flash-native-audio
Native S2S2655ms80%17Vertex AI Gemini Live

Stop guessing. Start benchmarking.

Independent, data-driven comparisons to help you pick the right voice AI stack.

Get Started