Talk to Speko's own agent. It answers anything about the platform in real time, retrieved live from our docs. No script, no decision tree. Ask something easy, then try to break it.
Try asking
Tap to talk. It answers from our docs, live
How it works
From a sentence to a shipped agent.
No provider sprawl, no glue code. Describe the agent you want, and Speko
assembles and routes the proven-best stack, then you ship.
01
Describe it
Tell Speko what your agent should do, in plain English.
02
Speko routes it
We continuously benchmark every STT, LLM and TTS provider and assemble the proven-best stack for your use case and language.
03
Test it live
Talk to it in the browser or get a real phone call. Tune voice, latency and accuracy in real time.
A voice agent has ~200 ms to think. Our voice-native retrieval engine
answers in sub-millisecond time, so memory never eats your latency budget.
95.8–100% recall vs
an exact search, at 0.76 ms across 100K documents.
Retrieval p50100K docs · lower is better
Speko0.76ms
ChromaDB352ms
Pinecone433ms
Qdrant598ms
Speko measured in-process at 100K docs. ChromaDB, Pinecone and
Qdrant are their published figures for network-hosted deployments, which include network
transport, so it isn't like-for-like.
Ship it anywhere
One API. Every surface.
However your users reach you, Speko answers with one integration.
Phone numberInbound and outbound calls over SIP. Bring a number or get one.
Web & mobile widgetDrop-in voice widget, or wire the SDK into your app.
API & webhooksFull programmatic control with @spekoai/sdk and event webhooks.
A voice-AI gateway. One API gives you speech-to-text, an LLM and text-to-speech, provider-agnostic. Speko benchmarks every provider and routes each call to the proven-best stack.
Do I need my own provider API keys?
No. One Speko key covers it. We manage provider credentials and routing, and bring-your-own keys are optional if you prefer.
How does routing choose a provider?
Continuous benchmarks on accuracy (WER), latency and cost, per language and use case — each call gets the proven-best stack for that language and use case.
What latency can I expect?
It depends on the stack you run — STT, LLM and TTS each add latency. Rather than quote one number, we publish measured per-provider latency in the open benchmark, and you can optimize any agent for latency, accuracy or cost.
Who owns the data and the agent?
You do. Your audio and transcripts aren't used to train models, and you can export your configuration at any time.
How fast can I launch?
Under five minutes from a plain-English description. Talk to your agent in the browser before you ship it to a phone number or the web.
Start now
Your first agent is waiting.
Describe it in a sentence. Hear it in a minute. Ship it today.
> One API for the whole voice stack — STT, LLM, and TTS — benchmarked in real time and routed to the best provider on every call, with automatic failover, in 10+ languages.
## Connect your agent
MCP-first. Speko runs a hosted, OAuth-protected MCP server — no local install required. Point any MCP client at it and your agent gets authenticated, operational access to the Speko voice gateway.
# Guided setup for any supported client
npx @spekoai/mcp@latest init
# Hosted endpoint
https://mcp.speko.ai/mcp
# Claude Code — add with OAuth, then run /mcp to sign in
claude mcp add --transport http speko https://mcp.speko.ai/mcp
# Or with an API key instead of OAuth
claude mcp add --transport http speko https://mcp.speko.ai/mcp \
--header "Authorization: Bearer sk_live_xxx"
// Cursor — Settings > MCP > Add new global MCP server
{
"mcpServers": {
"speko": {
"url": "https://mcp.speko.ai/mcp"
}
}
}
Authenticate with OAuth when your client supports it, or send a Speko API key as `Authorization: Bearer sk_live_xxx`. Once connected, the server exposes operational tools for: organizations (usage + credit balance), agents (create / update / deploy / roll back), sessions, calls (transcripts + recordings), phone numbers, knowledge bases, evals, deployment, a docs.search self-serve tool, and migration helpers to convert external voice-agent configs.
## SDKs
Full programmatic control of the gateway from your language of choice.
Speko is an OpenRouter-style gateway for voice AI: one API for every voice provider. It continuously benchmarks STT, LLM, and TTS providers on accuracy, latency, and cost, then routes each call to the proven-best stack for your language and target — with automatic failover across the full voice stack (STT, TTS, and speech-to-speech) in 10+ languages. No per-provider keys, no glue code. Around 340 ms median for a conversational turn. Describe an agent in plain English and talk to it in the browser before you ship it to a phone number, web widget, or API. Your keys, your data, your agent.
# Speko
> One API for the whole voice stack — STT, LLM, and TTS — benchmarked in real time and routed to the best provider on every call, with automatic failover, in 10+ languages.
## Connect your agent
MCP-first. Speko runs a hosted, OAuth-protected MCP server — no local install required. Point any MCP client at it and your agent gets authenticated, operational access to the Speko voice gateway.
- [Hosted MCP endpoint](https://mcp.speko.ai/mcp) — OAuth-protected Streamable HTTP; no local install
- [MCP quickstart](https://docs.speko.dev/quickstart/mcp) — Claude Code, Codex, OpenCode, Cursor, and other tools
- [@spekoai/mcp on npm](https://www.npmjs.com/package/@spekoai/mcp) — init wizard + local stdio bridge for the hosted MCP
```bash
# Guided setup for any supported client
npx @spekoai/mcp@latest init
# Hosted endpoint
https://mcp.speko.ai/mcp
```
```bash
# Claude Code — add with OAuth, then run /mcp to sign in
claude mcp add --transport http speko https://mcp.speko.ai/mcp
# Or with an API key instead of OAuth
claude mcp add --transport http speko https://mcp.speko.ai/mcp \
--header "Authorization: Bearer sk_live_xxx"
```
```json
// Cursor — Settings > MCP > Add new global MCP server
{
"mcpServers": {
"speko": {
"url": "https://mcp.speko.ai/mcp"
}
}
}
```
Authenticate with OAuth when your client supports it, or send a Speko API key as `Authorization: Bearer sk_live_xxx`. Once connected, the server exposes operational tools for: organizations (usage + credit balance), agents (create / update / deploy / roll back), sessions, calls (transcripts + recordings), phone numbers, knowledge bases, evals, deployment, a docs.search self-serve tool, and migration helpers to convert external voice-agent configs.
## SDKs
Full programmatic control of the gateway from your language of choice.
- [@spekoai/sdk on npm](https://www.npmjs.com/package/@spekoai/sdk) — official TypeScript SDK
- [spekoai on PyPI](https://pypi.org/project/spekoai/) — official Python SDK
- [@spekoai/adapter-livekit on npm](https://www.npmjs.com/package/@spekoai/adapter-livekit) — STT, LLM, and TTS components for LiveKit Agents
- [SpekoAI on GitHub](https://github.com/SpekoAI) — source repositories and SDKs
## Read
Machine-readable indexes first, then concepts and the live benchmarks that drive routing.
- [Documentation index for LLMs (llms.txt)](https://docs.speko.dev/llms.txt) — one-line descriptions of every doc page
- [Full documentation corpus (llms-full.txt)](https://docs.speko.dev/llms-full.txt) — all docs concatenated for LLM context
- [Site index (speko.ai/llms.txt)](https://speko.ai/llms.txt) — one-line index of every page on speko.ai
- [API reference](https://docs.speko.dev/api-reference/introduction) — public REST API for the voice gateway
- [How routing works](https://docs.speko.dev/concepts/routing) — intent + benchmark scores → ranked provider candidates
- [Failover](https://docs.speko.dev/concepts/failover) — transparent retry against runner-up providers
- [Provider benchmarks](https://benchmarks.speko.ai) — live STT, TTS, LLM, and speech-to-speech (accuracy, latency, cost)
## What it is
Speko is an OpenRouter-style gateway for voice AI: one API for every voice provider. It continuously benchmarks STT, LLM, and TTS providers on accuracy, latency, and cost, then routes each call to the proven-best stack for your language and target — with automatic failover across the full voice stack (STT, TTS, and speech-to-speech) in 10+ languages. No per-provider keys, no glue code. Around 340 ms median for a conversational turn. Describe an agent in plain English and talk to it in the browser before you ship it to a phone number, web widget, or API. Your keys, your data, your agent.
## Build
- [Build your agent](https://speko.ai/build) — describe it in a sentence, hear it in a minute
- [Platform dashboard](https://platform.speko.dev) — API keys, agents, usage
- [Get your API key](https://platform.speko.dev/api-keys)