unpod logo
PlaygroundAboutBlogDocs
Github
The stack · one platform, three layers
Speech Infra
Developer Platform
Telephony + Speech

The metered speech & telephony your agents run on. STT, TTS, numbers and routing through one webhook.

unpod.dev ↗
Open source
Unpod
Open-source CPaaS

Self-host the full agents & contact-center platform. Embeds SuperDialog, bills against Unpod infra.

unpod.ai/open-source-cpaas ↗
Open source
SuperDialog
Conversation framework

Write realtime conversation logic. Standalone, lower-level, free - adopt it before you ever need the cloud.

superdialog.unpod.ai ↗
→ How it works→ Compare→ Quickstart
Open source · self-host · or run on Unpod
unpod logo

Realtime voice + messaging infrastructure. An open-source conversation framework and CPaaS, on metered telephony + speech - so any agent can call and text the real world.

GitHubDocumentation

Products

  • Developer Platform
  • Unpod (OSS CPaaS)
  • SuperDialog

Developers

  • Documentation
  • GitHub
  • API reference

Company

  • About
  • Privacy
  • Terms
© 2026 Unpod.ai · All rights reservedMade in India · Built for the world

Telephony & Speech Infra for AI Agents.

Integrate voice agents into complex workflows and legacy systems with speech infrastructure and an IDE that give developer-grade control over building rich conversational flows.

Try the Playground →Read the docs
Developer PlatformTelephony + Speech→UnpodOpen-source CPaaS→SuperDialogConversation framework→
What teams ship on Unpod
AI ReceptionistCoding-agent phoneOn-call pager2FA inboxOutbound follow-upAutonomous workerClaude CodeCursorSupport lineAppointment bookingAI ReceptionistCoding-agent phoneOn-call pager2FA inboxOutbound follow-upAutonomous workerClaude CodeCursorSupport lineAppointment bookingAI ReceptionistCoding-agent phoneOn-call pager2FA inboxOutbound follow-upAutonomous workerClaude CodeCursorSupport lineAppointment booking
The flow · end-to-end

Phone → Speech → your endpoint.

Unpod handles the phone and the speech. You receive text at your endpoint and reply with text - the prompt, tools and customer data stay on your side.

inbound audiostreamed textoutbound audioreply textCaller+91 9XXXX XXXXXVoiceWhatsAppSMSOne number · inbound & outboundUnpodTelecom + Speech+91 80 4555 0142STTstreaming · voice → text~120msRouteunified webhook · threads~30msTTSneural · text → voice~180msAll-in60% less cost vs legacy stacks{ }Your agentPOST /unpod✓Your prompt + system✓Your LLM + tools✓Your customer data
Caller
+91 9XXXX XXXXX
VoiceWhatsAppSMS
↓inbound audio↑outbound audio
Unpod
Telecom + Speech
+91 80 4555 0142
STTstreaming · voice → text~120ms
Routeunified webhook · threads~30ms
TTSneural · text → voice~180ms
All-in60% less cost vs legacy stacks
↓streamed text↑reply text
{ }
Your agent
POST /unpod
  • ✓Your prompt + system
  • ✓Your LLM + tools
  • ✓Your customer data
Total end-to-end < 330ms·STT 120 + Route 30 + TTS 180
How it works

Build the flow, then plug in speech.

Author the conversation in the IDE with developer-grade control, attach speech and a real number, wire it into your workflows and legacy systems, then ship.

01
Build

Author the flow

SuperDialog IDE
02
Speech infra

Attach speech

Voice · SMS · WhatsApp
03
Integrate

Wire your stack

Workflows & legacy
04
Ship

Ship it

Carrier-grade · metered
SDKs · open source

Drop it into any backend.

Python, Node, REST or MCP - all open source. Receive text, reply with text. Wire it to any LLM, framework or your own loop.

pip install unpod
Python
Node.js
REST
MCP
from unpod import Unpod

client = Unpod(api_key="up_...")

# Provision a number for your agent
number = client.numbers.buy(country="IN", agent_id="agt_123")

# Place an outbound call - your webhook gets every turn
call = client.calls.create(
    agent_id="agt_123",
    to_number="+9197XXXXXXXX",
    webhook="https://api.yourapp.com/unpod",
)
Read the docs →View on GitHub →
Ready-to-deploy playbooks

What you can build with it

Real things you ship with Unpod - from coding agents that take calls to on-call pagers and 2FA inboxes.

Phone-call your codebase

Call a number, speak a command, and your coding agent runs it over MCP.

Native MCP

Autonomous AI workers

Give a long-running agent a real number to dial vendors and follow up, hands-free.

< 10 lines

AI receptionist

Answer inbound 24/7, qualify intent, book meetings, and warm-transfer with context.

1 webhook

On-call & incidents

Pager-style escalation — dial the on-call engineer, read the incident, escalate on no answer.

WSS streamed

Outbound follow-up

Trigger an outbound dial with one API call; the transcript streams back in real time.

1 API call

2FA & verification

A dedicated number for codes from Stripe, GitHub, or banks — SMS lands in the same webhook.

Same schema

Customer support

Handle inbound voice and SMS in one handler; resolve routine queries, escalate with context.

Voice + SMS

Appointment booking

Outbound dials confirm bookings; reschedule links go over SMS through the same SDK.

No lock-in
Where we draw the line

We handle voice. You keep your agent.

We handle the phone calls and the speech. You keep your prompt, your model, and your customer data.

CompareTwilioLiveKit / PipeCatUnpodbest
You send / receive⊘TwiML + audio⊘Audio I/O✓Text in, text out
STT + TTS–DIY–DIY✓Included
Speech quality + voices–DIY–DIY✓Curated profiles
Voice + SMS + WhatsApp⊘Separate APIs–Voice only✓One endpoint
Self-host the platform–No⊘Partial✓Yes · Apache 2.0
By the numbers

Proven in production.

0M+
calls handled
across the network
+0
numbers, instant
all major circles + WhatsApp
< 0ms
first-token latency
streaming STT + TTS
0.00%
uptime, regulated-grade
carrier failover built-in
Security & compliance

Enterprise-grade by default.

Audio is streamed, not stored. Data stays in your region, encrypted with your keys - and the open-source layers let you self-host the whole thing.

Data residency

Pin storage to US, EU or India regions - your choice, per project.

Encrypted end-to-end

TLS in transit, AES-256 at rest. Bring your own KMS keys.

Zero-retention audio

Streams aren’t stored by default. Opt in only if you need recordings.

Never trained on

Transcripts and recordings are never used to train any model.

ISO
27001
ISO 27001
Information security
ISO
42001
ISO 42001
AI management
DPDP
India
DPDP India
Data protection
SOC 2
Type II
SOC 2 Type II
In progress
FAQ

Questions before you start.

The products, the model, integration and security - the things developers ask before shipping.

SuperDialog is an open-source framework for writing realtime conversation logic - standalone and free. Unpod is the open-source CPaaS that embeds SuperDialog and adds the contact-center layer; you self-host it. The Developer Platform (unpod.dev) is the metered telephony + speech infrastructure those layers bill against - the only paid surface.
Twilio gives you telecom plumbing - raw audio, TwiML, separate APIs for voice and messaging. You wire STT and TTS yourself. Unpod gives you text: one unified event for voice, WhatsApp and SMS. We own the speech layer; you stay inside your agent stack.
LiveKit and PipeCat hand you audio in, audio out - speech quality and provider choice live on your side. Unpod inverts that: text in, text out, with the whole speech layer ours. The open-source layers are LiveKit-shaped; what unpod.dev sells underneath is the metered infra.
No. SuperDialog and the Unpod CPaaS are open source and self-hostable. The Developer Platform is the metered telephony + speech the self-hosted stack calls - that is what you pay for, by the minute.
Unpod runs as a Model Context Protocol server, so Claude Code, Cursor, Windsurf, Zed and any MCP client can provision numbers, send messages and handle calls through native tool use. One config block is all it takes.
Python SDK, Node.js SDK, REST API, or MCP server - all open source. Works in any MCP client and inside any backend that can reach an HTTPS endpoint.
Data residency in US, EU and India. ISO 27001 and ISO 42001 certified, DPDP-compliant for India, SOC 2 Type II in progress. Audio is streamed, not stored by default; transcripts are encrypted at rest and never used for training.

Build the agent. Ship the voice. Own the stack.

Start free with the open-source framework and CPaaS. Meter against Unpod infra only when you go live. No credit card to start.

Start building →View on GitHub ↗
✓ Free OSS framework✓ Self-host the CPaaS✓ Metered infra only✓ Native MCP
Telephony & Speech Infra for AI Agents - Unpod AI