docs(s2s): Speech-to-Speech API reference page by dan-ince-aai · Pull Request #760 · AssemblyAI/assemblyai-api-spec

dan-ince-aai · 2026-03-13T12:37:41Z

Summary

Rewrites speechtospeech.mdx (was an old placeholder for a different product) with full documentation for the native S2S WebSocket API
Page is marked hidden: true so it's not in the nav until ready to publish

What's covered

Quickstart (Python + JavaScript) — connect, configure, stream audio, receive audio
Audio format spec (PCM16, 24kHz, mono)
Full client→server event reference: audio.append, session.configure, response.create, response.cancel, function.result
Full server→client event reference: session.ready, speech.started/stopped, transcript.user.*, response.started/audio/transcript/done/interrupted, function.call, error
Session configuration and system prompt guidance
Function calling: tool schema, handle function.call, send function.result
Interruption / barge-in handling
Browser integration with server-side proxy pattern
Framework integrations (Pipecat + LiveKit) with code examples
Complete end-to-end customer support agent example (Python + JS)
Event flow diagram

Notes

session.configure is sent immediately on WebSocket connect (not on session.ready) — corrected throughout all examples and the event flow diagram
Uses current production event names

🤖 Generated with Claude Code

Complete rewrite of the Speech-to-Speech docs page (hidden): - Quickstart with Python + JavaScript WebSocket examples - Full client→server and server→client event reference with schemas - Audio format spec (PCM16, 24kHz, mono) - Session configuration and system prompt guidance - Function calling: tool schema, handle function.call, send function.result - Interruption / barge-in handling - Browser integration with proxy pattern - Framework integrations (Pipecat + LiveKit) with code examples - Complete end-to-end customer support agent example - Event flow diagram Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

….ready Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T12:38:46Z

🌿 Preview your docs: https://assemblyai-preview-a8725a4d-c6ec-4034-9d66-26ab867a220d.docs.buildwithfern.com/docs

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T13:05:27Z

🌿 Preview your docs: https://assemblyai-preview-7c088642-0b01-48b1-9064-27768405af17.docs.buildwithfern.com/docs

… error codes New from realtime_model.py review: - greeting field in session.configure - session.resume client event (reconnection with session_id) - session_id field in session.ready response - session.updated server event - error codes: session_not_found, session_forbidden Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T13:13:05Z

🌿 Preview your docs: https://assemblyai-preview-e867cffd-2ee5-41f4-9b5b-2a6f776c2bc8.docs.buildwithfern.com/docs

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T13:19:47Z

🌿 Preview your docs: https://assemblyai-preview-ab4bf459-6fd0-43c8-ac86-41b11da9abd9.docs.buildwithfern.com/docs

…ction The mabudu/renamed-events changes are deployed. Updated all event names: - audio.append → input.audio - session.configure → session.update - speech.started → input.speech.started - speech.stopped → input.speech.stopped - response.audio → reply.audio - response.transcript → transcript.agent - response.done → reply.done - function.call → tool.call (arguments is JSON string, not args dict) - function.result → tool.result Also: - Tool schema uses nested {"type":"function","function":{...}} format - Quickstart replaced with confirmed-working script - Added warning: audio must be gated behind session_ready - reply.done has optional status:"interrupted" field - invalid_format error code added Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T16:05:00Z

🌿 Preview your docs: https://assemblyai-preview-3100f2b7-b6f3-4f1a-9c4d-5b44829f30ad.docs.buildwithfern.com/docs

Key corrections: - Tool schema is flat (no nested "function" key) - tool.call args field is a dict, not a JSON string - tool.result must be sent in reply.done handler, NOT in tool.call - Accumulate pending_tools, send all on reply.done - On interrupted reply, discard pending_tools - Handle both "error" and "session.error" event types - Replaced quickstart with complete verified-working example Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T16:12:00Z

🌿 Preview your docs: https://assemblyai-preview-224cc722-256c-4341-9128-cb80ec4d99ab.docs.buildwithfern.com/docs

- Remove Warning about audio before session.ready - Remove reply.create (not commonly needed) - Remove reply.cancel (not commonly needed) - Remove reply.interrupted — interruptions come via reply.done status:interrupted - Add status field table to reply.done Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T16:22:23Z

🌿 Preview your docs: https://assemblyai-preview-8a5e80f3-ea6e-400d-9176-18847b431bd2.docs.buildwithfern.com/docs

…l plugin usage Removed cards linking to U3 Pro docs (wrong product). Added tabbed code blocks with real Pipecat and LiveKit S2S integration examples. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T22:01:54Z

🌿 Preview your docs: https://assemblyai-preview-bb1567b2-2188-4e7b-84e8-423c7972f287.docs.buildwithfern.com/docs

… and LiveKit Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

github-actions · 2026-03-13T22:05:08Z

🌿 Preview your docs: https://assemblyai-preview-13107787-ab1b-47a9-9e23-3a77aa9a8259.docs.buildwithfern.com/docs

github-actions · 2026-03-13T22:08:14Z

🌿 Preview your docs: https://assemblyai-preview-c49c77fd-c190-44d5-a832-c61b240b3066.docs.buildwithfern.com/docs

dan-ince-aai and others added 2 commits March 13, 2026 12:30

docs(s2s): fix session.configure timing — send on connect not session…

acbb87b

….ready Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

docs(s2s): fix slug to /voice-agents/speech-to-speech

6913ce7

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

docs(s2s): add greeting to quickstart example

c89e357

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

dan-ince-aai assigned dan-ince-aai and unassigned dan-ince-aai Mar 13, 2026

docs(s2s): add full plugin source code in CodeBlocks tabs for Pipecat…

d0499da

… and LiveKit Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

Merge branch 'main' into dan/s2s-docs

cc77949

dan-ince-aai marked this pull request as ready for review March 13, 2026 22:07

LeeVaughn approved these changes Mar 13, 2026

View reviewed changes

LeeVaughn merged commit 0e95df7 into main Mar 13, 2026
4 checks passed

LeeVaughn deleted the dan/s2s-docs branch March 13, 2026 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(s2s): Speech-to-Speech API reference page#760

docs(s2s): Speech-to-Speech API reference page#760
LeeVaughn merged 11 commits intomainfrom
dan/s2s-docs

dan-ince-aai commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dan-ince-aai commented Mar 13, 2026

Summary

What's covered

Notes

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants