Configuration¶

Server Settings¶

Configuration is set via constants in mcp_server.py and environment variables:

The port can be overridden via the VOICE_CHAT_PORT environment variable.

The server forwards audio to Whisper's OpenAI-compatible endpoint:

POST {WHISPER_URL}/v1/audio/transcriptions

Whisper accepts WebM, WAV, MP3, and other common audio formats. The browser records in WebM/Opus by default.

The server requests speech from Kokoro's OpenAI-compatible endpoint:

POST {KOKORO_URL}/v1/audio/speech

Change the voice by passing the voice parameter to converse().

sudo tailscale serve --bg --https=3456 http://127.0.0.1:3456

This creates an HTTPS endpoint with automatic TLS certificates. Tailscale also upgrades WebSocket connections to WSS automatically.

tailscale serve status

sudo tailscale serve --https=3456 off

Server logs are written to /tmp/voice-hub-mcp.log and stderr. Watch in real time:

tail -f /tmp/voice-hub-mcp.log

Port	Service	Protocol
3456	ClawMux (MCP server)	HTTP + WebSocket (localhost) / HTTPS + WSS (Tailscale)
2022	Whisper STT	HTTP (localhost only)
8880	Kokoro TTS	HTTP (localhost only)