API Playground

Test the VoiceMode API directly from your browser.

API Key

POST /api/voices/clone

Upload an audio sample to create a cloned voice profile.

WAV, MP3, M4A, OGG, FLAC, WebM. Max 25MB. Best results with 15+ seconds of clear speech.

The underlying OpenAI voice profile. Auto-detect assigns one based on your audio.

POST /api/voices/:id/speak

Generate speech from text using a cloned voice. Returns audio stream.

From /api/voices/clone response. Or select from your voices.

Max 4096 characters.

WS /ws/conversation/:session_id

Real-time voice conversation. Speak into your microphone and hear the AI respond with your cloned voice.

Select from your voices or clone a new one.

Optional. Customize the AI's personality and behavior.

Click the microphone to start a conversation
GET /api/voices

List all your cloned voices.