Skip to main content
POST
/
v1
/
audio
/
speech
curl https://gateway.bud.studio/v1/audio/speech \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-1",
    "input": "Hello, this is a test of text-to-speech.",
    "voice": "alloy"
  }' \
  --output speech.mp3
Binary audio file returned with appropriate Content-Type header (e.g., audio/mpeg for MP3)
curl https://gateway.bud.studio/v1/audio/speech \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-1",
    "input": "Hello, this is a test of text-to-speech.",
    "voice": "alloy"
  }' \
  --output speech.mp3
Binary audio file returned with appropriate Content-Type header (e.g., audio/mpeg for MP3)

Headers

ParameterTypeRequiredDescription
AuthorizationstringYesBearer authentication header

Body

ParameterTypeRequiredDescription
modelstringYesModel identifier (e.g., tts-1, tts-1-hd)
inputstringYesText to convert to speech (max 4096 characters)
voicestringYesVoice to use: alloy, echo, fable, onyx, nova, shimmer, ash, ballad, coral, sage, verse
response_formatstringNoAudio format: mp3, opus, aac, flac, wav, pcm. Default: mp3
speedfloatNoSpeed of generated audio (0.25 to 4.0). Default: 1.0

Supported Providers

OpenAI

TTS-1 and TTS-1-HD with multiple voices

Azure OpenAI

Enterprise text-to-speech with global deployment

Together AI

Cartesia Sonic with 100+ voice options