Skip to main content
POST
/
v1
/
audio
/
transcriptions
curl https://gateway.bud.studio/v1/audio/transcriptions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@audio.mp3" \
  -F "model=whisper-1" \
  -F "language=en" \
  -F "response_format=json"
{
  "text": "Hello, this is a transcribed audio message."
}
curl https://gateway.bud.studio/v1/audio/transcriptions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@audio.mp3" \
  -F "model=whisper-1" \
  -F "language=en" \
  -F "response_format=json"
{
  "text": "Hello, this is a transcribed audio message."
}

Headers

ParameterTypeRequiredDescription
AuthorizationstringYesBearer authentication header
Content-TypestringYesMust be multipart/form-data

Form Data

ParameterTypeRequiredDescription
filefileYesAudio file to transcribe (max 25MB). Supported formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm
modelstringYesModel identifier (e.g., whisper-1)
languagestringNoISO-639-1 language code of the input audio
promptstringNoOptional text to guide the model’s style or continue previous audio segment
response_formatstringNoFormat of the transcript output: json, text, srt, verbose_json, vtt. Default: json
temperaturefloatNoSampling temperature (0.0 to 1.0). Default: 0.0
timestamp_granularitiesarrayNoTimestamp detail level: word, segment. Default: segment

Supported Providers

OpenAI

Industry-leading Whisper models for high-accuracy transcription

Azure OpenAI

Enterprise-grade Whisper model support with global availability

Fireworks

Fast and efficient Whisper v3 and v3-turbo models