Convert audio files to text in the original language.
| Parameter | Type | Required | Description |
|---|---|---|---|
| Authorization | string | Yes | Bearer authentication header |
| Content-Type | string | Yes | Must be multipart/form-data |
| Parameter | Type | Required | Description |
|---|---|---|---|
| file | file | Yes | Audio file to transcribe (max 25MB). Supported formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm |
| model | string | Yes | Model identifier (e.g., whisper-1) |
| language | string | No | ISO-639-1 language code of the input audio |
| prompt | string | No | Optional text to guide the model’s style or continue previous audio segment |
| response_format | string | No | Format of the transcript output: json, text, srt, verbose_json, vtt. Default: json |
| temperature | float | No | Sampling temperature (0.0 to 1.0). Default: 0.0 |
| timestamp_granularities | array | No | Timestamp detail level: word, segment. Default: segment |