Skip to main content
POST
/
v1
/
audio
/
transcriptions
Create Transcription
curl --request POST \
  --url https://api.dedaluslabs.ai/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form file='@example-file' \
  --form 'model=<string>' \
  --form 'language=<string>' \
  --form 'prompt=<string>' \
  --form 'response_format=<string>' \
  --form temperature=123
{
  "language": "<string>",
  "duration": 123,
  "text": "<string>",
  "words": [
    {
      "word": "<string>",
      "start": 123,
      "end": 123
    }
  ],
  "segments": [
    {
      "id": 123,
      "seek": 123,
      "start": 123,
      "end": 123,
      "text": "<string>",
      "tokens": [
        123
      ],
      "temperature": 123,
      "avg_logprob": 123,
      "compression_ratio": 123,
      "no_speech_prob": 123
    }
  ],
  "usage": {
    "type": "<string>",
    "seconds": 123
  }
}

Authorizations

Authorization
string
header
required

API key authentication using Bearer token

Body

multipart/form-data
file
file
required
model
string
required
language
string | null
prompt
string | null
response_format
string | null
temperature
number | null

Response

Successful Response

Represents a verbose json transcription response returned by model, based on the provided input.

Fields:

  • language (required): str
  • duration (required): float
  • text (required): str
  • words (optional): list[TranscriptionWord]
  • segments (optional): list[TranscriptionSegment]
  • usage (optional): TranscriptTextUsageDuration
language
string
required

The language of the input audio.

duration
number
required

The duration of the input audio.

text
string
required

The transcribed text.

words
TranscriptionWord · object[]

Extracted words and their corresponding timestamps.

segments
TranscriptionSegment · object[]

Segments of the transcribed text and their corresponding details.

usage
TranscriptTextUsageDuration · object

Usage statistics for models billed by audio input duration.