Create Transcription - Dedalus Docs

{ "language": "<string>", "duration": 123, "text": "<string>", "words": [ { "word": "<string>", "start": 123, "end": 123 } ], "segments": [ { "id": 123, "seek": 123, "start": 123, "end": 123, "text": "<string>", "tokens": [ 123 ], "temperature": 123, "avg_logprob": 123, "compression_ratio": 123, "no_speech_prob": 123 } ], "usage": { "type": "<string>", "seconds": 123 } }

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data

file

required

model

string

required

language

string | null

prompt

string | null

response_format

string | null

temperature

number | null

Response

Successful Response

CreateTranscriptionResponseVerboseJson
CreateTranscriptionResponseJson

Represents a verbose json transcription response returned by model, based on the provided input.

Fields:

language (required): str
duration (required): float
text (required): str
words (optional): list[TranscriptionWord]
segments (optional): list[TranscriptionSegment]
usage (optional): TranscriptTextUsageDuration

language

string

required

The language of the input audio.

duration

number

required

The duration of the input audio.

text

string

required

The transcribed text.

words

TranscriptionWord · object[]

Extracted words and their corresponding timestamps.

Show child attributes

segments

TranscriptionSegment · object[]

Segments of the transcribed text and their corresponding details.

Show child attributes

usage

TranscriptTextUsageDuration · object

Usage statistics for models billed by audio input duration.

Show child attributes