You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# ID of the model to use. The options are `gpt-4o-transcribe`,
120
-
# `gpt-4o-mini-transcribe`, and `whisper-1` (which is powered by our open source
121
-
# Whisper V2 model).
149
+
# `gpt-4o-mini-transcribe`, `whisper-1` (which is powered by our open source
150
+
# Whisper V2 model), and `gpt-4o-transcribe-diarize`.
122
151
moduleModel
123
152
extendOpenAI::Internal::Type::Union
124
153
125
154
variantString
126
155
127
-
# ID of the model to use. The options are `gpt-4o-transcribe`, `gpt-4o-mini-transcribe`, and `whisper-1` (which is powered by our open source Whisper V2 model).
156
+
# ID of the model to use. The options are `gpt-4o-transcribe`, `gpt-4o-mini-transcribe`, `whisper-1` (which is powered by our open source Whisper V2 model), and `gpt-4o-transcribe-diarize`.
128
157
variantenum: ->{OpenAI::AudioModel}
129
158
130
159
# @!method self.variants
@@ -135,6 +164,8 @@ module Model
135
164
# first normalizes loudness and then uses voice activity detection (VAD) to choose
136
165
# boundaries. `server_vad` object can be provided to tweak VAD detection
137
166
# parameters manually. If unset, the audio is transcribed as a single block.
167
+
# Required when using `gpt-4o-transcribe-diarize` for inputs longer than 30
# Represents a diarized transcription response returned by the model, including
39
+
# the combined transcript and speaker-segment annotations.
40
+
#
41
+
# @param duration [Float] Duration of the input audio in seconds.
42
+
#
43
+
# @param segments [Array<OpenAI::Models::Audio::TranscriptionDiarizedSegment>] Segments of the transcript annotated with timestamps and speaker labels.
44
+
#
45
+
# @param text [String] The concatenated transcript text for the entire audio input.
46
+
#
47
+
# @param usage [OpenAI::Models::Audio::TranscriptionDiarized::Usage::Tokens, OpenAI::Models::Audio::TranscriptionDiarized::Usage::Duration] Token or duration usage statistics for the request.
48
+
#
49
+
# @param task [Symbol, :transcribe] The type of task that was run. Always `transcribe`.
50
+
51
+
# Token or duration usage statistics for the request.
# Usage statistics for models billed by token usage.
99
+
#
100
+
# @param input_tokens [Integer] Number of input tokens billed for this request.
101
+
#
102
+
# @param output_tokens [Integer] Number of output tokens generated.
103
+
#
104
+
# @param total_tokens [Integer] Total number of tokens used (input + output).
105
+
#
106
+
# @param input_token_details [OpenAI::Models::Audio::TranscriptionDiarized::Usage::Tokens::InputTokenDetails] Details about the input tokens billed for this request.
107
+
#
108
+
# @param type [Symbol, :tokens] The type of the usage object. Always `tokens` for this variant.
0 commit comments