Skip to main content

Perform speech to text using OpenAI

Provided an audio file, transcribe or translate the contents into text. This Wrk Action only supports files up to 25MB.

Application

  • OpenAI

Inputs (what you have)

NameDescriptionData TypeRequired?Example
FileAn audio file that will be converted into text. Supported file types are: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webmFileYesaudio.mp3
Conversion typeTranscription will record the speech in the same language it was spoken. Translation will record provided speech as EnglishPre-defined choice listYesTranscription
PromptYou can use a prompt to improve the quality of the transcripts generated by the Whisper API. The model will try to match the style of the prompt, so it will be more likely to use capitalization and punctuation if the prompt does too.Text (Long)NoEnsure correct grammar is used
LanguageIf transcription is selected, the language of the input audioPre-defined choice listNoEnglish
Output filenameOptional custom filename for the generated audio fileText(Long)Nosample.mp3

Note: the value of inputs can either be a set value in the configuration of the Wrk Action within the Wrkflow, or a variable from the Data library section. These variables in the Data library section are the outputs of previous Wrk Actions in the Wrkflow.

Outputs (what you get)

NameDescriptionData TypeRequired?Example
Recorded textThe transcribed or translated textText(Long)Yes

Outcomes

NameDescription
SuccessThis status is selected if the speech is recorded successfully to text
Impossible to completeThis status is selected in the event of the following scenarios:External system is not availableProvided file is not in the correct typeThe provided file is too large

Requirements

  • N/A