Google Speech Start Transcription Workflow Operation

ID: google-speech-start-transcription


Google speech Start Transcription invokes the Google Speech-to-Text service by passing an audio file to be translated to text.

Parameter Table

configuration keys description default value example
source-flavor The flavor of the audio file to be sent for translation. EMPTY presenter/delivery
source-tag The flavor of the audio file to be sent for translation. EMPTY transcript
skip-if-flavor-exists If this flavor already exists in the media package, skip this operation.
To be used when the media package already has a transcript file. Optional
false captions/timedtext
language-code The language code to use for the transcription. Optional. If set, it will override the configuration language code EMPTY en-US, supported language:

One of source-flavor or source-tag must be specified.


<!--  Encode audio to flac -->
    description="Extract audio for transcript generation">
    <configuration key="source-flavor">*/source</configuration>
    <configuration key="target-flavor">audio/flac</configuration>
    <configuration key="target-tags">transcript</configuration>
    <configuration key="encoding-profile">audio-flac</configuration>
    <configuration key="process-first-match-only">true</configuration>

<!-- Start Google Speech transcription job -->
    description="Start Google Speech transcription job">
    <!--  Skip this operation if flavor already exists. Used for cases when mp already has captions. -->
    <configuration key="skip-if-flavor-exists">captions/timedtext</configuration>
    <configuration key="language-code">en-US</configuration>
    <!-- Audio to be translated, produced in the previous compose operation -->
    <configuration key="source-tag">transcript</configuration>

Encoding profile used in example above = audio-flac = stream = audio = -audio.flac = audio/flac = -i /#{} -ac 1 #{out.dir}/#{}#{out.suffix}