Start Watson Transcription Workflow Operation

ID: start-watson-transcription

Description

The start watson transcription operation invokes the IBM Watson Speech-to-Text service, passing an audio file to be translated to text.

Parameter Table

configuration keys	description	default value	example
source-flavor	The flavor of the audio file to be sent for translation.	EMPTY	presenter/delivery
source-tag	The flavor of the audio file to be sent for translation.	EMPTY	transcript-audio
skip-if-flavor-exists	If this flavor already exists in the media package, skip this operation. To be used when the media package already has a transcript file.	false	captions/vtt+en

One of source-flavor or source-tag must be specified.

Example

  # Extract audio from video in ogg/opus format
  - id: encode
    fail-on-error: true
    exception-handler-workflow: partial-error
    description: Extract audio for transcript generation
    configurations:
      - source-tags: engage-download
      - target-flavor: audio/ogg
      - target-tags: transcript
      - encoding-profile: audio-opus
      # If there is more than one file that match the source-tags, use only the first one
      - process-first-match-only: true

  # Start IBM Watson recognitions job
  - id: start-watson-transcription
    fail-on-error: true
    exception-handler-workflow: partial-error
    description: Start IBM Watson transcription job
    configurations:
      # Skip this operation if flavor already exists. Used for cases when mp already has captions.
      - skip-if-flavor-exists: captions/vtt+en
      # Audio to be translated, produced in the previous compose operation
      - source-tag: transcript

Encoding profile used in example above

profile.audio-opus.name = audio-opus
profile.audio-opus.input = stream
profile.audio-opus.output = audio
profile.audio-opus.suffix = -audio.opus
profile.audio-opus.ffmpeg.command = -i /#{in.video.path} -c:a libvorbis -ac 1 -ar 16k -b:a 64k #{out.dir}/#{out.name}#{out.suffix}