16

Click here to load reader

Best Practices for Automatic Transcript Alignment

Embed Size (px)

DESCRIPTION

If you already have transcripts for your video or audio files, automatic transcript alignment is the fastest and least expensive way to create captions and use interactive video plugins. This webinar covers tips and best practices for using our automatic transcript alignment service. Watch this webinar to learn about: - How the technology works - Uploading video/audio files - Preparing a transcript for upload - Transcript synchronization accuracy - Using captions & interactive transcripts - Editing transcripts & captions post processing - Working with lecture capture and video platforms Presenters: Josh Miller (Moderator) Co-Founder | 3Play Media David Zylber Manager of Customer Happiness | 3Play Media Roger Zimmerman VP of Research & Development | 3Play Media

Citation preview

Page 1: Best Practices for Automatic Transcript Alignment

Transcript Alignment Service Webinar

March 12, 2013

Moderator: Josh Miller

Speakers: Roger Zimmerman David Zylber

Page 2: Best Practices for Automatic Transcript Alignment

Agenda

• Automatic Alignment vs. Transcription and Captioning

• Alignment Service Overview

• Best Practices

• Submitting Transcripts & Media Files

• Formatting your Transcripts

• Q&A

Page 3: Best Practices for Automatic Transcript Alignment

Transcript Alignment Service vs. Transcription and Captioning

• Use the Alignment service when you already have a transcript

• Both services ultimately give you access to the same 3Play Media account features and tools.

• Alignment is 100% automated where as the standard service involves human clean up.

• Turnaround Service Levels

Page 4: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

1)Re-encode text as ASCII • MS-Word exports still contain non-ASCII characters

• Direct upload users can see the results

Page 5: Best Practices for Automatic Transcript Alignment

DEMO

Page 6: Best Practices for Automatic Transcript Alignment

FTP Overview • Create a folder named for_alignment

• Add the media file first to the for_alignment folder - e.g. Casablanco.mp4

• Then add the plain .TXT transcript to the for_alignment folder - e.g. Casablanco.txt

• The .TXT file MUST HAVE THE SAME NAME as the media file

• Batch uploads: first submit all media files and then the corresponding transcripts.

Page 7: Best Practices for Automatic Transcript Alignment

Alignment Best Practices

• THE KEY: Text corresponds to audio!

• Common Problems: -Non-conforming speaker labels (not all caps, hyphens instead of colons -Wrapped text becomes paragraphs -Including instructions, screen directions, scene settings/headers -Interpretation -Overlapping speakers -Audio quality • Duration: No more than 2 hours per file

• Drag and Drop your transcripts when you can

• Transcripts should be unformatted plain text file (.TXT)

• Short duration reduces the likelihood of misalignment

Page 8: Best Practices for Automatic Transcript Alignment

DEMO

Page 9: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process continued…

2) Infer verbalization from text

• Speaker labels used for adaptation (and replaced with

optional pause) • Punctuation removed (sentences replaced with pause)

• Numerics expanded:

10/10/2013 => “ten ten thirteen” OR “October tenth” … 107 => “one hundred and seven” OR “one oh seven” … 5’3” => “five foot three” or “five three” …

• Acronyms/abbreviations expanded: “St.”, “ABC”, “NASDAQ”

Page 10: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

3) Build a “biased” language model (with options): CEO: “On 10/10/2013, we will be listed on NASDAQ as ABC”

<SPEAKER> on { NULL / this } { ten ten / october tenth } <COMMA> { NULL / twenty thirteen / thirteen } { we will / we’ll } be listed on the nasdaq as a b c <SENTENCE> …

Page 11: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

4) Run ASR with biased LM: ON 1.02 1.05 OCTOBER 1.05 1.32 TENTH 1.32 1.51 WE’LL 1.63 1.76 BE 1.76 1.82

Page 12: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

5) Re-Align with original text:

CEO: 0.0 1.02 ON On 1.02 1.05 OCTOBER TENTH 10/10/2013, 1.05 1.51 WE’LL we 1.63 1.695 will 1.695 1.76 BE be 1.76 1.82

Page 13: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

6) Fill in gaps in ASR output with all of the original transcript text in that region.

Page 14: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

7) Compute confidence from ASR process plus number/length of gaps. “Audio Quality” bars

Page 15: Best Practices for Automatic Transcript Alignment

Automatic Alignment Process

8) Create all output assets from the aligned transcript, as if it had been edited.

Page 16: Best Practices for Automatic Transcript Alignment

RESOURCES

Knowledge Base

support.3playmedia.com/forums Contact 3Play Media Support

[email protected]

NEED HELP?