Upload
colleen-obrien
View
214
Download
0
Tags:
Embed Size (px)
Citation preview
Week 8 - TutorialInteractive Digital Moving Image Production | CU3003NI | - Pratik Man Singh Pradhan
Media Encoding
Media Encoding Overview
Why and how audio and video are encoded.
Encoding Media
Encoding refers to the conversion of media files from one form to another (Compression)
Encoding is performed for the following purposes Compressing a file to a smaller size (data/frame size) Making it usable on a particular device / software player
Practically all audio and video is encoded and compressed for distribution.
Uncompressed audio and video are retained for archiving and re-use / re-encoding.
Encoding > Decoding Flow
DataFile
Stream Stream
WebcamMicrophoneOB Unit / Studio Control room
Uncompressed VideoUncompressed audio
Compressed data file
Compressed stream
Local Storage
TransportNetwork (www)
DataFile
En
cod
ing
En
gin
e
Decoding Engine
Transcoding
The techniques used for transcoding are the same as for encoding.
The goal of transcoding is not to get a file down to a small size (compression)
Transcoding can be seen as ‘translating’ from one form to another maintaining maximum quality.
Example: some editing systems may not be capable of processing a particular type of video – footage is transcoded to a form that can be used.
Digital Media Files
Containers (Wrappers) Encoded media is stored within container formats Containers ‘store’ encoded audio and / or audio ‘streams’ Containers also contain metadata needed for the player to make ‘sense’ of the
enclosed media formats. Container formats include QuickTime (MOV), RealMedia (RM), MPEG and OGG
(open source format)
IMPORTANT: Container formats do not describe the manner in which a file has been encoded.
- QT file might not play in QuickTime on a particular machine- The software requires the appropriate Codec to be installed
Digital Media Files - Codecs
Whether or not a file will play depends on its codec
Codec refers to the particular encoding method (algorithm) used to compress and decompress a piece of media(COmpress - DECompress)
Codecs specifically describe the type of video or audio compression used
Certain codecs play almost universally (MPEG4)
Some codecs may require plugins to be installed for playback (Vorbis(OGG), VP3 (Theora))
Encoding Applications
Encoding is don at the following points
A/V production applications (from the timeline) Final Cut Pro (native & via compressor) Protools
Within bespoke compression applications Adobe Media Encoder (PC/MAC) Compressor(Apple) MediaCoder (Open Source)
As import/export options on media players iTunes (import) QuickTime Pro (export options)
On websites such as YouTube (FFMPEG server side encoder)
Some encoding applications offer more control than others
Lossless and Lossy Compression
Lossless
Refers to any file type that is a true (verbatim) copy of the original
No quality has been lost is saving a file in the following formats Lossless Audio – Flac, WavPac, Monkey’s Audio, ALAC Lossless Video – Animation Codec, Huffyuv, Uncompressed Lossless Graphics – Gif, PNG, Tiff
A basic example of lossless compression methods include RLE (Rule Length Encoding)
Using the following as an abstraction of the data used to store a segment of audio – [AAAAABBCCCCCDEEEEEEE] = 20bytes
RLE would look at the ‘run lengths’ or repeated adjacent runs of data and summarise them as A5B2C5D1E7 = 10bytes
Lossless and Lossy Compression
Lossless
File formats and codecs where a file may look or sound acceptable or as good as the original but is in fact a degraded copy
Lossy file formats include Lossy audio – AAC, MP3, Vorbis Lossy video – M2V, H.264 Lossy Graphics - JPEG
Lossy compression approximates data in order to make easily represented sequences of data
A (very) basic example is to use a similar scenario as before
AAAAABAAAAA represents a signal or series of pixels (11 bytes) The compression could represent it as A5B1A5 (6 bytes lossless) Lossy compression decides that the discrepancy is not significant enough to record so instead
approximates it back to A (A11 = 3 bytes lossy)
Redundancy
File compression uses systems based around redundancy
Redundancy elements are parts of the sound or image that are not required to be recorded (written) as data in the compressed file
Audio uses psychoacoustic principles to determine which sound can be omitted without adversely affecting the overall quality (low/high frequencies, hiss, overlapping sounds)
Video uses pixel colour data to determine redundancies
Different codecs and encoders view and process these redundancies in different ways (algorithms) with different results
Redundancy can be broken into two categories Objective Redundancy Subjective Redundancy
Objective Redundancy in Imagery
An area of pure black is detected (area spans 15,300 pixels all black)
The area is mapped between 4 points (corners of green rectangle)
15,300 pieces of information can be reduced to 5 pieces of information
That information can then be decoded in the player and rendered exactly as it was.
Subjective Redundancy in Imagery
An area is detected where pixels are similar in colour (a;; black / dark grey)
The encoder decides that the difference is negligible (won’t be noticed)
The area is mapped similarly to before using 1 colour value
Information has been discarded and the quality of the compresses file is less than the original.
Compressing
The goal of compression is to get the smallest file size while retaining maximum ‘meaningful’ information (fidelity/clarity)
Compression is always a trade-off between quality and file size
The same principle applies to audio/video as to graphics Always work from a high quality source Never compress already compressed media (generation loss) Always retain (archive) a high quality original for future work
THE END