25
The Secret Lives of MP3 Files Doug Kaye The Conversations Network and GigaVox Media

The Secret Lives of MP3 Files

  • Upload
    dkaye

  • View
    4.011

  • Download
    0

Embed Size (px)

DESCRIPTION

Things you didn't know (or thought you did) about MP3 files.

Citation preview

Page 1: The Secret Lives of MP3 Files

The Secret Livesof MP3 Files

Doug KayeThe Conversations Network

and GigaVox Media

Page 2: The Secret Lives of MP3 Files

Formats & Encoders

• Lossless (WAV, AIFF)

• Lossy

- MPEG 1, Layer 3 (MP3)

- AAC (AAC, M4A, M4B)

- MPEG I, Layer 2 (MP2)

Page 3: The Secret Lives of MP3 Files

MPEG Confusion

• Lossy Perceptual/Psychoacoustical Codecs

• MP3 = MPEG-I Layer 3

• MP2 = MPEG-I Layer 2 (not MPEG-II)

Page 4: The Secret Lives of MP3 Files

Motion Picture Experts Group

• MPEG-1: Video CDs, MP3 Audio

• MPEG-2: Digital TV, Set-Top Boxes

• MPEG-4: Online Multimedia (Video)

• MPEG-7: Audio and Video Search

• MPEG-21: Multimedia Framework

Page 5: The Secret Lives of MP3 Files

MPEG-1 for Geeks

• Layer 1

• Simple 32-Band Algorithm

• Philips DCC (Digital Compact Cassette)

• Layer 2 (a.k.a. MUSICAM)

• Also 32 Bands

• International Standard for Broadcasting

Page 6: The Secret Lives of MP3 Files

MPEG-1 Layer 3 (MP3)for Geeks

• Psychoacoustic Masking

• 32 Bands Divided into 576 Subbands

• More Accurate Masking Thresholds

• Redundancy Reduction

• Lossless Huffman Encoding

• Bit-Reservoir Buffering

• Joint Stereo

Page 7: The Secret Lives of MP3 Files

Sample Rate for Geeks

• The Nyquist Theorem

• Sample at 2x the Highest Frequency

• 22.05kHz Sample Rate for 11kHz Audio

• Sample Rate Is a Property of Uncompressed Source (WAV or AIFF)

Page 8: The Secret Lives of MP3 Files

Sample Rate in Practice

• Standardize on 44.1kHz Sample Rate

• Flash & Other Players Require n*11.025kHz

• Resample if Source is 48kHz from DVDs

Page 9: The Secret Lives of MP3 Files

Bit Rate for Geeks

• Independent of Sample Rate

• Specifies Encoder Output File Size (CBR)

• @64kbps, 1 hour ≈ 27MB

• Variable Bit Rate (VBR)

• For Higher Bit Rates Only

• Not Universally Supported (Avoid It)

Page 10: The Secret Lives of MP3 Files

Bit Rate in Practice

• “Use Higher Bit Rates for Music?”

• It’s a Myth!

• Human Voices Are Complex

• Music Masks Its Own Artifacts

• 64kbps is Most Common Today

• 96kbps is Gaining

Page 11: The Secret Lives of MP3 Files

Podcasting Bit-Rate History

• June 2003: 32kbps. “Files too large”

• April 2004: 48kbps. “No problem”

• September 2004: 64kbps. “Quality is low”

• Today: Still 64kbps.

• Tomorrow??

Page 12: The Secret Lives of MP3 Files

Stereo Encoding

• “Stereo MP3s are twice as large as mono.”

• It’s a Myth!

• Only Bit Rate Specifies Output File Size

• You May Want to Use Higher Bit Rates for Stereo

Page 13: The Secret Lives of MP3 Files

Stereo Encoding for Geeks

• Dual Channel or Independent Channel (IC)

- Entirely Separate Left and Right

• But Most L/R Information is Redundant

• Intensity Stereo (IS)

• Mid/Side Stereo (MS)

• Joint Stereo (JS) Allows IS/MS Combination

Page 14: The Secret Lives of MP3 Files

Stereo Encoding(Even Geekier)

• JS Encodes L+R and L-R

• If L=R then L-R=0

• Since Bit Rate is ConstantL=R Uses Fewer Bits for Stereo Information

Page 15: The Secret Lives of MP3 Files

Stereo Encoding in Practice

• Stereo vs. Mono (not Music vs. Voice) is a Good Reason to Use Higher Bit Rates

• Greater Separation Suggests Higher Rates

• If Mostly Speech, Consider 100% Mono

• If Mono, Make L&R Digitally Identical

• Always Encode in Stereo for Compatibility

Page 16: The Secret Lives of MP3 Files

Mastering for MP3

• Help the Encoder: Eliminate Unnecessary Data

- High-Pass Filter at 80Hz

- Low-Pass Filter at 11kHz (@64kbps encoding)

- Normalize

Page 17: The Secret Lives of MP3 Files

Which is Louder?

• It’s Not the Height of the Peaks (voltage)

• It’s the Area Under the Curve (power)

Page 18: The Secret Lives of MP3 Files

Loudness

• What’s the Standard?

• We Asked:

- Podcasters

- Audio Engineers

- Radio Engineers

• Answer: There Isn’t One

• It’s a Hard Problem to Solve

Page 19: The Secret Lives of MP3 Files

Normalization

• Peak Normalization (common)

- Maximizes Voltage, not Power

• RMS Normalization

- Maximizes Power (=Loudness)

• Determine a Standard Loudness Level

Page 20: The Secret Lives of MP3 Files

Avoid Recording to MP3!

• MP3 is a final/release format.

• Not designed to be decoded and re-encoded.

• Use MP2 Instead...

• or the highest MP3 bit rate possible.

Page 21: The Secret Lives of MP3 Files

AAC/M4B Files?

• Yes, AAC is Better Than MP3

• We Added AAC to Support iPod Bookmarks

• Painful: Only iTunes Could Encode M4B

• Doubled Much of Our Workflow

• Can’t Be Easily Assembled

Page 22: The Secret Lives of MP3 Files

MP2: Why and When?

• MPEG-1 Layer 2

• Designed as an Intermediate Format

• The Standard in Broadcast Radio

• 128kbps per Track

• 44.1kHz Sample Rate Preferred

Page 23: The Secret Lives of MP3 Files

Audio Lessons Learned

• MP3 Options

• Audio-File Myths

• RMS Normalization (Loudness)

• AAC/M4B Files (iTunes & iPods)

• MP2 Files

Page 24: The Secret Lives of MP3 Files

To Summarize

• Record at 44.1kHz Sample Rate (not in MP3!)

• Mastering

- RMS Normalization (Pick a Standard Level)

- 80Hz Hi-Pass, 11kHz Low Pass (for voice)

- If Mono, Make L&R Digitally Identical

• Encoding

- 64kbps when L=R

- Consider ≥96kbps for L≠R

- Always Use Joint Stereo

Page 25: The Secret Lives of MP3 Files