17
7/29/2019 02 Audio http://slidepdf.com/reader/full/02-audio 1/17 1 2: Audio Basics Mark Handley  Audio Basics  Analog to Digital Conversion Sampling Quantization  Aliasing effects Filtering Companding PCM encoding Digital to Analog Conversion

02 Audio

Embed Size (px)

Citation preview

Page 1: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 1/17

1

2: Audio Basics

Mark Handley

 Audio Basics

 Analog to Digital Conversion

Sampling

Quantization

 Aliasing effects

Filtering

Companding

PCM encoding

Digital to Analog Conversion

Page 2: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 2/17

2

 Analog Audio

Larynx

(vocal cords)

Vocal tract

(resonance)

Sound Waves

(compression and rarefaction)

diaphragm

 Analog voltage

(proportional to

air pressure)

Simple Analog-to-Digital Converter 

Signal is sampled at sampling frequency f.

Sampled signal is quantized into discrete values.

Sample

 And

Hold

Quantizer 

 Analog

Signal

Sample

Clock

 A

Digitized

Codewords

Page 3: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 3/17

3

Sample and Hold

Voltage sampled and held to allow quantization.

Sampling Period(1/sampling rate)

time

voltage

(amplitude)

Sample Rate

Sample Rate: the number of samples per second.

 Also known as sampling frequency .

Telephone: 8000 samples/sec.

CD: 44100 samples/sec

DVD: 48000 samples/sec

Page 4: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 4/17

4

Sample and Hold

time

voltage

(amplitude)

Sampling Rate

Page 5: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 5/17

5

Sampling RateLow sample rate

High sample rate

High frequency information

lost due to subsampling

Real music example

Page 6: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 6/17

6

How fast to sample?

Nyquist-Shannon sampling theorem

Formulated by Harry Nyquist in 1928 (“Certain topics in telegraph

transmission theory”)

Proved by Claude Shannon in 1949 (“Communication in the

presence of noise”).

For no loss of information:

Sampling frequency   ≥ 2 * maximum signal frequency 

How fast to sample?

For no loss of information:

Sampling frequency ≥ 2 * maximum signal frequency

For a particular sampling frequency:

Nyquist frequency (or rate) is the highest frequency that can be

accurately represented.

Example:

Limit of human hearing: ~20KHz By Nyquist, sample rate must be ≥ 40,000 samples/sec.

CD sample rate: 44,100 samples/sec.

Page 7: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 7/17

7

Telephony

8KHz sampling used⇒4KHz maximum frequency.

Human ear can hear up to ~20KHz.

Human voice still intelligible when higher frequenciesare lost.

Higher frequencies convey some emotions, and areuseful for identification of the speaker.

4KHz pretty useless for music.

 Aliasing

What happens to all those higher frequencies you can’t

sample?

They add noise to the sampled data at lower 

frequencies.

Page 8: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 8/17

8

 Aliasing Noise (Real Music)

 Aliasing Noise

original

signal

samples resulting

aliasing

noise

Page 9: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 9/17

9

 Aliasing Noiseoriginal

signal

samples resulting

aliasing

noise

 Aliasing Noise

 Alias frequency = abs(signal freq. - closest integer multiple of 

sampling freq.)

Folding frequency: the frequency above which aliasing occurs.

Half the sampling rate.

Example: 7KHz input signal, 8KHz sampling: 1KHz alias

Folding frequency is 4KHz.

Example: 17KHz input signal, 8KHz sampling: 1KHz alias

17KHz - (2 * 8KHz) = 1KHz

Page 10: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 10/17

10

 Analog-to-Digital Converter 

Low-pass anti-aliasing filter (cutoff at f /2) on input.

Signal is sampled at sampling frequency f .

Sampled signal is quantized into discrete values.

Sample

 And

Hold

Quantizer 

 AnalogSignal

Sample

Clock

 A

Digitized

Codewords

Low-pass

filter 

Filtered

 Analog

Signal

Quantization

Sampled analog signal needs to be quantized (digitized).

Two questions:

How many discrete digital values?

What analog value does each digital value correspond

to?

Simplest quantization: linear.8-bit linear, 16-bit linear.

Page 11: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 11/17

11

Quantization Noise

Quantization Noise

Low amplitude

components lost

Page 12: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 12/17

12

Quantization Noise

Quantization Noise

High frequency noise introduced

Page 13: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 13/17

13

How many levels?

8 bits (256 levels) linear encoding would probably be

enough if the signal always used the full range.

But signal varies in loudness.

If full range is used for loud parts, quiet parts will suffer 

from bad quantization noise (only a few levels used).

If full range is used for quiet parts, loud parts will clip,

resulting in really bad noise.

CD uses 16-bit linear encoding (65536 levels).

Pretty good match to dynamic range of human ear.

Telephony

16 bit linear would be rather expensive for telephony.

8 bit linear poor quality.

Solution: use 8 bits with an “logarithmic” encoding.

Known as companding (compressing/expanding)

Goal is that quantization noise is a fixed proportion of 

the signal, irrespective of whether the signal is quiet or 

loud.

Page 14: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 14/17

14

Linear Encoding

Logarithmic Encoding

Page 15: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 15/17

15

µlaw encoding

µlaw encoding

Input Signal

(linear encoding)

Output Signal(8-bit µlaw encoding)

Small amplitude

input signal coded

with more code

points than it would

be with 8-bit linear 

Page 16: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 16/17

16

µlaw decoding

Output Signal

 

Input Signal(8-bit µlaw encoding)

µ-law vs A-law

8-bit µ-law used in US for telephony

ITU Recommendation G.711

8-bit A-law used in Europe for telephony

Similar, but a slightly different curve.

Both give similar quality to 12-bit linear encoding.

 A-law used for International circuits.  Both are linear approximations to a log curve.

8000 samples/sec * 8bits per sample = 64Kb/s data rate

Page 17: 02 Audio

7/29/2019 02 Audio

http://slidepdf.com/reader/full/02-audio 17/17

17

µ-law

 A-law

 A = 87.7 in Europe

µ  is 255 in US/Japan