Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Digital Audio Media Type
10/19/2010 1
Multimedia Applications
Multimedia Requirements
Basics of Digital Audio Media
Sound Categories
Sampling
Quantization
Audio Representation Standards◦ Wav
3/19/2010 2
Definitions:◦ Data: is information that may be valuable for an
end-user
◦ Media: tools used to store and deliver information or data
◦ Multimedia: communications that include multiple forms of information
3
Point of Sale
Point of Information
Training
Learning
Entertainment
4
Using multimedia data, the user is presented the various models available from the vendor
This can be a graphical demonstration of how the product actually works.
5
2009 TOYOTA LANDCRUISER UZJ200R SAHARA (4X4)
Price: $91,990
Kilometers: 34993
Body: Wagon
Color: Pearl White
Transmission: Automatic
Engine Size: 4.7 Litres
No of Seats: 8
Fuel Type: Petrol - Unlead
Extras/Features:
17 Inch Alloy Wheels Air Con + Climate Control Multi Zone Alarm System/Remote Anti Theft Anti-Lock Braking Brake Assist CD With 6 CD Stacker Central
Interactive multimedia information systems are being used in museums, hotels, university campuses, companies, etc.
6
Visual training with the help of images, animation, video and other type of multimedia is proven to be more effective.
Multiple choice exams after short video clips can lead the user through the training system.
7
A successful learning system should have features for the user to explore the information.
8
Entertainment is one of the important areas for multimedia applications.
Computer games are becoming widespread by simulating the real game environments.
9
Multimedia data is used from a distance using computer networks.
The Internet is a public network that can be used for multimedia transmission.
10
The Internet is a large network interconnecting local networks.
The Internet has a heterogeneous structure.
The Internet is a best-effort network.
The Internet does not guarantee a fixed data rate over a connection.
Multimedia data should adapt itself with network data rate changes.
11
A Multimedia System is a system capable of: ◦ Processing multimedia data,
◦ Storing multimedia data,
◦ Generation and manipulation of multimedia data.
A Multimedia system has four basic features:
◦ Multimedia systems must be computer controlled.
◦ Multimedia systems are integrated.
◦ The information they handle must be represented digitally.
◦ The interface to the final presentation of media is usually interactive.
The multimedia systems should:◦ Process large amount of data
◦ The should be real time in many cases
◦ The should be able to combine different media
Multimedia data can be used as:◦ Streaming data
Data (audio, video, etc) is presented to the end-user only once (without being saved to a file).
◦ Downloadable
Data is downloaded to a file and presented to the end-user repeatedly.
15
Assume a video with:◦ 25 frames per second
◦ 600 rows
◦ 800 columns
◦ 3 bytes for each pixel
The video needs 25*600*800*3 bytes per second (36 MB )
One hour video will be 129 GB
Conclusion: Compression is necessary
Audio Media Type
What is Sound?Sound is the brain's interpretation of electrical impulses being sent by the inner ear through the nervous system.
Audio is sound within the acoustic range available to humans.
An audio frequency (AF) is an alternating signal within the 20 to 20,000 hertz (cycles per second) range.
3/19/2010 18
Content Sound
Content sound provides information to audiences, for example, dialogs in movies or
theater.
Ambient Sound
Ambient sound consists of an array of background and sound effects.
10/22/2010 19
Narration: Narration provides information about an animation that is playing on the screen.
Testimonials: These could be auditory or video sound tracks used in presentations or movies.
Voice-overs: These are used for short instructions, for example, to navigate the multimedia application.
Music: Music may be used to communicate (as in a song).
10/22/2010 20
Message reinforcement: The background sounds you hear in real life, such as the crowds at a ball game, can be used to reinforce the message that you wish to communicate.
Background music: Set the mood for the audience to receive and process information by starting and ending a presentation with music.
Sound effects: Sound effects are used in presentations to liven up the mood and add effects to your presentations, such as sound attached to bulleted lists.
10/22/2010 21
Sound waves are disturbances in the air (or other mediums of transmission). The sound waves consist of compressions and rarefactions.
The amplitude measures the relative loudness of the sound, which is the distance between a valley and a crest.
The amplitude determines the volume of the sound.
The unit of measurement of volume is a decibel.
10/22/2010 22
Source Intensity Level
Threshold of Hearing (TOH) 0 dB
Rustling Leaves 10 dB
Whisper 20 dB
Normal Conversation 60 dB
Busy Street Traffic 70 dB
Vacuum Cleaner 80 dB
Large Orchestra 98 dB
Walkman at Maximum Level 100 dB
Front Rows of Rock Concert 110 dB
Threshold of Pain 130 dB
Military Jet Takeoff 140 dB
Instant Perforation of Eardrum 160 dB
10/22/2010 23
Frequency : The difference in time between the formation of two crests is termed period.
The number of peaks that occur in one second is the frequency.
Another term associated with frequency is pitch. If an object oscillates rapidly, it creates a "high-pitched" sound (treble).
A low-frequency sound on the other hand is produced by an object that vibrates slowly, such as the thicker strings of a piano or guitar (bass).
10/22/2010 24
Wavelength: Wavelength is the distance from the midpoint of one crest to the midpoint of the next crest.
Wavelength is represented by the symbol λ
Bandwidth: Bandwidth is defined as the difference between the highest and the lowest frequency contained in a signal.
A signal which spans the range of 200-3200 Hz has a bandwidth (BW) of:
BW= 3200-200 = 3000 Hz
10/22/2010 25
Few objects produce sound of a single frequency. Most musical instruments, for example, generate multiple frequencies for each note.
The combinations of frequencies generated by an instrument are known as the timbre.
A timbre consists of a fundamental or main frequency and other minor frequencies known as overtones or harmonics.
10/22/2010 26
Explain why a "note" (say, Middle C) played on a piano sounds different from the same "note“ played on a saxophone.
10/22/2010 27
The velocity of sound may be found directly by measuring the time required for the waves to travel a measured distance.
The velocity varies greatly with the medium through which it travels.
10/22/2010
CENG 460 By: Roya Choupani
28
Audio signal is an analog signal
Computers store digital signal
To convert analog signal into digital signal we have to do sampling
3/19/2010 29
3/19/2010
CENG 460 By: Roya Choupani
30
Sample values can be in a large range and include real numbers.
To store sample values with less number of bits quantization is used.
In quantization the values in a range are mapped to a single number
3/19/2010 31
3/19/2010
CENG 460 By: Roya Choupani
32
20,000 Hz sound, sampled at 40,000 sample/second
Quantization level: 256 (8 bits)
40,000 * 1 bytes /sec data
With 65,536 (16 bits) quantization levels, data size will be 40,000 * 2 = 80,000 bytes/second
3/19/2010 33
Fidelity is defined as the closeness of the recorded version to the original sound. In the case of digital speech, it depends upon the number of bits per sample and the sampling rate.
A really high-fidelity (hi-fi) recording takes up a lot of memory space (176.4 KB for every second of audio of stereo quality sampled at 16 bits, 44.1 kHz per channel).
Fortunately for most computer multimedia applications, it is not necessary to have very high fidelity sound.
10/22/2010 34
Mono Recording:
File size = Sampling rate x duration of recording in seconds x (bits per sample/8) x 1
Stereo Recording:
File size = Sampling rate x duration of recording in seconds x (bits per sample/8) x 2
10/22/2010 35
10/22/2010
CENG 460 By: Roya Choupani
36
wav - standard audio file format used mainly in Windows PCs. Commonly used for storing uncompressed, CD-quality sound files.
mp3 - the MPEG Layer-3 format is the most popular format for downloading and storing music.
au - the standard audio file format used by Sun, Unix and Java. The audio in au files can be uncompressed or compressed.
3/19/2010 37
The WAV file format is a Microsoft specification for the storage of multimedia files.
A WAV file starts out with a file header followed by a sequence of data chunks.
A WAV file consists of two sub-chunks ◦ a "fmt " chunk specifying the data format
◦ a "data" chunk containing the actual sample data.
3/19/2010 38
3/19/2010
CENG 460 By: Roya Choupani
39
Length Name Description
4 ChunkID Contains the
letters "RIFF" in
ASCII form
4 ChunkSize This is the size of
the
entire file in
bytes minus 8 bytes
for the
two fields not
included in this
count:
ChunkID and
ChunkSize.
4 Format Contains the
letters "WAVE"
10/22/2010 40
The "WAVE" format consists of two subchunks: "fmt " and "data":
The "fmt " subchunk describes the sound data's format:
The "data" subchunk contains the size of the data and the actual sound:
10/22/2010 41
Length Name Description
4 Subchunk1ID Contains the letters "fmt "
4 Subchunk1Size 16 for uncompressed case. This is the size of the
rest of the Sub-chunk which follows this number.
2 AudioFormat Uncompressed = 1 , Values other than 1 indicate
some form of compression.
2 NumChannels Mono = 1, Stereo = 2, etc.
4 SampleRate 8000, 44100, etc.
4 ByteRate == SampleRate * NumChannels * BitsPerSample/8
2 BlockAlign == NumChannels * BitsPerSample/8
2 BitsPerSample 8 bits = 8, 16 bits = 16, etc.
2 ExtraParamSize if uncompressed, then doesn't exist
X ExtraParams space for extra parameters
4 Subchunk2ID Contains the letters "data"
4 Subchunk2Size == NumSamples * NumChannels * BitsPerSample/8
This is the number of bytes in the data.
* Data The actual sound data.
10/22/2010 42
Audio data when sent over the Internet with a low bit rate connection, should be compressed. Discuss the importance of the audio data compression.
10/22/2010 43
3/19/2010 44