32
CS 414 - Spring 2011 CS 414 – Multimedia Systems Design Lecture 4 – Visual Perception and Digital Image Representation Klara Nahrstedt Spring 2011

CS 414 - Spring 2011 CS 414 – Multimedia Systems Design Lecture 4 – Visual Perception and Digital Image Representation Klara Nahrstedt Spring 2011

Embed Size (px)

Citation preview

CS 414 - Spring 2011

CS 414 – Multimedia Systems Design Lecture 4 – Visual Perception and Digital Image Representation

Klara Nahrstedt

Spring 2011

CS 414 - Spring 2011

Administrative

If you did not send group configuration to TA, please, do so today 1/26/2011

MP1 will be posted on 1/28 (Friday) Start by reading the MP1 and organizing

yourself as a group this week, start to read documentation, search for multimedia files, try the tutorial.

Administrative Leasing Process from TSG Helpdesk

Lease one Logitech camera for each student or at least two cameras within one group to start MP1, and then for MP2/MP3.

Leasing process starts on January 26 Pick up the camera from the TSG Helpdesk Bring your student ID to sign for the camera Each cs414 student is responsible for his/her own camera

if you loose it (or badly damage) and you don’t have police report, you pay for it (charged to your student account at the end of the semester)

If nobody is at the TSG Helpdesk, then go to Barb Leisner office, room 2312 SC

Helpdesk hours are Monday –Friday 8am-8pm No camera pickup on Saturday and Sunday

CS 414 - Spring 2011

Human Visual System

Eyes, optic nerve, parts of the brain

Transforms electromagnetic energy

Human Visual System

Image Formation cornea, sclera, pupil,

iris, lens, retina, fovea Transduction

retina, rods, and cones Retina has photosensitive

receptors at back of eye Processing

optic nerve, brain

Rods vs Cones (Responsible for us seeing brightness and color)

Contain photo-pigment Respond to low energy Enhance sensitivity Concentrated in retina,

but outside of fovea One type, sensitive to

grayscale changes

Contain photo-pigment Respond to high energy Enhance perception Concentrated in fovea,

exist sparsely in retina Three types, sensitive to

different wavelengths

Cones Rods

CS 414 - Spring 2011

Tri-stimulus Theory 3 types of cones (6/7 Mil. of them)

Red = L cones, Green = M cones, Blue = S cones Ratio differentiates for each person E.g., Red (64%), Green (32%), rest S cones E.g., L(50.6%), M(44.2%), rest S cones

Each type most responsive to a narrow band electro-magnetic waves red and green absorb most energy, blue the least

Light stimulates each set of cones differently, and the ratios produce sensation of color

Color and Visual System Color refers to how we

perceive a narrow band of electromagnetic energy source, object, observer

Visual system transforms light energy into sensory experience of sight

Color Perception (Color Theory) Hue

Refers to pure colors dominant wavelength of the

light Saturation

Perceived intensity of a specific color

how far color is from a gray of equal intensity

Brightness (lightness) perceived intensity

CS 414 - Spring 2011

Hue Scale

Saturation

Original

lightness

Source: Wikipedia

Digitalization of Images – Capturing and Processing

CS 414 - Spring 2011

Capturing Real-World Images

Picture – two dimensional image captured from a real-world scene that represents a momentary event from the 3D spatial world

CS 414 - Spring 2011

W3

W1W2

r

s

F r= function of (W1/W3); s=function of (W2/W3)

Image Concepts

An image is a function of intensity values over a 2D plane I(r,s)

Sample function at discrete intervals to represent an image in digital formmatrix of intensity values for each color plane intensity typically represented with 8 bits

Sample points are called pixels

CS 414 - Spring 2011

Digital Images

Samples = pixels Quantization = number of bits per pixel Example: if we would sample and quantize

standard TV picture (525 lines) by using VGA (Video Graphics Array), video controller creates matrix 640x480pixels, and each pixel is represented by 8 bit integer (256 discrete gray levels)

CS 414 - Spring 2011

Image Representations Black and white image

single color plane with 2 bits Grey scale image

single color plane with 8 bits Color image

three color planes each with 8 bits

RGB, CMY, YIQ, etc. Indexed color image

single plane that indexes a color table

Compressed images TIFF, JPEG, BMP, etc. 2gray levels4 gray levels

Digital Image Representation (3 Bit Quantization)

CS 414 - Spring 2011

Color QuantizationExample of 24 bit RGB Image

CS 414 - Spring 2011

24-bit Color Monitor

Image Representation Example

128 135 166 138 190 132

129 255 105 189 167 190

229 213 134 111 138 187

135 190

255 167

213 138

128 138

129 189

229 111

166 132

105 190

134 187

24 bit RGB Representation (uncompressed)

Color Planes

Graphical Representation

CS 414 - Spring 2011

Image Properties (Color)

CS 414 - Spring 2011

Color Histogram

CS 414 - Spring 2011

Spatial and Frequency Domains Spatial domain

refers to planar region of intensity values at time t

Frequency domain think of each color plane

as a sinusoidal function of changing intensity values

refers to organizing pixels according to their changing intensity (frequency)

CS 414 - Spring 2011

Image Processing Function: 1. Filtering

Filter an image by replacing each pixel in the source with a weighted sum of its neighbors

Define the filter using a convolution mask non-zero values in small neighborhood, typically

centered around a central pixel generally have odd number of rows/columns

CS 414 - Spring 2011

Example:Convolution Filter

CS 414 - Spring 2008

100 100 100 100 100

100 100 50 50 100

100 100 100 100 100

100 100 100 100 100

100 100 100 100 100

0 1 0

0 0 0

0 0 0

100 100 100 100 100

100 100 50 50 100

100 100 50 100 100

100 100 100 100 100

100 100 100 100 100

X =

Mean Filter

Convolution filterSubset of image

9549648

22813455

33191545

23141220

111

111

111

9

1

Mean Filter

Convolution filterSubset of image

9549648

22813455

33191545

23141220

111

111

111

9

1

Common 3x3 Filters

Low/High pass filter

Blur operator

H/V Edge detector

121

212

121

13

1

111

191

111

121

000

121

101

202

101

111

111

111

9

1

Example

CS 414 - Spring 2011

Image Function: 2. Edge Detection

Identify areas of strong intensity contrast filter useless data; preserve

important properties

Fundamental technique e.g., use gestures as input identify shapes, match to

templates, invoke commands

Edge Detection

CS 414 - Spring 2011

Simple Edge Detection Example: Let assume single line of pixels

Calculate 1st derivative (gradient) of the intensity of the original data Using gradient, we can find peak pixels in image I(x) represents intensity of pixel x and I’(x) represents gradient (in 1D), Then the gradient can be calculated by convolving the

original data with a mask (-1/2 0 +1/2) I’(x) = -1/2 *I(x-1) + 0*I(x) + ½*I(x+1)

CS 414 - Spring 2011

5 7 6 4 152 148 149

Basic Method of Edge Detection

Step 1: filter noise using mean filter

Step 2: compute spatial gradient

Step 3: mark points > threshold as edges

CS 414 - Spring 2011

Summary Other Important Image Processing Functions

Image segmentation Image recognition

Formatting Conditioning Marking Grouping Extraction Matching

Image synthesis

CS 414 - Spring 2011