19
CS 436 Computer Vision Fundamentals Sohaib A Khan Room 215, [email protected] Office Hrs: M 10am-11am W 10am-11:am Th 3pm-4pm Course Status Elective for Senior/Graduate students Prerequisites: Strong programming background Mathematics Background Required Matrix Manipulation, Calculus Will be helpful, but not assumed Probability, Random Processes, Signal Processing…

Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

1

CS 436Computer Vision Fundamentals

Sohaib A KhanRoom 215, [email protected]

Office Hrs: M 10am-11amW 10am-11:amTh 3pm-4pm

Course Status

Elective for Senior/Graduate studentsPrerequisites:

Strong programming backgroundMathematics Background

RequiredMatrix Manipulation, Calculus

Will be helpful, but not assumedProbability, Random Processes, Signal Processing…

Page 2: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

2

Goals

Basic Goal: Generate excitement about CV

Introductory theory & applications of CVDemonstration of simple exciting applicationsAbility to write programs to solve CV problemsExamples of CV SystemsIntro to some current research topics

Text

Computer VisionLinda G. Shapiro, George C. Stockman ISBN 0-13-030796-3 Prentice Hall, 2001Additional course notes and reading material will available through the course website.Programming Environment:

C/C++ and MATLAB

Page 3: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

3

Lectures

20 sessions, 100 minutes each1 in-class midterm, 1 final exam50 minutes each week for programming aspects of the courseNo grade for attendance

Grading

40% projects and programming assignments10% assignments/homework5% reading assignments

Graded through class participation/quizzes

20% mid-term25% final exam (non-comprehensive)

Page 4: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

4

Cheating/Plagiarism

Will not be tolerated…

Course Website

Tool to keep all reading links, additional information in one placeLecture notes and slides will be availablehttp://web.lums.edu.pk/~sohaib/cvf-fall02.htm

Page 5: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

5

What is Computer Vision

“The goal of Computer Vision is to make useful decisions about real physical objects and scenes based on sensed images”

ImageProcessing

Image IN Image OUT Computer Vision

Image IN

Symbolic Decision OUT

Computer Vision Area

Integration of several areasImage processingStatistical inferenceMachine intelligenceDecision theory…

Teaching methodologyCover some basic areasBreadth vs DepthShow lots of examples

Page 6: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

6

Module 1 (4 Sessions)

Introduction (1)Transformations and Imaging Geometry (3)

2D Transformations

Module 1 (4 Sessions)

Global Transformation ModelsAffine, Projective, Bilinear

From http://wearcam.org (Professor Steve Mann)

Page 7: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

7

Module 1 (4 Sessions)

3D Transformations

Camera ModelsCamera Calibration

Module 1 (4 Sessions)

WarpingApplying transformation to an imageCan be done for multiple images

Page 8: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

8

Module 2 (6 Sessions)

Basic Binary OperationsThresholding, Morphology, Region Properties, Moments, Connected Component Labeling

Page 9: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

9

Module 2Edge Detection

Shape Representation, Hough Transform

Module 3 (6 Sessions)

Pattern Recognition, Decision TheoryClustering

K-MeansBayes ClassifiersMAP EstimationEM AlgorithmPrinciple Component AnalysisFace Recognition

Page 10: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

10

Module 3

Background Subtraction

Segmentation

Page 11: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

11

Page 12: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

12

Module 4 (4 Sessions)

MotionBrightness Constancy EquationOptical Flow

Lucas-Kanade MethodHorn and Shunck Method

Page 13: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

13

Module 4

Global Motion Estimation / Image Stablization

Affine Least-Squares Method

Page 14: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

14

Programming Assignments1. Image Warping2. Edge Detection3. Hough Transform for Lines4. Clustering5. Face Recognition6. Segmentation7. Lucas-Kanade Optical Flow8. Global Motion Estimation & Image Stabilization

All assignments may not be compulsory, depending on the ability of the students

Outline.doc

Page 15: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

15

Reading Assignment 1

Chapter 1Section 2.5http://www.cs.ucf.edu/~vision/projects/projects.htmlhttp://wearcam.orghttp://wearcam.org/research.html

Digital Images

Images are essentially large matricesGray scale images are 2D matricesColor images have 3 layers (R, G, B)Each cell of the matrix represents a pixelEach pixel is quantized to a set of values, e.g. from 0-255 (1 Byte per pixel)

Page 16: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

16

PBM, PGM, PPM Format

Portable Bit Map FormatsSimple Image header in ASCIIImage data in either ASCII or Binary.PGM for Grayscale images.PPM for Color (RGB) images

Programming Assignment 0

Write Program to Read/Write BINARY .ppm & .pgm filesData will be available through course website

Page 17: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

17

Header

P2

# Created by IrfanView

128 128

255

47

55 58 58 58 58 58 56 56 55 54 53 52

Magic Numbers for format identificationP2 - ASCII PGM P3 – ASCII PPMP5 – Binary PGMP6 – Binary PPM

Page 18: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

18

Tasks

Implement both in MATLAB and C/C++MATLAB Introduction

http://math.ucsd.edu/~driver/21d-s99/matlab-primer.html

Tasks

Write functionsReadBinPGM, WriteBinPGM, ReadBinPPM, WriteBinPPM

Input: FileNameOutput: Matrix array containing image data

Page 19: Course Status - Lahore University of Management Sciencessuraj.lums.edu.pk/~cs436a02/Lecture 1 handout.pdf · 2 Goals Basic Goal: Generate excitement about CV Introductory theory &

19

Tasks

1. Read in .pgm image file. Create negative of the image. Save it as another .pgm file

2. Read in a .ppm file. Switch color planes R↔G. Save it as another .ppmfile

3. Read in a .ppm file. Save its R, G, B components as separate .pgm files

Due Date

Tuesday 10th SeptemberCode printout of functionsSample output

MATLAB help?