Signals and Systems (18-396), Image and Video Processing (18

Signals and Systems (18-396), Image and Video Processing (18-798),

and Life Beyond…

Prof. Tsuhan Chentsuhan@cmu.edu

Signals and SystemsSignals and Systems

Estimation, Detection and Estimation, Detection and IdentIdent..

Digital Signal Processing IIDigital Signal Processing II

Image and Video ProcessingImage and Video Processing

Multimedia CommunicationMultimedia Communication

Applied Stochastic Proc.Applied Stochastic Proc.

Optical Image and Radar Proc.Optical Image and Radar Proc.

Pattern RecognitionPattern Recognition

Sample Courses in Signal Processing and Communication

Error Control CodingError Control Coding

Digital Signal Processing IDigital Signal Processing I

Dig. Comm. and Sig. Proc. Dig. Comm. and Sig. Proc. SystSyst. Design. Design

Fund. Comm. Sys.Fund. Comm. Sys.

Tsuhan Chen

– CD: 44.1 kHz × 16 bits × 2 channels = 1.411 Mbits/s

FrequencyBand (Hz)

SamplingRate (kHz)

Bits perSample

Raw Bitrate(kbits/s)

TelephoneSpeech

300~3400 8 8 64

WidebandSpeech

50~7000 16 8 128

MediumbandAudio

10~11000 24 16 384

WidebandAudio

10~22000 48 16 768

Digital Audio

Tsuhan Chen

MPEG-1 Audio• ISO/IEC 11172-3 (1988~1991)

– First high quality audio compression standard– Sampling rates: 32, 44.1, 48 kHz– CD quality two-channel audio at ~256 kbits/s

• CD: 44.1 kHz × 16 bits × 2 = 1.411 Mbits/s– YES, this is MP3!!!

• Quality demonstration– Stereo 44.1 kHz at 64 kbits/s– Stereo 44.1 kHz at 128 kbits/s– Stereo 44.1 kHz at 192 kbits/s– Stereo 44.1 kHz at 256 kbits/s

Tsuhan Chen

ImageRGB Color

R = 255G = 200B = 200

R = 150G = 170B = 253

R = 251G = 200B = 190

R = 124G = 110B = 123

R = 204G = 203B = 202

R = 151G = 140B = 139

R = 248G = 220B = 242

R = 190G = 170B = 90

R = 151G = 148B = 149

R = 244G = 222B = 214

R = 253G = 100B = 120

R= 230G=120B=234

R = 149G = 244B = 130

R = 159G = 149B = 150

R = 254G = 133B = 200

R = 0G = 0B = 0

Tsuhan Chen

Sample Images

Lena Pepper Baboon

512 × 512 × 3 bytes = 768KBWith JPEG, ~32KB

Tsuhan Chen

SamplingSpatial Subsampling

MSE = 2058MAE = 24CR = 16:1

MSE = 3924MAE = 36CR = 64:1

Original (256×256) (64×64) (32×32)Aliasing!!!

Tsuhan Chen

SamplingSpatial Subsampling w/Averaging

MSE =1010MAE =18CR=16:1

MSE =1643MAE =26CR=64:1

Original (256×256) (64×64) (32×32)

Tsuhan Chen

Quantization

MSE = 9670MAE = 78CR = 2:1

MSE = 10381MAE = 82CR = 4:1

Original (24bit) (12-bit) (6-bit)

Tsuhan Chen

Pixel or Pel

Sequence

…...

Frame or Picture

Tsuhan Chen

Video Data

• Video

Pels/line Lines Frames/s Bytes/pel Bit rate

Video Telephony(CIF)

352 288 10 1.5 12.2 Mbits/s

Broadcast TV(ITU-R 601 4:2:2)

720 480 30 2 166 Mbits/s

HDTV ~1280 ~720 60 2 885 Mbits/s

Computer Graphics

Tsuhan Chen

Face Animation

• Wire-frame mesh model with texture mapping

Computer Vision

Face Tracking

Use color information to segment target vs. non-target pixels

Use deformable template to track the target

Tsuhan Chen

Lip Tracking

• Use a Gaussian mixture with three Gaussians to model the color distribution of the mouth

• Template: two parabolas defined by λ = (a,b,c,d,e)

Hand Tracking

Tsuhan Chen

Eye Tracking

• Find the center of the darkest region in the search window normalization

thresholding

Finding the center

Tsuhan Chen

CMU-GM LabFace/Eye/Hand Tracking:. Driver-Vehicle Interfaces. Cognitive Overflow Study

Airbag Deployment ControlMirror/wheel/panel/seat adjustment

Driver ID and Encryption:Security, Safety, User Preference

GestureCam

Interview Video

Higher Dimensions?

Tsuhan Chen

(Vx,Vy,Vz)(Vx,Vy,Vz)

(θ,ψ)(θ,ψ)

7D Plenoptic Function

[Adelson’91]

),,,,,,( tVVVf zyx λψθ

Image-Based Rendering• Plenoptic Function [Adelson’91]

[McMillan’95]

• Lumigraph/Lightfield[Gortler/Grzeszczuk’96] [Levoy’96]

• Concentric Mosaics [Shum99]

Tsuhan Chen

Image-Based Rendering

Tsuhan Chen

EyeVision

[Kanade’01]

After CorrectionBefore Correction

4D IBR(incl. time)

Super Bowl XXXV

Tsuhan Chen

Self-Reconfigurable Camera Array

[Stanford] [MIT][Zhang and Chen, CMU]

Tsuhan Chen

Details– 48 webcams

• Embedded processors• Sensor network

– 2 step-motors each (translation and pan)– Real-time capturing/calibration/rendering

Tsuhan Chen

Ongoing: Mirror/Lens Array

This is lightfield/lumigraph!

Tsuhan Chen

Future: “Transparent Material”

Many applications…

Camera Array

3D Display

Information Retrieval(Pattern Recognition)

Tsuhan Chen

Hand-Drawn Sketch Retrieval

User sketches a query

QuerySketch

SimilarSketch

Page stored in Database

Tsuhan Chen

Retrieved Trademarks

Trademark Retrieval

Tsuhan Chen

Hand-Drawn Query

Retrieved Trademarks

Trademark Retrieval

Tsuhan Chen

3D Object Retrieval

Tsuhan Chen

Sketched 3D Query too…

Tsuhan Chen

3D Protein Structures too…

Tsuhan Chen

Summary

• Signals and Systems• Image and Video Processing• Computer Vision• Computer Graphics• Pattern Recognition• Information Retrieval

Signals and Systems (18-396), Image and Video Processing (18-798),

and Life Beyond…

Prof. Tsuhan Chentsuhan@cmu.edu

Signals and Systems (18-396), Image and Video Processing (18

Documents

ABN 68 124 425 396 Ground Floor, Ann Place 895 Ann Street Fortitude Valley … · 2008. 6. 18. · WHITEHAVEN COAL LIMITED ABN 68 124 425 396 Ground Floor, Ann Place 895 Ann Street

THE DELTA ACADEMY · 2016-10-18 · 1 THE DELTA ACADEMY Student Handbook 818 West Brooks Avenue North Las Vegas, NV 89030 P: (702) 396-2252 F:(702) 396-0848 2016-2017

396 395 396 - Waterbury, CT

The Prodigy Micromouse 296/396. Team Members/Assignments Dale Balsis (396) – Web Designer/Hardware Tyson Seto-Mook (396)– Project Supervisor Calvin Umeda

396 - smithbrothersfurniture.com

396 - sima.gov.sb

Superman 396 1963

Representation of Aperiodic Signals: the Continuous-Time ...homepage.ntu.edu.tw/.../103-2_ss04_CTFT_Problem_3... · NTUEE-SS4-CTFT-18 Outline Representation of Aperiodic Signals:

Q2 2014 IMPORT HAULAGE TARIFF - VALID FROM … 2014 Import... · ashby-de-la-zouch leicestershire 396 396 396 396 396 396 520 520 520 572 572 572 ashford kent 871 871 871 735 735

El 396: Simplicity

Lecture 18 Photonic Signals and Systems An Introduction By

396 3790Y1 - support.surefireag.com

4. Digital Floor Tiles (396 x 396 mm)

Catalogue Pr Me5700dthdthc 396

Part 396 - MoDOT

ITEM NO: 5.2 REPORT NUMBER: 396/18 COUNCIL ASSESSMENT …

396 TC December Newsletter

Chapter 18: Solutions - Berkeley · PDF fileNotes on the Thermodynamics of Solids J.W. Morris, Jr.; Fall, 2008 Page 396 Chapter 18: Solutions Chapter 18: Solutions

SouthBostonTODAY · 2018/1/18 · 2 SUTHBOSTONTDAY January 18 2018 PO Box 491 • South Boston, MA 02127 info@southbostontoday.com • ads@southbostontoday.com 396 West Broadway

Dragon #396