Computational Imaging

Embed Size (px)

Citation preview

  • 7/27/2019 Computational Imaging

    1/51

    MIT Media Lab

    Camera Culture

    Ramesh Raskar

    Jack Tumblin

    Computational Photography:

    Advanced Topics

    Paul Debevec

  • 7/27/2019 Computational Imaging

    2/51

    Speaker: Jack Tumblin

    Associate Professor of Computer Science at Northwestern Univ.

    His LookLab group pursues research on new methods to capture

    and manipulate the appearance of objects and surroundings, in the

    hope that hybrid optical/computer methods may give us new ways

    to see, explore, and interact with objects and people anywhere in

    the world. During his doctoral studies at Georgia Tech and post-docat Cornell, he investigated tone-mapping methods to depict high-

    contrast scenes. His MS in Electrical Engineering (December 1990)

    and BSEE (1978), also from Georgia Tech, bracketed his work as

    co-founder of IVEX Corp., (>45 people as of 1990) where his flight

    simulator design work was granted 5 US Patents. He was an

    Associate Editor of ACM Transactions on Graphics (2000-2006), a

    member of the SIGGRAPH Papers Committee (2003, 2004), and in2001 was a Guest Editor of IEEE Computer Graphics and

    Applications.

    http://www.cs.northwestern.edu/~jet

  • 7/27/2019 Computational Imaging

    3/51

    Speaker: Paul Debevec

    Research Associate Professor ,

    University of Southern California and the

    Associate director of Graphics Research,

    USC's Institute for Creative Technologies.

    Debevec's Ph.D. thesis (UC Berkeley, 1996) presented Faade,

    an image-based modeling and rendering system for creating

    photoreal architectural models from photographs. Pioneer in

    high dynamic range photography, he demonstrated new image-

    based lighting techniques in his films Rendering with Natural

    Light (1998), Fiat Lux (1999), and The Parthenon (2004); he also

    led the design of HDR Shop, the first high dynamic range image

    editing program. At USC ICT, Debevec has led the developmentof a series of Light Stage devices used in Spider Man 2 and

    Superman Returns. He is the recipient of ACM SIGGRAPH's first

    Significant New Researcher Award and a co-author of the 2005

    book High Dynamic Range Imaging from Morgan Kaufmann.

    http://www.debevec.org

  • 7/27/2019 Computational Imaging

    4/51

    Speaker: Ramesh Raskar

    Associate Professor, MIT Media Lab.

    Previously at MERL as a Senior Research Scientist.

    His research interests include projector-based graphics,

    computational photography and non-photorealistic rendering.

    He has published several articles on imaging and photographyincluding multi-flash photography for depth edge detection,

    image fusion, gradient-domain imaging and projector-camera

    systems. His papers have appeared in SIGGRAPH,

    EuroGraphics, IEEE Visualization, CVPR and many other

    graphics and vision conferences. He was a course organizer at

    Siggraph 2002 through 2005. He was the panel organizer at

    the Symposium on Computational Photography and Video inCambridge, MA in May 2005 and taught a graduate level class

    on Computational Photography at Northeastern University, Fall

    2005. He is a member of the ACM and IEEE.

    http://raskar.info

    http://www.media.mit.edu/~raskar

  • 7/27/2019 Computational Imaging

    5/51

    Overview

    Unlocking Photography

    Not about the equipment but about the goal

    Capturing machine readable visual experience

    Goes beyond what you can see through the viewfinder

    Push the envelope with seemingly peripheral techniques and advances

    Think beyond post-capture image processing Computation well before image processing and editing

    Learn how to build your own camera-toys

    Emphasis

    Most recent work in graphics/vision (2006 and later)

    Research in other fields: Applied optics, novel sensors, materials Review of 50+ recent papers and projects

    What we will not cover

    Minimum discussion of graphics/vision papers before 2006

    Epsilon photography (improving camera performance by bracketing)

    Film Cameras, Novel view rendering (IBR), Color issues, Traditional imageprocessing/editing

  • 7/27/2019 Computational Imaging

    6/51

    Traditional Photography

    Courtesy: Shree Nayar

    Lens

    Detector

    Pixels

    Image

  • 7/27/2019 Computational Imaging

    7/51

    Traditional Photography

    Lens

    Detector

    Pixels

    Image

    Mimics Human Eye for a Single Snapshot:Single View, Single Instant, FixedDynamic range and Depth of fieldfor given Illumination in a Static

    world

  • 7/27/2019 Computational Imaging

    8/51

    Computational Photography:Optics, Sensors and Computations

    GeneralizedSensor

    GeneralizedOpticsComputations

    Picture

    4D Ray Bender

    Upto 4D

    Ray Sampler

    Ray Reconstruction

    Merged braketed photos, Coded sensing

  • 7/27/2019 Computational Imaging

    9/51

    Computational Photography

    Novel CamerasGeneralized

    Sensor

    Generalized

    OpticsProcessing

  • 7/27/2019 Computational Imaging

    10/51

    Computational Photography Novel Illumination

    Novel CamerasGeneralized

    Sensor

    Generalized

    OpticsProcessing

    Light Sources

  • 7/27/2019 Computational Imaging

    11/51

    Computational Photography Novel Illumination

    Novel Cameras

    Scene: 8D Ray Modulator

    Generalized

    Sensor

    Generalized

    OpticsProcessing

    Light Sources

  • 7/27/2019 Computational Imaging

    12/51

    Computational Photography Novel Illumination

    Novel Cameras

    Scene: 8D Ray Modulator

    Display

    Generalized

    Sensor

    Generalized

    OpticsProcessing

    Recreate 4D Lightfield

    Light Sources

  • 7/27/2019 Computational Imaging

    13/51

    Computational Photography Novel Illumination

    Novel Cameras

    Scene: 8D Ray Modulator

    Display

    Generalized

    Sensor

    Generalized

    OpticsProcessing

    4D Ray BenderUpto 4D

    Ray Sampler

    RayReconstruction

    GeneralizedOptics

    Recreate 4D Lightfield

    Light Sources

    Modulators

    4D Incident Lighting

    4D Light Field

  • 7/27/2019 Computational Imaging

    14/51

    What is Computational Photography?

    Create photo that could not have been taken by a traditional Camera (?)

    Goal: Record a richer, multi-layered visual experience

    1. Overcome limitations of todays cameras

    2. Support better post-capture processing

    Relightable photos, Focus/Depth of field, Fg/Bg, Shape boundaries

    3. Enables new classes of recording the visual signal

    Moment [Cohen05], Time-lapse, Unwrap mosaics, Cut-views

    4. Synthesize impossible photos

    Wrap-around views [Rademacher and Bishop 1998]), fusion of time-lapsed events [Raskar et al 2004],

    motion magnification [Liu et al 2005]), video textures and panoramas [Agarwala et al 2005].

    5. Exploit previously exotic forms of scientific imaging

    Coded aperture [Veeraraghavan 2007, Levin 2007], confocal imaging [Levoy 2004], tomography[Trifonov 2006]

  • 7/27/2019 Computational Imaging

    15/51

    Computational Photography

    1. Epsilon Photography Low-level vision: Pixels Multi-photos by perturbing camera parameters HDR, panorama, Ultimate camera

    2. Coded Photography Single/few snapshot Reversible encoding of data Additional sensors/optics/illum Scene analysis: (Consumer software?)

    3. Essence Photography Beyond single view/illum Not mimic human eye New art form

  • 7/27/2019 Computational Imaging

    16/51

    Epsilon Photography

    Dynamic range Exposure bracketing [Mann-Picard, Debevec]

    Wider FoV

    Stitching a panorama Depth of field

    Fusion of photos with limited DoF [Agrawala04]

    Noise Flash/no-flash image pairs

    Frame rate

    Triggering multiple cameras [Wilburn04]

  • 7/27/2019 Computational Imaging

    17/51

    Goal: High Dynamic Range

    Short Exposure

    Long Exposure

    Dynamic Range

  • 7/27/2019 Computational Imaging

    18/51

    Epsilon Photography

    Dynamic range Exposure braketing [Mann-Picard, Debevec]

    Wider FoV

    Stitching a panorama

    Depth of field

    Fusion of photos with limited DoF [Agrawala04]

    Noise

    Flash/no-flash image pairs [Petschnigg04, Eisemann04]

    Frame rate

    Triggering multiple cameras [Wilburn05, Shechtman02]

  • 7/27/2019 Computational Imaging

    19/51

    Computational Photography

    1. Epsilon Photography Low-level Vision: Pixels Multiphotos by perturbing camera parameters HDR, panorama Ultimate camera

    2. Coded Photography Mid-Level Cues:

    Regions, Edges, Motion, Direct/global

    Single/few snapshot Reversible encoding of data

    Additional sensors/optics/illum Scene analysis

    3. Essence Photography Not mimic human eye Beyond single view/illum

    New artform

  • 7/27/2019 Computational Imaging

    20/51

    3D

    Stereo of multiple cameras

    Higher dimensional LF

    Light Field Capture

    lenslet array [Adelson92, Ng05], 3D lens[Georgiev05], heterodyne masks[Veeraraghavan07]

    Boundaries and Regions

    Multi-flash camera with shadows [Raskar08]

    Fg/bg matting [Chuang01,Sun06]

    Deblurring

    Engineered PSF

    Motion: Flutter shutter[Raskar06], Camera Motion [Levin08]

    Defocus: Coded aperture [Veeraraghavan07,Levin07], Wavefront coding[Cathey95]

    Global vs direct illumination

    High frequency illumination [Nayar06]

    Glare decomposition [Talvala07, Raskar08]

    Coded Sensor

    Gradient camera [Tumblin05]

  • 7/27/2019 Computational Imaging

    21/51Marc Levoy

    Digital Refocusing usingLight Field Camera

    125 square-sided microlenses[Ng et al 2005]

  • 7/27/2019 Computational Imaging

    22/51

    3D

    Stereo of multiple cameras

    Higher dimensional LF

    Light Field Capture

    lenslet array [Adelson92, Ng05], 3D lens[Georgiev05], heterodyne masks[Veeraraghavan07]

    Boundaries and Regions

    Multi-flash camera with shadows [Raskar08]

    Fg/bg matting [Chuang01,Sun06]

    Deblurring

    Engineered PSF

    Motion: Flutter shutter[Raskar06], Camera Motion [Levin08]

    Defocus: Coded aperture [Veeraraghavan07,Levin07], Wavefront coding[Cathey95]

    Global vs direct illumination

    High frequency illumination [Nayar06]

    Glare decomposition [Talvala07, Raskar08]

    Coded Sensor

    Gradient camera [Tumblin05]

  • 7/27/2019 Computational Imaging

    23/51

    Depth

    Edges

    Left Top Right Bottom

    Depth EdgesCanny Edges

  • 7/27/2019 Computational Imaging

    24/51

    3D

    Stereo of multiple cameras

    Higher dimensional LF

    Light Field Capture

    lenslet array [Adelson92, Ng05], 3D lens[Georgiev05], heterodyne masks[Veeraraghavan07]

    Boundaries and Regions

    Multi-flash camera with shadows [Raskar08]

    Fg/bg matting [Chuang01,Sun06]

    Deblurring

    Engineered PSF

    Motion: Flutter shutter[Raskar06], Camera Motion [Levin08]

    Defocus: Coded aperture [Veeraraghavan07,Levin07], Wavefront coding

    [Cathey95]

    Global vs direct illumination

    High frequency illumination [Nayar06]

    Glare decomposition [Talvala07, Raskar08]

    Coded Sensor

    Gradient camera [Tumblin05]

  • 7/27/2019 Computational Imaging

    25/51

    Flutter Shutter CameraRaskar, Agrawal, Tumblin

    [Siggraph2006]

    LCD opacity switched

    in coded sequence

    C d d

  • 7/27/2019 Computational Imaging

    26/51

    Traditional

    CodedExposu

    re

    Image ofStaticObject

    DeblurredImage

    DeblurredImage

  • 7/27/2019 Computational Imaging

    27/51

    3D

    Stereo of multiple cameras

    Higher dimensional LF

    Light Field Capture

    lenslet array [Adelson92, Ng05], 3D lens[Georgiev05], heterodyne masks[Veeraraghavan07]

    Boundaries and Regions

    Multi-flash camera with shadows [Raskar08]

    Fg/bg matting [Chuang01,Sun06]

    Deblurring

    Engineered PSF

    Motion: Flutter shutter[Raskar06], Camera Motion [Levin08]

    Defocus: Coded aperture [Veeraraghavan07,Levin07], Wavefront coding

    [Cathey95]

    Decomposition Problems

    High frequency illumination, Global/direct illumination [Nayar06]

    Glare decomposition [Talvala07, Raskar08]

    Coded Sensor

    Gradient camera [Tumblin05]

  • 7/27/2019 Computational Imaging

    28/51

    "Fast Separation of Direct and Global Components of aScene using High Frequency Illumination,"S.K. Nayar, G. Krishnan, M. D. Grossberg, R. Raskar,ACM Trans. on Graphics (also Proc. of ACM SIGGRAPH),Jul, 2006.

    S ti R fl t C t ith

  • 7/27/2019 Computational Imaging

    29/51

    normal imagecross-polarized

    subsurfacecomponent

    polarization difference(primarily)

    specular component

    Separating Reflectance Components withPolarization-Difference Imaging

  • 7/27/2019 Computational Imaging

    30/51

    Computational Photography

    1. Epsilon Photography Multiphotos by varying camera parameters HDR, panorama Ultimate camera: (Photo-editor)

    2. Coded Photography

    Single/few snapshot Reversible encoding of data Additional sensors/optics/illum Scene analysis: (Next software?)

    3. Essence Photography High-level understanding

    Not mimic human eye Beyond single view/illum

    New artform

  • 7/27/2019 Computational Imaging

    31/51

  • 7/27/2019 Computational Imaging

    32/51

    Blind Camera

    Sascha Pohflepp,

    U of the Art, Berlin, 2006

  • 7/27/2019 Computational Imaging

    33/51

    Capturing the Essence of Visual Experience

    Exploiting online collections Photo-tourism [Snavely2006] Scene Completion [Hays2007]

    Multi-perspective Images Multi-linear Perspective [Jingyi Yu, McMillan 2004]

    Unwrap Mosaics [Rav-Acha et al 2008] Video texture panoramas [Agrawal et al 2005]

    Non-photorealistic synthesis Motion magnification [Liu05]

    Image Priors Learned features and natural statistics Face Swapping: [Bitouk et al 2008]

    Data-driven enhancement of facial attractiveness [Leyvand et al 2008]

    Deblurring [Fergus et al 2006, 2008 papers

  • 7/27/2019 Computational Imaging

    34/51

    Scene Completion Using Millions of PhotographsHays and Efros, Siggraph 2007

  • 7/27/2019 Computational Imaging

    35/51

    Capturing the Essence of Visual Experience

    Exploiting online collections Photo-tourism [Snavely2006] Scene Completion [Hays2007]

    Multi-perspective Images Multi-linear Perspective [Jingyi Yu, McMillan 2004]

    Unwrap Mosaics [Rav-Acha et al 2008] Video texture panoramas [Agrawal et al 2005]

    Non-photorealistic synthesis Motion magnification [Liu05]

    Image Priors Learned features and natural statistics Face Swapping: [Bitouk et al 2008]

    Data-driven enhancement of facial attractiveness [Leyvand et al 2008]

    Deblurring [Fergus et al 2006, 2008 papers

  • 7/27/2019 Computational Imaging

    36/51

    Andrew Davidhazy

    Unwrap Mosaics + Video Editing

  • 7/27/2019 Computational Imaging

    37/51

    Unwrap Mosaics + Video Editing

    Rav-Acha et al

    Siggraph 2008

  • 7/27/2019 Computational Imaging

    38/51

    Capturing the Essence of Visual Experience

    Exploiting online collections Photo-tourism [Snavely2006] Scene Completion [Hays2007]

    Multi-perspective Images Multi-linear Perspective [Jingyi Yu, McMillan 2004]

    Unwrap Mosaics [Rav-Acha et al 2008] Video texture panoramas [Agrawal et al 2005]

    Non-photorealistic synthesis Motion magnification [Liu05]

    Image Priors Learned features and natural statistics Face Swapping: [Bitouk et al 2008]

    Data-driven enhancement of facial attractiveness [Leyvand et al 2008]

    Deblurring [Fergus et al 2006, 2008 papers

  • 7/27/2019 Computational Imaging

    39/51

    Motion Magnification

    Liu, Torralba, Freeman, Durand, Adelson Siggraph 2005

  • 7/27/2019 Computational Imaging

    40/51

    Motion Magnification

    Liu, Torralba, Freeman, Durand, Adelson Siggraph 2005

  • 7/27/2019 Computational Imaging

    41/51

    Motion Magnification

    Liu, Torralba, Freeman, Durand, Adelson Siggraph 2005

  • 7/27/2019 Computational Imaging

    42/51

    Capturing the Essence of Visual Experience

    Exploiting online collections Photo-tourism [Snavely2006] Scene Completion [Hays2007]

    Multi-perspective Images Multi-linear Perspective [Jingyi Yu, McMillan 2004]

    Unwrap Mosaics [Rav-Acha et al 2008] Video texture panoramas [Agrawal et al 2005]

    Non-photorealistic synthesis Motion magnification [Liu05]

    Image Priors Learned features and natural statistics Face Swapping: [Bitouk et al 2008]

    Data-driven enhancement of facial attractiveness [Leyvand et al 2008]

    Deblurring [Fergus et al 2006, 2007-2008 papers]

  • 7/27/2019 Computational Imaging

    43/51

    Face Swapping

    Find Candidateface in DB andalign

    Tune pose,lighting, color

    and blend

    Keep result withoptimizedmatching cost

    [Bitouk et al 2008]

  • 7/27/2019 Computational Imaging

    44/51

    Computational Photography

    1. Epsilon Photography

    Low-level vision: Pixels Multi-photos by perturbing camera parameters HDR, panorama, Ultimate camera

    2. Coded Photography Mid-Level Cues:

    Regions, Edges, Motion, Direct/global

    Single/few snapshot Reversible encoding of data

    Additional sensors/optics/illum

    Scene analysis

    3. Essence Photography High-level understanding

    Not mimic human eye Beyond single view/illum

    New artform

  • 7/27/2019 Computational Imaging

    45/51

    Submit your questions ..

    1. Today

    What makes photography hard?

    What moments you are not able to capture?

    2. Future

    What do you expect in a camera orphoto-software you buy in 2020?

    Please submit by break at 3:30pm

    Panel Discussion at 5:10pm

    Siggraph 2006

  • 7/27/2019 Computational Imaging

    46/51

    Siggraph 200616 Computational Photography Papers

    Hybrid Images

    Oliva et al (MIT)

    Drag-and-Drop Pasting

    Jia et al (MSRA)

    Two-scale Tone Management forPhotographic Look

    Bae et al (MIT)

    Interactive Local Adjustment of Tonal Values

    Lischinski et al (Tel Aviv)

    Image-Based Material Editing

    Khan et al (Florida)

    Flash Matting Sun et al (Microsoft Research Asia)

    Natural Video Matting using Camera Arrays

    Joshi et al (UCSD / MERL)

    Removing Camera Shake From a Single Photograph

    Fergus (MIT)

    Coded Exposure Photography: Motion Deblurring

    Raskar et al (MERL)

    Photo Tourism: Exploring Photo Collections in 3D

    Snavely et al (Washington)

    AutoCollage

    Rother et al (Microsoft Research Cambridge)

    Photographing Long Scenes WithMulti-Viewpoint Panoramas

    Agarwala et al (University of Washington)

    Projection Defocus Analysis for Scene Capture andImage Display

    Zhang et al (Columbia University)

    Multiview Radial Catadioptric Imaging for Scene Capture Kuthirummal et al (Columbia University)

    Light Field Microscopy (Project)

    Levoy et al (Stanford University)

    Fast Separation of Direct and Global Components ofa Scene Using High Frequency Illumination

    Nayar et al (Columbia University)

  • 7/27/2019 Computational Imaging

    47/51

    Siggraph 2007

    19 Computational Photography Papers Image Analysis & Enhancement

    Image Deblurring with Blurred/Noisy Image Pairs

    Photo Clip Art

    Scene Completion Using Millions of Photographs Image Slicing & Stretching

    Soft Scissors: An Interactive Tool for Realtime High Quality Matting

    Seam Carving for Content-Aware Image Resizing

    Image Vectorization Using Optimized Gradient Meshes

    Detail-Preserving Shape Deformation in Image Editing

    Light Field & High-Dynamic-Range Imaging Veiling Glare in High-Dynamic-Range Imaging

    Ldr2Hdr: On-the-Fly Reverse Tone Mapping of Legacy Video and Photographs Appearance Capture & Editing

    Multiscale Shape and Detail Enhancement from Multi-light Image Collections Computational Cameras

    Active Refocusing of Images and Videos

    Multi-Aperture Photography

    Dappled Photography: Mask-Enhanced Cameras for Heterodyned Light Fields and Coded ApertureRefocusing

    Image and Depth from a Conventional Camera with a Coded Aperture Big Images

    Capturing and Viewing Gigapixel Images

    Efficient Gradient-Domain Compositing Using Quadtrees Video Processing

    Factored Time-Lapse Video

    Computational Time-Lapse Video (project page)

    Real-Time Edge-Aware Image Processing With the Bilateral Grid

  • 7/27/2019 Computational Imaging

    48/51

    Siggraph 2008

    19 Computational Photography Papers Computational Photography & Display

    Programmable Aperture Photography: Multiplexed Light Field Acquisition

    Glare Aware Photography: 4D Ray Sampling for Reducing Glare Effects of Camera Lenses

    Light-Field Transfer: Global Illumination Between Real and Synthetic Objects

    Deblurring & Dehazing

    Motion Invariant Photography

    Single Image Dehazing

    High-Quality Motion Deblurring From a Single Image

    Progressive Inter-scale and intra-scale Non-blind Image Deconvolution Faces & Reflectance

    Data-driven enhancement of facial attractiveness

    Face Swapping: Automatic Face Replacement in Photographs (Project)

    AppProp: All-Pairs Appearance-Space Edit Propagation

    Image Collections & Video

    Factoring Repeated Content Within and Among Images

    Finding Paths through the World's Photos

    Improved Seam Carving for Video Retargeting (Project)

    Unwrap Mosaics: A new representation for video editing (Project)

    Perception & Hallucination

    A Perceptually Validated Model for Surface Depth Hallucination

    A Perception-based Color Space for Illumination-invariant Image Processing

    Self-Animating Images: Illusory Motion Using Repeated Asymmetric Patterns

    Tone & Color

    Edge-preserving decompositions for multi-scale tone and detail manipulation

    Light Mixture Estimation for Spatially Varying White Balance

  • 7/27/2019 Computational Imaging

    49/51

    Ramesh Raskar andJack Tumblin

    Book Publishers: A K Peters

    Siggraph 2008 booth: 20% off

    Booth #821

    A ti l

  • 7/27/2019 Computational Imaging

    50/51

    More .. Articles IEEE Computer,

    August 2006 Special Issue

    Bimber, Nayar, Levoy, Debevec, Cohen/Szeliski

    IEEE CG&A, March 2007 Special issue

    Durand and Szeliski

    Science News cover story April 2007

    Featuring : Levoy, Nayar, Georgiev, Debevec American Scientist

    February 2008

    Siggraph 2008 19 papers

    HDRI, Mon/Tue 8:30am Principles of Appearance Acquisition and Representation

    Bilateral Filter course, Fri 8:30am

    Other courses .. (Citizen Journalism, Wedn 1:45pm)

    First International Conf on Comp Photo, April 2009

    Athale, Durand, Nayar (Papers due Oct 3nd)

    Class: Computational Photography Advanced Topics

  • 7/27/2019 Computational Imaging

    51/51

    Module 1: 105 minutes

    1:45: A.1 Introduction and Overview (Raskar, 15 minutes)

    2:00: A.2 Concepts in Computational Photography (Tumblin, 15 minutes)

    2:15: A.3 Optics: Computable Extensions (Raskar, 30 minutes)

    2:45: A.4 Sensor Innovations (Tumblin, 30 minutes)

    3:15: Q & A (15 minutes)

    3:30: Break: 15 minutes

    Module 2: 105 minutes

    3:45: B.1 Illumination As Computing (Debevec, 25 minutes)4:10: B.2 Scene and Performance Capture (Debevec, 20 minutes)

    4:30: B.3 Image Aggregation & Sensible Extensions (Tumblin, 20 minutes)

    4:50: B.4 Community and Social Impact (Raskar, 20 minutes)

    5:10: B.4 Panel discussion (All, 20 minutes)

    Class: Computational Photography, Advanced Topics

    Debevec, Raskar and Tumblin