Proceedings of the IEEE 2010 Antonio Torralba, MIT Jenny Yuen, MIT Bryan C. Russell, MIT

LabelMe: Online Image Annotation and Applications

Proceedings of the IEEE 2010Antonio Torralba, MIT

Jenny Yuen, MITBryan C. Russell, MIT

OutlineIntroductionWeb Annotation and Data Statistics

-A. Data Set Evolution and Distribution of Objects-B. Study of Online Labelers

The Space of LabelMe Images-A. Distribution of Scene Types-B. The Space of Images-C. Recognition by Scene Alignment

Beyond 2-D Images-A. From Annotations to 3-D-B. Video Annotation

Conclusion

IntroductionFrom small data set to large data setIn 2005, an online tool LabelMe is

createdLabelMe provides functionalities for

drawing polygons to outline the spatioal extent of object in images

Web Annotation and Data StatisticsA. Data Set Evolution and Distribution of

ObjectsB. Study of Online Labelers

The Features of LabelMe DatabaseObject class recognitionLearning about objects embedded in a sceneHigh-quality labelingMany diverse object classesMany diverse imagesMany noncopyrighted imagesOpen and dynamic

Data Set Evolution and Distribution of Objects(1/2)

(a)Number of annotated objects(b)Number of images with at least one annotated object(c)Number of unique object descriptions

Data Set Evolution and Distribution of Objects(2/2)

The observation suggests two learning problems:1) Learning from few training samples(N->1)2) Learning with millions of samples(N->)

Study of Online LabelersFrom July 7, 2008

to March 19, 2009

(a)Number of new annotations provided by individual users(b)Distribution of the length of time it takes to label an object

The Space of LabelMe ImagesA. Distribution of Scene TypesB. The Space of ImagesC. Recognition by Scene Alignment

Distribution of Scene Types(1/1)Let’s start from cognitive psychologyNext we study how many configurations of 4

objects are presentedThe distribution follows a power law

(n=1,2,4,8)

The Space of Images(1/3)Define “Semantic Distance”:

1) Assign each pixel to a single object category2) Divide the image into NN nonoverlapping windows and build histogram for each window3) Use spatial pyramid matching over object labels

Process of Defining Semantic Distance(2/3)

The Space of Images(3/3)A visualization of 12201 images that are fully

annotated

Recognition by Scene AlignmentWhen giving a new image as input, we use GIST

descriptor to compute the distance

The Power of a Large Scale DatabaseAn algorithm provides an upper bound:

find the nearest neighbor of input image as a labeling of the input image

This result gives us a hint about “How many more images do we need to label”?

Beyond 2-D ImagesA. From Annotations to 3-DB. Video Annotation

From Annotations to 3-D(1/7)The label of objects now contains some

implicit information observed by analyzing the overlap between object boundaries

Object types Ground Objects

Standing Objects

Attached objects

Relations between objects

Supported-by

Part-of

From Annotations to 3-D(2/7)Learning the relationship between objects

1) part-of : evaluate the frequency of high relative overlap between polygons2)supported-by : have the bottom part of its polygon live inside the supporting object

From Annotations to 3-D(3/7)

From Annotations to 3-D(4/7)Reconstructing a 3D model for input image

1) define object type2) define polygon edge type3) compute the real distance between objects

Object type Edge type

Ground objects(green)

Contact(white)

Standing objects(red)

Attached(gray)

Attached objects(yellow)

Occlusion(black)

From Annotations to 3-D(6/7)The more labeling makes the quality betterHowever, if the labeling goes wrong

Video Annotation(1/1)

ConclusionA web-based tool that allows the labeling of

objects and their location in imagesLabelMe has collected a large annotated

database of images with many different scene and object class

LabelMe can recover the 3-D description of an image

The next goal is expending the database of video and offering a promising direction of computer vision and computer graphics

References

There are a lot more references …

Proceedings of the IEEE 2010 Antonio Torralba, MIT Jenny Yuen, MIT Bryan C. Russell, MIT

Documents

Motion Estimation I - People | MIT CSAILpeople.csail.mit.edu/torralba/courses/6.869/lectures/... · 2010. 4. 22. · IJCV 2004 • Horn-Schunck (wikipedia) • A. Bruhn, J. Weickert,

Yuen Elyssa Magazine

Semi-Supervised Learning in Gigantic Image Collections Rob Fergus (New York University) Yair Weiss (Hebrew University) Antonio Torralba (MIT) TexPoint

China's Labor Cost Problem Ang, Yuen Yuen The

Yuen Long Report

Torralba 27

What makes an image memorable? - MITweb.mit.edu/phillipi/www/posters/ImageMemorability VSS... · 2011-05-03 · Phillip Isola, Jianxiong Xiao, Antonio Torralba, Aude Oliva, MIT What

HOGgles: Visualizing Object Detection Features (to be appeared in ICCV 2013) Carl Vondrick Aditya Khosla Tomasz Malisiewicz and Antonio Torralba,MIT Presented

Miguel Ángel Torralba

PLK HKTA Yuen Yuen Primary School - yyps.edu.hk · PLK HKTA Yuen Yuen Primary School . School Report . School Year . 2008-2009

Selim Aksoy Department of Computer Engineering Bilkent ...saksoy/courses/cs484-Spring2010/...CS 484, Spring 2010 ©2010, Selim Aksoy 89 Adapted from Antonio Torralba, MIT Context CS

Scene Understanding - People | MIT CSAILpeople.csail.mit.edu/torralba/courses/6.870/slides/... · 2008-10-09 · Scene Identification: Basic-Level Oliva, A., & Schyns, P.G. (2000)

Efficient Image Search and Retrieval using Compact Binary Codes Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.)

Torralba 20

Tsunami Eth YUEN

Unbiased Look at Dataset Bias - People | MIT CSAILpeople.csail.mit.edu/torralba/publications/datasets_cvpr... · 2011-06-28 · Unbiased Look at Dataset Bias Antonio Torralba Massachusetts

Yuen Yuen Yip CAUDITpresentation

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15

Jeff Yuen Portfolio

Yusuf Aytar, Carl Vondrick, Antonio Torralba Abstract ... · Yusuf Aytar, Carl Vondrick, Antonio Torralba Massachusetts Institute of Technology fyusuf,vondrick,torralbag@csail.mit.edu