13
Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Embed Size (px)

Citation preview

Page 1: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Object Recognition a Machine Translation

Learning a Lexicon for a Fixed Image Vocabulary

Miriam Miklofsky

Page 2: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Lexicons

A vocabulary of terms used in a subjectA specialized list of terms

Devices that predict one representation given another representation

Page 3: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Dataset

Aligned bitext Annotated images Images with regions Unknown which region of image goes

with which word from text

Page 4: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

EM

Page 5: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Clustering

K means clustering Vector quantize the image region

representation

Kullback-Leibler divergence Relative entropy Measure of difference of two

probability distributions over the same event space

Page 6: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Evaluation

Auto annotate images Quantize regions Use lexicon to determine word Annotate image with word

Page 7: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Results - Annotation

Base results 80 words of 371 word vocabulary

could be predicted

Retraining Similar results but some words with

higher recall and precision

Page 8: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Results(cont.)

Null probability Recall decreases Precision increases

Clustering of like words Recall values of clusters higher than

for single words

Page 9: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Results -Correspondence

Base results Some good words up to 70% correct

prediction

Null prediction Predict good words with greater

probability

Word clustering Prediction rate generally increases

Page 10: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

Evaluation

Human evaluation Images viewed by hand Somewhat subjective

Page 11: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky
Page 12: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

EM (cont.)

Page 13: Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky

KL Divergence