35
C.BYREGOWDA INSTITUTE OF TECHNOLOGY, KOLAR-563101 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Technical Seminar on “ TagSense: Approach to automatic image taggingUnder the guidance of Presented by, Mr. Raja A Usha V N Asst. Professor 1CK10CS049 Dept.of CSE,

Tagsense ppt

Embed Size (px)

DESCRIPTION

TagSense : An approach to automatic image tagging

Citation preview

Page 1: Tagsense ppt

C.BYREGOWDA INSTITUTE OF TECHNOLOGY, KOLAR-563101

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Technical Seminar on

“ TagSense: Approach to automatic image tagging” Under the guidance of Presented by, Mr. Raja A Usha V N Asst. Professor 1CK10CS049 Dept.of CSE, CBIT, Kolar

Page 2: Tagsense ppt

ContentsBig PictureMotivationTagSenseProblem SpaceSystem ArchitectureDesign and ImplementationPerformance EvaluationLimitations/Future workConclusions

1

Page 3: Tagsense ppt

Big Picture: Application category

Built-in sensors in Smartphone : Accelerometer, Compass, Light sensor, Camera(Back-illuminated sensor), Microphone, GPS, Gyroscope, Gesture sensor, Barometer, Hygrometer, Thermometer, Magnetometer …

2

Page 4: Tagsense ppt

Motivation

Digital pictures are undergoing an explosionImage retrieval becomes crucial and they use tagsHuman tagging is accurate but slowImage based auto-tagging still has many constraintsHow to approximate the human tagging ability?

3

Page 5: Tagsense ppt

Smartphone the wildcardToday's Smartphone have powerful built-in sensorsPeople always carry their phones

4

Page 6: Tagsense ppt

Existing problem of auto-taggingAutomatic image tagging has improved by research in image processing and face recognition But,

cannot recognize individuals moving fastcan only identify individuals who have well-defined facial features

Picasa iphoto

5

Page 7: Tagsense ppt

TagSenseMain points of new automatic image tagging system

Better than image processing/face recognitionCreates tag including the people, activity and context in a picture

Tagsense: A Smartphone-based Approach to Automatic Image Tagging Leverages multiple sensing domains of Smartphone“Tag” Definition : keywords that describe the on-going scenario/event/occasion during which the picture was taken“Tag” Format :when-where-who-what 6

Page 8: Tagsense ppt

Problem Space

Sensing multiple dimensionsaccelerometer, compass, light sensor, camera, microphone, GPS, gyroscope

Basis for Comparison with iPhoto and Picasa good under bad lighting conditions

Because it does not depend on the physical features of a person’s face

TagSense generated the following tags :

November 21st afternoon, Nasher Museum, indoor, Romit, Sushma, Naveen, Souvik, Justin, Vijay, Xuan, standing, talking

7

Page 9: Tagsense ppt

System Architecture

People enter a common password of TagSense in respective phonesThis password acts as a shared session key, ensuring that sensed information is assimilated only from group members.

8

Page 10: Tagsense ppt

Example Scenario

Bob’s phone immediately broadcasts an active-sensor bacon, encrypted with the shared keyPhones in the group activate their respective sensorsOnce Bob clicks the picture, Bob’s camera sends a beacon with its local times-tamp and the phones record it 9

Page 11: Tagsense ppt

Example Scenario (contd…)

After a threshold time from the click, the phones deactivate their sensors, perform basic activity recognition on the sensed information, and send them back to Bob’s phone

Bob’s phone assimilates these per-person activities, and also infers some contextual information from its own sensors

10

Page 12: Tagsense ppt

PowerPoint TemplateSubtitle color

Example of a slide with a subheadSet the slide title in “title case”Set subheads in “sentence case”Generally set subhead to 36pt or smaller so it will fit on a single lineThe subhead color is defined for this template but must be selected. In PowerPoint 2007, it is the fourth font color from the left

Tag GenerationTag Generation

11

Page 13: Tagsense ppt

Design & ImplementationWho are in the picture?- includes only those in camera view 3 possible techniques enabled by multi-dimensional sensing

Accelerometer based motion signaturesComplementary compass directionsCorrelating visual and acceleration

12

Page 14: Tagsense ppt

Accelerometer based motion signature

People move into a specific posture during picture-click

Accelerometer based signature 13

Page 15: Tagsense ppt

Accelerometer based motion signature (contd..)

People inside the picture

The variance of accelerometer readings From 20pictures at different times and Locations

people outside the picture

picture 14

Page 16: Tagsense ppt

Complementary compass directionsPeople behave naturally when the picture is being taken

Complementary compass directions15

Page 17: Tagsense ppt

Complementary compass directions (contd..)

People in picture likely face cameraPersonal Compass Offset (PCO)

Use posing picture to calibrate PCO

16

Page 18: Tagsense ppt

Correlating visual and acceleration

People move actively like playing ping-ping, dancing, running

Correlating visual and acceleration17

Page 19: Tagsense ppt

Moving Subjects

TagSense matches the optical velocity with each of the phone’s accelerometer reading to identify the moving subjectsBasic idea1. Taking multiple snapshots from the camera2. Deriving the subject’s motion vector from these snapshots3. Correlating it to the accelerometer measurements recorded by different phone

18

Page 20: Tagsense ppt

Moving Subjects (contd..)

Extracting motion vectors of people from two successive snapshotsThe optical flow field showing the velocity of each pixelThe motion vectors form the two detected moving objects 19

Page 21: Tagsense ppt

Combining the oppurtunity

20

Page 22: Tagsense ppt

What are they doing ?Activity recognition with the aid of mobile phones has been an active area of research lately.Ex: SoundSense, Sensing Meets Mobile Social Networks The focus of this paper not on devising new activity recognition schemesSo, they start with a limited vocabulary of tags to represent a basic set of activities.

21

Page 23: Tagsense ppt

What are they doing (Contd..)Usage of Accelerometer Standing, Sitting, Walking, Jumping, Biking, Playing Clear signature from accelerometer Sitting Or Standing Accelerometer readings & location information walking, jumping, biking, playing

22

Page 24: Tagsense ppt

What are they doing (Contd..)Usage of Acoustic : Talking, Music, SilencePhoto + Audio Sample From acoustic sensor Easier to differentiate between two cases In TagSense prototype, it provide basic information regarding ambient sound when the picture is taken

23

Page 25: Tagsense ppt

Where is the picture takenLocation of a picture conveys semantic information about the picture It also enables location based photo search.GPS based location coordinates are suitable for these purposes.TagSense leverages mobile phone sensors and cloud services to approach these goals TagSense utilizes the light sensor on the camera phone to detect indoor/outdoor

24

Page 26: Tagsense ppt

Where is the picture taken (Contd..)The variation of light intensity measured at 400 different times across days and nights in outdoor and indoor environments.Feasible to compute light intensity thresholds Using the light intensity measurement (from the camera) during the picture-click And uses this information to tag the picture as “indoors” or “outdoors”.

25

Page 27: Tagsense ppt

Where is the picture taken (Contd..)Location + Phone Compasses combinationTo tag the backgrounds

California beach + Westward = Infer the ocean in the background

26

Page 28: Tagsense ppt

When is the picture taken?

27

Page 29: Tagsense ppt

AdvantagesEnvisioning an alternative opportunity towards automatic image tagging.Designing TagSense, an architecture for coordinating the mobile phone sensors, and processing the sensed information to tag images.

28

Page 30: Tagsense ppt

Limitations

TagSense does not generate captions and cannot tag pictures taken in the past.TagSense requires users to input a group password at the beginning of a photo session.Tag Sense vocabulary of tags is quite limited

29

Page 31: Tagsense ppt

Future Work

Combine with facial recognition, robust systemVideo-taggingAugmented Reality

30

Page 32: Tagsense ppt

ConclusionsTagSense leverages trend to automatically tag pictures with people and their activities.Mobile phones are Replacing traditional cameras. TagSense has somewhat lower precision and comparable fall-out but significantly higher recall than iPhoto/PicasaLimited vocabulary of tags to represent a basic set of activities like what they are doing.GPS-based location coordinates are used to tell where the picture is taken

31

Page 33: Tagsense ppt

References[1] “TagSense: Leveraging Smartphones for Automatic Image

Tagging”, IEEE TRANSACTIONS ON MOBILE COMPUTING, VOL. 13, NO. 1, JANUARY 2014

[2] H. Lu and et al, “SoundSense: scalable sound sensing for people centric applications on mobile phones,” in ACM MobiSys, 2009.

[3] A. Engstrom and et al., “Mobile collaborative live video mixing,”Mobile Multimedia Workshop (with MobileHCI), Sep 2008.

[4] M. Azizyan and et al., “Surround Sense: mobile phone localization via ambience fingerprinting,” in ACM MobiCom, 2009.

32

Page 34: Tagsense ppt

PowerPoint TemplateSubtitle color

Example of a slide with a subheadSet the slide title in “title case”Set subheads in “sentence case”Generally set subhead to 36pt or smaller so it will fit on a single lineThe subhead color is defined for this template but must be selected. In PowerPoint 2007, it is the fourth font color from the left

Page 35: Tagsense ppt

PowerPoint Template

Example of a slide with a subheadSet the slide title in “title case”Set subheads in “sentence case”Generally set subhead to 36pt or smaller so it will fit on a single lineThe subhead color is defined for this template but must be selected. In PowerPoint 2007, it is the fourth font color from the left