20
GTC 2017

PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

Embed Size (px)

Citation preview

Page 1: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

GTC 2017

Page 2: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

The Smartvid.io solution

OUR MISSION

we're unlocking the value of photos and videos to dramatically improve safety, quality and productivity in the AEC industry.

@

Page 3: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

An untapped resource

MEDIA FROM THE FIELD

The amount of pictures & videos captured every day in the field keeps getting bigger.

50 GB of data is generated on the typical project.

Much of it ends up unused, siloed across different systems and devices.

Page 4: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

How it works

Page 5: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of
Page 6: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

WE’RE USING MACHINE LEARNING TO AUTOMATICALLY IDENTIFY

“WHAT’S IN” CONSTRUCTION PHOTOS AND VIDEOS…

Page 7: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

The results

IMPACT

2016 Annual AI for Safety Photo Contest Typical Construction Project

# REVIEWED 15,000 photos

HUMAN EXPERT TIME 80 days

SMARTVID.IO TIME ~8 days

# REVIEWED 1,080 photos

HUMAN EXPERT TIME 4.5 hours

SMARTVID.IO TIME <10 minutes

Page 8: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

STRATEGY

Exponential Data Growth

Page 9: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

• Basic: Object recognition• Is object present in image, Yes/No?

• Example: Is there scaffolding in this picture? (Yes/No)

• How used: image search within and across projects for key imagery (e.g., find me scaffolding images b/c I’m looking at a bill for scaffolding and want to check it)

• Advanced: Object analytics and logic• Where are the objects? How many of them are there? What is

their volume? (Quantitative)

• Examples: Is each person wearing high vis safety gear? What is the location and volume of visual defects like cracks?

• How used: identifying and quantifying visual data • Safety (Hard hats, safety vests, more) , Quality (Cracks, more)

Our deep learning for…

IMAGE RECOGNITION

Page 10: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

EXAMPLE: ADVANCED IMAGE RECOGNITION FINDS PEOPLE (1) THEN DETERMINES IF

THEY ARE SAFE (2), THUS “FOCUSING” THE AI

Page 11: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

QUANTITATIVE DATA IS AVAILABLE FROM OUR COMPUTER VISION

LINEAL EXTENT OF CRACK INTEGRITY MEASURE

Page 12: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

And deep learning for…

SPEECH RECOGNITION

• Industry keywords automatically detected from speech in video

• Tags are linked to timeline of video for instant retrieval and easy sharing or collaboration

• How used

– Field worker narrates video using Smartvid.ioapp or native IOS or Android device

– Office user (manager) can search by keyword

– Example: see all installation of blocking, by location

Page 13: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

How it works

OUR TECHNOLOGY

Multiple AWS P2 instances for model training & runtime execution

Full spectrum deep learning for computer vision & speech

• 5-10+ instances at peak training

COMMODITYFind objects of interest

Locate & segment objects

PROPRIETARY

STATE OF THE ART

Multi-model & focal point approach

Quantify objects

Page 14: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

SYSTEMSARCHITECTURE

Page 15: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

IMAGE MLSTACK

Page 16: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

ML AT SCALE

• Gain access to data • Manage data access (ingestion) • Clean data• Manage data • Build data sets for training and evaluation

Page 17: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

MLINFRASTRUCTURE

Page 18: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

ALTERNATESYSTEMSARCHITECTURE

Page 19: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

CONCLUSION

• AEC industry is creating tremendous amounts of visual and audio data • Deep learning can unlock value for safety, quality, productivity • New techniques must be applied to handle complexity of imagery and

scale of data

Come by the Dell Booth to see Smartvid.io in action. Case studies available on cracks and hard hats at www.smartvid.io.

Josh Kanner, [email protected] True, [email protected]

Page 20: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

Where things are going…