PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of

GTC 2017

The Smartvid.io solution

OUR MISSION

we're unlocking the value of photos and videos to dramatically improve safety, quality and productivity in the AEC industry.

@

An untapped resource

MEDIA FROM THE FIELD

The amount of pictures & videos captured every day in the field keeps getting bigger.

50 GB of data is generated on the typical project.

Much of it ends up unused, siloed across different systems and devices.

How it works

WE’RE USING MACHINE LEARNING TO AUTOMATICALLY IDENTIFY

“WHAT’S IN” CONSTRUCTION PHOTOS AND VIDEOS…

The results

IMPACT

2016 Annual AI for Safety Photo Contest Typical Construction Project

# REVIEWED 15,000 photos

HUMAN EXPERT TIME 80 days

SMARTVID.IO TIME ~8 days

# REVIEWED 1,080 photos

HUMAN EXPERT TIME 4.5 hours

SMARTVID.IO TIME <10 minutes

STRATEGY

Exponential Data Growth

• Basic: Object recognition• Is object present in image, Yes/No?

• Example: Is there scaffolding in this picture? (Yes/No)

• How used: image search within and across projects for key imagery (e.g., find me scaffolding images b/c I’m looking at a bill for scaffolding and want to check it)

• Advanced: Object analytics and logic• Where are the objects? How many of them are there? What is

their volume? (Quantitative)

• Examples: Is each person wearing high vis safety gear? What is the location and volume of visual defects like cracks?

• How used: identifying and quantifying visual data • Safety (Hard hats, safety vests, more) , Quality (Cracks, more)

Our deep learning for…

IMAGE RECOGNITION

EXAMPLE: ADVANCED IMAGE RECOGNITION FINDS PEOPLE (1) THEN DETERMINES IF

THEY ARE SAFE (2), THUS “FOCUSING” THE AI

QUANTITATIVE DATA IS AVAILABLE FROM OUR COMPUTER VISION

LINEAL EXTENT OF CRACK INTEGRITY MEASURE

And deep learning for…

SPEECH RECOGNITION

• Industry keywords automatically detected from speech in video

• Tags are linked to timeline of video for instant retrieval and easy sharing or collaboration

• How used

– Field worker narrates video using Smartvid.ioapp or native IOS or Android device

– Office user (manager) can search by keyword

– Example: see all installation of blocking, by location

How it works

OUR TECHNOLOGY

Multiple AWS P2 instances for model training & runtime execution

Full spectrum deep learning for computer vision & speech

• 5-10+ instances at peak training

COMMODITYFind objects of interest

Locate & segment objects

PROPRIETARY

STATE OF THE ART

Multi-model & focal point approach

Quantify objects

SYSTEMSARCHITECTURE

IMAGE MLSTACK

ML AT SCALE

• Gain access to data • Manage data access (ingestion) • Clean data• Manage data • Build data sets for training and evaluation

MLINFRASTRUCTURE

ALTERNATESYSTEMSARCHITECTURE

CONCLUSION

• AEC industry is creating tremendous amounts of visual and audio data • Deep learning can unlock value for safety, quality, productivity • New techniques must be applied to handle complexity of imagery and

scale of data

Come by the Dell Booth to see Smartvid.io in action. Case studies available on cracks and hard hats at www.smartvid.io.

Josh Kanner, [email protected] True, [email protected]

http://www.smartvid.io

mailto:[email protected]

mailto:[email protected]

Where things are going…

Documents

PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of