Upload
masstlc
View
808
Download
2
Tags:
Embed Size (px)
DESCRIPTION
Citation preview
What Does All This
Data Mean?
September 20, 2012IBM Innovation Center
Waltham MA
MassTLC Big Data Seminar
@m
asstlc #bigdata
What Does All This Data Mean?
Agenda •Setting the Context•Introducing the Panel•Panel Discussion•Q&A
– Hashtags: @masstlc #bigdata
Your Panel
• Richard Dale, Managing Director, Big Data Boston
Ventures – Twitter: @rdale
• Irene Greif, Fellow, IBM Visualization– Twitter: @igreif
• Martin Leach, CIO, Broad Institute– Twitter: @mdleach
• Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
Richard Dale
Managing Director, Big Data Boston VenturesMicro-VC fund investing in big data companies located in or connected to the regional big data cluster
Database techie turned Entrepreneur turned VC– Database Performance Guru, SQL Solutions– Co-founder, Phase Forward– Principal, Sigma Partners– Founder & Managing Director, Big Data Boston Ventures
Setting the Context
• What is Big Data?
• Where does Big Data come from?
• What is Big Data going?
What is Big Data?
a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools
(wikipedia)
What is Big Data?
3 V’s: •volume •velocity•variety
(Doug Laney, Gartner)
What is Big Data?
Data easier and cheaper to collect than to analyze
(??)
What is Big Data?
Data that you can’t process on a single machine, however big your machine (and however long you wait)
or
Data growing faster than Moore’s law
(Richard Dale)
Where Does Big Data Come From?
Behavior•Social Media•User Generated Content•Click streams•Viewing, Purchasing, Liking, Sharing•The Quantified Self
Where Does Big Data Come From?
Observation (in ever finer granularity)•Machines
– Computers, Vehicles, Phones, Industrial Machines•Environments
– RFID, Traffic flow, Nature (and our impact)•People
– The Quantified Self– Medical imaging– Genetic sequencing
Where Does Big Data Come From?
Correlations•Each data item, image or observation can be cross-correlated with any other
•Even if N is tractable, N x N x N x … is not
Technology Landscape
Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving
Analytics: Algorithms, Visualization, Machine Learning
Analytics: Algorithms, Visualization, Machine Learning
Applications: Horizontal and Verticalbusiness or domain applications
Applications: Horizontal and Verticalbusiness or domain applications
Data Services:
Collecting,Collating,
Correlating,Curating
Data Services:
Collecting,Collating,
Correlating,Curating
Source:
Technology Landscape
Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving
Analytics: Algorithms, Visualization, Machine Learning
Analytics: Algorithms, Visualization, Machine Learning
Applications: Horizontal and Verticalbusiness or domain applications
Applications: Horizontal and Verticalbusiness or domain applications
Data Services:
Collecting,Collating,
Correlating,Curating
Data Services:
Collecting,Collating,
Correlating,Curating
Source:
Technology Landscape
Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving
Analytics: Algorithms, Visualization, Machine Learning
Analytics: Algorithms, Visualization, Machine Learning
Applications: Horizontal and Verticalbusiness or domain applications
Applications: Horizontal and Verticalbusiness or domain applications
Data Services:
Collecting,Collating,
Correlating,Curating
Data Services:
Collecting,Collating,
Correlating,Curating
Source:
A Sea of Choices for Data Viz
• BI packages• Dashboard reporting tools • Ad hoc infographics• Whiteboards• Napkin scribbles
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
IBM Center for Social BusinessIrene Greif, IBM Fellow, Chief Scientist for Social Business
Many Eyes
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
• The Broad Institute is a non-profit biomedical research institute
• Ten core faculty members and approximately 150 associate members from across MIT and Harvard
• Greater than 1900 research and administrative staff
Programs and Initiativesfocused on specific disease or biology areas
CancerGenome BiologyGenome Sequencing and AnalysisCell CircuitsPsychiatric DiseaseMetabolismMedical and Population GeneticsChemical Biology/Novel TherapeuticsInfectious DiseaseEpigenomics
Platformsfocused technological innovation and application
Genomics PlatformBiological SamplesGenome SequencingGenetic Analysis
Chemical Biology/Novel TherapeuticsImagingMetabolite ProfilingProteomicsRNAiTherapeutics Discovery & Development
The Broad Institute of MIT & Harvard
Martin Leach, CIO
Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow– Twitter: @igreif
•Martin Leach, CIO, Broad Institute– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/
Big Data VisualizationAndrew Pandre, Ph.D.,PrincipalSears Holdings Corporation
Google+ microblog: http://tinyurl.com/VisibleData
Data Visualization Bloghttp://apandre.wordpress.com
@masstlc #bigdata