35
© 2012 IBM Corporation Workload and technology trends driving future system design Bob Blainey May 2012

Workload and technology trends driving future system design

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Workload and technology trends driving future system design

© 2012 IBM Corporation

Workload and technology trends driving future system design

Bob BlaineyMay 2012

Page 2: Workload and technology trends driving future system design

© 2012 IBM Corporation

Agenda

� Context: Quick overview of IBM’s software business

� Software development in Canada

� Deep dive into business analytics

� Some systems technology trends

� Q & A

Page 3: Workload and technology trends driving future system design

© 2012 IBM Corporation

IBM Strategy & Performance: 2000-2010

Page 4: Workload and technology trends driving future system design

© 2012 IBM Corporation

IBM solutions are built on a core set of software capabilities

Turn Information into Insights

Business Analytics

Data Management

Data Warehousing

Enterprise Content Management

Information Governance

Information Integration and Federation

Enable Product and Service Innovation

Application Lifecycle Management

Complex and Embedded Systems

Design and Development

Enterprise Architecture and Portfolio Management

Enterprise Modernization

Security

Drive Business Integration and Optimization

Application Infrastructure

Business Process Management

Commerce

Connectivity and Integration

Enterprise Marketing Management

Optimize the Impact of Business Infrastructures and Services

Asset Management

Business Service Management

Cloud and Virtualization Management

Network and Service Assurance

Security

Storage Management

Systems Management

Connect and Collaborate Social Business Application Development

Social Collaboration

Unified Communications

Web Experience

Manage Risk, Security, and Compliance

Application and Process

Data and Information

Network, Server, and Endpoint

People and Identity

Physical Infrastructure

Security governance, risk management and compliance

Page 5: Workload and technology trends driving future system design

© 2012 IBM Corporation

IBM Software Solutions: Making New Markets, Reaching New Buyers

Page 6: Workload and technology trends driving future system design

© 2012 IBM Corporation

Smarter Cities: Manage, automate & optimize city operations

Page 7: Workload and technology trends driving future system design

© 2012 IBM Corporation

Watson and Next Generation Analytics

Page 8: Workload and technology trends driving future system design

© 2012 IBM Corporation

IBM’s Big Data Platform

Page 9: Workload and technology trends driving future system design

© 2012 IBM Corporation

Software Development in Canada

Page 10: Workload and technology trends driving future system design

© 2012 IBM Corporation10

Page 11: Workload and technology trends driving future system design

© 2012 IBM Corporation11

IBM Canada Lab History of Software Product Leadership Growth through Acquisition

Page 12: Workload and technology trends driving future system design

© 2012 IBM Corporation12

IBM Canada Lab History of Software Product Leadership Growth through Acquisition

Page 13: Workload and technology trends driving future system design

© 2012 IBM Corporation13

IBM Canada Lab History of Software Product Leadership Growth through Acquisition

Page 14: Workload and technology trends driving future system design

© 2012 IBM Corporation14

Brain

Cities

Water

HPC & Agile Computing

Energy

Applied Research +

Innovation Centre

IBM Canada Research and Development Centre

Page 15: Workload and technology trends driving future system design

© 2012 IBM Corporation15

� IBM is establishing an IBM Research “footprint” in Canada (initially Ontario based).

� Announced April 10, 2012

� Research to be focused on:• Agile Computing (moving towards Exascale computing)• Health (Brain Modeling, Streaming Analytics, Watson)• Intelligent Infrastructure (Smarter Water, Cities, Energy, ….)

• $210 Million total investment.

• The overall project Includes:• The building of a new IBM Data Centre in Barrie, Ontario.• Canada’s first Bluegene/Q in Canada (this will be the largest HPC system in the country

when installed).• A Cloud Computing Cluster.• 145 highly skilled jobs in Research, Development, and Operations.

• A collaborative approach to Research involving IBM and 7 leading universities:• University of Toronto• Western University• Queen’s University• University of Ottawa• University of Waterloo• McMaster University• University of Ontario Institute of Technology

IBM Canada Research and Development Centre

Page 16: Workload and technology trends driving future system design

© 2012 IBM Corporation

Deep Dive into Business Analytics

Page 17: Workload and technology trends driving future system design

© 2012 IBM Corporation

Davenport’s Taxonomy of Analytics

Source: Davenport & Harris, Competing on Analytics: The New Science of Winning, February 2007

Page 18: Workload and technology trends driving future system design

© 2012 IBM Corporation

Drilling down: Analytical application types

Functional Goal Example Applications

PrescriptionSupply chain management, Product scheduling, Logistics, Routing,Workforce management

PredictionRevenue prediction, Disease spread prediction, Semiconductor yield analysis, Predictive policing

ReportingRetail sales analysis, Financial reporting, Budgeting, System management analysis, Social network analysis

SimulationVLSI sensitivity analysis, Insurance risk modeling, Credit risk analysis, Physics/Biology simulations, Games

Pattern MatchingIntrusion Detection, Computational chemistry, Document management, Searching, Bio- and Chemo-informatics

RecommendationCross-sale analysis, Customer retention, Music/Video and restaurant recommendation

AlertingWeb-traffic analysis, Fraud detection, Geological Sensor networks, Geographical analytics (Maps)

Quantitative AnalysisCustomer relationship analysis, Weather forecasting, Medical informatics, Econometrics, Computational finance

Source: Bordawekar et al, Analyzing Analytics: A Survey of Business Analytics Models and Algorithms,IBM Research Report RC25186, 2011

Page 19: Workload and technology trends driving future system design

© 2012 IBM Corporation

Key Analytics Exemplars and Algorithms

Exemplar Functional Goals Key Algorithms

1Regression Analysis

Prediction, Quantitative analysis

Linear, Non-linear, Logistic, and Probit regression

2 ClusteringPattern matching, Recommendation, Prediction, Reporting

K-Means and Hierarchical clustering, Expectation-Maximization Clustering, Naive Bayes

3Nearest-neighbor search

Pattern recognition, Prediction, Recommendation

K-d, Ball, and Metric trees, Approximate Nearest-neighbor, Locality-sensitive Hashing, Kohonen networks

4Association rule mining

RecommendationApriori, Partition, FP-Growth, Eclat and MaxClique, Decision trees

5 Neural networks Prediction, Pattern matchingSingle- and Multi-level perceptrons, Radial-Basis Function (RBF), Recurrent, and Kohonen networks

6Support vector machines

Prediction, Pattern matchingSVMs with Linear, Polynomial, RBF, Sigmoid, and String kernels

7Decision tree learning

Prediction, Recommendation ID3/C4.5, CART, CHAID, QUEST

Source: Bordawekar et al, Analyzing Analytics: A Survey of Business Analytics Models and Algorithms,IBM Research Report RC25186, 2011

Page 20: Workload and technology trends driving future system design

© 2012 IBM Corporation

Exemplar Functional Goals Key Algorithms

8Time series processing

Prediction, Pattern matching, Reporting, Alerting

Trend, Seasonality, Spectral analysis, ARIMA, Exponential smoothing

9 Text analytics Pattern matching, ReportingNaive Bayes classifier, Latent semantic analysis, String-kernel SVMs, Non-negative matrix factorization

10Monte Carlo methods

Simulation, Quantitative analysis

Markov-chain, Quasi-Monte Carlo methods

11Mathematical programming

Prescription, Quantitative analysis

Primal-dual interior point, Branch & Bound methods, Traveling salesman, A* algorithm, Quadratic programming

12On-line analytical processing (OLAP)

Reporting, PredictionGroup-By, Slice and Dice, Pivoting, Rollup and Drill-down, Cube

13 Graph analyticsPattern matching, Reporting, Recommendation

Eigenvector Centrality (e.g., PageRank), Routing, Coloring, Searching and flow algorithms, Clique and motif finding

Key Analytics Exemplars and Algorithms

Source: Bordawekar et al, Analyzing Analytics: A Survey of Business Analytics Models and Algorithms,IBM Research Report RC25186, 2011

Page 21: Workload and technology trends driving future system design

© 2012 IBM Corporation

Computational Characteristics of Exemplars

Exemplar Computational Patterns Key Data Types, Data Structures and

Functions

1Regression Analysis

Matrix inversion, LU decomposition, Transpose, Cholesky factorization

Double-precision and Complex data, Sparse & dense matrices, Vectors

2 ClusteringMetric-based iterative convergence

Height-balanced tree, Graph, Distance functions (Euclidean,Manhattan, Minkowski, and Log-Likelihood), log functions

3Nearest-neighbor search

Non-iterative distance calculations via metric functions, Singular value decomposition, Hashing

Higher-dimensional data structures (k-d, and Metric trees), Hash tables, Euclidean and Hamming distance functions

4Association rule mining

Set intersections, Unions, and Counting

Hash-tree, Relational tables, Prefix trees, Bit vectors

5 Neural networks

Iterative Weighted Feedback networks, Matrix multiplication, Inversion, Cholesky factorization

Sparse/dense matrices, Vectors, Double-precision/Complex data, Gaussian, Multiquadric, Spline, Logistic, Smoothing functions

Source: Bordawekar et al, Analyzing Analytics: A Survey of Business Analytics Models and Algorithms,IBM Research Report RC25186, 2011

Page 22: Workload and technology trends driving future system design

© 2012 IBM Corporation

Computational Characteristics of Exemplars

Exemplar Computational Patterns Key Data Types, Data Structures and

Functions

6Support Vector Machines

Cholesky factorization, Matrix multiplication

Double-precision floats, Sparse matrices, Vectors, Kernel functions (e.g., Linear, Sigmoid, Polynomial, String)

7Decision Tree Learning

Dynamic programming, Recursive tree operations

Integers, Double-precision floats, Trees, Vectors, log function

8Time Series Processing

Smoothing via averaging, Correlation, Fourier and Wavelet transforms

Integers, Single-/Double-precision floats, Dense matrices, Vectors, cosine, sine, log functions, Distance and Smoothing functions

9 Text Analytics

Parsing, Bayesian modeling, String matching, Hashing, Singular value decomposition, Matrix multiplication, Transpose, Factorization

Integers, Single/Double precision, Characters, Strings, Sparse matrices, Vectors, Inverse indexes, String functions, Distance functions

10Monte Carlo Methods

Random number generators (e.g., Mersenne, Gaussian), Polynomial evaluation, Interpolation

Double-precision floats, Bit vectors, Bit-level operations (shift, mask), log, sqrt functions

Source: Bordawekar et al, Analyzing Analytics: A Survey of Business Analytics Models and Algorithms,IBM Research Report RC25186, 2011

Page 23: Workload and technology trends driving future system design

© 2012 IBM Corporation

Computational Characteristics of Exemplars

Exemplar Computational Patterns Key Data Types, Data Structures and

Functions

11Mathematical Programming

Matrix multiplication, Inversion, Cholesky factorization, Dynamic programming, Greedy algorithms, Backtracking-based search

Integers, Double-precision floats, Sparse Matrices, Vectors, Trees, Graphs

12On-line Analytical Processing

Grouping and ordering multi-dimensional elements, Aggregation over hierarchies

Prefix trees, Relational tables, OLAP Operators (e.g., CUBE), Strings, Sorting, Ordering, Aggregation operators (e.g., Sum or Average)

13 Graph Analytics

Graph traversal, Eigensolvers, Matrix-vector and Matrix-matrix multiplication, Factorization

Integer, Single-/Double-precision floats, Adjacency/incident lists, Trees, Queues, Dense/Sparse matrices

Source: Bordawekar et al, Analyzing Analytics: A Survey of Business Analytics Models and Algorithms,IBM Research Report RC25186, 2011

Page 24: Workload and technology trends driving future system design

© 2012 IBM Corporation

Some Systems Technology Trends

Page 25: Workload and technology trends driving future system design

© 2012 IBM CorporationDr. Bernard Meyerson

1.40

0.80

0.85

0.90

0.95

1.00

1.25

1.30

1.35

Re

lati

ve

Pe

rfo

rma

nc

e

1.20

1.05

1.10

1.15 Classical ScalingClassical Scaling

Higher Capacitance

Higher Resistance

Reduced Stress

Scaling NowScaling Now

Channel ScalingHigh-k / Metal GateBody Controlled DevicesReduced Gate HeightAdvanced BEOL Dielectric

PostPost--Classical ScalingClassical Scaling--Mitigating InnovationsMitigating Innovations--

Semiconductor Scaling in the Past vs. Scaling Now

Page 26: Workload and technology trends driving future system design

© 2012 IBM Corporation26

Post-Classical Systems Scaling

Content removed.See speaker notes.

Page 27: Workload and technology trends driving future system design

© 2012 IBM Corporation

Many core

GPU

ASIC, ASSP

FPGA

Parallelism + Heterogeneity

Scale-in: What’s Next in Processors?

Page 28: Workload and technology trends driving future system design

© 2012 IBM Corporation

Heterogeneous Systems: Software Optimization Challenges

� Parallelism– Many (thousands) processors to keep busy– Hierarchies of parallelism (e.g. SMP, multi-

core, super-scalar)– Different styles of parallelism (data, task, bit)– Differing memory/compute balance

� Function placement– Specialized processing patterns – Specific system connect points (e.g. close to

network, close to host memory)– Functional limitations (e.g. FPGA area,

floating point precision)

� Communication optimization– Optimize available bandwidth (blocking,

streaming)– Hide latency (reduce round-trips,

asynchronous communication)– Overlap communication and computation

Page 29: Workload and technology trends driving future system design

© 2012 IBM Corporation

� An array of logic processing elements connected into a grid on a single chip

� Logic elements can be programmed to execute simple binary functions

� Logic elements also include memory elements for simple state management (e.g. flip-flop)

� Special purpose elements such as block RAMs and DSP ALUs are also typically included

� Switches in the grid interconnect can be configured to create a network of logic elements

29

Typically “programmed” using HDL languages such as Verilog of VHDL

Typically “programmed” using HDL languages such as Verilog of VHDL

What is a Field Programmable Gate Array?

Page 30: Workload and technology trends driving future system design

© 2012 IBM Corporation

Reaching a tipping point in FPGA capability (logic & memory)

Source: Altera, “FPGA Coprocessing Evolution: Sustained Performance Approaches Peak Performance”, June 2009

Page 31: Workload and technology trends driving future system design

© 2012 IBM Corporation

The IBM Netezza 1000 Data Warehouse Appliance

High-performance databaseengine streaming joins,aggregations, sorts, etc.

SQL CompilerQuery PlanOptimizeAdmin

Processor &streaming DB logic

Slice of User DataSwap and Mirror partitionsHigh speed data streaming

SMP Hosts

Snippet Blades™

(S-Blades™)

Disk Enclosures

Page 32: Workload and technology trends driving future system design

© 2012 IBM Corporation

The IBM Netezza S-Blade™

Page 33: Workload and technology trends driving future system design

© 2012 IBM Corporation33

Accelerating Data Warehouse Queries on the S-Blade™

Page 34: Workload and technology trends driving future system design

© 2012 IBM Corporation

Summary

� IBM has a vast software business, covering virtually every aspect of enterprise information technology

– Major focus on extracting insight from massive amounts of data– Increasing focus on novel analytic applications such as Smarter Cities and Watson to

deliver value directly to line of business customers

� In Canada, IBM has a very large software development organization– Sites across the country, created and expanded through organic growth but also through

many acquistions– Many different development missions in Canada but notably a major focus on

information management and business analytics

� Business analytics is a huge driver of value & differentiation for IBM customers– Many different analytic applications commonly deployed– We have investigated these applications, extracted common analytical models

(“exemplars”) and identified key computational characteristics

� Systems technology is in a period of change– We are taking the opportunity to drive system design from the top-down– Specifically, we are building systems which are focused on important analytics

workloads, driven by our deepening understanding of these workloads and drawing on insights into the systems technology roadmap

Page 35: Workload and technology trends driving future system design

© 2012 IBM Corporation

Questions?