15
8/11/2019 Big Data in Enterprise challenges & opportunities.pdf http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 1/15 Big Data in Enterprise challenges & opportunities Yuanhao Sun  yuanhao sun@intel com Software and Service Group

Big Data in Enterprise challenges & opportunities.pdf

Embed Size (px)

Citation preview

Page 1: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 1/15

Big Data in Enterprisechallenges & opportunities

Yuanhao Sun

 

yuanhao sun@intel com

Software and Service Group

Page 2: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 2/15

Big Data Phenomenon

20TB /hour

Sensor output of aBoeing jet engine

$20+B

Acquisitions in thelast 12 months

1.8ZB in 2011

2 Days > the dawn ofcivilization to 2003

750M

Photos uploaded toFacebook in 2 days

200PB

Created by a Smart Cityproject in China

Data are becoming the new rawmaterial of business: an

economic input almost on a parwith capital and labor.

The Economist, 2010

Information will be the “oil ofthe 21st century ”. 

Gartner, 2010

966PB

Stored in US manufacturing(2009)

200+TB

A boy’s 240’000 hours by aMIT Media Lab geek

$800B

in personal location datawithin 10 years

$300B /year

US healthcare saving fromBig Data

Page 3: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 3/15

Big Data in Telecom

• Lots of data

– One telco operator: 360TB Call Data Records within 6months (in aprovincial branch, 100M users)

– The other operator: ~300TB web access logs from mobile phones within 6months

• Keep growing

– ~2TB CDR/day in a provincial branch

• Various data sources:

– CDR (Voice, SMS, GPRS, 3G, WLAN, Value-add services, etc)

– Billing & accounting data, sales & marketing data, etc.

– Web access logs

– Network signaling data

– Base station sensor data

Page 4: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 4/15

0

500

10001500

2000

2500

3000

3500

Open Source HBase(0.90.3)

Optimized HDFSI/O

700

3500

How Hadoop helps

• Map/Reduce for data loading and data cleansing

• HBase as the data store

− Inserting 10000 records/second/server (2-way, 32GB) in average

− Read from disk: >400 query/second/server, latency within one second (0.05s~0.8sunder different load)

• A query is a scan to get all CDR within one month for one user.

• Optimizations significantly increase the throughput of a 8-node cluster

2011/12/54

query/s

0

10000

20000

30000

40000

50000

60000

70000

80000

90000

Open Source HBase(0.90.3)

Advanced RegionBalancing

25000

82000insertion/s

Page 5: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 5/15

 stage 4:

Wisdom

stage 3: Knowledge

stage 2: Information retrieval

Value chain of big data in telecom

2011/12/55

Statistics Query

Machine Learning Data Mining

MobileDevices

BaseStations

BillingSystem

Web LogsNetworkSignals

stage 1: Data collection & storage(several PB to hundreds of PB)

Visualization

Reporting

Predictive Analysis

Data FusionETL Data Integration

Data Serving

•Web metrics analytics•Network optimization•Route optimization•Failure detection

•Fraud detection

•… … 

•Customer segmentation•User behavior analysis

•Social analysis

•Ads targeting•… … 

•Customer churn analysis

•Customized actions•New services and pricing•ROI analysis• … … 

Page 6: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 6/15

Healthcare: Care Coordination and DataSharing for Improved Outcomes 

   Q  u  a   l   i  t  y  o

   f   L   i   f  e

Proactive healthand Wellness

Home Care

Residential/Community/Ambulatory Care

Acute Care

Reduce illness.Promote

wellness andempowerment Reduce costly

emergency care.Better managechronic disease. Reduce hospital

(re) admissions.Manage at home.

Reduce ALOS.Earlier Discharge

to Ambulatoryenvironments.

Cost of CareHighest Quality of Life at the Lowest Possible Cost 

Page 7: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 7/15

Enabling Technologies for Coordinated Care

Page 8: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 8/15

China EHR based RHIN platform

HIAL – Health Information AccessLayer

RegistrationServices

EHR Storage Services & Medical Collaboration Service

EHR Service

Regional Health Information Exchange

Hospital Information Exchange Platform Community Health Information IntegrationPlatform

Public Health Information Systems

Data Warehouse

Page 9: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 9/15

Challenges

Various data sources, unstructured, texts, images, videos, etc.− Health records, lab reports, billing data, PACS images, physical orders, follow-ups,etc

Difficult to standardize the data format

− Data needs to be stored for 50 years, its format keeps changing.

− HL7 Clinical Document Architecture (XML) is evolving frequently.

Big data volume

− 10PB: A medium city in China(10M population), 50 years’ data 

2011/12/59

Any existing IT system in China cannot process these data in 3~5 years.

Page 10: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 10/15

Opportunities… 

Improving efficiency and reducing costs

−   real-time information sharing from clinics, doctors to patients

−   real-time status dashboard

Computer aided diagnostics/research

−  Disease classifications, like blood poison

Decision support system

−   Trend analysis:cancer trend analysis、epidemic disease analysis、etc

−   Association analysis:adverse drug events analysis

Personalized Medicine

−  Personalized prompting and coaching

2011/12/510

LIMS/

RTIS

PACS/

MIIS

POs

MPI/CP

Hadoop/HBase Cluster

CareCoordination

Data Mining

Page 11: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 11/15

Internet of Things

2011/12/511

Page 12: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 12/15

Challenges

Numerous data− City A: 500,000 cameras, 200PB video within 3 months

− City B: 12,000 ITS cameras, 2B traffic records per day, 1PB records in 3 months

Real-time processing

− Real-time data collection, scan, query and sharing

− Real-time event detection

− Near real-time predictive analysis

Large scale distributed processing− Central data center is not affordable, because of money, space, power supply, air

conditioner, etc

− Application needs a uniform way to access the data

2011/12/512

Page 13: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 13/15

HBase as the infrastructure, but needs:

Global Table View

− Geographical distributed DCs, connected through high speed network

− One very big table across multiple data centers

Active-Active Availability

− Available for read/write even in case of data center failure(s)

Durability− Auto recover from data center failure

Locality

− Reduce write latency

− Reduce network bandwidth requirement

Eventual Consistency across data centers

2011/12/513

Page 14: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 14/15

Big table over DCs (a reference architecture)

2011/12/514

Page 15: Big Data in Enterprise challenges & opportunities.pdf

8/11/2019 Big Data in Enterprise challenges & opportunities.pdf

http://slidepdf.com/reader/full/big-data-in-enterprise-challenges-opportunitiespdf 15/15

2011/12/515