16
Big Data & Hadoop Ecosystem Canburak Tümer

Big Data and Hadoop Ecosystem

Embed Size (px)

Citation preview

Page 1: Big Data and Hadoop Ecosystem

Big Data & Hadoop

EcosystemCanburak Tümer

Page 2: Big Data and Hadoop Ecosystem

• Ege University, BSc. Computer Engineering, ’07-’12• Libera Universitá di Bolzano, BSc. Computer Science,

’09-’10• İstanbul Technical University, MSc. Computer

Engineering, ’13-’16 (expected)• Turkcell Technology, ETL & DWH Developer, ’11-’12• Oracle, Consultant, ’12-’13• MAKEIT Software & Consulting, BI&DW Specialist ’14-...• www.canburaktumer.com/blog @canburakTblog

https://www.linkedin.com/in/canburaktumer

About MeCanburak Tümer

Page 3: Big Data and Hadoop Ecosystem

Agenda• Big Data• NoSQL• Hadoop• HDFS• MapReduce• Management Tools• Data Access Tools• Data Processing and Mining Tools

Page 4: Big Data and Hadoop Ecosystem
Page 5: Big Data and Hadoop Ecosystem
Page 6: Big Data and Hadoop Ecosystem

VOLUME VALUE

VARIETYVERIFICATION VELOCITY

Page 7: Big Data and Hadoop Ecosystem
Page 8: Big Data and Hadoop Ecosystem
Page 9: Big Data and Hadoop Ecosystem

- Open source big data platform- Started by developers from Yahoo!- Two main distributors now : Cloudera, Hortonworks- Both storage and processing

- HDFS for storage- MapReduce for processing- Spark engine is replacing MapReduce day by day

Page 10: Big Data and Hadoop Ecosystem
Page 11: Big Data and Hadoop Ecosystem

HDFS

Page 12: Big Data and Hadoop Ecosystem

Map Reduce

Page 13: Big Data and Hadoop Ecosystem

Managing Tools for Hadoop

Page 14: Big Data and Hadoop Ecosystem

Data Access Tools for Hadoop

Page 15: Big Data and Hadoop Ecosystem

Data Processing and Mining Tools

Page 16: Big Data and Hadoop Ecosystem