7
Open-BDA Trainings Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org Big Data Hadoop Developer Training 10 th & 11 th June 2015 @ NED University High Performance Computing Centre (HPCC) Why learn Big Data and Hadoop? Forrester predicts, CIOs who are late to the Hadoop game will finally make the platform a priority in 2015. Hadoop has evolved as a must-to-know technology and has been a reason for better career, salary and job opportunities for many professionals.

Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

Embed Size (px)

Citation preview

Page 1: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

Open-BDA Trainings

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

Big Data Hadoop Developer Training

10th & 11th June 2015 @ NED University

High Performance Computing Centre (HPCC)

Why learn Big Data and Hadoop?

Forrester predicts, CIOs who are late to the Hadoop

game will finally make the platform a priority in 2015.

Hadoop has evolved as a must-to-know technology

and has been a reason for better career, salary and

job opportunities for many professionals.

Page 2: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

Objective

Big Data & Hadoop Developer training course is designed to provide knowledge and skills to become a successful

Hadoop Developer. In-depth knowledge of core concepts will be covered in the course along with implementation on

varied industry use-cases.

Course Objectives

• At the end of the course, participants should be able to:

• Master the concepts of HDFS and MapReduce framework

• Understand Hadoop 2.x Architecture

• Setup Hadoop Cluster and write Complex MapReduce programs

• Learn the data loading techniques using Sqoop and Flume

• Perform Data Analytics using Pig, Hive and YARN

• Implement HBase and MapReduce Integration

• Implement Advanced Usage and Indexing

• Schedule jobs using Oozie

• Implement best Practices for Hadoop Development

• Work on a Real Life Project on Big Data Analytics

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

Open-BDA Trainings

Page 3: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

Open-BDA Trainings

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

INTORDUCTION TO HADOOP

• Why Hadoop?

• Scaling

• Distributed Framework

• Hadoop v/s RDBMS

• Brief history of Hadoop

JOINS USING MAPREDUCE

• MapReduce Design Patterns Hands On

• Map Side joins

• Reduce side joins

CUSTOM TYPES

• Input Types in MapReduce

• Output Types in MapReduce

• Custom Input Data types

• Custom Output Data types

Training Course – Day 1 (9:00 am – 18:00 pm)

INTORDUCTION TO MAPREDUCE

MapReduce Code Walkthrough

• ToolRunner

• MR Unit

• Distributed Cache

• Combiner

• Practitioner

• Setup and Cleanup methods

INTORDUCTION TO HIVE

• HIVE Architecture Explanation

• Importing Data From Local Servers and HDFS To HIVE Tables

• XML Data Processing by using Hive

• Json Data Processing by using Hive

• Log Data Processing by using Hive

• Hive Manage Tables and External Tables

• Hive Partitions.

• Running DDL,DML and Joins by using HIVE.

• Creating HIVE UDF’S

• Hive Hands on

ADVANCE MAPREDUCE HANDS ON

• MR Unit hands on

• Distributed Cache hands on

• Practitioner hands on

• Combiner hands on

• Map Side joins hands on

• Reduce Side Joins

Page 4: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

Open-BDA Trainings

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

COMPLETE HIVE DISCUSSION

INTORDUCTION TO PIG

• What is Pig?

• Interacting HDFS using PIG

• How Pig Works

• Map Reduce Programs through

PIG

• Simple processing using Pig

• PIG Commands

• Loading, Filtering, Grouping….

• Data types, Operators…..

• Joins, Groups….

• Advanced Processing Using Pig

• Pig Hands On

• Pig User defined functions

Creation

Training Course – Day 2 (9:00 am – 18:00 pm)

INTORDUCTION TO OOZIE

• Understanding Oozie

• Oozie Workflow

• Designing & Implementing Workflow

• Oozie Coordinator application implementation

• Oozie Bundle application implementation

• Oozie Application -Deploy, Test & Execute

INTORDUCTION TO SQOOP

• What is Sqoop?

• Import and Export data from RDBMS to HDFS

• Import and Export data from RDBMS to HIVE

• Applying Conditions on Sqoop querys

• Importing Appended files into HDFS

HANDS ON ASSESMENT

Audience

• Analytics Professionals

• BI /ETL/DW Professionals

• Project Managers

• Testing Professionals

• Mainframe Professionals

• Software Developers and Architects

• Graduates aiming to build a career in Big Data

Page 5: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

The Trainers

Mr. Babu MandadiTrainer (India)

Mr. Babu Mandadi is a skilled, experienced and much sought after Certified

Hadoop Trainer and Consultant. He conducts regular workshops for

corporate clients on Hadoop Ecosystem and Big Data Technologies helping

them build in-house competencies. His clients include start-ups as well as

multi-national companies based in India & US. He has overall 7+ Years of

Experience in the Training Domain. He has managed the entire training

functions for the corporate sectors including Content development, and

Material preparation. Trained by more than 1000+professionals.

Mr. Babu Mandadi is also an entrepreneur and is in the process of setting up

his own Big Data Analytics company.

He comes from a core technical background and has extensive experience in architecting enterprise class solutions on

various technology platforms in complex domains and served as a consultant for implementation of Project

Management tools and best practices. His vast experience in product development, project management and IT

operations has helped him in key positions with well known IT companies, such as Infosys, Wipro, NGRIT and

Landmark (Tata group).

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

Open-BDA Trainings

Page 6: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

Prof. Dr. Tariq MahmoodData Scientist & Trainer

Dr. Tariq Mahmood is the Chief Data Scientist for Big Data with NexDegree

Pvt. Ltd., Data Scientist and Trainer with Innovative Management Services,

both based in Karachi, Pakistan. He is also an Associate Professor at the

Karachi Institute of Economics and Technology (KIET). He has around 10

years of professional and research experience in the domains of Business

Intelligence, Data Warehousing, Data Mining and Advanced Analytics. He

also has 6 years of professional and consultancy experience in Big Data

Analytics, particularly using Open-Source technologies like the Apache

Hadoop platform and NoSQL databases. Notably, Dr. Tariq has designed Big

Data Infrastructures for the Healthcare, Telecommunication and Financial

sector in Pakistan. He has conducted numerous training and workshops on

Big Data, both for Government and Private Organizations, notably the one

held at Mariott, Karachi www.pakistanciosummit.com/bigdataworkshop.

He also heads the Big Data Research Group at KIET www.sites.google.com/site/bigdatabolt/BDAResearch with a multi-

focus on discovering optimized infrastructures for Big Data applications, and of porting current data mining algorithms to

the Big Data platform. His primary focus is to facilitate the spread of Big Data technologies in the corporate sector, both

in Pakistan and across the globe.

The Trainers

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

Open-BDA Trainings

Page 7: Open-BDA - Big Data Hadoop Developer Training 10th & 11th June

What are the pre-requisites for this Course?Knowledge on core Java and basic SQL query's required.

Hands-on/Lecture RatioThis Apache Hadoop class is 70% hands-on, 30% lecture, with the longest lecture segments lasting for 45 minutes.

Students will get a balanced diet of the necessary theoretical knowledge and practical Big Data skills.

Registration Fee: 21, 500/=Inclusive:

• Hands on Hadoop Training on Data sets

• Systems

• Hand-outs

• CPE Hours

• Lunch

• Refreshments

FAQs & Fee

Innovative Management Services – All Rights Reserved 2015 www.innovative-management.org

For Information:Innovative Management Services

M2, 82/C, 11th Commercial Street, DHA Phase 2 Extension,

Karachi – 75500, Call: +92333 3005411

Email: [email protected]

Url: www.innovative-management.org

Open-BDA Trainings