If you can't read please download the document
Upload
phungliem
View
221
Download
4
Embed Size (px)
Citation preview
1 Hadoop Online Training | www.iqonlinetraining.com
Hadoop Online Training
IQ training facility offers Hadoop Online Training. Our Hadoop trainers come with vast work
experience and teaching skills. Our Hadoop training online is regarded as the one of the best online
training in India. All our students were happy and able to find Jobs quickly in USA, UK, Singapore,
Japan and Europe. Hadoop training online is your one stop solution to learn Hadoop at the comfort
of your home with flexible schedules.
IQ Training offers the Hadoop Online Course in a true global setting.
Course Contents:
Introduction to Hadoop and Architecture
1. Hadoop 1.0 Architecture
Introduction to Hadoop & Big Data
Hadoop Evolution
Hadoop Architecture
Networking Concepts
Use cases - Where Hadoop fits into
2. Hadoop 2.0 Architecture
Limitations on Hadoop 1.0 Architecture
Features of Hadoop 2.0 Architecture
HDFS Federation
High Availability of Name Node
YARN Yet Another Resource Negotiator
Non MR applications on top of YAR
Quiz on Architecture Concepts
2 Hadoop Online Training | www.iqonlinetraining.com
Prerequisites for Hadoop Developer/Data Analysts/Admins
1. Linux
Introduction to Linux
Commands & Shell Scripts
Vi& Vim editor features
Case Study to develop a Shell Script
2. Java
Introduction to OOPS & JAVA
Discussion on Object, Class & Methods
Features & Concepts of Core Java for developing MR jobs
Familiarizing Eclipse
Case Study to develop a Java Code with the concepts learnt
3. Python
Introduction to concepts of Python
How different is Python from other Programming Languages
Complex data Types in Python (Tuple, List, Dictionary)
Inbuilt Modules available in Python
File handling functions using Python
Case Study to develop a Python Code with the concepts learnt
Cluster Installation
1. Hadoop Cluster Installation
Types of Hadoop Cluster
Installing Pseudo Mode Cluster
Walk through on inbuilt scripts, directories, configuration files and port numbers.
Discussion on Real Time Cluster Size
Detailed documentation on Installation Procedure
3 Hadoop Online Training | www.iqonlinetraining.com
Distributed File System - HDFS
1. HDFS Commands
Introduction to HDFS Commands
Discussion on scenarios where specific commands are applicable
Introduction to Advanced HDFS Commands including fine tuning of cluster
Detailed documentation on all the HDFS Commands
Custom Script building using HDFS & Unix commands
Quiz on HDFS Commands
Map Reduce - MR
1. Map Reduce using Java
Introduction to Map Reduce Architecture
Detailed discussion on different phases of MR
o Mapper
o Reducer
o Splitting
o Sorting
o Shuffling
o Combiner
o Spilling
o Partitioning
o Merging
Developing Map Reduce Application from Scratch using different use cases
Discussion of difference between Old MR API & New MR API
Introduction to different file formats and their internal features (Sequential,Binary
etc.,)
Developing MR code for Image Analytics
Case Study on Map Reduce (Customer Sentiment Analyser)
4 Hadoop Online Training | www.iqonlinetraining.com
2. Map Reduce using Python Streaming
Developing Map Reduce Application using Python
Discussion of different features available in Streaming
Case Study on Map Reduce Streaming (Analytics on Temperature Datasets)
Quiz on Map Reduce
Hadoop Eco System Components
1. Hive (Data Warehouse on top of HDFS)
Introduction to Hive Architecture
Configuring Hive Metadata store in different ways
Basic Queries in Hive (DDL,DML)
Advanced features of Hive
o Partitioning
o Bucketing
o Sampling
o Multi Table load Queries
o Serialize & De Serialize
Dealing with different formats of data (Flat file, JSON, CSV etc.,)
Query optimization using Hive.
Developing User Defined Functions (UDFs) in Java & Python
Case Study (Analytics on Telecom Datasets)
Quiz on Hive
2. PIG (Data Flow Language)
Introduction to Pig Latin
Basic Commands in Pig
Explanation of advanced features like Pig with real time scenarios
Different ways of using Pig Storage
Dealing with Unstructured data
Developing Regular Expressions
Developing User Defined Functions (UDFs) in Java & Python
Case Study (Analytics on Books Datasets)
Quiz on Pig
5 Hadoop Online Training | www.iqonlinetraining.com
3. SQOOP (Import Export utility)
Introduction to Sqoop Basic Sqoop Commands
Advanced Import Features
Advanced Export Features
o Upsert calls
o EVAL
o Compressed formats
Case Study (Analytics on Telecom Datasets)
Quiz on Sqoop
4. HBASE (Versioned Database)
Introduction to HBASE & NOSQL
Basic difference in Row Oriented and Column Oriented storage
Basic HBASE Commands
Advanced HBASE Features
o Versions
o Compression Techniques
o Bloom Filters
o Sequential Scans
Bulk Loads to HBASE Features
Case Study on HBASE
Quiz on HBASE
5. Flume
Flume Architecture
Configuring Flume Components
o Source
o Sink
o Channel
o Agents
6 Hadoop Online Training | www.iqonlinetraining.com
Building Flume Config files for different scenarios
o Basic Config File building
o Config file for connecting to different File Servers
o Config file for connecting to Web Servers
Quiz on Flume
6. Scheduler (OOZIE & Autosys)
Introduction to Oozie
Introduction to Autosys
Using Schedulers for Batch Processing
Quiz on OOZIE
Finally this series of Practical Sessions End with Quiz on entire Series.