View
190
Download
2
Category
Preview:
Citation preview
Ph:+91 90528 20000, USA +1 209 207 3642 contact@attainonlinetraining.com www.attainonlinetraining.com
Hadoop Online Training Content-Attain Technologies Madhapur, Hyderabad 2013
Hadoop Course Content
(Includes theoretical as well as practical sessions)
Table of Contents- With Real Time Faculty 1. Basics of Parallel Programming (4 hours)
a. Multi-Threading
b. Open MP (Open Multiprocessing)and MPI (Message Passing Interface)
c. Performance tuning and optimization
i. Matrix Multiplication
ii. Unique word count problem
2. Distributed computing concepts (2 hours)
3. Hadoop Overview (6 hours)
a. Why Hadoop?
b. Brief history of hadoop
c. Architecture of Hadoop
d. Overview of HDFS (Hadoop Distributed File System) and MR (Map Reduce) framework
e. Overview of problems solved by Hadoop
i. Data Mining
ii. Web Mining
iii. Natural Language Processing
iv. K-means clustering
v. Sentimental Analysis
4. Map Reduce Programming Model (8 hours)
a. Details of execution of Map Reduce frame work
b. Word count problem solved using MapReduce programming model.
c. Data Mining on Wikipedia data set.
5. Hadoop ecosystem (2 hours)
6. Hadoop Programming Languages (4 hours)
a. Pig
b. Hadoop Pipes (C++)
c. Hadoop Streaming
d. Hadoop and R
7. Distributed data base concepts (4 hours)
a. RDBMS v/s NoSQL DB
b. Overview of HBase and Cassandra
Ph:+91 90528 20000, USA +1 209 207 3642
Ph:+91 90528 20000, USA +1 209 207 3642 contact@attainonlinetraining.com www.attainonlinetraining.com
Hadoop Online Training Content-Attain Technologies Madhapur, Hyderabad 2013
contact@attainonlinetraining.com
www.attainonlinetraining.com
SAP Basis Course Content-Attain Technologies Madhapur, Hyderabad 2013
8. Advance MapReduce Programming (chaining Mapper and Reducer)
9. Case Studies
A. Data Mining on Wikipedia data set using
a. Batch Mode Processing (MR )
b. Using Hive
c. Using HBase and Hive
B. Web Mining using Apache Nutch, Apache Solr and Hadoop
C. Web Log processing using Flume and Hadoop
D. Complex Event processing using Flume, Hadoop and EPL ( Event Processing Language)
E. Integrating Hadoop and RDBMS
Prerequisites:
(1) Hands-on Core java programming / C++/ R/Python
(2) Hands on parallel/multithreaded programming
(3) Query Language (SQL or EPL) (Optional)
Recommended