14
HADOOP INFRASTRUCTURE FOR EDUCATION Darko Marjanović, [email protected] Miloš Milovanović, [email protected] Božidar Radenković, [email protected] University of Belgrade Faculty of Organizational Sciences Laboratory for E-business

Hadoop infrastructure for education

Embed Size (px)

DESCRIPTION

Hadoop infrastructure for education

Citation preview

Page 1: Hadoop infrastructure for education

HADOOP INFRASTRUCTURE FOR EDUCATION

Darko Marjanović, [email protected]

Miloš Milovanović, [email protected]

Božidar Radenković, [email protected]

University of Belgrade

Faculty of Organizational Sciences

Laboratory for E-business

Page 2: Hadoop infrastructure for education

Laboratory for E-business

• Exists within the Faculty of Organizational Sciences, University of Belgrade

• Organizes e-learning courses since 2001. by using Moodle LMS and blended learning concept

• More than 1000 students take our courses each year• Research areas:

E-business Internet and mobile technologiesBig DataCloud ComputingE-educationAdaptive e-services Internet of things Social media

Page 3: Hadoop infrastructure for education

Overview

• Introduction

• Hadoop model for education

• Implementation

• Cluster organizaton

• Conclusion

Page 4: Hadoop infrastructure for education

Introduction

• Education institutions need to have access to relevant information in order to offer high-quality education to students.

• Main problem – Information arrive to organizations

• from variety of sources

• with rapidly increasing speed

• in variety of types.

• Hadoop as a possible solution to this matter

Page 5: Hadoop infrastructure for education

Hadoop

• Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware.

• All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common and thus should be automatically handled in software by the framework.

Page 6: Hadoop infrastructure for education

Big Data

• Big data is a blanket term for any collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications.

Page 7: Hadoop infrastructure for education

Hadoop model for education

Guidelines used for deploying Hadoop model:

• Efficient data import

• Reliable manipulation

• Flexible output

Page 8: Hadoop infrastructure for education

Model for managing Big Data in educational institutions

Page 9: Hadoop infrastructure for education

Implementation

• Three node cluster

• Integration with Moodle LMS

• Distributed storage

• In its performance, Hadoop cluster consumes a significant amount of resources, and controlling them is inevitable.

Page 10: Hadoop infrastructure for education

Implemented Hadoop e-learning infrastructure

Page 11: Hadoop infrastructure for education

Cluster organization

• Central role that is responsible for Hadoop’s performance is represented by Master node.

• In order to optimize Hard Disk Drive Memory, the implementation described here contains Data Node installed on the Master Node

• Imposed mechanism for preventing data losing between nodes is to constantly monitor network infrastructure.

• Data replication as a mechanism for preserving data within cluster

Page 12: Hadoop infrastructure for education

Hadoop cluster organization in Laboratory for E-business

Page 13: Hadoop infrastructure for education

Conclusion

• A scalable platform that brings Big Data based on Hadoop to e-learning environment is presented.

• Main contribution of described paper is providing environment for manipulating data generated from variety of sources in education activities.

• Primary objective is improvement of e-learning process.

• Future research is directed to:-optimizing integration with e-learning services-integration with cloud platform

Page 14: Hadoop infrastructure for education

HADOOP INFRASTRUCTURE FOR EDUCATION

Darko Marjanović, [email protected]

Miloš Milovanović, [email protected]

Božidar Radenković, [email protected]

University of Belgrade

Faculty of Organizational Sciences

Laboratory for E-business