17
A Cloud Environment for Backup and Data Storage HUGO E. CAMACHO, J. ALFREDO BRAMBILA, ALFREDO PEÑA, JOSÉ M. VARGAS

A cloud enviroment for backup and data storage

Embed Size (px)

DESCRIPTION

IGeekS Technologies (Make Final Year Project) No: 19, MN Complex, 2nd Cross, Sampige Main Road, Malleswaram Bangalore- 560003. Phone No: 080-32487434 /9590544567 / 9739066172 Mail: [email protected] , [email protected] Land mark : Near to Mantri Mall, Malleswaram Bangalore

Citation preview

Page 1: A cloud enviroment for backup and data storage

A Cloud Environment for Backup and Data StorageHUGO E. CAMACHO, J. ALFREDO BRAMBILA, ALFREDO PEÑA, JOSÉ M. VARGAS

Page 2: A cloud enviroment for backup and data storage

Abstract

Currently derived from advances and technological developments can have Input-Output devices ever better able to store more information. The use of the disks of the nodes of a cluster as global storage system is an inexpensive solution for a cloud environment. The need for the available of information from anywhere is increasing; this represents a problem for many users who use applications such as databases, media, personal file, documents, etc. The I/O data demands of these applications get higher as they get larger. In order to improve performance of these applications can use parallel file systems. PVFS2 is a free parallel file system developed by a multi-institution team of parallel I/O, networking and storage experts. In this paper we present the design of an implementation for cloud environment for able to store and back up data through using remote servers that can be accessed through the Internet. The implementation aims to increase the availability of data and reduce in loss of information.

Page 3: A cloud enviroment for backup and data storage

Existing System

Nowadays due to the growth of technologies, we have different services and applications that allow users to perform tasks that improve productivity in their daily activities. However, the need to access any time and / or from a remote site has grown from a theoretical proposal a genuine need. This has given rise to seek alternatives to solve this problem. One of the alternatives is known by the term cloud computing; cloud computing can be defined as that service (software, platform or infrastructure) located on the Internet and is accessed from a mobile device or desktop computer, giving users a wide variety of applications (databases, middle office software, storage, etc.).

Page 4: A cloud enviroment for backup and data storage

Disadvantages of Existing System

The public cloud in the existing system is maintained and operated by third parties not related with the organization, for this reason, both the processes and data of various clients are mixed on servers, storage systems and other infrastructure of the cloud.

It is not suited for companies that need high confidentiality and data security.

In existing system, management of the cloud is carried out by a client that controls which applications should run. Servers, network, and storage devices are the property of the organization. So they decide which users are allowed to use the infrastructure.

Page 5: A cloud enviroment for backup and data storage

Proposed System

We propose a cloud environment to backup and store data on a private cloud using PVFS2 like file system for storage data in order to increase the performance of these applications. This option allows input / output parallel, so that will reduce the access times to data. On the client-end, a multiplatform application is developed using free software that allows data transfer fast and simple way.

Page 6: A cloud enviroment for backup and data storage

Advantages of Proposed System

The advantages of this implementation is that it can reuse existing infrastructure (servers, cluster, and other devices) to reduce costs, becoming an advantage compared to having to deploy a private cloud from nowhere.

The proposed system increases the availability of data and prevents information loss.

It also allows data sharing, without having to delete the original file.

Page 7: A cloud enviroment for backup and data storage

Problem Statement

The different services and applications that allow users to perform tasks that improve productivity in their daily activities. However, the need to access any time and or from a remote site has grown from a theoretical proposal a genuine need. This has given rise to seek alternatives to solve this problem. One of the alternatives is known by the term cloud computing; cloud computing can be defined as that service (software, platform or infrastructure) located on the Internet and is accessed from a mobile device or desktop computer, giving users a wide variety of applications (databases, middle office software, storage, etc.).

Page 8: A cloud enviroment for backup and data storage

Scope

Cloud Computing and cloud storage have become the preferred method for delivering information and online functionality. The implementation of a larger number of servers time access to data is reduced with respect to the implementation of a single server. This is due to the distribution of data across multiple servers, allowing to access parts of the file parallel. Better still is the fact that it allows to manage in a file different customers because each of them can manage a part of the file, so further increasing parallelism. This implementation is being developed in order to give users the experience of implementing and managing a private cloud environment that facilitates the backup and data storage, using infrastructure already available or low cost. Avoiding payment of fees or memberships required to contract this service. This allows full control of those who access the information, so maintaining the confidentiality of the data.

Page 9: A cloud enviroment for backup and data storage

Architecture

Page 10: A cloud enviroment for backup and data storage

Modules

The proposed system consists of the following modules:

Cloud Storage Module.

Backup File System Module.

Server Application Module.

PVFS2 Module.

Page 11: A cloud enviroment for backup and data storage

Description of modules

Cloud Storage Module

A network model where the data is backed up and stored by using an internet connection on remote servers is known by the name of cloud storage, generally are hosted by third parties. Hosting companies operate large data centers, and people who require their data to be hosted buy or lease storage capacity from them. The data center managers often use virtualization to reserve resources according to customer requirements and expose them as storage spaces to store files or data objects. Physically, the resource can extend along multiple servers and multiple locations. The safety of the files depends upon the hosting companies, and on the applications that leverage the cloud storage.

Page 12: A cloud enviroment for backup and data storage

Backup File System Module

This application that serves as storage and backup environment in the cloud, we selected as a file system PVFS2 for our cloud environment; to be free and open source, we have the freedom to use and modify according to our needs. So far, some features of the application-level cloud and the benefits to be gained by paying for them have been mentioned. But it is important to know how to actually manage the backup and storage of files within the input / output.

Page 13: A cloud enviroment for backup and data storage

Server Application Module

The server application will contain one thread per user, provided by the Thread class in Java, which through Sockets application will contact the client with whom share data and metadata, depending on the rules described in the previous section. The application server will receive data and metadata as strings of bytes encrypted the decrypt, write, replace or delete as applicable, and also send data and metadata in encrypted byte streams to the client application. This application will also feature a web application which can view and download the files in the directory of each user, and simple steps can make the user account, and update personal information and password. Metadata that will keep are: file name, size, path, last date modified, deleted mark, this information is stored in a MySQL table.

Page 14: A cloud enviroment for backup and data storage

PVFS2 Module

PVFS2 distinguishes between three different components:

1) Data servers or I/O servers constitute the PVFS store subsystem. In order to calculate the available store space the system first determines the available space in each data server and then it multiplies the smallest obtained value by the number of data servers.

 

2) Metadata servers store all the metadata information of the PVFS namespace files (metafiles).

 

3) Clients or processing nodes that access the file system through one of three application programmer interfaces (APIs): a kernel Linux module that allows access through the Linux virtual file system (VFS) with the POSIX I/O API; a user-space library that provides access through what it is called the PVFS system interface (pvfs2lib); and a high level library, MPI-IO.

Page 15: A cloud enviroment for backup and data storage

Minimum Hardware Configuration of the proposed system

Processor : Intel/AMD

Speed : 1.1 GHz

RAM : 1 GB

Hard Disk : 20 GB

Key Board : Standard Keyboard

Mouse : Standard Mouse

Monitor : SVGA/LCD

Page 16: A cloud enviroment for backup and data storage

Software Configuration of the proposed system

Operating System : Windows

Technology : Java and J2EE

Web Technologies : HTML, JavaScript, CSS

IDE : Eclipse

Web Server : Tomcat 6/7

Tool kit : Android Phone

Database : MySQL 5.5

Java Version : JDK 1.6/1.7/1.8

Page 17: A cloud enviroment for backup and data storage

Reference Papers

Mohammad Hamdaqa, Ladan Tahvildari, Cloud Computing Uncovered: A Research Landscape, In: Ali Hurson and Atif Memon, Editor(s), Advances in Computers, Elsevier, 2012, Volume 86, Pages 41-85, ISSN 0065-2458, ISBN 9780123965356, http://dx.doi.org/10.1016/B978-0-12-396535-6.00002-8.

Rajkumar Buyya, Chee Shin Yeo, and Srikumar, Venugopal. “Marketoriented cloud computing: Vision, hype, and reality for delivering it services as computing utilities”. CoRR, (abs/0808.3558), 2008.

P. J. Braam, "The Lustre Storage Architecture," November. 2002. Windows Azure : http://www.windowsazure.com/es-es/ Hybrid Cloud: http://www.redhat.com/products/cloudcomputing/cloudforms/