15
Chapter 6.5 Distributed File Systems Summary Junfei Wen Fall 2013

Chapter 6.5 Distributed File Systems Summary Junfei Wen Fall 2013

Embed Size (px)

Citation preview

Chapter 6.5 Distributed File Systems Summary

Junfei Wen Fall 2013

Outline

• 6.1 Characteristics of DFS• 6.2 DFS Design and Implementation• 6.3 Transaction and Concurrency Control• 6.4 Data and File Replication• Current Work• Future Work

6.1Characteristics of DFSDispersion • Dispersed Files

Location Transparent Location Independent

• Dispersed Clients login transparencyaccess transparency

Multiplicity • Multiple Files

Replication Transparency• Multiple Clients

Concurrency Transparency

Others (general)• Fault Tolerance – crash of server or client, loss of message• Scalability – Incremental file system growth • Efficient

6.2DFS Design and Implementation

• Hierarchy files structure• File mounting protocol– Explicit mounting, manual– Boot mounting, boot up time mounting– Auto mounting, mounting at use

• Distribute state information between server and clients. Stateless or stateful server

• File access– Space multiplexing, multiple copy of file

• Remote access• Cache access• Download/upload access

– Time multiplexing, concurrent control. Same file different time• Simple RW• Transaction• session

• File sharing semantics• Unix, update propagated immediately• Session, delayed update• Transaction, delayed update

COMPONENTS IN A FILE SYSTEM

6.3Transaction and Concurrency Control

• Distribdute Transaction Processing System:– Transaction Manager: correct execution of local

and remote transactions.– Scheduler: schedules operations to avoid conflicts

using locks, timestamps and validation managers.– Object Manager: coherency of replicas/caches;

interface to the file system.

• Serializability: A schedule is Serializable if the result of execution is equivalent to that of a serial schedule.

• Concurrency Control Protocol required to maintain ACID Semantics for Concurrent transactions. – Two phase locking– Timestamp ordering– Optimistic

6.4Data and File Replication

• Architecture• Client chooses one / more FSA to access data object.• FSA acts as front end to replica managers RMs to

provide replication transparency.• FSA contacts one or more RMs for actual updating

and reading of data objects.

• One-copy Serializability: – The execution of transaction on replicated objects

is equivalent to the execution of the same transactions on non-replicated objects

– Read Operations : Read-one-primary, Read-one ,Read-quorum

– Write Operations:Write-one-primary,Write-all,Write-all-available,Write-quorum,Write-gossip

• Quorum Voting : Uses Read Quorum, Write Quorum– Write-write conflict: 2 * Write quorum > all object copies– Read-write conflict: Write quorum + read quorum > all

object copies.

• Gossip Update Propagation:– Read: if TSfsa<=TSrm, RM has recent data, return it,

otherwise wait for gossip, or try other RM– Update :if Tsfsa>TSrm, update. Update TSrm send gossip.

Otherwise, process based on application, perform update or reject

– Gossip : update RM if gossip carries new updates.

Current work• A Distributed Cache for Hadoop

Distributed File System in Real-Time Cloud Service

• Intensive Workload Consolidation for the Hadoop Distributed File Systems

• An integrated high-performance distributed file system implementation on existing local network

• A Cost-Effective File Lookup Service in a Distributed Metadata File System

• The Mobile Agent-based Distributed Network File system

Future work

• Innovations in the area of security for Distributed/Cloud Computing

• Improve efficiency of Parallel/Distributed system Concurrency control protocol

• Improve Efficiency and Effectiveness of file replication scheme

• Integrate File Replication and Consistency Maintenance

Reference[1]Distributed Operating Systems and Algorithm Analysis, Andy Chow

& Theodore Johnson,1997[2] “Optimizing File Replication over Limited-Bandwidth Networks

using Remote Differential Compression” IEEE Infocom Conference, 2006.

[3] Transaction Management and Concurrency control by Connolly & Begg. Chapter 19. Third edition

[4] "Distributed File System Replication: Frequently Asked Questions";http://technet2.microsoft.com/WindowsServer/en/library/f9b98a0f-c1ae-4a9f-9724-80c679596e6b1033.mspx?mfr=true

[5]http://blogs.cs.st-andrews.ac.uk/angus/2009/09/[6]http

://www.quora.com/Distributed-Systems/What-is-the-future-of-file-systems -Future of File Systems

Thank you!