Upload
anne-levine
View
22
Download
2
Tags:
Embed Size (px)
DESCRIPTION
6.1 Characteristics of a DFS. Yichuan Wang [email protected]. Files. Computing system Persistent data objects File system. DFS. Transparency Directory service (name service) Caching and replication Access control and security. DFS Characteristics. Dispersion[Randy chow,1997] - PowerPoint PPT Presentation
Citation preview
Transparency[3]Access transparency: Client programs should be
unaware of the the distribution of files.Location transparency: Client program should
see a uniform namespace. Files should be able to be relocated without changing their path name.
Mobility transparency: Neither client programs nor system admin program tables in the client nodes should be changed when files are moved either automatically or by the system admin.
Performance transparency: Client programs should continue to perform well on load within a specified range.
Scaling transparency: increase in size of storage and network size should be transparent.
Name Service• a name space -- collection of names• name resolution -- mapping a name to an object
– same or different view of a directory hierarchy • 3 traditional ways to name files in a distributed
environment– concatenate the host name to the names of files stored on that
host:system-wide uniqueness guaranteed, simple to located a file; however, not network transparent, not location independent, e.g., /machine/usr/foo
– mount remote directories onto local directories:once mounted, files can be referenced in a location-transparent manner
– provide a single global directory:requires a unique file name for every file, location independent,cannot encompass heterogeneous environments and wide geographical areas
Cache[2] Four places to store files
server’s disk: slow performance server caching: in main memory
cache management issue, how much to cache, replacement strategy
still slow due to network delay Used in high-performance web-search engine servers
client caching in main memory can be used by diskless workstation faster to access from main memory than disk compete with the virtual memory system for physical memory
space Three options (Fig. 13-10)
client-cache on a local disk large files can be cached the virtual memory management is simpler a workstation can function even when it is disconnected from the
network
Cache1 reduces remote accesses : reduces network
traffic and server load2 total network overhead is lower for big
chunks of data (caching) than a series of responses to specific requests.
3 disk access can be optimized better for large requests than random disk blocks
4 cache-consistency problem is the major drawback. If there are frequent writes, overhead due to the consistency problem is significant.
5 OS is simpler for remote service.
NFSThe Network File System (NFS) was
developed to allow machines to mount a disk partition on a remote machine as if it were on a local hard drive. This allows for fast, seamless sharing of files across a network
NFS
UNIX kernel
protocol
Client computer Server computer
system calls
Local Remote
UNIXfile
system
NFSclient
NFSserver
UNIXfile
system
Applicationprogram
Applicationprogram
NFS
UNIX
UNIX kernel
Virtual file systemVirtual file system
Oth
er f
ile s
yste
m
Google FS[Sanjay G etl.2003]The system is built from many inexpensive
commodity components that often fail. Big file optimizationLarge, sequential writes, seldom
modificationsMultiple concurrent appendingHigh sustained bandwidth is more
important than low latency
GFS architectureA GFS cluster consists of a single master
and multiple chunkservers and is accessed by multiple clients
Master stores all the meta data: namespace, access control information, the mapping from files to chunks, and the current locations of chunks.