17
File Organization Techniques

File Organization Techniques

Embed Size (px)

DESCRIPTION

File Organization Techniques. Topics Covered: File Organization Techniques of file organization Serial file organization Sequential file organization Direct file organization Indexed sequential file organization. File Organization: - PowerPoint PPT Presentation

Citation preview

Page 1: File Organization Techniques

File Organization Techniques

Page 2: File Organization Techniques

Topics Covered:

File Organization

Techniques of file organization

◦ Serial file organization

◦ Sequential file organization

◦ Direct file organization

Indexed sequential file organization

Page 3: File Organization Techniques

File Organization:

File organization refers to the relation ship of the key of the record to the physical

location of that record in the computer file. Two important characteristics of files are Data Organization and Method of Access.

Data organization, refers to the way the records of the file are organized on the

backing storage device.

Method of access, refers to the way in which records are accessed. Some

organizations are more versatile than others. A file with an organization of Indexed

or Relative may still have its records accessed sequentially; but records in a file

with an organization of Sequential, cannot be accessed directly.

Page 4: File Organization Techniques

Techniques of File Organization:

1. Serial - Records organized serially in any order.

2. Sequential - Records organized serially in ascending or descending order.

3. Direct - Relative record number based organization.

4. Indexed - Index based organization.

Serial Organization

Creation of file -> In a Sequential file the records are arranged one after another.

There is no relationship between key field values of consecutive records. In other

words records can be stored in an order.

Page 5: File Organization Techniques

Access -> Only way to access records in a serial file, is linear search. i.e. key field

value to be located is compared with key field value of each record starting at the

first record and read all the succeeding records until the required record is found or

until the end of the file is reached.

Insertion -> Since no space is left between the records at the time of creation,

insertion can be performed at the end of the file. However to insert a record in

between method of rewriting can be used.

Deletion -> To delete any record first it is located then the record is marked for

deletion.

Media Used -> Serial organization may be implemented on magnetic tape or on

hard disk. i.e. on serial access device as well as direct access device.

Page 6: File Organization Techniques

Advantages->

Easy to use

Maximum utilization of space.

Algorithms are easy to implement.

Less storage cost

Disadvantages->

Slow organization because only linear access is possible.

It is not really feasible to delete and update records in a serial file.

Page 7: File Organization Techniques

Sequential Organization

Creation of file -> In a Sequential file the records are arranged one after another. In

ascending or descending order of key field value.

Access -> Records in a sequential file can be accessed either by using linear search

in which all the records are searched starting at the first record and read all the

succeeding records until the required record is found or until the end of the file is

reached. The alternative method is skip search in which given key field is compared

with key field of record after skipping a fixed number of records.

Insertion -> Insertion can be performed either at the end of the file or by rewriting

the file. But in any case the sequence must be maintained.

Page 8: File Organization Techniques

Deletion -> To delete any record first it is located either sequentially or by skip

search and then the record is marked for deletion.

Media Used -> Sequential organization may be implemented on magnetic tape or

on hard disk. i.e. on serial access device as well as direct access device.

Advantages->

Fast access as compared to serial organization because skip search can be used.

Maximum utilization of space.

Algorithms are easy to implement.

Disadvantages->

Sequence of records is to be maintained for which extra time & efforts are

required.

Page 9: File Organization Techniques

Direct Organization (Random or Relative)

Creation of file -> In direct file organization the records are placed randomly on

backing storage device without any sequence of key field value. Key field value at

given record is converted to address on backing storage device and then given

record is stored at calculated address.

Access -> Any record can be accessed from its storage location or address. The

previous records need not to be accessed here i.e all records can be retrieved

independently.

Insertion -> Insertion can be performed randomly at any position by calculating

the address.

Page 10: File Organization Techniques

Deletion -> Deletion can be performed randomly from any position by calculating

the address.

Media Used -> Direct organization can be created only on direct-access storage

device like magnetic disk.

Advantages->

Immediate access to records for updating is available.

Transactions need not be sorted.

Random inquiries which are too frequent in business can be handled easily.

Updating of any record does not require the rewriting of the entire file.

Direct file organization is suitable for interactive online applications such as

airlines or railway reservation or banking applications.

Page 11: File Organization Techniques

Disadvantages->

Expensive hardware and software resources are required.

Complex in programming

Less efficient in terms of use of storage space as compared to sequential

organization.

Since it is used in online applications problem of security may arise.

Page 12: File Organization Techniques

Indexed sequential organization

Creation of file -> In this organization the records are organized in sequence but

direct access is possible to individual records through an index. Here storage area is

divided into three parts one is prime area, second is overflow area, third is

indexed area.

Prime area: It covers the backing storage device. In prime area records are placed

in sequential order i.e. ascending or descending order of key field value. Records

are written in prime area when the file is reorganized or created.

Overflow area: It is one in which records are stored when prime area is full.

Indexed area: It is used to store index of the file. Index contains track number and

highest key field value on that track.

Page 13: File Organization Techniques

e.g. if we have a storage device with 5 tracks and 5 records can be stored on each

track then indexed organization can be indicated as follows:

1

2

3

4

5

Track

Prime area Index area

Overflow area

1 4 6 30 31

35 42 47 51 59

60 61 63 67 72

74 76 81 84 90

95 97 98 99 102

Track no

HKF Value

12345

31597290

102

Page 14: File Organization Techniques

Access -> Records can be accessed in linear form in which all the records are

searched starting at the first record and read all the succeeding records until the

required record is found or until the end of the file is reached.

In second method given key field value is compared with index to obtain track

number for which highest key field value is greater than or equal to given key field

value then that particular track is searched sequentially to obtain the record.

Insertion -> Insertion can be performed either at the end of the file or insertion can

be performed in between by shifting the records within the file. During insertion if

prime area is full records are stored in overflow area. e.g. in the above example if

record with key field value 64 is to be inserted then 67 (on track no. 3)will be

shifted to next position and 72 will be shifted to overflow area.

Page 15: File Organization Techniques

1

2

3

4

5

Track

Prime area Index area

Overflow area

1 4 6 30 31

35 42 47 51 59

60 61 63 64 67

74 76 81 84 90

95 97 98 99 102

Track no

HKF Value

12345

31597290

102

72

Page 16: File Organization Techniques

Deletion -> For deletion first the record is searched using indexed sequential

method and then marked for deletion.

Media Used -> Indexed sequential organization can be created only on direct-

access storage device like magnetic disk.

Advantages->

This process is faster as compared to other organization.

It combines positive aspects of both sequential and direct access files.

Disadvantages->

Wastage of space in creating index.

Slow retrieval as compared to direct access as searching of index requires time.

Page 17: File Organization Techniques

Thanks!!