26
1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

Embed Size (px)

Citation preview

Page 1: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

1

INTRO TO DATABASES PART II

IS 340

BY

CHANDRA S. AMARAVADI

Page 2: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

2

DBMS evolution file processing, hierarchical, relational

ER Models Distributed databases Multi-media databases Data warehouses

IN THIS PRESENTATION..

Page 3: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

3

DBMS EVOLUTION

Page 4: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

4

EVOLUTION OF DBMS

File processingsystems

Hierarchicalsystems

Relationalsystems

1 March 2002 Greece2 July 2003 Italy3 ……………..

Person

Photo1 Photo2

Greece Italy

PhotoId

Page 5: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

5

EVOLUTION OF DBMS..FILE PROCESSING SYSTEMS:

Data in the form of “flat files” (ascii files) Each program had its own specific files Data was duplicated across files.

A system where data was managed by the programin the form of flat files.

Page 6: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

6

EVOLUTION OF DBMS..PROBLEMS CAUSED BY FILE PROCESSING

Uncontrolled redundancy Poor data quality (a.k.a.?) Lack of data sharing

Page 7: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

7

HIERARCHICAL MODEL

structure is hierarchical (not flat) one or more root segments child segments are linked hierarchically

difficult to write queries but very efficient (fast)!

A model for storing data:

Page 8: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

8

HIERARCHICAL MODEL

NAME TOTAL_ENROLL NO_OF_PHD

ASNAME JOB_TITLE FNAME RANK DEGREE

(D) DEPARTMENT

(A) ADMIN-STAFF (F) FACULTY

What do you call this?

Page 9: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

9

Data organized into tablesData retrieved by using SQL, 4th GL

joins tables by equating values of cross reference keys.

RELATIONAL MODEL

8895 Smith 4/16/00 $35,000

8896 Farley 4/22/01 $300

8897 Gomez 1/10/00 $2,000

ACCT

ACCT# NAME DT. OPENED BALANCE

Page 10: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

10

ER MODELS (Data Models)

Page 11: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

11

DEFINITIONSEntity – Individual example of person, place or thing.Entity Class – Collection of related entities.Attributes – Properties of entity classes about which we would like to collect information.

Page 12: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

12

AN EXAMPLE ER CHART

CUSTOMER

ORDERS

places

Cust# Name

PRODUCTSare for

ENTITY CLASS

ATTRIBUTE

RELATIONSHIP

Ord# Ord dt

Page 13: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

13

Draw Data Models (ER models) corresponding to the followingSituations:

Athletes participate in events (qualifier, semi-finals, finals) in various sports (swimming, boxing etc.).

Clients can book one or more properties with a rental agency. Eachproperty can be booked by more than one client (for different dates).

Trains arrive and depart from platforms in train stations, corresponding to the cities that they stop in. A city can have many Stations.

Airlines operate a number of flights which arrive and depart fromGates. There could be several flights from a gate, but each flightis assigned only one gate. Each airline operates a gate or a set of Gates which are leased to them by the airport.

DISCUSSION

Page 14: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

14

Design a database for the following situation: Bellsouth, atelephone company needs a database to maintain records ofits automobiles, repairs, mechanics and special equipment neededto repair cars. Mechanics are assigned skill codes based on thetype of repairs they can carry out. For e.g. Class III mechanicscan perform reborings, engine overhauls and transmission repairs.The repair shops are located in 300 cities and towns throughout the state. Mechanics can be assigned to only one repair shop at atime. The application requires knowing what equipment is (fore.g. diagnostic equipment, hydraulic machinery) is at what locationand also the repairs carried out on each vehicle. For each vehiclethe application requires knowing its identification number, mileage and repair history.

DISCUSSION

Page 15: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

15

DISTRIBUTED ANDMULTI-MEDIA DATABASES,

DATA WAREHOUSES

Page 16: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

16

CENTRALIZED DATABASES

The database is in one physical location. All applications regardless of whether the clients are located in Seoul or Detroit need to access data from that physical location. What are the limitations?

Centralized databases

Page 17: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

17

DISTRIBUTED DATABASES

Replicated -- copies of the database maintained in multiple sites Partitioned -- database is physically divided into chunks

The data is physically stored in multiple geographical locations

Page 18: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

18

REPLICATED DATABASES..

Page 19: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

19

PARTITIONED DATABASES..

Emp ID Emp Name Emp Address

11049 Richard 200 Meadow brook

11051 Cathy 13 E. Willow

11054 Hugh 1400 E. Washington

Page 20: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

20

MULTI-MEDIA DATABASES

Emp ID Emp Name Emp. Picture Emp Address

A database that is able to handle multiple data types: Conventional information Images Audio & video

Capable of storing, retrieving and updating this type ofinformation

Emp.jpg

Page 21: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

21

MULTI-MEDIA DATABASES..

APPLICATIONS

Travel bureaus Hospitals Manufacturing Training

Page 22: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

22

MULTI-MEDIA DATABASES..

Storage space Retrieval Modification

ISSUES

Page 23: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

23

DATA WAREHOUSE: Historical data organized for analysis and decision making.

DATA MART: smaller version of data warehouse, specialized by

functional area e.g. marketing

ON-LINE ANALYTICAL PROCESSING (OLAP): organizes data into a “cube” for decision analysis.

DATA WAREHOUSES

Page 24: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

24

HISTORICALINFORMATION

INTERNALDATASOURCES

EXTERNALDATASOURCES

OPERATIONAL,HISTORICAL DATA

DATA WAREHOUSE

EXTRACT,TRANSFORM

OLAP

DATA MINING

DATA WAREHOUSING & OLAP

Page 25: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

25

DISCUSSION QUESTIONS

Are Databases and DBMSs one and the same? Does a record consist of files? What is the smallest unit of data in the database? Does data integrity refer to data quality? What does the detail line of a report consist of? Is an “employee” an entity class or an attribute? Is the data dictionary useful while creating the database? Why do we use SQL? Is it necessary to use SQL to produce database reports? Why do we use data models? Would multi-national organizations prefer centralized databases?

Page 26: 1 INTRO TO DATABASES PART II IS 340 BY CHANDRA S. AMARAVADI

26

THAT’S ALL FOLKS!