57
1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part. CHAPTER SIX DATA BUSINESS INTELLIGENCE

1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

Embed Size (px)

Citation preview

Page 1: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

1

© 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied,

scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.

CHAPTER SIX

DATA

BUSINESS INTELLIGENCE

Page 2: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

2

CHAPTER OVERVIEW

SECTION 6.1 – Data, Information, Databases• The Business Benefits of High-Quality Information• Storing Information Using a Relational Database

Management System• Using a Relational Database for Business Advantages• Driving Websites with Data

SECTION 6.2 – Business Intelligence• The Business Benefits of Data Warehousing• Performing Business Analysis with Data Marts• Uncovering Trends and Patterns with Data Mining• Supporting Decisions with Business Intelligence

Page 3: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

3

© 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied,

scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.

SECTION 6.1

DATA, INFORMATION,

AND DATABASES

Page 4: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

4

LEARNING OUTCOMES

1. Explain the four primary traits that determine the value of information

2. Describe a database, a database management system, and the relational database model

3. Identify the business advantages of a relational database

4. Explain the business benefits of a data-driven website

Page 5: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

5

THE BUSINESS BENEFITS OF HIGH-QUALITY INFORMATION

Information is everywhere in an organization

Employees must be able to obtain and analyze the many different levels, formats, and granularities of organizational information to make decisions

Successfully collecting, compiling, sorting, and analyzing information can provide tremendous insight into how an organization is performing

Page 6: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

6

THE BUSINESS BENEFITS OF HIGH-QUALITY INFORMATION

Levels, Formats, and Granularities of Information

Page 7: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

7

Information Type: Transactional and Analytical

• Transactional information – Encompasses all of the information contained within a single business process or unit of work, and its primary purpose is to support the performing of daily operational tasks

• Analytical information – Encompasses all organizational information, and its primary purpose is to support the performing of managerial analysis tasks

Page 8: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

8

Information Type: Transactional and Analytical

Page 9: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

9

Information Type: Transactional and Analytical

Page 10: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

10

Information Timeliness

Timeliness is an aspect of information that depends on the situation• Real-time information – Immediate,

up-to-date information• Real-time system – Provides real-

time information in response to requests

Page 11: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

11

Information Quality

Business decisions are only as good as the quality of the information used to make the decisions

You never want to find yourself using technology to help you make a bad decision faster

Page 12: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

12

Information Quality

Characteristics of High-quality Information• Accurate• Complete• Consistent• Unique• Timely

Page 13: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

13

Information Quality

Low Quality Information Example

Page 14: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

14

Understanding the Costs of Using Low-Quality Information

The four primary sources of low quality information include

1. Customers intentionally enter inaccurate information to protect their privacy

2. Different entry standards and formats

3. Operators enter abbreviated or erroneous information by accident or to save time

4. Third party and external information contains inconsistencies, inaccuracies, and errors

Page 15: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

15

Understanding the Costs of Using Low-Quality Information

Potential business effects resulting from low quality information include• Inability to accurately track customers• Difficulty identifying valuable customers• Inability to identify selling opportunities• Marketing to nonexistent customers• Difficulty tracking revenue• Inability to build strong customer relationships

Page 16: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

16

Understanding the Benefits of Good Information

High quality information can significantly improve the chances of making a good decision

Good decisions can directly impact an organization's bottom line

Page 17: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

17

STORING INFORMATION IN A RELATIONAL DATABASE

Information is everywhere in an organization

Information is stored in databases

• Database – maintains information about various types of objects (inventory), events (transactions), people (employees), and places (warehouses)

Page 18: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

18

STORING INFORMATION IN A RELATIONAL DATABASE

Database management systems (DBMS) –Allows users to create, read, update, and delete data in a relational database

Page 19: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

19

STORING INFORMATION IN A RELATIONAL DATABASE

Data element – The smallest or basic unit of information

Data model – Logical data structures that detail the relationships among data elements using graphics or pictures

Metadata – Provides details about data

Data dictionary – Compiles all of the metadata about the data elements in the data model

Page 20: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

20

Storing Data Elements in Entities and Attributes

Entity – A person, place, thing, transaction, or event about which information is stored

• The rows in a table contain entities

Attribute (field, column) – The data elements associated with an entity

• The columns in each table contain the attributes

Record – A collection of related data elements

Page 21: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

21

Creating Relationships Through Keys

Primary keys and foreign keys identify the various entities (tables) in the database• Primary key – A field (or group of fields)

that uniquely identifies a given entity in a table

• Foreign key – A primary key of one table that appears an attribute in another table and acts to provide a logical relationship among the two tables

Page 22: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

22

USING A RELATIONAL DATABASE FOR BUSINESS ADVANTAGES

Database advantages from a business perspective include

Page 23: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

23

Increased Flexibility A well-designed database should

• Handle changes quickly and easily• Provide users with different views• Have only one physical view

Physical view – Deals with the physical storage of information on a storage device

• Have multiple logical viewsLogical view – Focuses on how

individual users logically access information to meet their own particular business needs

Page 24: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

24

Increased Scalability and Performance

A database must scale to meet increased demand, while maintaining acceptable performance levels• Scalability – Refers to how well a

system can adapt to increased demands• Performance – Measures how quickly a

system performs a certain process or transaction

Page 25: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

25

Reduced Information Redundancy

Databases reduce information redundancy

• Information redundancy – The duplication of data or storing the same information in multiple places

Inconsistency is one of the primary problems with redundant information

Page 26: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

26

Increase Information Integrity (Quality)

Information integrity – measures the quality of information

Integrity constraint – rules that help ensure the quality of information

• Relational integrity constraint

• Business-critical integrity constraint

Page 27: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

27

Increased Information Security

Information is an organizational asset and must be protected

Databases offer several security features

• Password – Provides authentication of the user

• Access level – Determines who has access to the different types of information

• Access control – Determines types of user access, such as read-only access

Page 28: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

28

DRIVING WEBSITES WITH DATA

Data-driven websites – An interactive website kept constantly updated and relevant to the needs of its customers using a database

Content creator Content editor Static information Dynamic information Dynamic catalog

Page 29: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

29

DRIVING WEBSITES WITH DATA

Page 30: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

30

DRIVING WEBSITES WITH DATA

Data-driven website advantages

• Easy to manage content

• Easy to store large amounts of data

• Easy to eliminate human errors

Page 31: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

31

DRIVING WEBSITES WITH DATA

Page 32: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

32

© 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied,

scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.

SECTION 6.2

BUSINESS INTELLIGENCE

Page 33: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

33

LEARNING OUTCOMES

5. Define a data warehouse and provide a few reasons it can make a manager more effective

6. Explain ETL and the role of a data mart in business

7. Define data mining and explain the three common forms for mining structured and unstructured data

8. Identify the advantages of using business intelligence to support managerial decision making

Page 34: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

34

THE BUSINESS BENEFITS OF DATA WAREHOUSING

Data warehouses extend the transformation of data into information

In the 1990’s executives became less concerned with the day-to-day business operations and more concerned with overall business functions

The data warehouse provided the ability to support decision making without disrupting the day-to-day operations

Page 35: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

35

THE BUSINESS BENEFITS OF DATA WAREHOUSING

Data warehouse – A logical collection of information – gathered from many different operational databases – that supports business analysis activities and decision-making tasks

The primary purpose of a data warehouse is to aggregate information throughout an organization into a single repository for decision-making purposes

Page 36: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

36

THE BUSINESS BENEFITS OF DATA WAREHOUSING

Page 37: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

37

THE BUSINESS BENEFITS OF DATA WAREHOUSING

Page 38: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

38

PERFORMING BUSINESS ANALYSIS WITH DATA MARTS

Extraction, transformation, and loading (ETL) – A process that extracts information from internal and external databases, transforms the information using a common set of enterprise definitions, and loads the information into a data warehouse

Data mart – Contains a subset of data warehouse information

Page 39: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

39

PERFORMING BUSINESS ANALYSIS WITH DATA MARTS

Page 40: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

40

Multidimensional Analysis Databases contain information in a series of

two-dimensional tables

In a data warehouse and data mart, information is multidimensional, it contains layers of columns and rows

• Dimension – A particular attribute of information

• Cube – Common term for the representation of multidimensional information

Page 41: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

41

Multidimensional Analysis

Cubes of Information

Page 42: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

42

Information Cleansing or Scrubbing

An organization must maintain high-quality data in the data warehouse

Information cleansing or scrubbing – A process that weeds out and fixes or discards inconsistent, incorrect, or incomplete information

Page 43: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

43

Information Cleansing or Scrubbing

Contact Information in an Operational System

Page 44: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

44

Information Cleansing or Scrubbing

Standardizing Customer Name from Operational Systems

Page 45: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

45

Information Cleansing or Scrubbing

Information Cleansing Example

Page 46: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

46

Information Cleansing or Scrubbing

Cost of Accurate and Complete Information

Page 47: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

47

UNCOVERING TRENDS AND PATTERNS WITH DATA MINING

Data mining – The process of analyzing data to extract information not offered by the raw data alone

Data-mining tools – use a variety of techniques to find patterns and relationships in large volumes of information

• Classification

• Estimation

• Affinity grouping

• Clustering

Page 48: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

48

UNCOVERING TRENDS AND PATTERNS WITH DATA MINING

Structured data – Data already in a database or a spreadsheet

Unstructured data – Data does not exist in a fixed location and can include text documents, PDFs, voice messages, emails

Text mining – Analyzes unstructured data to find trends and patterns in words and sentences

Web mining – Analyzes unstructured data associated with websites to identify consumer behavior and website navigation

Page 49: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

49

UNCOVERING TRENDS AND PATTERNS WITH DATA MINING

Common forms of data-mining analysis capabilities include

• Cluster analysis• Association detection• Statistical analysis

Page 50: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

50

Cluster Analysis

Cluster analysis – A technique used to divide an information set into mutually exclusive groups such that the members of each group are as close together as possible to one another and the different groups are as far apart as possible

Page 51: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

51

Association Detection

Association detection – Reveals the relationship between variables along with the nature and frequency of the relationships

• Market basket analysis

Page 52: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

52

Statistical Analysis Statistical analysis – Performs

such functions as information correlations, distributions, calculations, and variance analysis

• Forecast – Predictions made on the basis of time-series information

• Time-series information – Time-stamped information collected at a particular frequency

Page 53: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

53

The Problem: Data Rich, Information Poor

Businesses face a data explosion as digital images, email in-boxes, and broadband connections doubles by 2010

The amount of data generated is doubling every year

Some believe it will soon double monthly

Page 54: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

54

The Solution: Business Intelligence

Improving the quality of business decisions has a direct impact on costs and revenue

BI enables business users to receive data for analysis that is:• Reliable• Consistent• Understandable• Easily manipulated

Page 55: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

55

The Solution: Business Intelligence

BI Can Answer Tough Questions

Page 56: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

56

Visual Business Intelligence Informing – Accessing large amounts of

data from different management information systems

Infographics – Displays information graphically

Data visualization – Allows users to “see” or visualize data to transform information into a business perspective

Data visualization tools – Sophisticated analysis techniques such as pie charts, controls, instruments, maps, time-series graphs, and more

Page 57: 1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner

57

LEARNING OUTCOME REVIEW

Now that you have finished the chapter please review the learning outcomes in your text