34
Data Representation

Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Embed Size (px)

Citation preview

Page 1: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Data Representation

Page 2: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

What is data?

Data is information that has been translated into a form that is more convenient to process

As information take different forms, the most efficient way is to represent all forms of information using a universal format.

Page 3: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Data Types

Word processing programs

Engineering programs

Image processing programs

Audio play program

Video display

programs

Multimedia

Page 4: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Information coding and decoding

• Human senses deal with a variety of information (signals).

• Input devices of computer translates these information into electrical signals, why electrical?.

• Electrical signals are then translated into universal format (0s,1s), this is known as coding.

• After processing, output devices transform back data into their original form, this is known as decoding

Page 5: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different
Page 6: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Bit Pattern• A bit is the smallest unit of data that the computer

deals with.• a bit can take two values (0 or 1).• A two-state electrical switch (transistor) is used to

represent a bit (on state →1, off state →0).• To store 16 bits you need 16 switches, to store

million bits you need million switches.• In computer memory data are stored as blocks of

bits (bit-patterns), the length of bit-patterns is the number of bits in the bit-patterns.

• A bit-pattern of 8 bits length is called a byte

Page 7: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Representing Data:1. Text Representation

• Written text is made of alphabetical symbols (letters). For example, in English there are 26 uppercase and 26 lowercase symbols.

• Each of those symbols is represented by distinctive bit-pattern (code), ex table A1, P337.

• Once alphabetical symbols are represented by a bit-pattern, any word that is made of combination of letters can be represented.

Page 8: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Representation of word “BYTE”

Ex: 34 Page13

Page 9: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Number of bits in bit-pattern• The number of possible bit-patterns (symbols)

made of N number of bits, M is given by: M = 2N • Inversely, the number of bits needed to construct

M number of symbols is given by: N= Log2 M ≈ 3.2 Log10 M

(Note: N must be rounded to next bigger integer)• Ex: for M = 26, what is the min number of bits? N= Log2 26 = 3.2 Log10 26 = 4.5 = 5 bits

Page 10: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Code systems for text representation

• There are about 5 code systems used to represent alphabetical symbols:1. ASCII (American Standard Code for Information

Interchange)2. Extended ASCII3. EBCDIC (Extended Coded Decimal Interchange

Code)4. Unicode (Universal Code)5. ISO (International Organization for Standardization)

Page 11: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(1 )ASCII

• In ASCII codes each code is made of 7 bits.• Number of possible codes M = 27 = 128 codes.• Bit-patterns ranging from 0000000 to 1111111• The first pattern represents (null character)• The last pattern represents (delete character)• Appendix A

Page 12: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(2 )Extended ASCII

• Is invented to make the bit-pattern length equal to 8 bits (Byte), by adding a bit to the left of the ASCII code representation.

Ex. If ASCII code is 1111111 the extended ASCII code is 01111111.

• Extended ASCII is not used because it is not standardized as each manufacturer has different 8-bits system.

Page 13: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(3 )EBCDIC

• Uses 8 bit patterns → # of codes = 28=256• Just used in IBM mainframe (system)

Page 14: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(4 )Unicode• To represent more languages’ character

beside English, Unicode is invented.• Uses 16 bit pattern → # of codes = 216=65536 enough to represent all world’s languages.• Some codes are allocated for geographical

and special symbols.• Java uses Unicode, Microsoft uses the first 256

symbols• Appendix B

Page 15: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(5 )ISO

• ISO uses 32 bit patterns → # of codes = 232=4,294,967,296 symbols

enough to represent all world’s symbols.

Page 16: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Representing Data:2. Image Representation

Image representation methods

Page 17: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

1. Bitmap Graphic• Image is divided into matrix of pixels.• A pixel represents a dot which is the smallest

unit of the image.• Image resolution depends on the number of

pixels in the image.• Higher resolution images require larger memory.• Once image is divided into pixels, each pixel is

given a bit-pattern.• The pixel bit-pattern determines the color of the

pixel

Page 18: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Black & white)) Pixel Color• For black and white images, only two bit-

patterns are needed, one to represent a black pixel and the other to represent a white pixel.

• In this case, the length of the pattern could be only one bit, i.e. 1 pattern to represent a black pixel and 0 pattern to represent a white pixel.

• The rows of patterns are then stored in the memory.

Page 19: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Bitmap graphic method of a black-and-white image

Page 20: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(gray scale )Pixel Color• To represent a gray-scale image of 4 colors (for

example) we need to increase the length of bit-pattern representing the pixel to be 2 bits.

• In this case 00→ black pixel 01→ dark gray pixel 10→ light gray pixel 11→ white pixel

Page 21: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

(colored pixel )Pixel Color• Any visible color could be constructed from the 3

basic colors Red, Green, Blue (RBG)• The difference between one color another depends

on the intensity of the RBG colors in the color• Therefore, to represent a colored image, each pixel

in the image must be represented by 3 different bit-patterns. Each of them represent the intensity of the basic colors.

• The length of a bit pattern representing each basic color is usually 8 bits

Page 22: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Representation of color pixels

Page 23: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

1. Vector Graphic• Image is decomposed into lines and curves.• Each curve and line is represented by a mathematical

formula.• The mathematical formula is sorted.• No bit-patterns are stored• For example a line is described by its coordinates, the

circle is described by it’s the coordinates of its centre and length of the radius.

• The advantage of vector representation is that image can be scaled by multiplying the formula by the scale factor without effecting the image resolution as in bitmap representation

Page 24: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Representing Data:3. Audio Representation

• Audio is sound• Sound signal is analog signal• The representation of audio signal requires

converting analog signal into digital signal (A/D)

Page 25: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Audio representation

Page 26: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Representing Data:4. Video Representation

• Video is a series of images (frame) shown sequentially (one after another)

• Thus video data representation is basically the representation of images changed with time.

• Video files are multimedia files

Page 27: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Binary Notation

• Is a way to write binary numbers• In this way we assign a symbol for multiples of

successive bits that makes the binary number• We are going to learn two binary notation

systems:– Octal notation: a symbol for 3 bits.– Hexadecimal notation: a symbol for 4 bits.

Page 28: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Decimal numbers

• A decimal number is made of digits• A digit takes the value between (0-9)• Each of the digits is multiplied by its weight

which is 10 to the power of 0 for the first digit from the right, 1 for the second digit, 2 for the third digit …. etc

Page 29: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different
Page 30: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Binary numbers

• A binary number is made of digits• A digit takes the value of either 0 or 1• Each of the digit is multiplied by its weight

which is 2 to the power of 0 for the first digit from the right, 1 for the second digit, 2 for the third digit …. Etc

Page 31: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different
Page 32: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Octal Notation1. Oct means eight in Greek2. In Octal notation, successive 3 bits are given a symbol (0,

1, 2, 3, 4, 5, 6, 7).3. In binary to octal transformation, if the number of bits in a

bit pattern is not a multiple of three, we fill with 0s added to the lift of bit-pattern to make the total number of bits multiple of three.

4. Converted octal notation must be distinguished by either:1. adding o or O in front of the octal number2. Adding subscript 8 to the base of the octal number

Page 33: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Hexadecimal Notation1. hexadec means 16 in Greek2. In hexadecimal notation, successive 4 bits are given a symbol

(0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D,E, F).3. In binary to hexadecimal transformation, if the number of

bits in a bit pattern is not a multiple of four, we fill with 0s added to the lift of bit-pattern to make the total number of bits multiple of four.

4. Converted hexadecimal notation must be distinguished by either:

1. Adding x or X in front of the hexadecimal number2. Adding subscript 16 to the base of the hexadecimal number

Page 34: Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different

Note

1. Octal or binary notation is just a way to represent binary numbers (i.e. they are not a numbering systems)

2. You have to make sure the converted number is always distinguished.