CArcMOOC 02.03 - Encodings of non-numerical sets

Carc 02.03

alessandro.bogliolo@uniurb.it

02. Information theory02.03. Representation of non-numerical sets

• Texts

• Images

• Signals (Audio/Video)

• Redundancy and compression

Computer Architecture

Carc 02.03

1. A text is a sequence of characters

2. Each character is taken from a finite alphabete

3. Using a constant-size encoding for the characters, a text is encoded as a concatenation of character codes

4. ASCII: 7-bit encoding

5. Extended ASCII: 8-bit encoding

Carc 02.03

Images

1. An image is a matrix of points with assigned colors

2. An image contains infinite points and each point may take infinite colors

3. Both space and color discretization required

4. Discretized points are called pixels

5. Pixels are organized on a matrix

6. Using a constant size encoding for each pixel, an image is a concatenation of pixels, to be read in a given order

Carc 02.03

Color (gray) levels

The encoding associates a unique code with an

interval of gray levels

All gray levels within the interval are associated

with the same code, thus loosing informationThe original gray level cannot be exactly

reconstructed from the code

Encoding associates each code with a unique gray

level (representative of a class)

Carc 02.03

2D images

Gray level

levyx nnnsize 2log

Carc 02.03

Example

100x100x1bit100x100x8bit

50x50x1bit50x50x8bit

10x10x8bit 10x10x1bit

Carc 02.03

Analog and digital signals

• Signal: time-varying physical quantity• Analog: continuous-time, continuous-value

• Digital: discrete-time, discrete-value

• The digital encoding of a continuous signal entails:• Sampling (i.e., time discretization)

• Quantization (i.e., value discretization)

sizerate sTssize

Sampling rate

Duration

Sample size

Carc 02.03

Audio: time series

levratesizerate nTssTssize 2log

Carc 02.03

yxcolratesizerate nnnlogTssTssize 2

srate = frame rate

ncol = number of colors

nxny = frame size

Carc 02.03

Redundancy

• Redundant encoding: encoding that makes use of more than the minimum number of digits required by an exact encoding

MN Slog

• Motivations for redundancy:

– Providing more expressive/natural encoding/decoding rules

– Reliability (error detection)

Ex: parity encoding

– Noise immunity / fault tolerance (error correction)

Ex: triplication

Carc 02.03

• Parity encoding:

– A parity bit is used to guarantee that all codewords have an

even number of 1’s

– Single errors are detected by means of a parity check

Redundancy: examples

0010 00101

000000111000

parity check

Irredundant codeword

• Triple redundancy:

– Each character is repeats 3 times

– Single errors are corrected by means of a majority voting

000000111010

0 0 1 0 voting result

Carc 02.03

Compression

• Lossy compression• Compression achieved at the cost of reducing the accuracy of the

representation

• The original representation cannot be restored

• Always effective

• Lossless compression• Compression achieved by either removing redundancy or

leveraging content-specific opportunities

• The original representation can be restored

• Not always effective

CArcMOOC 02.03 - Encodings of non-numerical sets

Education

Conditional Positional Encodings for Vision Transformers

Succinct Randomized Encodings and their Applications

Legacy & Not-So-Legacy Character Sets & Encodings · Legacy & Not-So-Legacy Character Sets & Encodings

Text Encodings

CArcMOOC 03.02 - Switching networks and combinational circuits

akciosujsag.hu - Lidl, 2016.01.28-02.03

Image encodings

Www.service.electrolux.com Files It It IT Lb 02.03 1

CArcMOOC 04.01 - Von Neumann and CPU micro-architecture

CArcMOOC 03.04 - Gate-level design

ICRP concept on Protection of Environment 02.03

Latex Font Encodings

NCM F.02.03 - 2005

L 02.03 Strategic Hedging.ppt

Investigating Direct Manipulation of Graphical Encodings

Temporality and Encodings

Encodings of cladograms and labeled trees

02.03 adult art monitoring, changing gsn

CArcMOOC 01.01 - Automated information processing

A Study of 0/1 Encodings Prosser & Selensky A Study of 0/1 Encodings Prosser & Selensky