163
Slide 1 © Gerd Stumme 2002 Tutorial Formal Concept Analysis Data Mining with Formal Concept Analysis Tutorial at ECML/PKDD 2002 Gerd Stumme Conceptual Knowledge Discovery

Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 1© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Data Mining withFormalConcept Analysis

Tutorial at ECML/PKDD 2002

Gerd Stumme

Conceptual Knowledge Discovery

Page 2: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 2© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Coffee Break

Page 3: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 3© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 4: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 4© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Formal Concept Analysis

arose around 1980 in Darmstadt as a mathematical theory, which formalizes the concept of ‚concept‘.

Since then, FCA has found many uses in Informatics, e.g. for

• Data Analysis,

• Knowledge Discovery,

• Software Engineering.

Based on datasets, FCA derives concepthierarchies.

FCA allows to generate and visualize the concept hierarchies.

Page 5: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 5© Gerd Stumme 2002

Tutorial Formal Concept Analysis

• FCA models concepts as units of thought, consisting of two parts:

• The extension consists of all objects belonging to the concept.

• The intension consists of all attributes common to all those objects.

• FCA is used for data analysis, information retrieval, and knowledge discovery.

• FCA can be understood as conceptual clustering method, which clusters simultanously objects and their descriptions.

• FCA can also be used for efficiently computing association rules.

Page 6: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 6© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA focusses on the concept level, the representationlevel plays only a minor role.

ISO 704: Terminology Work: Principles and Methods

Name

Conceptattribute aattribute battribute c

DefinitionRepresentation level

Object level

Concept level

object 1

property Aproperty Bproperty Cproperty D

object 3

property Aproperty Bproperty Cproperty F

object 2

property Aproperty Bproperty Cproperty E

Page 7: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 7© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Some typical applications:

• analysis of children suffering from diabetes

• IT security management system

• database marketing in a Suiss department store

• email management system

• developing qualitative theories in music estethics

• analysis of flight movements at Frankfurt airport

Page 8: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 8© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Links

• The Karlsruhe FCA page:km.aifb.uni-karlsruhe.de/fca

•FCA Mailing List: http://www.aifb.uni-karlsruhe.de/mailman/listinfo/fca-list

•The Darmstadt Research Group on FCA: www.mathematik.tu-darmstadt.de/ags/ag1/

•Research Center Conceptual Knowledge Processing, Darmstadt: http://www.fzbw.de/

•Ernst Schröder Center, Darmstadt: http://www.mathematik.tu-darmstadt.de/ags/esz/Welcome-en.html

• NaviCon GmbH:www.navicon.de

•The Dresden FCA page: http://www.math.tu-dresden.de/~ganter/fba.html

•Uta Priss‘ FCA page: http://php.indiana.edu/~upriss/fca/fca.html (will be moving)

•Michel Liquiere‘s FCA page: http://www.lattices.org/

Page 9: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 9© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Bibliography

• R. Wille: Restructuring lattice theory: an approach based on hierarchies ofconcepts. In: I. Rival (ed.): Ordered sets. Reidel, Dordrecht-Boston, 1982, 445-470

• B. Ganter, R. Wille: Formal Concept Analysis – Mathematical Foundations. Springer, Heidelberg 1999 (Translation of: Formale Begriffsanalyse –Mathematische Grundlagen. Springer, Heidelberg 1996)

• G. Stumme, R. Wille (Eds.): Begriffliche Wissensverarbeitung - Methoden und Anwendungen. Springer 2000

• Proc. Intl. Conf. on Conceptual Structures (ICCS), Springer, Heidelberg (contain FCA topics since 1995)

• More books: http://www.math.tu-dresden.de/~ganter/FCAbooks.html

• List of publications: http://www.mathematik.tu-darmstadt.de/ags/ag1/Literatur/literatur_en.html

Page 10: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 10© Gerd Stumme 2002

Tutorial Formal Concept Analysis

KDD/FCA specific bibliography

Page 11: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 11© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 12: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 12© Gerd Stumme 2002

Tutorial Formal Concept Analysis

These references are taken from

G. Stumme: Efficient Data Mining Basedon Formal Concept Analysis. Proc. 13thIntl. Conf. on Database and ExpertSystems Applications (DEXA 2002). LNCS, Springer, Heidelberg 2002 (InvitedTalk, in press)

Page 13: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 13© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 14: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 14© Gerd Stumme 2002

Tutorial Formal Concept Analysis

National Parks in California

Formal Concept Analysis

Def.: A formal contextis a tripel (G,M,I), where

• G is a set of objects,

• M is a set of attributes

• and I is a relationbetween G and M.

• (g,m)�I is read as „object g has attribute m“.

Page 15: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 15© Gerd Stumme 2002

Tutorial Formal Concept Analysis

National Parks in California

For A � G, we define

A´:= { m�M | �g�A: (g,m)�I }.

For B � M, we define dually

B´:= { g�G | �m�B: (g,m)�I }.

A

Page 16: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 16© Gerd Stumme 2002

Tutorial Formal Concept Analysis

National Parks in CaliforniaFor A, A1, A2 � G holds:

• A1 � A2 � A‘2 � A‘1• A � A‘‘

• A‘ = A‘‘‘

For B, B1, B2 � M holds:

• B1 � B2 � B‘2 � B‘1• B � B‘‘

• B‘ = B‘‘‘

A

Page 17: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 17© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Intent B

National Parks in California

Exte

nt A

Def.: A formal concept

is a pair (A,B) where

• A is a set of objects (the extent of the concept),

• B is a set of attributes(the intent of the concept),

• A‘ = B and B‘ = A.

The last condition isequivalent to A�B being a maximal rectangle in the binary relation (i.e., A and B are maximal with A�B � I ).

Page 18: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 18© Gerd Stumme 2002

Tutorial Formal Concept Analysis

National Parks in California

The blue concept is a subconcept ofthe yellow one, since its extent is contained in the yellow one.

( � the yellow intentis contained in theblue one.)

Page 19: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 19© Gerd Stumme 2002

Tutorial Formal Concept Analysis

National Parks in California

The concept lattice of the National Parks in California

Page 20: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 20© Gerd Stumme 2002

Tutorial Formal Concept Analysis

• Def.: The concept lattice [Begriffsverband] of a formal context (G,M,I) is the set of all formal concepts of (G,M,I), together with the partial order

(A1,B1 ) � (A2,B2 ) : � A1 � A2 (� B1 � B2 ) .

The concept lattice is denoted by (G,M,I) .

• Theorem: The concept lattice is a lattice, i.e. for two concepts (A1,B1 ) and (A2,B2 ), there is always

•a greatest common subconcept: (A1A2 , (B1 B2 )´´)

•and a least common superconcept: ((A1 A2 )´´, B1B2 ) .

More general, it is even a complete lattice, i.e. the greatest common subconcept and the least common superconcept exist for all (finite and infinite) sets of concepts.

Corollary: The set of all concept intents of a formal context is a closure system. The corresponding closure operator is h(X) := X‘‘.

Page 21: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 21© Gerd Stumme 2002

Tutorial Formal Concept Analysis

In the power set of M, the conceptintents are always the largest setsamong those with the sameclosure.

Example: h( {a,b} ) = h( {a,b,c} ) = h( {a,b,c,e} ) = {a,b,c,e}

123

a b c e

a

bc

e

1

2

3 intent

Page 22: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 22© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: An implicationX � Y holds in a context, ifevery object having all attributes in X also has all attributes in Y.

• Examples:

Swimming � Hiking

Implications

Boating � Swimming, Hiking, NPS Guided Tours, Fishing

Bicycle Trail, NPS Guided Tours � Swimming, Hiking

Page 23: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 23© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: Let X � M. The attributes in X areindependent, if thereare no trivial dependenciesbetween them.

Example:• Fishing• Bicycle Trail• Swimming

are independent attributes.

Independency

Page 24: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 24© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Lemma: Attributes are independent if they span a hyper-cube.

Example:

• Fishing• Bicycle Trail• Swimming

are independent attributes.

Independency

Page 25: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 25© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: A subset T � M respects an implication A � B, if A � T or B � T.

T respects a set of implications, if T respects every single implication in

Lemma: An implication A � B holds in a context iff B � A‘‘ . It is then respected by all concept intents.

Concept Intents and Implications

/

Page 26: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 26© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Lemma: Is a set of implications in M, then

( ) := { X � M | X respects }

is a closure system.

The related closure operator is constructed as follows: For a set X � M let

X := X � { B | A � B � , A � X }.

Compute X , X , X ,..., until a set

(X) := X ...

with (X) = (X) (i.e., a fix point) is reached. (for infinite contexts this may be an

infinite process). (X) ist then the closure of X with respect to the closure system ( ).

Page 27: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 27© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: An implication A � B is (semantically) entailed from a set of implications, if

every subset of M respecting also respects A � B.

A family of implications ist called closed if every implication entailed from is already

contained in .

Lemma: A set of implications on M is closed iff the following conditions (Amstrong

rules) are fulfilled for all W, X, Y, Z � M :

1. X � X � ,

2. If X � Y � , then X Z � Y � ,

3. If X � Y � and Y Z � W � , then X Z � W � .

Page 28: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 28© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: A set of implications of a context (G, M, I) is called complete, if every implication of (G, M, I) is entailed from .A set of implications is called non-redundant, if no implication is entailed from the others.

Def.: P ���� M is called pseudo intent of (G, M, I) if P � P “ and for every pseudo intent Q ���� P with Q � P holds Q“���� P.

Theorem: The set of implications

:= { P ���� P“ | P Pseudoinhalt }

is non-redundant and complete. We call stem basis.

Page 29: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 29© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example: Membership of developing countries in supranational groups (Source: Lexikon Dritte Welt. Rowohlt-Verlag, Reinbek 1993)

Taken from: B. Ganter, R. Wille: Formal Concept Analysis -Mathematical Foundations. Springer, Heidelberg 1999

Page 30: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 30© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 31: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 31© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 32: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 32© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Stem basis of the 3rd World context:

OPEC �MSAC �

Non-Alligned �Group of 77, Non-Alligned, MSAC, OPEC �Group of 77, Non-Alligned, LLDC, OPEC �

Group of 77, Non-AllignedGroup of 77Group of 77LLDC, AKPMSAC, AKP

Page 33: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 33© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Clustering

7. Exercises

8. FCA-Based Mining of Association Rules

9. Application Examples II

Page 34: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 34© Gerd Stumme 2002

Tutorial Formal Concept Analysis

IT-Security Management

���� Supports the analysis of security risks in IT units� status quo test for establishing guidelines and checklists

Page 35: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 35© Gerd Stumme 2002

Tutorial Formal Concept Analysis

More examples ...

... on the overhead projector.

Page 36: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 36© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Clustering

7. Exercises

8. FCA-Based Mining of Association Rules

9. Application Examples II

Page 37: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 37© Gerd Stumme 2002

Tutorial Formal Concept Analysis

There exist a number of algorithms for computing concept lattices:

• Naive approach

• Intersection method

• Titanic [Stumme et al 2001]

• Next-Closure [Ganter 1984]

• and some incremental algorithms

Page 38: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 38© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Naive Approach

Theorem: Every concept of the context (G,M,I) is of the form (X‘‘, X‘) for some X � G (and of the form (Y‘, Y‘‘) for at least one Y � M ).

On the other hand, each such pair is a concept.

„Algorithm“: Determine for each subset Y of M the pair (Y‘, Y‘‘).

But: Too many concepts are created too often.

Page 39: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 39© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Intersection Method

This method is also suitable for manual computation. [Wille 1982]

It provides the best worst-case time complexity. [Nourine, Raynoud 1999]

It uses the following theorem:

Theorem: Each intent is intersection of attribute intents. I.e., the closure system of all intents is generated by the attribute intents.

The question is which intersections of attribute intents to take.

� Example „Faces“ on the Blackboard

Page 40: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 40© Gerd Stumme 2002

Tutorial Formal Concept Analysis

How to compute/draw a concept lattice (manually):

• From left to right, consider all intersections of each column extent with every column extent to the left of it. If the resulting extent is not already a column, add it as column at the right end of the context. Repeat this until the last (added) column is reached.

• Add a full column, unless there is already one. (Now each column stands for one concept.)

•Draw a circle for the full column.

•Draw for each column, starting for the ones with a maximal number ofcrosses, a circle, and link it with a line to the circles where the column comprises the current column.

•Attach every attribute label to the circle of the corresponding column.

•Attach every object label to the circle laying exactly below the circles of theattributes in its intent.

Page 41: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 41© Gerd Stumme 2002

Tutorial Formal Concept Analysis

How to check the drawing of a concept lattice:

• Is it really a lattice? (This test is usually skipped.)

• Is every concept with exactly one upper neighbor labeled by at least one attribute?

• Is every concept with exactly one lower neighbor labeled by at least one object?

• Is, for all g � G and all m � M, the label of object g below the label ofattribute m iff (g,m)�I ?

Page 42: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 42© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

computes the closure system of all (frequent) concept intents using the support function:

Def.: The support of an attribute set (itemset) X � M is given by

GX

(X)´

supp �

Only concepts with a support above a threshold minsupp � [0,1].

Page 43: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 43© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC makes use of some simple facts about the support function:

Page 44: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 44© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

tries to optimize the following three questions:

1. How can the closure of an itemset be determined based on supports only?

2. How can the closure system be computed with determining as few closures aspossible?

3. How can as many supports as possible be derived from already known supports?

Page 45: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 45© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

1. How can the closure of an itemset be determined based on supports only?

X‘‘ = X � { x� M \ X | supp(X) = supp(X � x ) }

Example: { b,c }‘‘ = { b, c, e }, since

supp( { b, c } ) = 1/3

supp( { a, b, c } ) = 0/3

supp( { b, c, e } ) = 1/3,

123

a b c e

Page 46: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 46© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

2. How can the closure systembe computed with determining as few closures as possible?

• We determine only the closuresof the minimal generators.

123

a b c e

a

bc

e

1

2

3

Page 47: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 47© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

2. How can the closure systembe computed with determining as few closures as possible?

We determine only the closures ofthe minimal generators.

• A set is minimal generator iff itssupport is different of the supportsof all its lower covers.

• The minimal generators are an order ideal (i.e., if a set is notminimal generator, then none of itssupersets is either.)� Apriori like approach

In the example, TITANIC needs two runs (and Apriori four).

Page 48: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 48© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANICApriori likeapproach

End

i � 1i � singletons

Determine support for all C � i

Determine closures for all C � i - 1

Prune non-minimal generators from i

i � i + 1i � Generate_Candidates( i - 1 )

i empty?

no

yes

Page 49: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 49© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

We only generate candidates forminimal generators.

If the support is too low or equal to thesupport of a lowercover, thecandidate is pruned.

compared with Apriori

End

i � 1i � singletons

Determine support for all C � i

Determine closures for all C � i - 1

Prune non-minimal generators from i

i � i + 1i � Generate_Candidates( i - 1 )

i empty?

no

yes

Determine support for all C � i - 1

Page 50: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 50© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

1. How can the closure of an itemset be determined based on supports only?

X‘‘ = X � x� M \ X | supp(X) = supp(X � x )

2. How can the closure system be computed with determining as few closures as possible?

Approach à la Apriori

3. How can as many supports as possible be derived from already known supports?

Page 51: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 51© Gerd Stumme 2002

Tutorial Formal Concept Analysis

3. How can as many supports as possible be derived from already known supports?

Theorem: If X is no minimal generator, then

supp(X) = min { supp(K) | K is minimal generator, K � X } .

123

a b c e

Example: supp( { a, b, c } ) = min { 0/3, 1/3, 1/3, 2/3, 2/3 } = 0, since the set is no minimal generator, and since

supp( { a, b } ) = 0/3, supp( { b, c } ) = 1/3supp( { a } ) = 1/3, supp( { b } ) = 2/3supp( { c } ) = 2/3

Remark: It is sufficient to check the largest generators K with K � X, i.e. here { a, b } and { b, c} .

Page 52: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 52© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

1. How can the closure of an itemset be determined based on supports only?

X‘‘ = X � x� M \ X | supp(X) = supp(X � x )

2. How can the closure system be computed with determining as few closures as possible?

Approach à la Apriori

3. How can as many supports as possible be derived from already known supports?

If X is no minimal generator, then

supp(X) = min { supp(K) | K is minimal generator, K � X } .

Page 53: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 53© Gerd Stumme 2002

Tutorial Formal Concept Analysis

For pot. min. generators: count in database. Else supp(X) = min { supp(K) | K � X , K m.g.}.

X‘‘ = X � { x� M \ X | supp(X) = supp(X � {x}) }

A la Apriori

iff supp(X) � supp(X \ {x}) f.a. x �X

A la Apriori

End

i � 1i � singletons

Determine support for all C � i

Determine closures for all C � i - 1

Prune non-minimal generators from i

i � i + 1i � Generate_Candidates( i - 1 )

i empty?

no

yes

Page 54: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 54© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

Page 55: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 55© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

Page 56: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 56© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC

Page 57: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 57© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example of TITANIC

Page 58: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 58© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 59: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 59© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 60: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 60© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 61: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 61© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 62: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 62© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Next-Closure

It determines the concept intents in lectical order.

was developed by B. Ganter (1984).

It can be used

• to determine the concept lattice or

• to determine the concept lattice together with the stem basis or

• for interactive knowledge acquisition.

Page 63: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 63© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Let M = {1, ..., n}. A � M is lectically smaller than B � M, if B � A if the smallest element where A and B differ belongs to B :

A < B :� � i � B\A: A {1, 2, ..., i-1} = B {1, 2, ..., i-1}

12345

12

12

34

5

45

345

13

2414

123

23

124

34

234134

1234

2515

125235

135

35

1235 1245

1245145

23451345

Page 64: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 64© Gerd Stumme 2002

Tutorial Formal Concept Analysis

We need the following:

A <i B :� i � B \A A {1, 2, ..., i-1} = B {1, 2, ..., i-1}

A � i := ( A {1, 2, ..., i-1} ) � {i}

Theorem: The smallest concept intent, which according to the lectical order is larger as a given set A � M, is

(A � i )‘‘,

where i is the largest element of M with A <i (A � i )‘‘.

Page 65: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 65© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Algorithm Next-Closure for determining all concept intents:

1) The lectically smallest concept intent is ‘‘.

2) Is A a concept intent, then we find the lectically next intent, by checking all attributes i � M \ A , starting with the largest, und then in decreasing order, until A <i (A � i )‘‘holds. Then (A � i )‘‘ is the lectically next concept intent.

3) If (A � i )‘‘ = M, then stop, else A � (A � i )‘‘ and goto 2).

Page 66: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 66© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example: on blackboard

A i A � i (A � i )‘‘ A <i (A � i )‘‘ ? new concept intent

XX X

X XX

Sinus 44Nokia 6110T-Fax 301T-Fax 360 PC

Han

dy (1

)Te

lefo

n (2

)Fa

x (3

)Fa

x w

. n.p

aper

(4)

Page 67: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 67© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANIC vs. Next-Closure

• Next-Closure needs almost no memory.

• Next-Closure can exploit known symmetries between attributes.

• Next-Closure can be used for knowledge acquisition.

• TITANIC has far better performance, especially on large data sets.

Page 68: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 68© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 69: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 69© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Exercise: Compute the concept lattices of the following formal contexts

Star AlliancePartners

Page 70: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 70© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 71: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 71© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example: Civil Engineering regulations in Nordrhein-Westfalen

Problem: Concept latticescan grow exponential in thesize of the context.

Answer:

• One method for reducing the complexity of the diagram is conceptual scaling.

• The idea is to consider only few attributes at a time.

•If combinations are of interest, they can be put together again.

Page 72: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 72© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 73: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 73© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example: Civil Engineering regulations in Nordrhein-Westfalen

Problem: Concept latticescan grow exponential in thesize of the context.

Answer:

• One method for reducing the complexity of the diagram is conceptual scaling.

• The idea is to consider only few attributes at a time.

•If combinations are of interest, they can be put together again.

Page 74: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 74© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 75: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 75© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 76: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 76© Gerd Stumme 2002

Tutorial Formal Concept Analysis

In general, attributes may not only be properties which are or are not related to an object, but they may allow for different values. We call such attributes, as e.g. „color“, „sexe“, „weight“, many-valued attributes.

Def.: A many-valued context (G, M, W, I) consists of sets G, M, and W and ternary relation I between G, M and W (i.e. I� G x M x W), where the following holds:

(g, m, w) � I and (g, m, v) � I imply w = v.

Many-valued Contexts and Conceptual Scaling

The elements of G are called objects, the elements of M (many-valued) attributesand the elements of W attribute values.

(g, m, w) � I is read as „attribute m has value w for object g“.

Many-valued attributes can be considered as partial mappings from G to W, hence we note m(g) = w instead of (g, m, w) � I.

Page 77: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 77© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example.: This many-valued context lists different drive concepts for cars:.

Conventional Front-Wheel All-WheelMid-EngineRear-Wheel

In: Antriebskonzept für Personenkraftwagen. Quelle: Schlag nach! 100 000 Tatsachen ausallen Wissenschaftsgebieten. BI-Verlag Mannheim, 1982

Page 78: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 78© Gerd Stumme 2002

Tutorial Formal Concept Analysis

• The many-valued context is transformed by conceptual scaling (as describedbelow) to a one-valued context, for which one then can compute formal concepts.

How to derive concepts from many-valued contexts?

• Conceptual Scaling involves the human expert, as s/he has several choices how to interpret the data:

• For scaling, each attribute of the many-valued context is represented by a formal context, called conceptual scale.

Def.: A (conceptual) scale for attribute m of the many-valued context is a (one-valued) context Sm := (Gm, Mm, Im) with m(G) � Gm. The attributes of a scale are called scale values, the attributes scale attributes.

Page 79: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 79© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Using the following conceptual scales, we obtain the derived context on thefollowing slide:

Plain Scaling

Page 80: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 80© Gerd Stumme 2002

Tutorial Formal Concept Analysis

From the many-valued context at the top, we obtain the following derived context:

Page 81: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 81© Gerd Stumme 2002

Tutorial Formal Concept Analysis

very poorpoor

goodexcellent

Page 82: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 82© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Concept lattice for the derived context of drive concepts

Page 83: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 83© Gerd Stumme 2002

Tutorial Formal Concept Analysis

As in the example above one obtains from a many-valued context (G, M, W, I) and the conceptual scales Sm, m�� M, the derived context as follows:

The object set G remains unchanged. Every many-valued attribute m is replaced by the scale attributes of the scale Sm. Every attribute value m(g) is replaced by the corresponding row of the scale context Sm.

The formal definition is on the next slide.

Plain Scaling

Page 84: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 84© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: For a many-valued context (G, M, W, I) and scale contexts Sm, m � M, the derived context is (G, N, J) with

N := U Mm,m � M

andgJ(m, n) : � ( m(g) = w and wImn) .

( Mm stands for { m } x Mm in order to distinguish attribute values of different many-valued attributes.)

Page 85: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 85© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: Elementary Scales

We use the abbreviation n := { 1, ..., n }

Nominal scales Nn := (n, n, =)are used for scaling attributes whose values exclude each other. (E.g., an attribute having the values masculine, feminine, neuter will be scaled nominally.) Then the concept extents are a partition of the object set.

1 2 3 41 x2 x3 x4 x

Die nominal scale N4

Any context can be a scale, there is no formal difference. However, we will call onlythose contexts ‚scales‘ which have a clear conceptual structure. Some very simple contexts are often used as scales:

Page 86: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 86© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Ordinal scales On := (n, n, ���� )are used for attributes with ordered values, where each value implies the smaller values. (E.g., loud, very loud, extremely loud.) The result is a chain of conceptextents which can be interpreted as ranking.

1 2 3 41 x x x x2 x x x3 x x4 x

The ordinal scale O4

Page 87: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 87© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Interordinal scales In := (n, n, � ) | (n, n, � )are e.g. used in questionaries where one can select values on a scale like activ-passiv or agree-disagree. The concept intents are exactly the intervals of scale values - this reflects conceptually the between relation.

Page 88: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 88© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Often these attributes can also be scaled bi-ordinally:

Biordinal scales Mn,m := (n, n, �) � (m, m,�)are used when the objects are assigned to one of two poles, and this with a different degree. (E.g., very silent, silent, loud, very loud: loud and silent exlude each other, very loud implies loud, and very silent implies silent.) The result is a partition with ranking.

The bi-ordinal scale O4,2 (e.g. for German school grades)

Page 89: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 89© Gerd Stumme 2002

Tutorial Formal Concept Analysis

The dichotomic scale D := ( 0, 1 , 0,1 , =)is a special case, since it is isomorphic to the scales N2 und M1,1 and closely relatedto I2 . It is used most often for scaling yes-no.

0 10 x1 x

Page 90: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 90© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example: This context will be scaled ordinally.

Page 91: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 91© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 92: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 92© Gerd Stumme 2002

Tutorial Formal Concept Analysis

The concept lattice of the derived context

Page 93: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 93© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Additive Line Diagrams

Page 94: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 94© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Additive Line Diagrams

Def.: An attribute m � M is irreducible, if there are no other attributes m1, m2 � M with m1 � m � m2 and m1‘ m2‘ = m‘. The set of all irreducible attributes is denoted by Mirr

We define the mapping irr : B(G,M,I) � P(Mir) by

irr(A,B) := { m � B | m irreducible }

Let vec : Mirr � R � R<0.

Then pos : B(G,M,I) � R2 with pos(A,B) := �x� irr(A,B) vec(m) is an additive line diagram of the concept lattice B(G,M,I).

Page 95: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 95© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Nested line diagrams are used for

• for visualizing larger concept lattices

• for emphasizing sub-structures and regularities.

• for combining conceptual scales on-line.

The basic idea is to „summarize“ parallel lines and display it as just one line.

Nested Line Diagrams

Page 96: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 96© Gerd Stumme 2002

Tutorial Formal Concept Analysis

These line diagrams all show the same concept lattice.

Nested Line Diagrams

Page 97: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 97© Gerd Stumme 2002

Tutorial Formal Concept Analysis

A nested line diagram consists thus of an outer line diagram, which contains in eachnode inner diagrams.

In the simplest case the inner diagrams of two connected nodes of the outerdiagram are congruent. The connecting line of the outer diagram indicates then that each node in an inner diagram is connected with the corresponding node in the other inner diagram.

A double line between two nodes indicates that each element within the upper nodeis larger than each element in the lower node.

Nested Line Diagrams

Page 98: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 98© Gerd Stumme 2002

Tutorial Formal Concept Analysis

We also allow that the inner diagrams are not congruent, but only substructures of congruent diagrams.

The congruent diagrams are then drawn as „background structure“, having some unrealized concepts.

Unrealized concepts indicate implications as we will see below.

Nested Line Diagrams

Page 99: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 99© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example for a nested line diagram with non-congruent components.(Details below)

Page 100: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 100© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Reading Implications in Nested Line Diagrams

• Implications within the inner scale are read from the inner diagram at the upmost concept:

{Treppen} � {Treppenraum} {stairs} � {staircase}

• Implications within the outer scale are read directly from it:

{Wand} � {Brandwand} {wall} � {firewall}

{Decke, Brandwand} � {Wand, Brandwand} {ceiling, firewall} � {wall, firewall}

{Decke, Fundament} � ? {ceiling, foundation} � ?

• Implications between inner and outer scale are indicated by non-realized concepts. The premise is the intent of the non-realized concept, and the conclusion is theintent of the largest realized subconcept:

{Decke, Kellerfußboden} � {Treppenraum} {ceiling, cave floor} � {staircase}{Treppenraum, Schornstein} � {Decke, Wand, Brandwand, Dach}

{staircase, chimney} � {ceiling, wall, firewall, roof}{Wand, Dach, Schornstein} � ? {wall, roof, chimney} � ?

Page 101: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 101© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Def.: The least common superconcept of two concepts c1 und c2 is called Supremum of c1and c2 (denoted c1 ���� c2). The greatest common subconcept of c1 and c2 is the Infimumvon c1 und c2 (denoted c1 ���� c2). A mapping f : V � W between two lattices V and W is called supremum-preserving, iff (x � y) = f (x) � f (y).

Remark.: If a mapping preserves suprema, it also preserves the partial order, since x � y � x � y = y � f (x) � f (y ) = f (x � y) = f (y) � f (x) � f (y)

Theorem: Let (G, M, I) be a context and M = M1 � M2. The mapping

(A, B) � ((( B M1)‘, B M1), (( B M2)‘, B M2))

is an supremum-preserving embedding of B(G, M, I) in the direct product B(G, M1, I G x M1) B(G, M2, I G x M2).

Construction of Nested Line Diagrams

Page 102: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 102© Gerd Stumme 2002

Tutorial Formal Concept Analysis

• For constructing nested line diagrams, one first splits the attribute set: M = M1 U M2.

• The sets need not be disjoint, it is more important that they are grouped meaningfully.

• For many-valued contexts, the sets M1 and M2 the attribute sets of the conceptual scales.

Construction of Nested Line Diagrams

• Then the direct product of B(K1) and B(K2) is drawn. Draw a large diagram for B(K1), where the nodes are large ellipses, in which diagrams of B(K2) are drawn.

• One draws the concept lattices of the smaller contexts

Ki := (G, Mi, I G x Mi), i � 1, 2 ,

and labels them with the objects and attributes as usual.

Page 103: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 103© Gerd Stumme 2002

Tutorial Formal Concept Analysis

The concept lattice B(G, M, I) is embedded in this direct product as a V-semi-lattice (according to the previous theorem).

Mark it as follows:

• Put all object labels to the right positions.

• Compute all suprema, and mark them as realized concepts.

Page 104: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 104© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 105: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 105© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 106: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 106© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example on

Blackboard

Page 107: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 107© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 108: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 108© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Application Examples

• Database Marketing at Jelmoli AG, Zürich

• Analysis of flight movements at Frankfurt Airport

• Information Retrieval at the library of the Center for InterdisciplinaryTechnology Research (ZIT), TU Darmstadt

• Analysis of children suffering from diabetes, McGill Hospital Montréal

• Conceptual Email Management

Page 109: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 109© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Database Marketing at Jelmoli AG, Zürich

� Analysis of the user behavior of customers using the Shopping Bonus Card

� Supporting of Cross-Selling via Direct Mailing

Page 110: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 110© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Analysis of flight movements at Frankfurt Airport

�Ermöglichen von Ad-hoc-Anfragen an die Datenbank

� Visualisierung von Zusammenhängen

more on the overhead slides

Page 111: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 111© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Information Retrieval at the ZIT library, TU Darmstadt

Example: Search for older literature about automation in the most important industrial countries

Scale Change of Production

Page 112: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 112© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Scale Important Industrial Countries

(restricted to books with the catchwords Automation and Rationalisierung

Page 113: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 113© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Scale Publishing Year

restricted to books with the catchwords Deutschland, Automation and Rationalisierung)

Page 114: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 114© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 115: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 115© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Scales Coma and pH-level of the blood

Analysis of children suffering from diabetes, McGill Hospital Montréal

Page 116: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 116© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Scales Coma, pH-Wert of the blood and Symptom Duration

Analysis of children suffering from diabetes, McGill Hospital Montréal

Page 117: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Conceptual Email Manager

In CEM an email can beassigned to several „folders“.

Page 118: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

© Gerd Stumme 2002

Tutorial Formal Concept AnalysisConceptual Email Manager

This allows for multiple searchpaths:

• Darmstadt/KVO/KVO_Members

• KVO/Darmstadt/KVO_Members

• KVO/KVO_Members/Darmstadt

Conceptual Email Manager

Page 119: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Mails from subfolders can als be found in the more general folders.

Conceptual Email Manager

This allows for multiple searchpaths:

• Darmstadt/KVO/KVO_Members

• KVO/Darmstadt/KVO_Members

• KVO/KVO_Members/Darmstadt

Page 120: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Conceptual Email Manager

Nested line diagrams allow the combination of views.

Page 121: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 121© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 122: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 122© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Conceptual Clustering

Conceptual Clustering methods are clustering methods which generate simultaneously descriptions of the clusters.

• Examples: Michalski & Stepp 1983; Lebowitz 1987; Fisher 1987; Gennari et al 1989• Advantages of conceptual clustering against non-conceptual clustering:

• A cluster is not only a set of objects, but there also exists an intensional description.

• Disadvantages: • The language used to describe the clusters restricts the type of clusters which can be built.• The computation has usually higher complexity.

Page 123: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 123© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Iceberg concept lattices only allow conjunctions of attributes as descriptions.

• Recall: the support of an itemset X � M is given by

• Def.: The iceberg concept lattice of a formal context (G,M,I) for a givenminimal support minsupp is the set

{ (A,B) � B(G,M,I) | supp(B) � minsupp }

• It can be computed with TITANIC. [Stumme et al 2001]

GX

(X)´

supp �

Page 124: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 124© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Iceberg Concept Lattices

For minsupp = 85% the seven most general of the 32.086 concepts of the Mushrooms database http:\\kdd.ics.uci.edu are shown.

minsupp = 85%

Page 125: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 125© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Iceberg Concept Lattices

minsupp = 85%

minsupp = 70%

Page 126: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 126© Gerd Stumme 2002

Tutorial Formal Concept Analysis

minsupp = 55%

With decreasing minimum support the information gets richer.

Page 127: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 127© Gerd Stumme 2002

Tutorial Formal Concept Analysis

The visualization as a nested line diagramindicates implications.

Page 128: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 128© Gerd Stumme 2002

Tutorial Formal Concept Analysis

TITANICmodified for computing iceberg concept lattices

The only modification is theadditional condition that

a candidate is also removed if its support is too low.

End

i � 1i � singletons

Determine support for all C � i

Determine closures for all C � i - 1

Prune non-minimal generators from i

i � i + 1i � Generate_Candidates( i - 1 )

i empty?

no

yes

Page 129: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 129© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Iceberg Concept Lattices and Frequent Itemsets

Iceberg concept lattices are a condensed representation of frequent itemsets:

supp(X) = supp(X‘‘)

Differences between frequent concepts and frequent itemsets in the mushrooms database.

Page 130: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 130© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 131: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 131© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Association Rules

{ veil color: white, gill spacing: close } � { gill attachment: free }

Support: 78,52 % Confidence: 99,6 %

The input data of association rules algorithms can be written as a formal context (G,M,I):

• M is a set of items,

• G consists of the transaction IDs,

• and the relation I is the list of transactions.

Page 132: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 132© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Association Rules

The support is the percentage of all objects having all attributes in premise and conclusion:

Def.: The support of an attribute set X � M is given by

The support of an association rule X � Y is given by supp (X � Y) := supp (X � Y).

The confidence is the percentage of all objects fulfilling the premiseamong all objects fulfilling both premise and conclusion.

Def.: The confidence of a rule X � Y is given by

GX

(X)´

supp �

)(supp)(supp)conf(

XYXYX �

��

{ veil color: white, gill spacing: close } � { gill attachment: free }

Support: 78,52 % Confidence: 99,6 %

Page 133: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 133© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Our task: Find a basis of rules, i.e., a minimal set of rules out of which all other rules can be derived.

Classical Data Mining Task: Find, for given minsupp, minconf � [0,1], all rules with support and confidence above these thresholds

Bases of Association Rules

{ veil color: white, gill spacing: close } � { gill attachment: free }

Support: 78,52 % Confidence: 99,6 %

Page 134: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 134© Gerd Stumme 2002

Tutorial Formal Concept Analysis

• From B‘ = B‘‘‘ follows

)´´supp(´´´´

)supp( BGB

GB

B ���

Theorem: X � Y and X‘‘� Y‘‘ have the same support and the same confidence.

Hence for computing association rules, it is sufficient to compute the supports of allfrequent sets with B = B´´ (i.e., the intents of the iceberg concept lattice).

Page 135: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 135© Gerd Stumme 2002

Tutorial Formal Concept Analysis

� more efficient computation (e.g. TITANIC)

� fewer rules (without information loss!)

32 frequent itemsets are represented by 12 frequent concept intents

minsupp = 70%

Advantage of the use of iceberg concept lattices (compared to frequent itemsets)

Page 136: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 136© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Association rules can be visualized in the iceberg concept lattice:

• exact rules

• approximate rules

conf = 100 %

conf < 100 %

Advantage of the use of iceberg concept lattices (compared to frequent itemsets)

Page 137: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 137© Gerd Stumme 2002

Tutorial Formal Concept Analysis

can be derived from the stem basis (Sect. 2).

In concept lattices, they can be directly read from the diagram:

• Lemma: An implication X � Y holds iff the largest concept which is belowall concepts generated by the attributes in X is below all concepts generated by attributes in Y.

• Examples:• Swimming � Hiking

(supp=10/19 � 52.6%, conf = 100%)

• Boating � Swimming, Hiking, NPS Guided Tours, Fishing (supp=4/19 � 21.0%, conf = 100%)

• Bicycle Trail, NPS Guided Tours � Swimming, Hiking (supp=4/19 � 21.0%, conf = 100%)

Exact Association Rules

Page 138: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 138© Gerd Stumme 2002

Tutorial Formal Concept Analysis

The concept latticeof the National Parks in California

Exact Association Rules

Page 139: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 139© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Each arrow indicates a rule of the basis, e.g. the rightmost arrow stands for{ veil type: partial, gill spacing: close, veil color: white } � { gill attachment: free }(conf = 99.6 %, supp = 78.52 %)

Def.: The Luxenburger basis consists of all valid association rules X � Y such that there are concepts (A1, B1) and (A2, B2) where (A1, B1) is a direct upper neighbor of (A2, B2), X = B1, and X � Y = B2.

supp = 78.52 %

Approximate Association Rules

Page 140: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 140© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Satz: From the Luxenburger-Basis all approximate rules (incl. support und confidence) can be derived with the following rules:

• �(X � Y) = (X � Y\ Z), für � � { conf, supp }, Z � X

• �(X‘‘� Y‘‘) = �(X � Y)

• conf(X � X) = 1

• conf(X � Y) = p, conf(Y � Z) = q � conf(X � Z) = p·qfor all frequent concept intents X � Y � Z.

• supp(X � Z) = supp(Y � Z), for all X, Y � Z.

The basis is minimal with this property.

Page 141: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 141© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Example:

{ ring number: one } � { veil color: white }• has support 89.92 % (the support of the largest concept having both

attributes in its intent) • and confidence 97.5 % × 99.9 % � 97.4 %.

supp = 89.92 %

Page 142: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 142© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Some experimental results

Page 143: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 143© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

Page 144: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 144© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

For an overview see at

http://www.mathematik.tu-darmstadt.de/~plueschke/fcatools/programs.html

Page 145: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 145© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• visualizes nested line diagrams

• accesses all ODBC databases (where the context is stored)

• the conceptual scales are stored in a proprietary format .csc

• the scales have to be prepared in advance, using Anaconda

• is part of the Navicon Decision Tool Suite (together with Anaconda and Cernato)

• available from Navicon (research licence): www.navicon.de

Page 146: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 146© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 147: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 147© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 148: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 148© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• is the preparation tool for TOSCANA applications

• allows easy editing of formal contexts and concept lattices

• computes a concept lattice out of a context and provides an initial layout

• stores its data in the format .csc

• is part of the Navicon Decision Tool Suite (together with TOSCANA 2 and Cernato)

• available from Navicon (research licence): www.navicon.de

Page 149: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 149© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 150: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 150© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• is a C++ re-implementation of TOSCANA

• accesses all ODBC databases (where the context is stored)

• stores its data in the format .csc

• the preparation software AnacondaJ is under development

• is available from the author: B. Grohhttp://www.itee.uq.edu.au/people/staffView.jsp?id=bernd

Page 151: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 151© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 152: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 152© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• is an open source re-implementation of TOSCANA

• accesses all ODBC databases (where the context is stored)

• the conceptual scales are stored in an XML-based file format .csx

• the preparation software AnacondaJ is under development

• is downloadable from http://toscanaj.sourceforge.net/

• Join the effort !!!

Page 153: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 153© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 154: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 154© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• allows ad-hoc visualization

• reads data from csv files (eg export of MS Excel)

• animates line diagrams during drawing

• is part of the Navicon Decision Tool Suite (together with Anaconda and TOSCANA 2)

• available from Navicon (research licence): www.navicon.de

Page 155: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 155© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 156: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 156© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• computes implications for given contexts

• implements the knowledge acquisition technique Attribute Exploration

• is a DOS based tool

• downloadable from http://www.mathematik.tu-darmstadt.de/ags/ag1/Software/software_en.html

Page 157: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 157© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 158: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 158© Gerd Stumme 2002

Tutorial Formal Concept Analysis

FCA Tools• TOSCANA 2

• Anaconda

• Toscana 3

• ToscanaJ

• Cernato

• ConImp

• ConExp

• Concept Explorer is a mixture of ConImp and Cernato.

• written by Sergey Yevtushenko in Java

• comes with its own diagram editor and also supports implications.

• is still under heavy development .

• It is planned to extend the program, so that it can be used as in exchange for Anaconda within the ToscanaJ project.

• for more information, contact the author: [email protected]

Page 159: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 159© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises see extra exercises sheet

Page 160: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 160© Gerd Stumme 2002

Tutorial Formal Concept Analysis

1. Introduction

2. Formal Contexts & Concept Lattices

3. Application Examples I

4. Computing Concept Lattices

5. Exercises

6. Conceptual Scaling

7. Application Examples II

8. Conceptual Clustering

9. FCA-Based Mining of Association Rules

10. FCA Tools

11. Exercises

The End

Page 161: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Solutions to Sect. 5

Page 162: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 162© Gerd Stumme 2002

Tutorial Formal Concept Analysis

Page 163: Tutorial Formal Concept Analysis Conceptual Knowledge ... · • FCA is used for data analysis, information retrieval, and knowledge discovery. • FCA can be understood as conceptual

Slide 163© Gerd Stumme 2002

Tutorial Formal Concept Analysis

A non-nested diagram of the context „Planets“