Upload
lydieu
View
212
Download
0
Embed Size (px)
Citation preview
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Computational Intelligence Approaches to Systems Modelling and Control
Dr. M.N.H. Siddique email: [email protected],
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Computational Intelligence Approaches
Keep it simple:As simple as possible,
But not simpler.
- Albert EinsteinIntroduction to CI
Much is unknown about intelligence, and much will remain beyond human comprehension.
Fundamental nature of intelligence is only poorly understood
According to a dictionary, intelligence means the ability to comprehend, reason and learn.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
A demanding definition of intelligent system -An intelligent system is capable to comprehend with or without much a priori information about the process, reason and learn about the process
Other aspects of intelligence that describes human intelligence are creativity, skills, consciousness, emotion and intuition.
Highly complex mathematical description can seriously inhibit the ability to develop system models.
Required to cope with significant un-modelled and unanticipated changes in the plant,
Dissatisfaction with conventional modelling techniques is growing with increasing complexity.
Necessitates the use of more human expertise and knowledge.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Intelligent techniques are aimed at processes that are
ill-defined, complex, nonlinear, time varying and stochastic.
Central to intelligent systems is the construction of the process model.
Many real world processes are not amenable to mathematical modelling because
i. processes are too complex to represent mathematicallyii. process models are difficult and expensive to evaluate
iii. uncertainties in process operationiv. process is nonlinear, distributed, incomplete and stochastic in nature
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Paradigms of CI The area of computational intelligence is in fact interdisciplinary Combine and extend theories and methods from other disciplines Inter-relationship between these disciplines is illustrated as
Figure 1: Periphery of computational intelligent
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Approaches to CI
According to Engelbrecht there are five basic approaches to computational intelligence. These are
i. Fuzzy logicii. Neural networks
iii. Evolutionary computingiv. Swarm intelligencev. Learning theory
vi. Probabilistic methods
We will cover three basic methodologies: -Fuzzy logic-Neural networks-Evolutionary computing
Other methodologies such as Swarm intelligence, learning theory and probabilistic methods will be addressed in later date.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Introduction
Two justification for fuzzy systems
(1) Real world too complicated- Fuzziness introduced to obtain a reasonable model
(2) Dissatisfaction growing with conventional approaches- Human knowledge becoming more and more important
Simple Example
Figure 2: A real world situation.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
A fuzzy system can combine the two
Figure 3: Example of a practical system.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Notational Conventions
A notational convention of fuzzy sets for a discrete and finite universe of discourse X is:
(1) (2)
represents the degree of membership of x in A.
Where ‘+’ does not mean arithmetic addition or logical OR.
Membership Functions
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
A membership function (MF) characterizes the fuzziness of a fuzzy set A in X, which associate each point in X a real number in the interval [0,1] with the value of at x representing the grade of membership.
Choice of membership function is usually problem dependent and very often determined by heuristically and subjectively.
Triangular MF Triangular MF: specified by three parameters , Figure 5(a).
(3) Parameters with determine the coordinates of the three corners of
the underlying triangular MF.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 4(a): Triangular MF.
Trapezoidal MF Trapezoidal MF: specified by four parameters , Figure 5(b).
(4) Parameters with determine the coordinates of the four corners
of the underlying trapezoidal MF.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 4(b): Trapezoidal MF.
Gaussian MF Gaussian MF: specified by two parameters , Figure 5(c).
(5) Parameters and represent the centre and width of the Gaussian MF
respectively.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 4(c): Gaussian function. Figure 4(d): Gaussian MF.
Bell-shaped MF Bell-shaped MF: specified by three parameters , Figure 5(d).
(6) Parameters and represent the centre and width of the bell-shaped MF. Parameter a, usually positive, controls slope of the MF at crossover point.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 4(e): Bell-shaped function Figure 4(f): Bell-shaped MF
Gaussian and bell shaped MFs achieve smoothness - unable to specify asymmetric MFs, which are important in certain applications.
Sigmoidal MF, which is either open left or right, can be used in such applications.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Heuristic selection of parameters of membership functions are widely used and practised in fuzzy systems applications.
Different approaches to construct membership functions, such as
- heuristic selection, - clustering approach, - C-means clustering approach, - adaptive vector quantization, and - self-organising map.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Linguistic Variables and Hedges
Linguistic variable is a variable whose values are words or sentences, which serve as names of fuzzy subsets of a universe of discourse.
For example, Speed is a linguistic variable if its values are slow, not slow, very slow, not very slow etc.
Specifically, a linguistic variable is characterised by a quintuple of primary terms, connectives, hedges, and markers.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
For example: linguistic variable with hedges
Figure 5: Linguistic variables and hedges.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Purpose of the hedges is to generate a larger set of values for a linguistic variable from a small collection of primary terms.
Hedges are realised on primary terms through the processes
intensification or concentration, dilation, and fuzzification.
Example:Operator VERY is concentration (or intensification) operator
Very (Small) = Small2
Very(Very (u)) = ( Very (u))2 = (u2)2 = u4
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Linguistic hedges very and more or less – shown in Figure 6.
Figure 6: Dilation and Concentration
For example, more or less very nice can be represented as
More or less (Very (Nice)) = More or less (Nice2) = (Nice2)1/2 = Nice
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Linguistic variable can be a word or sentence
Such natural language expressions are fuzzy
For example, Slow OR Medium, Medium AND Fast.
Figure 7: Three fuzzy sets for speed – Slow, Medium and Fast
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 8: Expression for “Slow OR Medium”.
Figure 9: Expression for “Medium AND Fast”.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Some widely used Hedges
Hedge Meaning
About, around, near roughly Approximates a scalar
Above, more than Restricts a fuzzy region
Almost, definitely, precisely Contrast intensification
Below, less than Restricts a fuzzy region
Generally, usually Contrast diffusion
Neighbouring, close to Approximate narrowly
Not Negation or complement
Quite, rather, somewhat Dilute a fuzzy region
Very, extremely Intensify a fuzzy region
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Linguistic variable can be used with more than one hedge.
For example
- Almost very fast but generally below 100 km/hr- Close to 100m but not very high (but AND)- Not more than about zero Not (more than (about zero))
Linguistic variable and hedges allow constructing mathematical models for expression in natural language.
These models can than be used to write computer programs and process rules and simulate real world process and behaviour.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Fuzzy If-Then Rules
If-part of the rule ‘x is A1’ is called the antecedent or premise. Then-part of the rule ‘y is B2’ is called the consequent.
Example: Speed and pressure of a steam engine can be expressed with the following linguistic
conditional statementIf Speed is Slow Then Pressure should be High.
Graphically, this statement looks like
Figure 10: If-Then rule.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Rule Forms
In general, three forms of rule exist for any linguistic variables.
Assignment statemente.g. x is not large AND not very small
Conditional statement e.g. IF x is very big THEN y is medium
Unconditional statement e.g. set pressure high
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Compound Rules
Linguistic statement expressed by a human might involve compound rule structures.
Any compound rule structure may be decomposed and reduced to a number of simple Canonical rules.
Two forms of Canonical forms
Conjunctive antecedent Disjunctive antecedent
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Conjunctive antecedents
Multiple conjunctive antecedents of the form
IF x is A1 AND x is A2 … AND x is An THEN y is Bs
Can be rewritten in compact form as
IF x is As THEN y is Bs
As = A1 A2 A3 … An
As expressed by means of membership functions based on fuzzy intersection operation
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Disjunctive antecedents
Multiple disjunctive antecedents of the form
IF x is A1 OR x is A2 … OR x is An THEN y is Bs
Can be rewritten in compact form as
IF x is As THEN y is Bs
As = A1 A2 A3 … An
As expressed by means of membership function based on fuzzy union operation
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Aggregation of rules
Aggregated output y is found by fuzzy intersection of the entire individual rule consequent , where .
y = y1 AND y2 AND … AND yr
ory = y1 y2 … yr
y defined by the membership function
for
For disjunctive system of rules where at least one rule must be satisfied, the rules are connected by OR connectives.
Aggregated output y is found by fuzzy union of all the individual rule consequent , where .
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
y = y1 OR y2 OR … OR yr or
y = y1 y2 … yr
y defined by the membership function
for
Fuzzification
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Process that allows converting a numeric value (or crisp value) into a fuzzy input is called fuzzification.
Two ways to do fuzzification:
Aui is a fuzzy singleton
Aui is a fuzzy set such that
Singleton fuzzification is generally used in implementation where there is no noise.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 11: Fuzzification in different MFs
Defuzzification
Defuzzification is the reverse process of fuzzification.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Mathematically, the defuzzification of a fuzzy set is the process of conversion of a fuzzy quantity into a crisp value, i.e. rounding off from its location in the unit hypercube to the nearest vertex.
Fuzzy control engineers have many different ways of defuzzifying; fuzzy reasoning, however, can usually use quite simple methods.
There are many defuzzification methods available in the literature.
Decision as to which method to use is a fairly technical matter; but the experts should know that these method are available and should try to see which works best.
Max-membership method
The max-membership method is both simple and quick method.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
This method takes the peak value of each fuzzy set and builds the weighted sum of these peak values.
This method is given by the algebraic expression as
for all
Shown in Figure 12(a).
12(a): Max-membership defuzzification
Centre of Gravity method
In the literature also referred to as centre of area and centroid method. In the continuous case it is given by the expression as
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
For a discrete universe with m quantisation levels in the output
This method determines the centre of the area below the combined membership function. Shown in Figure 12(b).
12(b): Centre of gravity defuzzification Weighted average method
Only valid for symmetrical output membership functions. It is given by the algebraic expression as
where denotes an algebraic sum. Shown in Figure 12(c).
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
12(c): Weighted average defuzzification For example, the two functions shown in Figure 12(c) would result in the following
defuzzified value:
Mean-max membership
If the mean or average of all local maxima in X is taken, a single defuzzified output is generated by this mean of maximum approach through
where xj = max (x) and N is the number of times the membership function reaches the maximum support value. Shown in Figure
12(d).
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
12(d): Mean-max defuzzification The defuzzified output is then
Centre of sums
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
This process involves the algebraic sum of individual output fuzzy sets, say C1 and C2, instead of their union.
One drawback of this method is that the intersecting areas are added twice. The defuzzified value x* is given in discrete case by the following
expression:
Shown in Figure 12(e).
12(e): Centre of sums defuzzification
Inference Mechanism
Inference is the process of formulating a nonlinear mapping from a given input space to output space.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
The mapping then provides a basis from which decisions can be taken. The process of fuzzy inference involves all the membership functions, fuzzy logic
operators and if-then rules. There are three types of fuzzy inference, which have been widely employed in
various applications. The differences between these three fuzzy inferences, also called fuzzy models, lie
in the consequents of their fuzzy rules, aggregations and defuzzification procedures. These fuzzy inferences are
(1) Mamdani fuzzy inference(2) Sugeno fuzzy inference (3) Tsukamoto fuzzy inference
Mamdani Fuzzy Inference
Mamdani type fuzzy modelling was proposed as the first attempt to control a steam engine and boiler by a set of linguistic control rules by Mamdani and Assilian in 1974.
Max-min is the most common rule of composition.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
In max-min rule of composition the inferred output of each rule is a fuzzy set chosen from the minimum firing strength.
In max-product rule of composition the inferred output of each rule is a fuzzy set scaled down by its firing strength via algebraic product.
A typical rule in Mamdani-type fuzzy model with two-input single-output has the form
If x is A and y is B then z is C
In Mamdani’s fuzzy model crisp values are used as inputs and defuzzification is used to convert a fuzzy set to a crisp value.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 13: Two-input single-output Mamdani fuzzy model.
Sugeno Fuzzy Inference
The Sugeno fuzzy model, also known as the TSK fuzzy model, was proposed by Takagi, Sugeno and Kang (Takagi and Sugeno, 1985; Sugeno and Kang, 1988) in an
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
effort to develop a systematic approach to generate fuzzy rules from a given input-output data set.
A typical fuzzy rule in Sugeno fuzzy model has the form
If x is A and y is B then z=f(x,y)
where A and B are fuzzy sets in the antecedent part, while z=f(x,y) is crisp function in the consequent part.
Usually f(x,y) is polynomial in the input variables x and y but it can be any function as long as it can appropriately describe the output of the model within the fuzzy region specified by the antecedent of the rule.
http://www.infm.ulst.ac.uk/~siddique
Computational Intelligence Approaches: Fuzzy Systems
Figure 14: Two-input single-output Sugeno fuzzy model.
http://www.infm.ulst.ac.uk/~siddique
Computational Intelligence Approaches: Fuzzy Systems
Features of Sugeno-type Inference Systems
When f(x,y) is a first-order polynomial, the resulting fuzzy inference system is called a first-order Sugeno fuzzy model which was proposed by Sugeno and Takagi.
When f(x,y) is a constant, it is zero-order Sugeno fuzzy model, which can be considered as a special case of Mamdani fuzzy model, in which the consequent of each rule is specified by a fuzzy singleton or by a pre-defuzzified consequent or a special case of Tsukamoto fuzzy model in which the consequent of each rule is specified by an MF of a step function.
Output of zero-order Sugeno model is a smooth function of its input variables as long as neighbouring MFs in the antecedent have enough overlap.
Overlap of MFs in the consequent of a Mamdani model does not have a decisive effect on the smoothness.
Overlap of the antecedent MFs that determines the smoothness of the resulting input-output behaviour.
http://www.infm.ulst.ac.uk/~siddique
Computational Intelligence Approaches: Fuzzy Systems
Overall output of a Sugeno fuzzy model is obtained via weighted average of the crisp output, thus avoiding the time consuming process of defuzzification required by Mamdani model.
Zero-order Sugeno fuzzy model is functionally equivalent to a radial basis function network under certain minor constraints which was shown by Jang and Sun.
In practice, weighted average operator is sometimes replaced with weighted sum operator to reduce computation further, especially in the training of a fuzzy inference system.
Simplification could lead to the loss of MF linguistic meaning unless the sum of firing strengths is close to unity.
Tsukamoto Fuzzy Inference
http://www.infm.ulst.ac.uk/~siddique
Computational Intelligence Approaches: Fuzzy Systems
In the Tsukamoto fuzzy model, the consequent of each fuzzy if-then rule is represented by a fuzzy set with a monotonic MF.
As a result, the inferred output of each rule is defined as crisp value included by the rule’s firing strength.
The overall output is taken as the weighted average of each rule’s output.
Since each rule infers a crisp output, the Tsukamoto fuzzy model aggregates each rule’s output by the method of weighted average and thus avoids the time-consuming process of defuzzification.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique
Figure 15: Two-input single-output Tsukamoto fuzzy model.
Computational Intelligence Approaches: Fuzzy Systems
http://www.infm.ulst.ac.uk/~siddique