Upload
adryyandc
View
222
Download
0
Embed Size (px)
Citation preview
8/3/2019 An Adaptable Fuzzy Emotion Model for Emotion Recognition
1/6
An Adaptable Fuzzy Emotion Model for Emotion Recognition
Natascha Esau Lisa Kleinjohann
C-LAB
Fuerstenallee 11
D-33102 Paderborn, Germany
e-mail: {nesau, lisa, bernd}@c-lab.de
Bernd Kleinjohann
Abstract
Existing emotion recognition applications usually dis-
tinguish between a small number of emotions. How-ever this set of so called basic emotions varies from
one application to another depending on their accord-
ing needs. In order to support such differing appli-
cation needs an adaptable emotion model based on
the fuzzy hypercube is presented. In addition to exist-
ing models it supports also the recognition of derived
emotions which are combinations of basic emotions.
We show the application of this model by a prosody
based fuzzy emotion recognition system.
Keywords: Fuzzy emotion model, fuzzy hypercube,
fuzzy emotion recognition, basic emotion.
1 Introduction
Emotions are an evident part of interactions between
human beeings. But also for interactions of humans
with computer systems emotions play a major role,
since humans can never entirely switch off their emo-
tions. During the last years interest in emotions in-
creased considerably in various domains of computer
based systems. Examples are robots or virtual agents
that show emotions or human-computer interfaces that
consider human emotions in their interaction capabil-
ities. In Japan an entire stream called KANSEI infor-
mation processing [10] deals with subjective human
feelings when interacting with IT systems. These few
examples already reveal two major tasks of emotion
processing in IT systems, the recognition of human
emotions and the (re)production of artificial emotions.
Whereas robots or virtual agents often show emotions
themselves, for many other IT systems the recognition
of human emotions and appropriate reactions suffice
to improve the systems performance or acceptance.
Imagine for instance a user who is angry, because the
IT system does not behave in the expected way or she
tried several times to accomplish a task without suc-cess. In such a situation it would be very helpful and
increase the system acceptance, if an IT system could
recognize this emotion and react accordingly. Another
example is speech recognition. According to investi-
gations at the MIT a doubling of the word error rate to
about 32% was observed when people talk in an angry
way [4]. In such a case an appropriate system reaction
would be to redirect the user to a human operator or
to give hints how she could decrease the error rate.
Depending on the intended application domain dif-
ferent emotions are relevant for emotion recogni-tion. According to the observation described above,
the speech recognition system Mercury distinguishes
only two classes of emotions: frustration and neu-
tral. For other applications like personal robots or en-
tertainment robots certainly the recognition of some
more emotions like happiness or sadness would be
interesting to react with according robot behavior.
Since also psychologists have not yet agreed upon a
set of basic emotions (see Section 2) it is not likely
to identify a set of emotions, that is appropriate for
all computer based emotion recognition (CBER) sys-tems. Therefore, in this paper we propose an emotion
model for emotion recognition that is easily adaptable
to the selected set of basic emotions for the CBER
problem (see Section 3). However humans do not only
feel some basic emotions in their pure form but also
some more complex or derived emotions [16]. An
example is for instance curiosity which is according
to experiments by Plutchik a combination of accep-
tance and surprise. Accounting for this observation
our emotion model does not only support basic emo-
EUSFLAT - LFA 2005
73
8/3/2019 An Adaptable Fuzzy Emotion Model for Emotion Recognition
2/6
tions but also supports the representation of such de-
rived emotions or blends.
Furthermore different intensities or degrees of emo-
tions can be observed [16]. For modelling of different
degrees of membership to different classes in a classi-
fication system in many application domains, among
them also emotion recognition, fuzzy logic is a very
useful approach. Therefore we developed our adapt-
able emotion model using the fuzzy hypercube as ba-
sis (see Section 3). We show the applicability of our
approach by a system for emotion recognition from
prosody of natural speech in Section 4. Afterwards
we compare our approach with related work and give
a short conclusion.
2 Emotion Models in Psychology
Psychologists have tried to explain the nature of hu-
man emotions for decades or even centuries. Never-
theless no unique established emotion model exists.
However emotion is now usually seen as a dynamic
process that involves several modalities like motoric
expression, physiological arousal and subjective feel-
ing [13, 9]. For computer based emotion recogni-
tion (CBER), however models that help with the clas-
sification of emotions are more important. Among
these two major types of emotion models can be dis-tinguished (also mixtures of these types are found):
models that rely on basic emotions and emotion mod-
els that classify emotions according to different di-
mensions like valence, potency, arousal, intensity etc.
The first one has a major advantage for CBER, since it
considerably decreases recognition complexity due to
a small number of basic emotions to which the CBER
can be restricted. A well known earlier model of basic
emotions is the work of Plutchik [16]. He uses basic
emotions as a kind of building block for derived emo-
tions, so called secondary emotions. Plutchik evendistinguishes ternary emotions that are combinations
of secondary derived emotions. His model like many
others also describes the concept of emotion inten-
sity, that represents the strength by which an emo-
tion is felt. Although emotion models exist for sev-
eral decades, even now there is no general agreement
among psychologists how many basic emotions exist
and what they are. This is shown in table 1 which is
an excerpt from [15].
Due to the variety of basic emotions described in liter-
Table 1: Basic emotions distinguished by psycholo-
gists
Psychologist Basic Emotions
Plutchik Acceptance, anger, anticipaton, dis-
gust, joy, fear, sadness, surpriseEkman,
Friesen,
Ellsworth
Anger, disgust, fear, joy, sadness,
surprise
Frijda Desire, happiness, interest, surprise,
wonder, sorrow
Izard Anger, contempt, disgust, distress,
fear, guilt, interest, joy, shame, sur-
prise
James Fear, grief, love, rage
Mowrer Pain, pleasureOatley and
Johnson-
Laird
Anger, disgust, anxiety, happiness,
sadness
ature it seems reasonable to develop an emotion model
for emotion recognition that is easily adaptable to the
selected set of basic emotions for the CBER problem.
3 Fuzzy Emotion Model
As already stated, according to psychologists like
Plutchik humans do not only feel a single basic emo-
tion but have more complex emotional states, where
more than one basic emotion is involved with varying
strength or intensity. Therefore we propose a fuzzy
classification of emotional states using fuzzy hyper-
cubes [12]. Furthermore we assume that the inten-
sity of an emotion can be mapped to the interval [0, 1].First we define a fuzzy set corresponding to an emo-
tional state and then show how it is represented in a
fuzzy emotion hypercube.
Fuzzy set for emotional state. Let BE be a finite
base set ofn basic emotions e1, e2, . . . en and
{FEj : BE [0, 1], j = 1, 2, . . .} an infinite set offuzzy membership functions. Then each
FEj := {(ei, FEj(ei) | ei BE}, j = 1, 2, . . . de-fines a fuzzy set corresponding to one emotional state
Ej .
Fuzzy emotion hypercube. IfBE, FEj and FEjare defined as described above, we shall use the mem-
EUSFLAT - LFA 2005
74
8/3/2019 An Adaptable Fuzzy Emotion Model for Emotion Recognition
3/6
bership vector
(FEj(e1), FEj (e2), . . . , FEj(en)) =: (FEj(ei))to denote a point in an n-dimensional hypercube.
Each axis of the hypercube corresponds to one basic
emotion ei. Thus a membership vector (FEj(ei))corresponds to one emotional state Ej and can be in-
terpreted psychologically as vector of emotion inten-
sities (Iei) := (Ie1 , Ie2 , . . . , I en).
The number of distinguished emotions depends on
the psychological theory or in the case of computer
based emotion recognition on the intended applica-
tion. If for instance the three basic emotions happi-
ness h, anger a and surprise s shall be distinguished,
a three dimensional unit cube as depicted in Figure 1
is needed for modelling emotional states.
(1,0,0)
(0,1,0)
(0,0,1)
Happiness
Anger
Surprise
E1
E2
(0,0,0)
Figure 1: Fuzzy unit cube for three emotions happi-
ness, suprise and anger
The corners in the unit cube describe dual mem-berships (0 or 1) for all emotions, vertices desribe
dual memberships for two emotions and the third one
varies from 0 to 1. For example, the point E1 =(1.0, 0.2, 0.3) corresponding to the fuzzy setFE1 = {(h, 1.0), (a, 0.2), (s, 0.3)} represents ahappy emotional state. The point E2 = (0.2, 1.0, 0.9)corresponding to
FE2 = {(h,0.2), (a, 1.0), (s, 0.9)} certainly repre-sents an emotional state for a derived emotion from
anger and suprise. The point (0, 0, 0) represents the
entirely neutral state where no emotion is present.
Anger
Surprise
Happin.
Surprise+
Happin.
Neutral
Surprise+
Anger
Anger+
Happin.
Surprise+ Anger+ Happin.
Figure 2: Subdivisions of unit cube representing basic
and derived emotions
Figure 2 shows how the unit cube could be further di-
vided in order to represent basic emotions and their
mixtures. In the subcubes denoted by a single emo-
tion the membership function of this emotion takes
values in the interval [0.5, 1.0] whereas the member-
ship values for the other emotions respectively theirintensities are below 0.5. Therfore it is reasonableto associate the subcube with this basic emotion. In
the subcubes denoted with a sum of emotions (e.g.
Surprise + Happiness) memberships of these emo-
tions are in the interval [0.5, 1.0] whereas the mem-bership of the third emotion is below 0.5. Hencea derived emotion from these two basic emotions
(e.g. surprise and happiness) is assumed. The sub-
cube where the membership values of all basic emo-
tions are between 0.5 and 1.0 is denoted by the sum
Surprise + Anger + Happiness.
If a general n-dimensional emotion hypercube is re-
garded certainly not all combinations of up to n emo-
tions make sense. However, whether a combination
is reasonable or not is certainly a psychological ques-
tion. If combinations that do not make sense are rec-
ognized by a CBER this could for instance indicate an
error.
4 Application
This section deals with the application of our adapt-
able emotion model for the fuzzy rule based emotion
recognition system PROSBER [2].
4.1 Overview of PROSBER
PROSBER recognizes emotions from the prosody of
natural speech. It takes single sentences as input and
classifies them into the emotion categories happiness,
sadness, anger and fear. Furthermore a neutral emo-
tional state is distinguished. PROSBER automaticallygenerates the fuzzy models for emotion recognition.
Accordingly two working modes are distinguished,
training and recognition, as depicted in Figure 3.
During the training the training samples with well-
known emotion values are used to create the fuzzy
models for the individual emotions. For that purpose
sequences of acoustic parameters like fundmental fre-
quency or jitter are extracted. PROSBER extracts
about twenty parameters that have shown their rele-
vance for emotion recognition in psychological stud-
EUSFLAT - LFA 2005
75
8/3/2019 An Adaptable Fuzzy Emotion Model for Emotion Recognition
4/6
Featurecalculation
Parameterextraction
Preprocessing Membershipfunctionsgeneration
Featureselection
Fuzzyruleconstruc-tion
Fuzzyclassification
Featurecalculation
Parameterextraction
Preprocessing
Training
Recognition
peech
ignal
peechignal
Frames
Frames
Parametersequences
Featurevectors
Emotions of
training samples
Membership
functions
Fuzzy
rules
Emotion
Parameter
sequences
Featurevectors
Fuzzy model generation
Figure 3: Architecture of PROSBER
ies or in other speech based emotion recognition sys-
tems. The sequences of these acoustic parameters are
summarized by statistical analysis steps performed by
the feature calculation. The fuzzy model generation isbased on a fuzzy grid approach [11]. It performs the
following three steps on the training database. First
the membership functions for every feature are gener-
ated. Afterwards for each emotion up to six most sig-
nificant features are selected and then the fuzzy rule
system for each emotion is generated. These fuzzy
models are used in the emotion recognition process to
classify unknown audio data. A detailed description
of PROSBER can be found in [2].
4.2 Fuzzy Emotion Recognition in PROSBER
In order to describe the application of our emotion
model we shall now have a closer look at the fuzzy
classification. In principle it is structured as depicted
in Figure 4.
Emotion nintensity
Fuzzification DefuzzificationFuzzy Inference
Rule BaseEmotion n
Emotion n
Emotion 1
..
Emotion 1intensity
Featurevectors
Max
..Emotion
Figure 4: Principle structure of fuzzy classification
For each basic emotion ei, i = 1,...,n, a sepa-rate rule set is generated by an adapted fuzzy grid
method. Each rule takes the fuzzified features fj , j =1,...,K,K 6, as input and produces a fuzzy emo-tion value Iei as output. We represent each fea-
ture fj and emotion intensity Iei by five triangular
membership functions verylow, low,medium,high
and veryhigh as schematically depicted in Figure 5.
However, the actual start and end coordinates as well
as the maximum coordinates are generated automat-
ically during the training phase. This representationis simple enough to support real time emotion recog-
nition, yet allows to distinguish degrees to which a
feature or emotion is present in the current input sen-
tence. Furthermore, it is in line with psychologists
approaches who often use two up to ten levels for
characterizing psychological phenomena like emotion
intensities.
very
low low med. highvery
high
0
.5
0.25 0.5 0.75 1.0
.2
.8
1.0
Figure 5: Membership functions for features and emo-
tions
The rule set for the emotion ei is generated by a fuzzy
grid approach [11]. Since this approach uses only
the AND connector it generates 5K+1 rules of thefollowing form:
IFf1 IS verylow AND ... AND fK IS verylow THEN
Iei IS veryhigh
IFf1 IS verylow AND ... AND fK IS low THENIeiIS veryhigh
IFf1 IS verylow AND ... AND fK IS medium THEN
Iei IS medium
...
The number of rules could be reduced, if the OR con-
nector or rule pruning could be used. However, both
features are not yet supported by the fuzzy library we
use. For defuzzification of emotion values we use
the center of gravity (COG) method. By projecting
the COG to the x-axis we calculate the corresponding
emotion intensity. Hence a four dimensional vector
(Ih, Is, Ia, If) = (E(h), E(s), E(a), E(f))
containing the intensities of the four emotions hap-
piness h, sadness s, anger a, and fear f is generated.
This vector represents the membership values for each
EUSFLAT - LFA 2005
76
8/3/2019 An Adaptable Fuzzy Emotion Model for Emotion Recognition
5/6
emotion and hence determines a point in the four di-
mensional emotion hypercube.
Presently PROSBER recognizes a single basic emo-
tion. In order to select this emotion we determine the
emotion erec {h,s,a,f} with maximum intensityIerec = max{Ih, Is, Ia, If}. If the maximum cannotbe determined unambiguously, since two or more in-
tensity values are maximal, that emotion is selected
which was recognized for the previous sentence. The
neutral emotional state is identified by a hypercube
part near to the origin as depicted in Figure 2. We
plan to extend PROSBER for recognition of combined
emotions as described above.
5 Related Work
Up to now a variety of emotion models have been
described in literature. They are mainly dedicated
to computer based emotion (re)production or sim-
ulation in different application domains. A broad
application domain are virtual agents, that show
(pseudo)emotional behavior in their communication
with humans [3, 8, 5, 7]. They rely on a dimen-
sional model of emotions based on the event-appraisal
emotion model of Ortony et al. [14]. They usu-
ally distinguish the dimensions pleasure, arousal and
dominance and try to maintain their dynamics over
time. The PETEEI system [7] for simulation of a
pets evolving emotional intelligence similarly to our
system uses fuzzy sets for emotion representation.
However this system is different from our approach
since it associates certain types of events with posi-
tive or negative feelings in order to react with accord-
ing emotions whereas our approach is dedicated to
emotion recognition from certain features (of speech,
facial expression etc.). Emotion recognition as in-
vestigated in our approach is to some extent covered
by Kismet [6]. However Kismet recognizes inten-
tions rather than emotions. The emotional system de-
veloped for AIBO and SDR [1] like our model uses
basic emotions. Since it is intended for production
of emotional behavior it uses the dimensions plea-
sure and arousal as mentioned above. But the dom-
inance dimension is substituted by a confidence di-
mension representing the certainty of recognized ex-
ternal stimuli. The models described above only deal
with single emotions and do not allow to represent
combinations or blends of emotions like our approach.
The Cathexis model [17] supports this feature and
is also adaptable to different sets of basic emotions
or emotion families as they are called there by sup-
porting the coexistence of several active so called
emotion proto-specialists representing different emo-
tion families. However Cathexis is also dedicated tothe (re)production of emotional behavior in synthetic
agents whereas our adaptable emotion model is in-
tended for emotion recognition.
6 Conclusion and Outlook
This paper presented an adaptable emotion model for
emotion recognition. It uses the concept of an n-
dimensional fuzzy hypercube to represent emotional
states made up of n basic emotions. In contrast to
other approaches this allows not only the represen-
tation and recognition of a fixed set of basic emo-
tions but also supports the handling of derived emo-
tions. We showed the application of this model us-
ing the fuzzy prosody based emotion recognition sys-
tem PROSBER. As a first step we proposed a divi-
sion of the unit hypercube in equally sized subcubes
to distinguish basic emotions and their combinations
or blends. An interesting point for further investiga-
tion is whether this subdivision corresponds to human
recognition. This could for instance be done using a
learning approach that automatically finds such sub-
divisions and compares them with human interpreta-
tions of corresponding emotional states.
References
[1] R. C. Arkin, M. Fujita, T. Takagi, and R. Hasegawa.
An ethological and emotional basis for human-robot
interaction. In Robotics and Autonomous Systems,
vol. 42, no. 3, pages 191201. Elsevier Science, 2003.
[2] A. Austermann, N. Esau, L. Kleinjohann, andB. Kleinjohann. Prosody based emotion recognition
for mexi. In Proceedings of IEEE/RSJ Int. Confer-
ence on Intelligent Robots and Systems (IROS 2005),
Edmonton, Alberta, Canada, August 2005.
[3] J. Bates. The role of emotion in believable agents.
Communication of the ACM, 37(7), pages 122125,
1992.
[4] A. Boozer. Characterization of Emotional Speech
in Human-Computer-Dialogues. M.Sc. Thesis. MIT
Press, 2003.
EUSFLAT - LFA 2005
77
8/3/2019 An Adaptable Fuzzy Emotion Model for Emotion Recognition
6/6
[5] C. Breazeal. Affective interaction between humans
and robots. In Proc. of ECAL 01, pages 582591,
Prague, 2001.
[6] C. Breazeal and L. Aryananda. Recognition of affec-
tive communicative intent in robot-directed speech. In
Autonomous Robots 12, pages 83104. Kluwer Aca-demic Publishers, 2002.
[7] M. S. El-Nasr, J. Yen, and T. Ioerger. Flame - a
fuzzy logic adaptive model of emotions. In Automous
Agents and Multi-agent Systems 3, pages 219257,
2000.
[8] C. Elliot. The Affective Reasoner: A Process model of
emotions in a multi-agent system. Ph.D. Thesis. Insti-
tute for the Learning Sciences, Evanston, IL: North-
western University, 1992.
[9] N. H. Frijda. Neural Networks and Fuzzy Systems;
A Dynamical Systems Approach to Machine Intelli-gence. Cambridge University Press, 1986.
[10] S. Hashimoto. Kansei as the third target of informa-
tion processing and related topics. In Proceedings
of Intl. Workshop on Kansei Technology of Emotion,
pages 101104, 1997.
[11] H. Ishibuchi and T. Nakashima. A study on generat-
ing fuzzy classification rules using histogramms. In
Knowledge based Intelligent electronic Systems, Bd.
1. Prentice Hall, 1998.
[12] B. Kosko. Neural Networks and Fuzzy Systems; A Dy-
namical Systems Approach to Machine Intelligence.Prentice Hall, Englewood Cliffs, NJ, 1992.
[13] R. S. Lazarus. Emotion and Adaptation. Oxford Uni-
versity Press, 1991.
[14] A. Ortony, G. Clore, and A. Collins. The Cognitive
Structure of Emotions. Cambridge University Press,
1988.
[15] A. Ortony and W. Turner. Whats basic about basic
emotions? Psychological Review, pages 315331,
1990.
[16] R. Plutchik. The Emotions. University Press of Amer-
ica, Inc., revised edition, 1990.
[17] J. Velasquez. Modeling emotions and other motiva-
tions in synthetic agents. In Proceedings of the AAAI
Conference, pages 1015. Providence, RI, 1997.
EUSFLAT - LFA 2005
78