Upload
cataldo-musto
View
404
Download
5
Tags:
Embed Size (px)
Citation preview
A comparison of lexicon-based approaches for Sentiment Analysis
of microblog postsCataldo Musto, Giovanni Semeraro, Marco Polignano
(Università degli Studi di Bari ‘Aldo Moro’, Italy - SWAP Research Group)
DART 2014 8th Internation Workshop on
Information Filtering and Retrieval Pisa (Italy)
December 10, 2014
Outline• Background
• Sentiment Analysis • Lexicon-based approaches
• Methodology • State-of-the-art
lexicons • Experiments • Conclusions
2Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
BackgroundOne minute on the Web
3Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
BackgroundOne minute on the Web
4
Information Overload
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
5
BackgroundInformation Overload
Obstacleor Opportunity?Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
6
Opportunities(Social) Content Analytics
Insight: to aggregate rough human-generated data to get valuable people-based findings
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
7
Social Content AnalyticsApplications
- Online brand monitoring
- Social CRM- Real-time polls
All these applications share a common denominator
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
8
Social Content AnalyticsApplications
- Online brand monitoring
- Social CRM- Real-time polls
All these applications share a common denominator
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
They all need a methodology to automatically associate an opinion and/or a polarity to each piece of content
9
Social Content AnalyticsApplications
- Online brand monitoring
- Social CRM- Real-time polls
All these applications share a common denominatorThey all need a methodology to automatically associate
an opinion and/or a polarity to each piece of content
Solution: Sentiment Analysis
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
10
Sentiment AnalysisDefinition
“It is the field of study that analyzes people’s
opinions, sentiments, evaluations, appraisals, attitudes, and emotions towards entities such as
products, services, organizations, individuals, issues, events, topics, and
their attributes “ (*)
(Pang, Bo, and Lillian Lee. "Opinion mining and sentiment analysis." Foundations and trends in information retrieval, 2008)
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
11
Sentiment AnalysisDefinition
“It is the field of study that analyzes people’s
opinions, sentiments, evaluations, appraisals, attitudes, and emotions towards entities such as
products, services, organizations, individuals, issues, events, topics, and
their attributes “ (*)
(Pang, Bo, and Lillian Lee. "Opinion mining and sentiment analysis." Foundations and trends in information retrieval, 2008)
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
We will focus on the polarity detection task
12
Sentiment AnalysisState of the art
Supervised Approaches
(Machine Learning-based)
Unsupervised Approaches
(Lexicon-based)Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
13
Sentiment AnalysisSupervised approaches
Learn a classification model relying on labeled examples
?Man
Dog
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
14
Sentiment AnalysisUnsupervised approaches
Rely on external lexical resourcesthat associate a polarity score to each term.
Sentiment of the content depends on the sentiment of the terms which compose it.
joy +++
frustration - -
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
15
Sentiment AnalysisSupervised vs Unsupervised
Nakov, Preslav, et al. "Semeval-2013 task 2: Sentiment analysis in Twitter.” Proceedings of SemEval 2013Rosenthal, Sara, et al. "Semeval-2014 task 9: Sentiment analysis in Twitter." Proceedings of SemEval 2014.
(*)(**)
Pros Cons
Supervised Higher Accuracy (*) (**)
Pre-labeled examples
Unsupervised No TrainingAccuracy depends on lexical
resources
Several lexical resources available
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Pros Cons
Supervised Higher Accuracy (*) (**)
Pre-labeled examples
Unsupervised No TrainingAccuracy depends on lexical
resources
Several lexical resources available
We focus on lexicon-based approaches
16
Sentiment AnalysisSupervised vs Unsupervised
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
17
Contributions
We provide a comparison of
lexical resources for sentiment analysis of
microblog posts
We propose a novel unsupervised lexicon-
based approach for sentiment analysis
1.
2.
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
18
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:The polarity of a textual content (e.g. a
microblog posts) depends on the polarity of the microphrases which compose it.
19
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:The polarity of a textual content (e.g. a
microblog posts) depends on the polarity of the microphrases which compose it.
A microphrase is built whenever a splitting cue
is found in the text
20
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:The polarity of a textual content (e.g. a
microblog posts) depends on the polarity of the microphrases which compose it.
A microphrase is built whenever a splitting cue
is found in the text
Conjunctions, adverbs and punctuations are used as
splitting cues
21
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:The polarity of a textual content (e.g. a
microblog posts) depends on the polarity of the microphrases which compose it.
A microphrase is built whenever a splitting cue
is found in the text
Conjunctions, adverbs and punctuations are used as
splitting cues
example: “I don’t like this food, it’s terrible”
22
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:The polarity of a textual content (e.g. a
microblog posts) depends on the polarity of the microphrases which compose it.
A microphrase is built whenever a splitting cue
is found in the text
Conjunctions, adverbs and punctuations are used as
splitting cues
example: “I don’t like this food, it’s terrible”{ { m1 m2
splittingcue
23
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:
pol(T) = ∑ pol(mi)
The polarity of a textual content (e.g. a microblog posts) depends on the polarity of the microphrases which compose it.
i=1
k
Tweet microphrase
T={m1…mk}
24
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lexicon-based approach
Insight:
pol(T) = ∑ pol(mi)i=1
k
The polarity of a microphrase depends on the polarity of the terms which compose it.
pol(mi) = ∑ score(tj)j=1
term
n
T={m1…mk}
Mi={t1…tn}
Tweet microphrase
25
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Four variant proposed
Basic pol(T) = ∑ pol(mi)
i=1
k
pol(mi) = ∑ score(tj)j=1
n
score(tj)
26
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic pol(T) = ∑ pol(mi)
i=1
k
pol(mi) = ∑ score(tj)j=1
n
Normalized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑j=1
n
|mi|
Score of each microphrase is normalized according to its length
Four variant proposed
score(tj)
27
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic pol(T) = ∑ pol(mi)
i=1
k
pol(mi) = ∑ score(tj)j=1
n
Normalized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑j=1
n
|mi|
Emphasized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑ score(tj)j=1
n*w(tj)
Specific categories are provided with an higher weight
categories=adverbs, verbs, adjectives & valence &&
valence shifters (intensifiers & downtoners)Several weights have been evaluated
Four variant proposed
score(tj)
28
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic pol(T) = ∑ pol(mi)
i=1
k
pol(mi) = ∑ score(tj)j=1
n
Normalized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑j=1
n
|mi|
Emphasized Normalized-Emphasized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑ score(tj)j=1
n
pol(T) = ∑ pol(mi)
pol(mi) = ∑score(tj)|mi|
*w(tj) *w(tj)
Combination
Four variant proposed
score(tj)
29
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
We have a problemBasic
pol(T) = ∑ pol(mi)i=1
k
pol(mi) = ∑ score(tj)j=1
n
Normalized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑j=1
n
|mi|
Emphasized Normalized-Emphasized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑ score(tj)j=1
n
pol(T) = ∑ pol(mi)
pol(mi) = ∑score(tj)|mi|
*w(tj) *w(tj)
score(tj)
30
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic pol(T) = ∑ pol(mi)
i=1
k
pol(mi) = ∑ score(tj)j=1
n
Normalized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑j=1
n
|mi|
Emphasized Normalized-Emphasized pol(T) = ∑ pol(mi)
i=1
pol(mi) = ∑ score(tj)j=1
n
pol(T) = ∑ pol(mi)
pol(mi) = ∑score(tj)|mi|
*w(tj) *w(tj)
How to calculate score(tj) ?
We have a problem
31
Solution
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
32
Lexical ResourcesState of the art
We evaluated four state-of-the-art resources for sentiment analysis
SentiWordNet
WordNet Affect
SenticNet
MPQA
http://sentiwordnet.isti.cnr.it
http://wndomains.fbk.eu/wnaffect.html
http://sentic.net
http://mpqa.cs.pitt.edu
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
33
Lexical ResourcesSentiWordNet(*)
Each WordNet synset is provided with three different sentiment scores (positivity, negativity, objectivity)
(*) Baccianella, Stefano, Andrea Esuli, and Fabrizio Sebastiani. "SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining." LREC. Vol. 10. 2010.
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
34
Lexical ResourcesWordNet Affect(*)
Affective-related synsets are mapped with an A-Label
e.g. euphoria —> positive-emotion illness —> physical state
WordNet extension
(*) Strapparava, Carlo, and Alessandro Valitutti. "WordNet Affect: an Affective Extension of WordNet." LREC. Vol. 4. 2004.
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
35
Lexical ResourcesSenticNet(*)
(*) Cambria, Erik, Daniel Olsher, and Dheeraj Rajagopal. "SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis." Twenty-eighth AAAI conference on artificial intelligence. 2014.
Inspired by the Hourglass of Emotions model
Each term is represented of the ground of the intensity of four basic emotional dimensions (sensitivity, aptitude, attention, pleasantness)
The activation level of each dimension defines 16 basic emotions
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
36
Lexical ResourcesSenticNet(*)
According to the triggered emotions, each term is provided with an aggregated polarity score
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
37
Lexical ResourcesSenticNet(*)
SenticNet models a sentiment score for some bigrams and trigrams as well!
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
38
Lexical ResourcesMPQA(*)
(*) Wilson, Theresa, Janyce Wiebe, and Paul Hoffmann. "Recognizing contextual polarity in phrase-level sentiment analysis." Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics, 2005.
Each term is (manually) provided
with a discrete sentiment score
+1 positive0 neutral
-1 negative
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
39
Lexical ResourcesComparison
Resource Coverage (terms)
SentiWordNet 117,659
WordNet Affect 200
SenticNet 14,000
MPQA 8,222
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
40Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
41
Lexical Resources
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Score calculation
SentiWordNetGiven a term, score(tj) is the
mean of the sentiment score of
all the possible synsets of tj
0.75 + 0 + 1 +1score(good) = = 4
0.687
score(benevolence) =
42
Lexical Resources
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Score calculationWordNet Affect
Given a term, score(tj), WordNet Affect hierarchy is
climbed until an A-Label which occur in SentiWordNet is found.
tj inherits the sentiment score of the A-Label
score(good) = 0.339
43
Lexical Resources
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Score calculation
SenticNetGiven a term,
score(tj), SenticNet APIs are queried and sentiment
score is extracted
0.883score(good) =
44
Lexical Resources
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Score calculation
MPQAGiven a term,
score(tj), MPQA Lexicon are queried and
sentiment score is extracted
1score(good) =
45
Methodology
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experimental EvaluationResearch Hypothesis
46
1. How do the different versions of the algorithm perform with respect to state-of-the-art datasets?
2. What is the best lexical resource to detect the polarity of microblog posts?
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experimental EvaluationDescription of the datasets
47
• SemEval-2013 • 14,435 Tweets
• 8,180 training • 3,255 test • Positive, Negative, Neutral
• STS Dataset • 1,600,000 Tweets
• only 359 test • Positive, Negative
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experimental EvaluationStatistics about Coverage
48
Lexicon SemEval-2013-Test STS-Test
Vocabulary Size 18,309 6,711
SentiWordNet 4,314 883
WordNet-Affect 149 48
MPQA 897 224
SenticNet 1,497 326
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experiment 1
49
Intra-Lexicons evaluation
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
significant (p < 0,0001)
Basic
Normalized
Emphasized
Norm-Emph
45 50 55 60 65
58,99
58,65
58,1
57,67
Experiment 1
50
SemEval :: SentiWordNet
Emphasis and Normalization improve the accuracy
norm vs norm+emph
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
45 50 55 60 65
55,08
53,95
55,05
53,92
Experiment 1
51
SemEval :: WordNet Affect
Emphasis and Normalization improve the accuracy
not significant
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
45 50 55 60 65
58,1
58,25
57,97
58,03
Experiment 1
52
SemEval :: MPQA
Emphasis improves the accuracy. Normalization doesn’t.
not significant
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
45 50 55 60 65
48,08
48,29
47,25
48,69
Experiment 1
53
SemEval :: SenticNet
No improvement
norm vs norm+emph
significant (p < 0,0001)
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experiment 1
54
General OutcomesSentiWordNet WordNet Affect MPQA
Emphasis leads to improvements (7 out of 8 comparisons).
1.2.
SenticNet
Normalization doesn’t. (1 out of 4 comparisons)
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
60 63,75 67,5 71,25 75
71,59
71,31
72,42
71,87
Experiment 1
55
STS :: SentiWordNet
Normalization improves the accuracy. Emphasis doesn’t
not significantgaps
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
60 63,75 67,5 71,25 75
62,95
62,96
62,67
62,95
Experiment 1
56
STS :: WordNet Affect
not significantgaps
Emphasis improves the accuracy. Normalization doesn’tCataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
60 63,75 67,5 71,25 75
70,76
69,92
70,75
69,54
Experiment 1
57
STS :: MPQA
not significantgaps
Both Emphasis and Normalization improve the accuracy.Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Basic
Normalized
Emphasized
Norm-Emph
70 71,75 73,5 75,25 77
74,65
73,82
74,65
74,37
Experiment 1
58
STS :: SenticNet
Normalization improves the accuracy. Emphasis doesn’tCataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
not significant
Experiment 1
59
General OutcomesSentiWordNet WordNet Affect MPQA
Controversial behavior (normalization typically improves, emphasis doesn’t)
1.2.
SenticNet
Little statistical significance (small dataset)
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experiment 2
60
Inter-Lexicons evaluation
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Experiment 2
61
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
Experiment 2
62
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
SentiWordNet is the best-performing configuration on SemEval data
Experiment 2
63
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
MPQA well-performs on SemEval data
Experiment 2
64
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
SenticNet has a controversial behavior: worst on SemEval - best on STS
Experiment 2
65
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
Reason: SenticNet can hardly classify neutral Tweets (threshold learning?)
Experiment 2
66
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
SentiWordNet and MPQA confirm their performance on STS
Experiment 2
67
Comparison between lexicons
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
Poor coverage negatively influences Wordnet-Affect performances
Experiment 2
68
Statistical Analysis
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
= not significant gap = significant gap
p < 0,11p < 0,0001bestbest p < 0,42p < 0,0001 p < 0,001 p < 0,50
Experiment 2
69
Conclusions
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Accu
racy
0
20
40
60
80
SemEval-2013 STS
70,76
58,2562,96
55,08
74,65
48,69
72,42
58,99
SentiWordNet SenticNet WordNet-Affect MPQA
p < 0,11p < 0,0001bestbest p < 0,42p < 0,0001 p < 0,001 p < 0,50
= best-performing lexicons
Conclusions
70Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
Lessons Learned
71
Comparison of 4 state-of-the-art resourcesSentiWordNet - SenticNet - MPQA - WordNet Affect
Evaluation.Research Question: What is the impact of each lexical resource in the task of polarity classification?
MPQA and SentiWordNet typically overcome other resources (interesting result, due to the smaller coverage of MPQA)
SenticNet behavior is worth to be deepen investigated
INVESTIGATION ABOUT THE EFFECTIVENESS OF LEXICAL RESOURCES IN POLARITY CLASSIFICATION OF MICROBLOG POSTS
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014
1.2.
Future Research
72
Evaluation against different datasets and with more lexical results;
Better tuning of parameters (classification threshold) , integration of more complex syntactic structures, merging lexical resources
Integration of the algorithm in a recommendation framework to exploit sentiment-based information to model user interests
Cataldo Musto, Giovanni Semeraro, Marco Polignano A comparison of lexicon-based approaches for sentiment analysis of microblog posts. DART 2014 Workshop, Pisa(Italy) 10.12.2014