Entity-Centric Topic-Oriented Opinion Summarization in Twitter

Entity-Centric Topic-Oriented Opinion Summarization in

Twitter

Date : 2013/09/03Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou,

Sujian Li and Houfeng WangSource : KDD’12Advisor : Jia-ling KohSpeaker : Yi-hsuan Yeh

Outline Introduction Topic Extraction Opinion Summarization Experiment Conclusion

Introduction Microblogging services, such as Twitter, have

become popular channels for people.

People not only share their daily update information or personal conversation, but also exchange their opinions towards a broad range of topics.

However, people may express opinions towards different aspects, or topics, of an entity.

Introduction

Goal : Produce opinion summaries in accordance with topics and remarkably emphasizing the insight behind the opinions.

Topic Extraction #hashtags

They are created organically by Twitter users as a way to categorize messages and to highlight topics

We use #hashtags as candidate topics.

Topic Extraction

1. Collect a dictionary from ODP, Freebase Rule-base classifier

2. Split #hashtags into multiple words and then check if some of words in person/location dictionary

3. Tagness (threshold=0.85)ex : occurrences of #fb = 95, total occurrences of its content = 100 tagness = 95/100 = 0.95 > 0.85 (remove)

Graph-based Topic Extraction Affinity Propagation algorithm

Input : #hashtags pairwise relatedness matrix output : #hashtags clusters and the centroids of

clusters.

1. Co-occurrences Relation

Relatedness2. Context Similarity

ex : hi hj t1 t2 t3 t4

Cosine(hi, hj) = [(4*2)+(0*3)+(5*0)+(3*6)] /[(42+52+32)1/2 ]*[(22+32+62

Relatedness3. Topic-Aware Distributional Similarity

Labeled LDA

ex : hi hj

0.30.10.50.1

0.40.30.10.2

KL(hi, hj) = ( ln (0.4/0.3) * 0.4)+( ln (0.3/0.1) * 0.3)+( ln (0.1/0.5) * 0.1)+( ln (0.2/0.1) * 0.2)

Other words in the tweets

Topic Labeling and Assignment For a tweet with #hashtag(s), we assign it the

topic(s) corresponding to every #hashtag in the tweet

For a tweet without #hashtags, we predict its topic using a SVM classifier Bag-of-words feature

Outline Introduction Topic Extraction Opinion Summarization

Insightful Tweet Classification Opinionated Tweet Classification Summary Generation

Experiment Conclusion

Insightful Tweet Classification

Standford Parser match the pattern syntax trees against the tweet syntax

trees To create a high coverage pattern set, we use a paraphrase

generation algorithm ex : “that is why” “which is why”

Opinionated Tweet Classification A lexicon-based sentiment classifier relies on

sentiment dictionary matching counts the occurrences of the positive (cp) and

negative (cn) words

Negation expressions the distance in words between neg and w is smaller

than a predefined threshold (5) invert the sentiment orientation

ex : “eliminate”, “reduce”

Target-lexicon dependency classification A binary SVM classifier to determine whether the

sentiment word (w) is used to depict the target (e).

Feature:1. The distance in word between w and e2. Whether there are other entities between w and e3. Whether there are punctuation(s) between w and e4. Whether there are other sentiment word(s) between w

and e5. The relative position of w and e : w is before or after e6. Whether these is a dependency relation between w

and e (MST Parser)

Summary Generation Selecting a subset of tweets P from tweet set Tk

for topic k

1. Language style score

ex : “I am Avril Lavigne’s biggest fan!! ❤” L(ti) = 1+ (1/7) = 1.143

2. Topic relevance score Term distribution of tweet ti and topic label lk

ex : ti lk t1

0.20.10.60.1

0.10.50.20.2

KL(ti,lk) = ( ln (0.1/0.2) * 0.1)+( ln (0.5/0.1) * 0.5)+( ln (0.2/0.6) * 0.2)+( ln (0.2/0.1) * 0.2)

3. Redundancy score Word distribution of tweet ti and tweet tj

ex : ti tj

0.10.350.20.150.2

0.40.1

0.150.3

KL(ti,lk) = ( ln (0.4/0.1) * 0.4)+( ln (0.1/0.35) * 0.1)+( ln (0.15/0.2) * 0.15)+( ln (0.3/0.15) * 0.3)+( ln (0.05/0.2) * 0.05)+

Data 2011.9 ~ 2011.10

Evaluation of Topic Extraction

Evaluation of Opinion Summarization

Language style score = 1

Conclusion An entity-centric topic-oriented opinion

summarization framework, which is capable of producing opinion summaries in accordance with topics and remarkably emphasizing the insight behind the opinions in Twitter.

In the future, we will further study the semantics underlying #hashtags, which we can make use of to extract more comprehensive and interesting topics.

Entity-Centric Topic-Oriented Opinion Summarization in Twitter

Documents

Tweet Stream Summarization for Online Reputation Managementnlp.uned.es/~jcalbornoz/papers/ECIR_2016.pdf · Producing online reputation reports for an entity (company, brand, etc.)

Representing Texts as contextualized Entity Centric Linked Data Graphs

Entity Type Modeling for Multi-Document Summarization

Improving Entity Retrieval on Structured Dataiswc2015.semanticweb.org/sites/iswc2015.semanticweb.org/...The entity-centric nature of the Web of data has led to a shift towards tasks

Building data centric applications for web, desktop and mobile with Entity Framework 5

Multi-document Arabic Text Summarization · 2015-12-23 · DUC2002 Document Understanding Conference 2002 ER Entity Recognition FB_MDS Feature Based Multi-document summarization GP

Tim Estes - Information Systems in an Entity Centric World

TopExNet: Entity-centric Network Topic Exploration in News ... · Entity-centric Network Topic Exploration in News Streams Andreas Spitz Heidelberg University Heidelberg, ... news

ECIR – a Lightweight Approach for Entity-centric Information Retrieval

Time-centric Exploration of Court Documentsceur-ws.org/Vol-2593/paper4.pdf · 3.2.1 Entity-centric Timelines For entity-centric timelines, we do not take all timestamps into account

Improved Lifelog Ego-centric Video Summarization …eprints.keele.ac.uk/7088/1/0003Downloaded.pdflifelogging images (roughly 1-2 frames per minute) [10]. The challenge was to analyze

Gleaning Types for Literals in RDF with Application to Entity Summarization

Multi-aspect Entity-centric Analysis of Big Social …fafalios/files/pubs/fafalios_2017_tpdl.pdfMulti-aspect Entity-centric Analysis of Big Social Media Archives Pavlos Fafalios 1,

Knowledge Representation and Rule Mining in Entity-Centric

Entity-Centric Document Filtering: Boosting Feature Mapping through Meta-Features

Schönheit kommt von innen – das Böse auch Human-Centric ...€¦ · • Human-Centric Approach: Fokus auf entity behavior, voll integriert, weitere und komplexere Use Cases können

Entity Centric Coreference Resolution with Model Stacking

Multi-aspect Entity-centric Analysis of Big Social …users.ics.forth.gr/.../ppts/Fafalios_2017_TPDL_Slides.pdfMulti-aspect Entity-centric Analysis of Big Social Media Archives 1 L3S

Diversi ed and Verbalized Result Summarization for ...ws.nju.edu.cn/association/summ2018/wise18_extended.pdfSA pattern (SAP), which is obtained by substituting each non-query entity

Relatedness-based Multi-Entity Summarization