Khalid El-Arini Carnegie Mellon University Joint work with: Ulrich Paquet, Ralf Herbrich, Jurgen Van...

Khalid El-AriniCarnegie Mellon University

Joint work with:Ulrich Paquet, Ralf Herbrich, Jurgen Van Gael, Blaise Agüera

y Arcas

Transparent User Models for

Personalization

Personalization is ubiquitous.

• YouTube: 72+ hours/minute of new video• Facebook: 950 million+ users• Twitter: 400+ million tweets/day• Shopping:

[1994]: 500K unique consumer goods sold in U.S.[2010]: Amazon alone offered 24 million.

Personalization is invaluable.

Keyword search is not enough.

Personalization is often wrong.

- J. Zaslow, November 26, 2002

“Basil…is not a neo-Nazi. Lukas…is not a shadowy stalker.David…is not Korean.

intent on giving them such labels.”

“there's just one way to change its mind: outfox it.” - J. Zaslow, November 26, 2002

What recourse do we have?

Can we do better?

You behave like a

vegan hipster

Vegan? Really? Why?

You: • tweeted with #meatlessmonday• follow @WholeFoods• …

We propose an alternative.

Why am I getting this?

We propose an alternative.

Why am I getting this?

You behave like a

Brooklyn hipster

Goal: Achieve transparency via interpretable user features, learned from user activity

You behave like a

Brooklyn hipster

Goal: Achieve transparency via interpretable user features, learned from user activity

Badges

Approach Model Experiments Summary

1. Define a vocabulary of badges

Apple fanboy

vegan runner photographer

Rich, interpretable and explainable

2. Identify exemplars

How do I find vegans?

observed label

Take advantage of how users describe themselves

Most vegans don’t label themselves as “vegan” on Twitter…

we want to infer the attributes of these users

2. Identify exemplars3. Model characteristic

behavior• Hashtags #meatlessmonday• Retweets RT @WholeFoods

• We have no negative training examples.Use a generative model.

• Actions can be explained by multiple badges, even for the same user.

Noisy-or to combine badges.• How do we deal with user corrections?

Observing a latent variable.

Model sketch

i=1…B

B badges

u=1…N

i=1…B

N users

u=1…N

i=1…B

F actions j=1…F

j=1…F

u=1…N

i=1…BDoes user u have badge i?

j=1…F

bi(u) λi(u)

u=1…N

i=1…B

j=1…F

j=1…FDoes user u have label for

badge i in his profile?

bi(u) λi(u)

j=1…F u=1…N

i=1…B

Has user u performed action j?

j=1…F

bi(u) λi(u)

j=1…F

u=1…N

i=1…B

Does badge i explain action j?

sijφij

bi(u) wi(u)

αφβφj=1…F

j=1…F

u=1…N

i=1…B

What’s the probability that a user with badge i performs action j?

sijφijφbg aj(u)

bi(u) wi(u)

αφβφj=1…F

j=1…F

u=1…N

i=1…B

What is the background probability for each action?

sijφijφbg aj(u)

bi(u) wi(u)

αφβφj=1…F

j=1…F

u=1…N

i=1…B

noisy or:Can at least one of my badges (or the background) explain it?

sijφijφbg aj(u)

bi(u) λi(u)

αφβφj=1…F

j=1…F

u=1…N

i=1…B

sijφijφbg aj(u)

bi(u) λi(u)

αφβφj=1…F

j=1…F

u=1…N

i=1…B

Beta priors to control sparsity

sijφijφbg aj(u)

bi(u) λi(u)

γiT γiF

αφβφ

αT βT αF βF

j=1…F

u=1…N

i=1…B

Beta prior to encode low recall (e.g., 10%)

Beta prior to encode high precision

(e.g., 99.9%)

ηisijφijφbg aj(u)

bi(u) λi(u)

γiT γiFωi

αφβφ

αη βη αω βω αT βT αF βF

j=1…F

u=1…N

i=1…B

• Collapsed Gibbs sampler (with MH steps)

Inference

sijφijφbg

bi(u) λi(u)

γiT γiFωi

αφβφ

j=1…F

u=1…N

i=1…BYou behave like a

vegan hipster.

bi(u) λi(u)

γiT γiFωi

αφβφ

j=1…F

u=1…N

i=1…BYou behave like a

vegan hipster.

• Start with 7 million Twitter users• Manually define 31 sample badges

by specifying labels

Data description

• Start with 7 million Twitter users• Manually define 31 sample badges by

specifying labels• Gather 2 million tweets from August

2011• Recall: actions are hashtags and

retweets

Remove infrequent actions and inactive users, leaving us with:

75,880 users32,030 actions

Data description

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 310

Chart Title

Badges

artist

photographer

country music fan

book worm

Badge statistics

Can we learn badges?

Vegetarian badge

Runner badge

Hacker badge

Manchester United badge

Do all badges look this good?

No, but most do.

45wine lover

Over-generalized

Overwhelmed

Ruby on Rails

Can we just use the labels directly?

Inferred Apple fanboy badge

Self-described Apple fanboys

• Compare to labeled LDA [Ramage+ 2009]– LDA extension where each document is

labeled with multiple tags– One-to-one mapping between topics and tags– Document explained only by topics

associated with its tags

• Hold out random 10% of labels, treat as ground truth, and try to predict them

Comparative Analysis

Rank of held-out labels be

Better predictiveperformance

erBetter predictions for active

Sparse badges

Apple fanboy (badges) Apple fanboy (l-lda)

Leveraged how users describe themselves

Leveraged how users describe themselves to build interpretable user features You behave like a

vegan hipster

Empirically showed we can infer a user’s attributes from his behavior

谢谢

What recourse do we have?

Collaborative filtering

Content-based filtering

Can we do better?

Most vegans don’t label themselves as “vegan” on Twitter……but what about non-vegans?

“I drink too much and hate vegans.”

Khalid El-Arini Carnegie Mellon University Joint work with: Ulrich Paquet, Ralf Herbrich, Jurgen Van...

Documents

Drug Usage in RF-Arini-Jul09

CURRICULUM VITAE DINI ARINI

Herbrich Et Al. - Neural Networks in Economics

1 Learning CRFs with Hierarchical Features: An Application to Go Scott Sanner Thore Graepel Ralf Herbrich Tom Minka TexPoint fonts used in EMF. Read the

Introduction to Support Vector Machines · 2017. 10. 19. · – Herbrich et al., “Large Margin Rank Boundaries for Ordinal Regression”, Advances in Large Margin Classifiers,

DAFTAR PUSTAKA - repository.unika.ac.idrepository.unika.ac.id/2303/8/08.40.0127 Arini... · Firmanzah. 2011. Narkoba: Potensi Kerugian Ekonomis dan Pelemahan Pembangunan Manusia

Curatolo Arini Presentation

er amiliæ - TrustedPartnercdn.trustedpartner.com/docs/library/ArmoryArts2010/mayer-reprint.pdf · Theatrum Familiæ Installation View, Galerie Lausberg 2012 Foto: Thomas Herbrich

EDITA: Dpto. Hidrogeología y Química Analítica · PDF fileInternational Scientific Committee • Dr. Triantafyllos Albanis (University of Ioannina, Greece). • Dr. Ana Agüera

2201408030-Nunuk Evi Arini-Chapter 20

UAE High School Students’ Attitude towards Peer Response using Blogs (Ms. Arini Muntaha - MOE - Sharjah)

IMPROVINGTHE STUDENTS’ SPEAKING ABILITY …eprints.uny.ac.id/16171/1/Arini Isnaen Meilyaningsih 10202241044.pdfIMPROVING THE STUDENTS’ SPEAKING ABILITY ... speaking skill improved

AGENDA - ke.tu-darmstadt.de · ECAI 2012 Tutorial on Preference Learning | Part 4 | J. Fürnkranz & E. Hüllermeier 4 Example: Complexity of SVMRank Reformulation as Binary SVM [Herbrich

HESS Arini: artistic outdoor lighting system - mulifunctional

David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK

H. Traineau, B. Herbrich, E. Lasne, D. Tournaye

Ch 6 Multimedia Distribution Arini, ST, MT arinizul@gmail. Com arinizoel@yahoo.com

METADATA CAPITAL: CONCEPTUAL UNDERSTANDING, … · **Drexel/UNC : Jose R. Pérez-Agüera, Sarah Carrier, Elena Feinstein, Lina Huang, Robert Losee,

Genetically Engineering Plants Riyanda N G (10198) Vina E A (10221) Arini N (10268) Suluh N (10302)

Phytolacca esculenta Van Houtte (Ph - Gradina Botanica …botanica.uaic.ro/docs/Journal of Plant Development2014.pdfE-mail: camelia.stefanache@yahoo.com. ... O2, Arini 1 – A1, Arini