21
UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and Prediction of Problematic Users in Wikipedia Students - Charu Rawat , Arnab Sarkar, Sameer Singh Faculty Advisor – Rafael Alvarado Clients – Patrick Earley , Sydney Poore Advisor – Lane Rasberry Trust and Safety Team, Wikimedia Foundation 1

UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

  • Upload
    others

  • View
    18

  • Download
    0

Embed Size (px)

Citation preview

Page 1: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

UVA DSI Capstone Project

Wikimedia Foundation, Trust & Safety

Automatic Detection of Online Abuse and

Prediction of Problematic Users in Wikipedia

Students - Charu Rawat , Arnab Sarkar, Sameer Singh

Faculty Advisor – Rafael Alvarado

Clients – Patrick Earley , Sydney Poore

Advisor – Lane Rasberry

Trust and Safety Team, Wikimedia Foundation

1

Page 2: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

And roughly 80% of Americans say tech companies should do more to prevent it- survey from the Anti-Defamation League

1 in 3 Americans Suffered Online Harassment in 2018

Page 3: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

How can we as Data Scientists do what we do

best, and make these communities

safer from online

Page 4: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

The English Wikipedia over the last few years…

https://commons.wikimedia.org/wiki/File:Wikidata_Items_Map_2014_%E2%80%93_2017_gif.gif

• Languages: 250+

• Popularity: Alexa Rank #5

• Total Edits: 868k

• All Pages: 46 mill

• Articles 5 mill

• Registered Users: 35 mill

• Administrators: ~1,193

Page 5: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Online Harassment at WikipediaPew Research Centre Survey (America centric)

41% have experienced

personal attacks

66% observed attacks

directed towards others

47% reported a decrease in

contribution and engagement levels

Page 6: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Human Driven Process

Failure to identify bad behavior

Bias to overlook such behavior

Inability to understand context

and scope of problem

Our goal was to use machine learning to develop a model that can detect abusive content as well as predict

problematic users in the community.

To that end, we leveraged a variety of data to analyse the prevalence and nature of online harassment at

scale.

There is a need for a more robust process to combat

harassment, one that can scale well as the Wikipedia

community continues to grow in its size and diversity.

Problem

Page 7: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

User Comment Text

Data Repository

on AWS

User Blocks

Identified blocked

users

SQL Queries

Extracted user

comments

Python web scraper

Data Pipeline

ORES score dataUser Activity

Extracted activity

statistics

SQL Queries

Wiki automated edit

quality scores

Python web scraper

Google Ex: Machina

Human annotated wiki user

comments

CSV files

Page 8: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

• ~1.02 million unique users

have been blocked in

Wikipedia till Oct 2018

• 91.5% registered users

• 8.5% are anonymous

users

• Increase in user bocks in

2017, 2018

8

Block User Trends in Wikipedia

Page 9: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

For what reasons are users getting blocked over the years?

Page 10: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

90% of all blocked users are most active in the week that they get blocked

within their 8-week rolling window

90,442

10,3177,499 5,937 5,102 4,309 3,872 3,554

Active status of blocked users

Page 11: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Abuse Detection - Problem at hand

• Goal

Prediction of a toxicity score for each user comment

• Inputs

User Blocks, User Comment Text, User Activity, and Google Ex: Machina Corpus

• Challenges

Data Cleaning and Pre-processing

Ground truth labeling

Class imbalance

Page 12: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Model Building Methodology

• Corpus Aggregation + Annotation

• Feature Extraction

Natural Language Processing

Orthographical Features

• Implementing Machine Learning Algorithms

Classification Techniques

Train – Test split

K-fold cross validation

• Model comparison

Using Google Ex Machina dataset to compare best performing model

• Model Tweaking

Page 13: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Modeling Deep Dive – Feature Extraction

• Natural Language Processing

Char n-gram

Word n-gram

NLTK Sentiment Analyzer

Latent Dirichlet Allocation

Word Embedding - GloVe

Word Embedding - fastText

• Orthographical Features

Numeric Digits

Capital Characters

Special Characters

Average length of characters in word

Page 14: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

65.36%66.03%

63.54%

66.74%

68.26%

71.30%

50.00%

55.00%

60.00%

65.00%

70.00%

75.00%

Logistic Regression Linear SVM Random ForestClassifier

Gradient BoostingClassifier

XGBoost Classifier XGBoost Classifier(Theshold - 0.4)

Are

a U

nd

er

the

RO

C C

urv

e

Modeling Deep Dive – ML Algorithms

Page 15: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Toxicity Score Evolution

Page 16: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

• Goal

Prediction of propensity of user to be blocked in the future

• Inputs

User Blocks, User Comment Text, User Activity, Toxicity Scores, and ORES Scores

• Methodology

Take into account n-1 recent scores and activity features of each user to predict the

propensity for their nth comment to be abusive

Leveraging Naïve Bayes approach with different Machine Learning Algorithms

User Risk Model – Early Detection of Problematic Users

Page 17: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

User Risk Model - Early Detection of Problematic Users

Page 18: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

89.65%

91.98% 92.38%

87.85%89.40%

90.29%

50.00%

55.00%

60.00%

65.00%

70.00%

75.00%

80.00%

85.00%

90.00%

95.00%

Linear SVM Logistic XGBoost

Pe

rce

nta

ge

Accuracy

AUC ROC

User Risk Model – ML Algorithms

Page 19: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

• Optimizing for higher Recall values instead of Precision

Performance of XGBoost model at different thresholds

Page 20: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Wikipedia is

full of

morons. I’m

done with

this!

Thanks! This

is helpful

I agree with

him on the

editing trend

F%$@ I will

delete

everything

you post!!!

I don’t think

this is

relevant

How is this

going chicken

shit

0.5

0.3

0.8

0.6

0.1

0.0

WEEKLY STATS

edit count: 5

avg length: 500

active days : 4

WEEKLY STATS

edit count: 7

avg length: 1230

active days : 2

WEEKLY STATS

edit count: 9

avg length: 30

active days : 10

WEEKLY STATS

edit count: 4

avg length: 740

active days : 7

WEEKLY STATS

edit count: 3

avg length: 100

active days : 4

WEEKLY STATS

edit count: 30

avg length: 510

active days : 14

Page 21: UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety · 2019-06-20 · UVA DSI Capstone Project Wikimedia Foundation, Trust & Safety Automatic Detection of Online Abuse and

Thank you!