59
http://lora-aroyo.org @laroyo Harnessing Human Semantics at Scale Measurable, Reproducible, Engaging, Sustainable Crowdsourcing & Nichesourcing Lora Aroyo

Digital Humanities Benelux 2017: Keynote Lora Aroyo

Embed Size (px)

Citation preview

Page 1: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Harnessing Human Semantics at Scale

Measurable, Reproducible, Engaging, SustainableCrowdsourcing & Nichesourcing

Lora Aroyo

Page 2: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20071998 2006 2009

from DVDs to data science

Page 3: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20071998 2006 2009

Team BellKor wins Netflix Prize

Page 4: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20061994 2003 2016 2017

from books to data science

Page 5: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20061994 2003 2016 2017

from books to data science

Page 6: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20061994 2003 2016 2017

from books to data science

Page 7: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20061994 2003 2016 2017

from books to data science

Page 8: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

20061994 2003 2016 2017

from books to data science

Page 9: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

data is at the centre of every process

Page 10: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

data is essential to evolve with users

Page 11: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Ceci n'est pas … la mona lisa

Page 12: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Ceci n'est pas … la mona lisa

Louvre’s Mona Lisa

is only #14

Page 13: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

the battle of two worlds

9,3 million

Louvre

visitors 2014

14 million

website visitors

2,3 million

social media

Page 14: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

in the (very near) future

most visitors will be digital-born

not bound by time or location

native to new forms of co-makership

native to new mediaSiebe Weide, Max Meijer and Marieke Krabshuis (2012).

Agenda 2026: Study on the Future of the Dutch Museum Sector

Page 15: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

variety of meaningsmultitude of perspectivesabundance of sourcesendless contexts

know your data

Page 16: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

crowdsourcing to know your data at scale

Page 17: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

variety of typesmultitude of platformsabundance of interactionsendless characteristics

know your crowds

Page 18: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

https://www.rijksmuseum.nl/en/rijksstudio

Engage with Co-creation

Page 19: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Engage with Co-creativity

Page 20: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Engage with Co-curation

Page 21: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Engage the Expert Niche

http://annotate.accurator.nl

Page 22: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

expertise of Rijksmuseum professionals is in annotating their collection

with art-historical information, e.g. when they were created, by whom, etc.

Page 23: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

detailed domain-specific information about depicted objects, e.g. which species the

animal or plant belongs to,is in most cases not available

Page 24: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

use nichesourcing, i.e. niches of people with the right expertise, to add more specific

information

Page 25: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Keep Reproducing

http://annotate.accurator.nl

Page 26: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Engage with Games

training the general crowd to be a niche:game in which players can carry out an expert

annotation tasks with some assistance

Page 27: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

http://waisda.nl

Engage with Games

Page 28: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

http://waisda.nl

Engage with Games

Page 29: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

http://spotvogel.vroegevogels.vara.nl

Keep Reproducing

Page 30: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

CrowdTruth.org

Experiment with Paid Crowds

Page 31: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

CrowdTruth.org

Experiment with Paid Crowds

Page 32: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

CrowdTruth.org

Experiment with Paid Crowds

Page 33: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

http://crowdtruth.org/

Page 34: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

http://data.crowdtruth.org/

Page 35: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Challenges

Page 36: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Low reproducibility ratesDifficult to estimate & control the time to complete Difficult to assess & compare quality Demands continuous promotional effortActive learning (human-in-the-loop) needs different expertiseDifficult to incorporate results into existing content infrastructure

Challenges

Crowdsourcing typically undertaken in isolation

Page 37: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Assess Impact of Task Design

Page 38: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

InstructionsLayoutSequenceCrowdsPaymentCampaign

Assess Impact of Task Design

experiment with different designs

Page 39: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

for example

mapping music to mood

Page 40: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Otherpassionate, rollicking, literate, humorous, silly, aggressive, fiery, does not fit into

rousing, cheerful, fun, poignant, wistful, campy, quirky, tense, anxious, any of the 5

confident, sweet, amiable, bittersweet, whimsical, witty, intense, volatile, clusters

boisterous, good-natured autumnal, wry visceral

rowdy brooding

Choose one:

Which is the mood most appropriate

for each song?

Goal:

(Lee and Hu 2012)

1 song - 1 mood???

Page 41: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

If “One Truth” & “No Disagreement”

Worker Mood-C1 Mood-C2 Mood-C3 Mood-C4 Mood-C5

W1 1

W2 1

W3 1

W4 1

W5 1

W6 1

W7

W8

W9 1

W10 1

Totals 1 3 1 2 1

Page 42: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Worker Mood-C1 Mood-C2 Mood-C3 Mood-C4 Mood-C5 Other

W1 1 1 1

W2 1 1 1

W3 1 1 1

W4 1 1

W5 1 1

W6 1 1 1

W7 1 1 1

W8 1 1 1

W9 1 1

W10 1 1 1 1 1

Totals 3 5 6 5 2 8

If “Many Truths” & “Disagreement”

Page 43: Digital Humanities Benelux 2017: Keynote Lora Aroyo

Web & Media Group

http://lora-aroyo.org @laroyo

simplification of context

this all results in

Page 44: Digital Humanities Benelux 2017: Keynote Lora Aroyo

Web & Media Group

http://lora-aroyo.org @laroyo

Page 45: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

● Identify Crowdsourcing Goals through user log analysis

○ # queries, #unique queries, #queries of specific type

○ ranked by popularity

○ ranked by popularity and with error, e.g.

■ # queries entered over 50 times with 0 results

■ # queries of specific type with 0 results

○ which will have biggest impact

○ which has biggest urgency

● … or through other user analysis

Assess Impact of Results

Page 46: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

for example

in video search

Page 47: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

people search for fragmentsexperts annotate full videos

35% of search queries result in not found

people search for fragmentsexperts annotate full videos

35% of search queries result in not found

for example

in video search

Page 48: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Measure Quality

“On the Role of User-Generated Metadata in A/V Collections”, Riste Gligorov et al. KCAP2011

Page 49: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Measure Quality

“On the Role of User-Generated Metadata in A/V Collections”, Riste Gligorov et al. KCAP2011

time-based annotationbernhard

88% of the tags usefulfor specific genres

describe short segmentsoften not very specificdon’t describe program as a whole

Page 50: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

for example

in video search

video annotation is time-consuming5 times the video duration

experts use a specific vocabularythat is unknown to general audiences

video annotation is time-consuming5 times the video duration

experts use a specific vocabularythat is unknown to general audiences

Page 51: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

user vocabulary8% in professional vocabulary23% in Dutch lexicon89% found on Google

locations (7%)

engeland

persons (31%)

objects (57%)

Measure Quality

“On the Role of User-Generated Metadata in A/V Collections”, Riste Gligorov et al. KCAP2011

Page 52: Digital Humanities Benelux 2017: Keynote Lora Aroyo

Web & Media Group

http://lora-aroyo.org @laroyo

human subjectivity, ambiguity & uncertainty of expression

natural part of human semantics

Page 53: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

measure quality

quality is not just about spamquality is typically multi-dimensionalunderstand the diversity in crowd answers do not ignore multitude of interpretationsunderstand the variety of contextsidentify cases with high ambiguity, similarity, …experiment with explicit metricsexperiment with different designs

Page 54: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Measure Progress

6 months 2 years

340,551 tags 36,981 tags

137.421 matches

602 items 1.782 items

555 registered players 2,017 users (taggers)

thousands of anonymous players

12,279 visits (3+ min online)

44,362 pageviews

Riste Gligorov, Michiel Hildebrand, Jacco van Ossenbruggen, Guus Schreiber, Lora Aroyo

(2011). On the role of user-generated metadata in audio visual collections. International

conference on Knowledge capture K-CAP '11, Pages 145-152

Page 55: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

campaign, campaign, campaign

Page 56: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Page 57: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Page 58: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Page 59: Digital Humanities Benelux 2017: Keynote Lora Aroyo

http://lora-aroyo.org @laroyo

Measurable qualityReproducible resultsSustainable settingsEngaging interaction

Goals