Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Domain-Specific Insight Graphs (DIG)
Pedro SzekelyMay 2017
1
dig.isi.edu2
Use the web to answer investigative questions
3
Use Case: Human Trafficking
100 million pages>5,000 Web sites
help victims &prosecute traffickers
4
Investigating a Reported Victim
San Diego, where else?5
Locations Where A Potential Victim Was Advertised
6
DIG Technology
raw w messy w disconnected clean w organized w linkedhard to query, analyze & visualize easy to query, analyze & visualize
7
Steps To Build a DIG
Crawling ExtractionData Acquisition
Mapping ToOntology
Entity Linking& Similarity
Knowledge GraphDeployment
Query &Visualization
ElasticSearch
GraphDB
schema.org geonames
8
Data Acquisition
batch w real-time
Web pages w Web service database w CSV w Excel
XML w JSON
9
Information ExtractionText
Web pages
Web tables
Images
PDF10
“YOU don't wanna miss out on ME :) Perfect lil booty Green eyes Long curly black hair Im a Irish, Armenian and Filipino mixed princess :) ❤ Kim ❤7○7~7two7~7four77 ❤ HH 80 roses ❤ Hour 120 roses ❤ 15 mins 60 roses”
name: Kimeye-color: greenhair-color: black
phone: 707-727-7477rate: $60/15min
$80/30min$120/60min
11
12
13
Schema Alignment karma.isi.edu
ServicesRelationalSources
{ JSON-LD }
Hierarchical Sources
Schema.org
14
Linking Using Image Similarity
15
DIG ApplicationsHuman Trafficking Identify victims, prosecute traffickers
Cyber AttacksPredict cyber attacks from dark web data
Firearms TraffickingIdentify illegal sales
PatentsIdentify patent trolls
Securities FraudIdentify fraudulent stocks in the Penny Stock market 16