Finish Section on Linked Data Begin data cleaning and pre ... · JSON-LD (example from json-ld.org)...

•  Finish Section on Linked Data •  Begin data cleaning and pre-processing topic

Graphs: Social networks

https://www.flickr.com/photos/marc_smith/5592302165

Protein-Protein Interactions

http://www.nature.com/nrg/journal/v5/n2/fig_tab/nrg1272_F2.html

The Internet Graph (https://en.wikipedia.org/wiki/Opte_Project)

Linked Data

•  We need to connect data together --- form links. –  A key part of the Semantic Web –  Also important for the Internet of Things

•  (26 billion things by 2020, each continuously producing data)

1.  Principles of links from Tim Berners-Lee 1.  All kinds of conceptual things, they have names now that start with

HTTP. 2.  If I take one of these HTTP names and I look it up, I will get back

some data in a standard format which is kind of useful data that somebody might like to know about that thing, about that event.

3.  When I get back that information it's not just got somebody's height and weight and when they were born, it's got relationships. And when it has relationships, whenever it expresses a relationship then the other thing that it's related to is given one of those names that starts with HTTP.

Linked Data Examples

•  DBPedia –  ~5 million “things” from Wikipedia –  Can be linked to external datasets such as CIA World

Factbook, US Census Data –  “Give me all cities in New Jersey with more than 10,000

people

•  Freebase •  FOAF (friend of a friend) •  Google Knowledge Graph

•  https://www.google.com/intl/bn/insidesearch/features/search/knowledge.html

Standards for Linked Data

•  Widely used standards (W3C Recommendations) –  JSON-LD (JSON Linked Data) –  RDF (Resource Description Framework)

JSON-LD (example from json-ld.org)

•  Provide mechanisms for specifying unambiguous meaning in JSON data

•  Provides extra keys with “@” sign –  “@context” (used to define meanings of terms, map to

identifiers) –  “@type” –  “@id”

•  Use cases –  Google Knowledge Graph

JSON-LD Example (from https://en.wikipedia.org/wiki/JSON-LD)

{"@context": { "name": "http://xmlns.com/foaf/0.1/name", "homepage": { "@id": "http://xmlns.com/foaf/0.1/workplaceHomepage", "@type": "@id" }, "Person": "http://xmlns.com/foaf/0.1/Person" }, "@id": "http://me.example.com", "@type": "Person", "name": "John Smith", "homepage": "http://www.example.com/" }

Graphs – RDF (Resource Description Framework) [materials from w3.org]

Serialisation of RDF Example Graph

This graph can be serialised as XML (don’t worry about syntax!)

<?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:contact="http://www.w3.org/2000/10/swap/pim/contact#">

<contact:Person rdf:about="http://www.w3.org/People/EM/contact#me"> <contact:fullName>Eric Miller</contact:fullName> <contact:mailbox rdf:resource="mailto:em@w3.org"/> <contact:personalTitle>Dr.</contact:personalTitle> </contact:Person>

RDF – Triple Store

•  An alternative format for storing RDF type data – triple store <http://www.w3.org/People/EM/contact#me> <http://www.w3.org/2000/10/swap/pim/contact#fullName> "Eric Miller" . <http://www.w3.org/People/EM/contact#me> <http://www.w3.org/2000/10/swap/pim/contact#mailbox> <mailto:em@w3.org> . <http://www.w3.org/People/EM/contact#me> <http://www.w3.org/2000/10/swap/pim/contact#personalTitle> "Dr." . <http://www.w3.org/People/EM/contact#me> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2000/10/swap/pim/contact#Person> .

Freebase

•  A large database that connects entities (facts, people, places, organizations …) together as a graph –  www.freebase.com –  Freebase is the basis of the Google Knowledge graph that is

used to improve search. •  https://developers.google.com/knowledge-graph/

•  Retrieving data from the Google Knowledge Graph –  Example adapted from http://www.nolan-nichols.com/

knowledge-graph-via-sparql.html

Other formats for Graphs: Matrix Representation

B A B C D

A 0 0 1 0 B 0 0 0 0 C 0 1 0 0 D 0 1 0 0 A ‘1’ in the matrix iff there is an edge from node X to node Y. Or use a relational table

Source Destination

A C C B D B

What you should know about data formats

•  -Why do we have different data formats and why do we wish to transform between different formats?

•  -Motivation for using relational databases to manage information •  -Different between a (standard) relational database and a nosql database •  -What is a csv, what is a spreadsheet, what is the difference? •  -Be able to write regular expressions in python format (operators .^$*+|[]) •  -Difference between HTML and XML and when to use each •  -Motivation behind using XML and XML namespaces •  -Be able to read and write data in XML (elements, attributes, namespaces) •  -Be able to read and write data in JSON •  -Difference between XML and JSON. Applications where each can be used. •  -The purpose of using schemas for XML and JSON data. •  -The motivation behind Linked Data and the purpose of using JSON-LD or RDF

to represent it.

Finish Section on Linked Data Begin data cleaning and pre ... · JSON-LD (example from json-ld.org)...

Documents

Linked Data - UNECE · structured-data) First off, JSON -LD is based on JSON, which practically is the grammar of APIs. JSON-LD works by injecting a context and some other aspects

Discovering Implicit Schemas in JSON Data

Modeling JSON data for NoSQL document databases

Ember Data and JSON API

JSON hijacking - OWASP...History of JSON hijacking ... • If you control some of the JSON data then you can. Hacking without Proxies

JSON: Data model and query languages

JSON(as(an(XML(Alterna3ve( JSONLD1 JSONLD ((JSON(as(an(XML(Alterna3ve(! JSON"is"alightweightalternave"to"XML"for"data interchange"! JSON"=JavaScriptObjectNotaon" – It’s"really"language

NoSQL & Informix The Power of Hybrid – Informix and JSON€¦ · – Accessing JSON Data in DB2, Informix MongoDB using JSON query – Schema-less JSON Data for variety of applications

JSON Schema - IETF | Internet Engineering Task Force© SitePen, Inc. All Rights Reserved JSON Schema JSON-based Schema for JSON data Structure deﬁnition Validation Structured documentation

Microdata and JSON-LD to make Structured Data

SciData: a data model and ontology for semantic ......JSON-LD processor, e.g. the JSON-LD Playground [10]. This capability makes JSON-LD files not only useful as a data format but

IoT and Smart Home: Seamless Interoperability - Parks Associates€¦ · Layer 6 Data Presentation Data Presentation to Encryption GATT JSON XML / JSON XML / JSON ZigBee Z-e Layer

Tools and Methodology - Joinup...JSON Linked Data (JSON-LD) A developer friendly Linked Data format, based on the successful JSON format Machine interpretable semantics through “context

STAR Lab Technical Reportdata, precise and unambiguous semantics of the data were lacking. In order to en-hance the potentialities of unambiguous data exchange and future exploitation

JSON: data model, query languages and schema speciﬁcation · mally deﬁne an appropriate data model for JSON, iden- tify the key querying features provided by the existing JSON

The Next Generation of Structured Data: JSON-LD

Understanding JSON Schema...Understanding JSON Schema, Release 7.0 JSON Schema is a powerful tool for validating the structure of JSON data. However, learning to use it by reading

Data Governance with JSON Schema

LDM Slides: Data Modeling for XML and JSON

MySQL's JSON Data Type and Document Store