Machine Learning on Graphs

DTUDRIVEN

Diego KozlowskiFaculty of Science, Technology and Medicine

November 18, 2020

DTUDRIVEN

about me

DTUDRIVEN

Networks

Deep Learning on Networks

Science of Science

DTUDRIVEN

Networks

Networks, or graphs, are a way of representing information where there isa relation between elements. The generality of this definition make it aproper tool for many different scenarios:I In Social Media for example, the users interaction are know to build a

Social Network, but also in real-life interactions between people in acompany, school, village, etc. can be modeled with networks,

I in Chemistry the interaction between molecules can be thought of asa network

I in transportation, moving from one point to another can berepresented as a network where nodes are intermediate points andedges represent the possibility of movement,

I in the economics, we can represent the banking system as a networkof financial entities and money transfers, also international trade canbe represented in this way, etc.

DTUDRIVEN

Network types

A network can either be:I Directed if the edge between vertices has a one-way direction,I weighted if edges are categorical or in R,I attributed if the nodes have features,I signed if edges can take negative values,I heterogeneous if there can be different types of edges/vertices.

Heterogeneous networks can represent multiple types of relations.For example, knowledge graphs represent the relation betweenconcepts. Something like Paris is in France could be represented astwo nodes: Paris and France connected by the relation is in

DTUDRIVEN

Representation

We can represent a networkusing an adjacency matrix,where a 1 at the position ijrepresents that it exists a linkbetween nodes i and j. thereis no natural order for rowsand columns.

0 0 0 0 0 0 1 0 00 0 1 0 0 0 0 0 00 0 0 1 0 0 0 0 01 0 0 0 0 0 0 0 00 1 0 0 0 1 0 0 00 0 0 0 0 0 0 0 00 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0

A B C D E F G H I

ABCDEFGHI

DTUDRIVEN

Tasks on networks

In networks, given the flexibility of the representation, there exists manydifferent tasks:I network classificationI node classificationI link predictionI community detectionI network generation

DTUDRIVEN

Tasks on networks

Besides the different possible tasks, there is also an important distinction:I transductive framework: all nodes, their features and their links are

visible during training, and only the labels of the test set areremoved. This is semi-supervised learning

I inductive setting: Is a fully supervised learning problem.Nodes/Networks to classify are not seen during training. Is a moregeneral, though more complex task.

DTUDRIVEN

Deep Learning on Networks

Problem

Traditional Deep Learning models for images or text depend on theregularities that not all networks have. The two main difficulties are:

I Nodes have a variable number of neighbors,I neighbors cannot be ordered.

DTUDRIVEN

Images as networks

Images can be thoughtof as regular networks,where each node isconnected to theadjacent pixels followinga regular spatial pattern.

DTUDRIVEN

Sequences as networks

Also a text can be seen as a sequential network

The cat is under the table

The same can be said with any sequential data, like time series, sensordata, etc.

DTUDRIVEN

EmbeddingsI For any of the tasks on networks mentioned before, the first step is to

build an embedding representation of nodes.I We train an encoder that maps the nodes to a low-dimensional

embedding spaceI The goal is that the distances in the embedding preserve the

distances in the network.

Hamilton 202012

DTUDRIVEN

Spatial/Message Passing I

I In the spatial framework, we are going to build the representation ofeach node as an AGGREGATE of its neighbors,

I at each iteration (layer) embedding of a node encodes informationfrom more distant neighbors,

I We can initialize the embeddings the feature-vector of each node

Hamilton 2020

DTUDRIVEN

Spatial/Message Passing III how do we AGGREGATE and UPDATE the representation from a

previous layer with the neighbors information, defines the differenttypes of models.

I Kipf and Welling 2017 use the average of the previous layer with theneighbors, while Hamilton2017 use a concatenation as update andthe average as aggregate

I Xu et al. 2019 use the sum of the neighbors.

Hamilton 2020

DTUDRIVEN

Machine Learning on Graphs

Documents

Pattern Recognition and Machine Learning-Chapter 2: Probability Distributions (Part 2) + Graphs

Machine Learning Models on Random Graphslyu/student/phd/haixuan/thesis-7.pdf · Machine Learning Models on Random Graphs submitted by Haixuan YANG for the degree of Doctor of Philosophy

The Power of Machine Learning and Graphs

Graphs in Machine Learning - Inriaresearchers.lille.inria.fr/.../20162017/mlgraphs2.pdf · Similarity Graphs: "-neighborhood graphs Edges connect the points with the distances smaller

Fake News detection using Machine Learning on Graphs ...web.stanford.edu/class/cs224w/project/26422038.pdf · from Machine Learning on Graphs. 4.1 Gradient Boosting with Decision

CS224W: Machine Learning with Graphs Jure Leskovec, Weihua ...web.stanford.edu/class/cs224w/slides/18-limitations.pdf · 12/3/19 Jure Leskovec, Stanford CS224W: Machine Learning with

Learning in indefinite proximity spaces: Mathematical ...fschleif/... · ‹ Machine learning - asymmetry in graphs, ML in non-metric spaces ... Some examples - Signal processing

Fast Algorithms for Querying and Mining Large Graphs Hanghang Tong Machine Learning Department Carnegie Mellon University htong@cs.cmu.edu htong

Functions, Graphs, and Graphing: Tasks, Learning, and …bennett/onlinehw/qcenter/lzs.pdf · Functions, Graphs, and Graphing: Tasks, Learning, ... Graphs, and Graphing: Tasks, Learning,

Deep Neural Networks (3) Computational Graphs, Learning ... · Deep Neural Networks (3) Computational Graphs, Learning Algorithms, Initialisation Steve Renals Machine Learning Practical

Machine learning methods for gene/protein function predictionhomes.di.unimi.it/valentini/MB201213/slide/MLMethodsGFP.pdf · Learning in Graphs., European Conference on Machine Learning

GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013

Machine Learning with and for Semantic Web Knowledge Graphs · Keywords: Knowledge Graphs, Semantic Web, Machine Learning, Back-ground Knowledge 1 Introduction The term \Knowledge

FluxGraph: a time-machine for your graphs

Representation Learning on Graphs: Methods and Applications · modern machine learning. Machine learning applications seek to make predictions, or discover new patterns, using graph-structured

Knowledge Graphs for a Connected World - AI, Deep & Machine Learning Meetup

CS224W: Machine Learning with Graphs Jure Leskovec, http ...web.stanford.edu/class/cs224w/slides/04-communities.pdf · ¡Two perspectives on friendships: §Structural:Friendships

Graphs in Machine Learning - Inriaresearchers.lille.inria.fr/~valko/hp/serve.php?what=../projects/... · Similarity Graphs: "-neighborhood graphs Edges connect the points with the

A Review of Relational Machine Learning for … A Review of Relational Machine Learning for Knowledge Graphs From Multi-Relational Link Prediction to Automated Knowledge Graph …

Graphs in Machine Learningresearchers.lille.inria.fr/~valko/hp/serve.php?what=...November 30, 2015 MVA 2015/2016 Graphs in Machine Learning Michal Valko Inria Lille - Nord Europe,