KONECT – The Koblenz Network Collection

Preview:

DESCRIPTION

We present the Koblenz Network Collection (KONECT), a project to collect network datasets in the areas of web science, network science and related areas, as well as provide tools for their analysis. In the cited areas, a surprisingly large number of very heterogeneous data can be modeled as networks and consequently, a unified representation of networks can be used to gain insight into many kinds of problems. Due to the emergence of the World Wide Web in the last decades many such datasets are now openly available. The KONECT project thus has the goal of collecting many diverse network datasets from the Web, and providing a way for their systematic study. The main parts of KONECT are (1)~a collection of over 160 network datasets, consisting of directed, undirected, unipartite, bipartite, weighted, unweighted, signed and temporal networks collected from the Web, (2)~a Matlab toolbox for network analysis and (3)~a website giving a compact overview the various computed statistics and plots. In this paper, we describe KONECT's taxonomy of networks datasets, give an overview of the datasets included, review the supported statistics and plots, and briefly discuss KONECT's role in the area of web science and network science.

Citation preview

KONECTThe Koblenz Network Collection

Jérôme KunegisInstitute for Web Science and Technologies (WeST), University of Koblenz–Landau

With acknowledgments to everyone who has made network datasets available to the public

Jérôme Kunegis KONECT – The Koblenz Network Collection 2

What Is Koblenz?

Jérôme Kunegis KONECT – The Koblenz Network Collection 3

My PhD Thesis

Jérôme Kunegis KONECT – The Koblenz Network Collection 4

The Trick Is…

Jérôme Kunegis KONECT – The Koblenz Network Collection 5

Everything Is a NETWORK !

Jérôme Kunegis KONECT – The Koblenz Network Collection 6

Well, Only Almost Everything Is a Network

Communication

Authorship

Friendship

Interaction

Trust

Co-occurrence

Jérôme Kunegis KONECT – The Koblenz Network Collection 7

A Network Dataset Is Like a Gummi Bear

Jérôme Kunegis KONECT – The Koblenz Network Collection 8

A Network Dataset Is Like a Gummi Bear

Lots of contentto analyse

Evaluate prediction algorithms

Test network models

Test search and recommender

systems

Jérôme Kunegis KONECT – The Koblenz Network Collection 9

When You Have Tested One, You Have Tested All ?!

Jérôme Kunegis KONECT – The Koblenz Network Collection 10

Or Do You?

Jérôme Kunegis KONECT – The Koblenz Network Collection 11

Diversity of Network Datasets

Jérôme Kunegis KONECT – The Koblenz Network Collection 12

Overview

Total: 168 datasets

Jérôme Kunegis KONECT – The Koblenz Network Collection 13

Network Formats

U • Undirected D • Directed B • Bipartite

Jérôme Kunegis KONECT – The Koblenz Network Collection 14

Edge Weight and Multiplicity Types

= • Multiple + • Positive

w > 0

± • Signed

– • Unweighted

w ± 1

* • Rating

★★★★★

** • Multiple Ratings

★★★★★

Jérôme Kunegis KONECT – The Koblenz Network Collection 15

Timestamps

t = 1 t = 2 t = 3

Jérôme Kunegis KONECT – The Koblenz Network Collection 16

Example Dataset

Jérôme Kunegis KONECT – The Koblenz Network Collection 17

Network Comparison

Jérôme Kunegis KONECT – The Koblenz Network Collection 18

Network Comparison: Plots

Jérôme Kunegis KONECT – The Koblenz Network Collection 19

More Plots

Jérôme Kunegis KONECT – The Koblenz Network Collection 20

Download

Jérôme Kunegis KONECT – The Koblenz Network Collection 21

RDF

Jérôme Kunegis KONECT – The Koblenz Network Collection 22

Matlab Toolbox

Jérôme Kunegis KONECT – The Koblenz Network Collection 23

Handbook of Network Analysis

http://konect.uni-koblenz.de/publications

http://konect.uni-koblenz.de/

@KONECTproject

Give Me Datasets!

Recommended