42
HUMANITIES NETWORKED INFRASTRUCTURE (HUNI) JAILBREAKING AUSTRALIA’S CULTURAL DATA

Humanities Networked Infrastructure (HuNI)

Embed Size (px)

DESCRIPTION

A report on the progress of the Humanities Networked Infrastructure Project presented at the 2013 Digital Humanities conference held in Lincoln Nebraska.

Citation preview

Page 1: Humanities Networked Infrastructure (HuNI)

HUMANITIES NETWORKED INFRASTRUCTURE (HUNI)

JAILBREAKING AUSTRALIA’S CULTURAL DATA

Page 2: Humanities Networked Infrastructure (HuNI)

CRICOS Provider Code: 00113B

NATIONAL E-RESEARCH COLLABORATION TOOLS AND RESOURCES (NECTAR)

NeCTAR is a $47 million dollar, Australian Government project, conducted as part of the Super Science initiative and financed by the Education Investment Fund. The University of Melbourne is the lead agent, chosen by the Commonwealth Government.

Page 3: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY PROGRAM

Page 4: Humanities Networked Infrastructure (HuNI)

• Ensure that Australian cultural datasets and the research associated with them become part of the emerging international Linked Open Data environment.

• Enable research enquiries to move easily from: what is? to where is?

• Support the role of annotation and metadata in discovery of new knowledge or the means to elucidate new knowledge

• Position the idea of data as both a subject and object of analysis in humanities

• Contribute to debates around standards for development and implementation

HuNI BROAD BENEFITS

Page 5: Humanities Networked Infrastructure (HuNI)

• Enable humanities researchers to work with cultural datasets more efficiently and effectively, and on a larger scale;

• Encourage the systematic sharing of research data between humanities researchers (including the cultural dataset curators themselves), the community and cultural institutions;

• Encourage a greater level of cross-disciplinary and interdisciplinary research, both within the humanities/creative arts and between the humanities/creative arts and other disciplines, and the wider public;

• Support innovative methodologies such as network analysis, game theory and ‘virtual history’ that rely on large-scale datasets

HUNI: SPECIFIC BENEFITS

Page 6: Humanities Networked Infrastructure (HuNI)

1. Organisational level: the goals and processes of the institutions involved

2. The semantic level: meaning of the exchanged digital resources3. Technical level: implementing data interoperability requires

both data integration and data exchange processes as well as enabling effective use of the data that becomes available

Pasquale Pagano, ‘Data Interoperability’ (GRDI2020)4. Project level: The advent of more complex ‘big humanities’

projects requires multiple and multi-disciplinary personnel which in turn entails the organization of different workflows and expectations: e.g. challenge of developing a comprehensive or consortial approach, common definition of project method etc.

INTEROPERABILITY

Page 7: Humanities Networked Infrastructure (HuNI)

1. A PARTNERSHIP… a Deakin led consortium • Cultural data providers (10) – project co-operators• Humanities software developer (1) – project co-

developers• eResearch organisations (2) – lead development

agencies

Page 8: Humanities Networked Infrastructure (HuNI)

HUNI PARTNER DATASETS

AMHD

MAPCAARPBonzaAFIRCCircus OzAusStage

Media: film, cinema, theatre, newspapers, magazines, advertising, music, live performances

DAAOAustLitAWRADBDoS

Biographical: artists, designers, writers, significant people, scientists, Sydney demographics

EOAS

AUSTLANGMura

Indigenous languages

Page 9: Humanities Networked Infrastructure (HuNI)

AUSTLIT

Page 10: Humanities Networked Infrastructure (HuNI)

ADB

Page 11: Humanities Networked Infrastructure (HuNI)

DAAO

Page 12: Humanities Networked Infrastructure (HuNI)

AUSTLANG

Page 13: Humanities Networked Infrastructure (HuNI)

bonza

Page 14: Humanities Networked Infrastructure (HuNI)

AUSSTAGE

Page 15: Humanities Networked Infrastructure (HuNI)

EOAS

Page 16: Humanities Networked Infrastructure (HuNI)

TUGG

Page 17: Humanities Networked Infrastructure (HuNI)

Welcome to the Cinema and Audiences Research Project (CAARP) database: An online encyclopaedia of cinema-going in Australia.

DataThis site contains information on film screenings and venues in Australia. 430,137 screenings10,256 films1,978 cinemas1,649 companiesFrom 1846 to now

Page 18: Humanities Networked Infrastructure (HuNI)

• NeCTAR investment of $1.33M

• Partner contributions of $480,000

• Partner in-kind contributions amounting to >$1M

A FISCAL COLLABORATION

Page 19: Humanities Networked Infrastructure (HuNI)

COMMUNITY BUILDING• Collated user-stories (20) • Online showcase events – next one is 4th September

2013• Live link to the latest alpha prototype on huni.net.au;

feedback buttons• Wider beta launch at eResearch Australasia in October

2013• Stay up to date through our monthly Newsletter and

blog feed• Follow us on twitter - @HuNIVL

Page 20: Humanities Networked Infrastructure (HuNI)

Information design challenge to build an ontology and use linked data and controlled vocabularies for data to be aligned and related.

• Reading the data. Characteristics of the data determine the ontological components selected and the major “entities” (aka “access points”).

• Identified early as: people, organisations, events, relationships, places, dates, resources, and subjects.

• Components from ontologies already available and being reused or kept in our sights: CIDOC-CRM, FOAF, FRBR, FRBR-OO, BibFrame and PROV-O.

2. INTEGRATING MEANING

Page 21: Humanities Networked Infrastructure (HuNI)

PHASE ONE

Page 22: Humanities Networked Infrastructure (HuNI)

HUNI ONTOLOGY March 2013

Page 23: Humanities Networked Infrastructure (HuNI)

HUNI ONTOLOGY (all classes and object properties)

Page 24: Humanities Networked Infrastructure (HuNI)

ALIGNING ONTOLOGIES

Page 25: Humanities Networked Infrastructure (HuNI)

3. HuNI DATA ARCHITECTURE

Page 26: Humanities Networked Infrastructure (HuNI)

A total of 28 Australian datasets are being harvested for integration into HuNI

• Data gateway components, called HuNI Corbicula, deployed on the NeCTAR Cloud to harvest the XML feed data and transforming it into forms suitable for ingestion into two HuNI data aggregates: a Solr search server [HuNI Data], and a Jena RDF Triple Store [HuNI Linked Data]

DATA INTEGRATION

The harvesting process requires:• Live data feeds

deployed at the partner sites to publish updated partner data as XML

Page 27: Humanities Networked Infrastructure (HuNI)

TWO HUNI DATA AGGREGATES?Solr aggregate RDF aggregate

28

0

7

1

4

2

1

24

0

7

1

4

2

1

6

part

ner

data

set

part

ner

data

set

Page 28: Humanities Networked Infrastructure (HuNI)

TECHNOLOGY STACK

• front-end frameworks - AngularJS and Twitter Bootstrap single page web app

• tools hosting framework - Open Social via Apache Shindig

• back-end framework - SpringMVC via Roo.• layer integration - RESTful web services

Page 29: Humanities Networked Infrastructure (HuNI)

• Search the HuNI Data• Save their search results as a

private collection• Refine their collection through

additional searches• Analyse and annotate their

collection with their own assertions and commentary

• Export their collection for further analysis

• Publish and share their collection and research

RESEARCH ACTIVITIESA researcher with a HuNI account will be able to:

Page 30: Humanities Networked Infrastructure (HuNI)

Scholarly researchers will also be able to perform a “deep search” of the graphs in RDF Triple Store.The large-scale aggregation of Linked Data makes explicit the relationships and connections between related records across all the partner datasets, enabling the researcher to construct more complex semantic queries.

RESEARCH ACTIVITIES 2

Page 31: Humanities Networked Infrastructure (HuNI)

EARLY VLAB PROTOTYPE

Page 32: Humanities Networked Infrastructure (HuNI)
Page 33: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Discovery (part 1)

Page 34: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Discovery (part 2)

Page 35: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Discovery (part 3)

Page 36: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Analysis (part 1)

Page 37: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW – Analysis (part 2)

Page 38: Humanities Networked Infrastructure (HuNI)

VIRTUAL LABORATORY RESEARCHER WORKFLOW: Sharing

Page 39: Humanities Networked Infrastructure (HuNI)

4. THE PROJECT• project director/community liaison (20%)• project manager (100%)• technical coordinator (100%)• information services coordinator (90%)• community engagement (30%)• communication coordinator (20%)• administrative support (20%)• software developer(s)

NeCTAR Directorate

HuNI Steering

Committee

Team HuNI

Technical Working

Group

Expert Advisory

GroupExpert Data

Group

Page 40: Humanities Networked Infrastructure (HuNI)

PROJECT WEBSITE: huni.net.au

Page 41: Humanities Networked Infrastructure (HuNI)

PROJECT WIKI: apidictor.huni.net.au

Page 42: Humanities Networked Infrastructure (HuNI)

HuNI: a virtual laboratory for the humanities

http://huni.net.au/@HuNIVL