2
Data science is essential as research shifts from a single discipline to collaboration across multiple disciplines and institutions. However, our nationally significant research data assets and large amounts of critical data are either not available or difficult to access. This makes collaboration between research teams virtually impossible. Collaboration and data are essential to discovery, as there is too much information for any one person to comprehend and manage. The result Powering nationwide shared research data project for better research outcomes The driver for Mediaflux Growth in structured and unstructured data is outstripping storage capabilities, driving the need for better managed and more accessible information, particularly for data-intensive research, such as studies into... The Mediaflux solution The causes of cancer Genetics and disease Climate change Neuroimaging analysis Particle physics The humanities and more The challenge Improving research efficiency by streamlining data workflows Breaking down barriers between disparate data pools, making data easier to share and re- use, securely Enhancing collaboration across teams and institutions Ensuring data integrity, as data us checked by Mediaflux as it is ingested Simplifying data management Research data workflows Research Outcomes Data Management Collaboration Strengthening Australia’s data-intensive research capabilities with improved: Enabling researchers to access, analyse and re-use Australian research nationally and internationally 55 Petabytes of data Hundreds of data sets Eight nodes In Eight different locations CASE STUDY

CASE STUDY - Arcitecta About RDSI The Research Data Storage Infrastructure Project is a $50m federally funded project, for which the University of Queensland is the lead agent. The

Embed Size (px)

Citation preview

Data science is essential as research shifts from a single discipline to collaboration across multiple disciplines and institutions.

However, our nationally significant research data assets and large amounts of critical data are either not available or difficult to access. This makes collaboration between research teams virtually impossible.

Collaboration and data are essential to discovery, as there is too much information for any one person to comprehend and manage.

The result

Powering nationwide shared research data

project for better research outcomesThe driver for

MediafluxGrowth in structured and unstructured data is outstripping storage capabilities, driving the need for better managed and more accessible information, particularly for data-intensive research, such as studies into...

The Mediaflux solution

The causes of cancer

Geneticsand disease

Climatechange

Neuroimaginganalysis

Particlephysics

The humanitiesand more

The challenge

Improving research efficiencyby streamlining data workflows

Breaking down barriersbetween disparate data pools, making data easier to share and re- use, securely

Enhancing collaboration across teams and institutions

Ensuring data integrity,as data us checked by

Mediaflux as it isingested

Simplifying data management

Research dataworkflows

ResearchOutcomes

DataManagement

Collaboration

Strengthening Australia’s data-intensive research capabilities

with improved:

Enabling researchers to access, analyse and re-use Australian

research nationally and internationally

55 Petabytes of dataHundreds of data setsEight nodesIn Eight different locations

CASE STUDY

Strengthen Australia’s capabilities in data-intensive research and collaboration.Deliver a shared data infrastructure for nationally significant research data.Improve research efficiency with better data access and management.

The project’s aimAbout RDSI

The Research Data Storage Infrastructure Project is a $50m federally funded project, for which

the University of Queensland is the lead agent. The RDSI Project, an initiative of the National

Collaborative Research Infrastructure Strategy, is funded from the Education Investment Fund under

the Super Science (Future Industries) Initiative.

www.rdsi.edu.au

The Research Data Storage Infrastructure Project

© Copyright Arcitecta Pty Ltd 2014 – All rights reserved. Arcitecta, Arcitecta Desktop and CAReHR are trademarks of Arcitecta Pty Ltd: Mediaflux is a registered trademark of Arcitecta Pty Ltd in the U.S.A. and a trademark of Arcitecta Pty Ltd in Australia. All other trademarks are the property of their respective holders.

Modern research is characterised by large teams, complex experiments, large datasets, and lengthy data analysis.

Yet, there aren’t many tools to manage research data throughout its lifecycle, and none with the comprehensive and extensible features of Mediaflux® (also known as SGI® LiveArc™).

The challenge

Using the powerful metadata management capabilities of Mediaflux, researchers can more easily collaborate across different data types and repositories, while retaining control of when, how and who can access their data.

Researchers can access and manipulate significant collections of data previously unavailable or difficult to access, from the eight RDSI funded data centres, or Nodes.

Plus, researchers can search data collections by name, type, owner, date or linked publications, as well as increasing publication rates and, streamlining data workflows.

Primary Nodes are in Adelaide, Brisbane, Canberra, Melbourne, Perth and Sydney, with additional Nodes in Tasmania and Townsville.

Working in collaboration with RDSI and SGI, Arcitecta is delivering scalable and customised data-connected research platforms.

Mediaflux, as the key data management engine for Australia’s Research Data Storage Infrastructure, is providing researchers with shared access to 55 petabytes of raw data.

Using the extensible and flexible distributed data management capabilities of Mediaflux, researchers can collect, store, analyse, and curate any data, in any format, securely, from any of the RSDI Nodes.

Arcitecta’s core product, Mediaflux®, is an operating system for metadata enabling distributed groups of people to share, manipulate and manage any type of data.

The company’s products are deployed in geospacial, eResearch and eHealth contexts, streamlining data intensive workflows and simplifying data management for better business outcomes.

About SGISGI is a global leader in high performance solutions for compute, data analytics and data management that enable customers to accelerate time to discovery, innovation, and profitability. Visit www.sgi.com for more information.

The Mediaflux solution The result

www.arcitecta.com

A case study about...

About Arcitecta™