CARL definition Research data are defined here as the factual records (e.g. microarray, numerical and textual records, images and sounds, etc.) used as primary sources for research, and that are commonly accepted in the research community as necessary to validate research findings. Could be survey results, data collected automatically from computer programs, sensors or instruments NB ANDS does not define what they mean by “data” prefer to keep it open Image from ANDS
Data new paradigm
Data is no longer just a by product of researchMay be the starting point of researchData is an assetA deluge of dataWhy now?Some examples…..
Presenter
Presentation Notes
To paraphrase John Wilbanks, Vice President of Science Commons, ‘our capacity to measure, store, analyze, and visualize data is becoming the new reality to which research will have to adapt’ Ref: CARL Economist has a major piece on the data deluge Data, data everywhere. February 2010 Over the next five years, the world will produce more research data than has been created in all of human Why now? digital age data born digital supercomputers
Research Data as Research Productexamples…
The Human Genome project
Hubble telescope data
Linguistics scholars are concentrating on data capture as languages disappear
6
Presenter
Presentation Notes
Research data that is used collaboratively can be central to research Data intensive science
There are more research papers written by “second use” of the research data, than by the use initially proposed Papers that present analysis of HST data to reach a scientific conclusion. GO (General Observation) paper: At least one author was investigator on the General Observation proposal that obtained the data. AR(chive) paper: No overlap between the paper authors and investigators on the GO proposal that obtained the data. GO+AR: Combination of GO data sets with AR data sets.
BIG picture…trends
EResearch
EScience
Research 2.0
Gov 2.0
Open access to data
Presenter
Presentation Notes
What is eResearch? 'eResearch' is a broad term used to describe a set of activities that harness the power of advanced information and communication technologies (ICTs) for research. E-Science US use term cyberinfrastructure Image from ANDS
BIG picture…trends
EResearchresearch involving the collection and
manipulation of data….
1. Research collaboration2. Data management and sharing3. High-performance computing4. Visualisation and haptics (eresearchsa)
Presenter
Presentation Notes
What is eResearch? 'eResearch' is a broad term used to describe a set of activities that harness the power of advanced information and communication technologies (ICTs) for research. (from http://www.eresearchsa.edu.au/) Research collaboration Data management and sharing High-performance computing Visualisation and haptics (ie virtual touch)
BIG picture…trends
Government 2.0– Australia: Declaration of Open Government,
July 2010– Data being shared…
• ABS open data• GA (Geoscience Australia) data
– Change to Freedom of Information (FOI)
Presenter
Presentation Notes
What is eResearch? 'eResearch' is a broad term used to describe a set of activities that harness the power of advanced information and communication technologies (ICTs) for research. Research collaboration Data management and sharing High-performance computing Visualisation and haptics
Data visualisation…trends
Presenter
Presentation Notes
Visualisation of data on free bike usage in London Source http://www.oobrien.com/vis/bikes/
Data visualisation…trends
Presenter
Presentation Notes
Visualisation of data on phone calls to 311 in New York Source: Wired. November 2010
Open AccessOpen access to publicly funded
researchResearch funding bodies starting
to demand thisOpen access
–Institutional repositories–Data repositories
Presenter
Presentation Notes
Movement towards Open access Moral argument that: Open access to publicly funded research should be mandatory Scholarly literature used to be locked away, no longer Now, more pressing to have open access Funders demand this now with digital content availability Open access repository of published research outputs Open access repository of data
Starting point, the Code….
Australian Code for the Responsible Conduct of Research– S1 General principles of responsible
research– S2 Management of research data and
primary materials
Presenter
Presentation Notes
Australian Code for the Responsible Conduct of Research The central framework governing research integrity at the national level is the Australian Code for the Responsible Conduct of Research. The code has been developed jointly by the National Health and Medical Research Council (NHMRC), the Australian Research Council (ARC) and Universities Australia and has relevance across all research disciplines.��The code guides institutions and researchers in the responsible conduct of research and also explains the rights and responsibilities of researchers who witness research misconduct. It includes guidance on: How to manage research data and materials; How to publish and disseminate research findings, including proper attribution of authorship, How to conduct effective peer review; and How to manage conflicts of interest.
The Code = responsibilities
Responsible conduct of research– proper management of research data– retention of research data
Retaining research data important – may be all that remains at the end of the
research project
What is ANDS?
Australian National Data Service
Presenter
Presentation Notes
Australian research response to data management ANDS was established in 2008 under the NCRIS Platforms for Collaboration with initial funding of $24m
Presenter
Presentation Notes
ANDS funding ANDS is funded by the Australian Commonwealth Government's Department of Innovation, Industry, Science and Research (DIISR). The funding has been provided through the National Collaborative Research Infrastructure Strategy (NCRIS) as part of the Platforms for Collaboration Investment Plan.
ANDS’ goalstagline: more Australian researchers reusing
research data more often1. influence national policy in data
management in the Australian research community
2. inform best practice for curation of data 3. transform the disparate collections of
research data around Australia into a cohesive collection of research resources
Australian Research Data Commons Goal
From Data being:• Unmanaged• Unconnected• Unfindable• Not reusable
To form a nationally significant research resource
To Data being:• Managed• Connected• Findable• Reusable• Collected
Research Data Australia
Discovery interface– “Window on the Commons”
Data collections produced by or relevant to Australian researchers Makes research data collections visibleYou can see what has been done already
ANDS Projects
Seeding the Commons projects
Presenter
Presentation Notes
Research Data Commons Projects
Seeding the Commons projects
Create infrastructure within institutions
– collect and transform metadata about collections
– publish to Australian Research Data Commons (ARDC)
Opportunities for researchers Enable researchers to publish their data
Enable the institutions to publicise its research
Help build a data commons
Presenter
Presentation Notes
Australian researchers to have a comprehensive access to Australia's research data ANDS is working with both ThomsonReuters and Elsevier to investigate the feasibility of tracking and recording of data set use through DOIs, and to make that information available through Web of Science and Scopus. Both of these databases are used extensively world-wide as part of research assessment activities
Presenter
Presentation Notes
Image from ANDS
Data needs to be organised..how?RIF-CS schemaRIF-CS compliant datasets
ARD Commons Metadata
Roles for libraries
Contact and outreach to researchersAwareness raisingInformation gathering about available dataAdvice on metadata, descriptions, disposal
policy and sustainabilityPossible use of institutional repository for
holding descriptions or connecting publications to dataTraining and support
New roles for librarians
data librarian, metadata librarianresearch data librarianembedded librariandata manager research support officer data scientist data curator
QUT data support structures
Presenter
Presentation Notes
QUT, Monash and others have data management websites
QUT data support structures
Presenter
Presentation Notes
QUT, Monash and others have data management websites
This project is supported by the Australian National Data Service (ANDS)
ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy Program and the Education