9
Challenges and opportunities to open access to scientific documents and data Antonia Ferrer Sapena Tony Hernández-Pérez Maredata Research Group (maredata.net) Workshop on Open Data and Language Processing Technologies: An opportunity not to be missed BEST PRACTICES

Maredata iodc v2

Embed Size (px)

Citation preview

Challenges and opportunities to open access to scientific

documents and data

Antonia Ferrer Sapena

Tony Hernández-Pérez

Maredata Research Group (maredata.net)

Workshop on Open Data and Language Processing Technologies: An opportunity not to be missed

BEST PRACTICES

Proyecto CSO2015-71867-REDT financiado por:

Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

¿What’s Maredata?

Sinergiesbetween res. groups Identify research groups

producing Research data

Liasons w SpecialInterest Groups

Libraries, funders

Recommendations

Collaborative Work

Data, data, data everywhere

¿Experience transferable?

Promote open research data

Mª Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

From Open Access toOpen Research Data

Advantages & Obstacles forOpen Research Data

Advantages of Open Research Data• Increasing the efficiency of research.

• Promoting scholarly rigour and enhancements to the quality of research.

• Enhancing visibility and scope for engagement.

• Enabling researchers to ask new research questions.

• Enhancing collaboration and community-building.

• Increasing the economic and social impact of research.

Obstacles for researchers• Lack of evidence of benefits and rewards.

• Lack of skills, time and other resources.

• Cultures of independence and competition.

• Concerns about quality. ¿Peer reviewed of data? Fear of data misread or misapplied (methodological or without key contextual information).

• Ethical, legal and other restrictions on accessibility (anonymization, Licenses…)

Mª Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

Top-DownApproach

Bottom-upApproach

OPEN ACCESS

RESEARCH DATA

JOURNALS

REPOSITORIESInstitutional

Thematic

FundersEuropean C.NSF – NIH…

Sci communityUniversities

Mª Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

Lessons learnt and to learnfor a national strategy on RD

“Fragmentation, duplication of efforts, isolation of small research groups put at risks the competitive advantage”

Federated or central thematic & harvester,

institutional, mandated, rewarded

e.Infrastructre with EOSC& interoperable

Mª Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

ESFRI Roadmap European Strategy Forum on Research Infrastructures

Landmarks

Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

Infrastructures for LanguageProcessing Technologies

• Language resources (corpora, lexical, linguistic data, audio)• Linguistic tools (concordance, clusters, keywords…)

http://linguistlist.org/sp/GetWRListings.cfm?wrtypeid=2• Not only for translations nor only for linguists• Text and data mining (TDM)

• Sentiment analysis (extracted from social media)• Script analysis (extracted from tv or cinema scripts)• Discourse analysis (extracted from all types of discourses)

Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

http://www.consorciomadrono.es/pagoda/index2.php

Tools – Data literacy

AntConc

Stanford CoreNLP

DATA REPOSITORY1 OR N

23 things RDAMetadata

PrivacyLicences

PreservationCiting DataCommunityof practices

Research Data & Libraries

Antonia Ferrer y Tony Hernández (Maredata) – 5 de octubre de 2016

@maredataproject

http://www.maredata.net