43
04/07/22 1 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal Proposal : : SkTech.RC/IT/Madnick SkTech.RC/IT/Madnick

19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

Embed Size (px)

Citation preview

Page 1: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 1

Semantic WEB Scientific Data Integration

Vladimir SerebryakovComputing Centre

of the Russian Academy of Science

ProposalProposal: : SkTech.RC/IT/MadnickSkTech.RC/IT/Madnick

Page 2: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 2

Table of Contents• What is the Computing Centre of the Russian

Academy of Science (CCRAS)• Unified Information space of the Russian

Academy of Science• Information system “Research institution”• Its extensions

– LibMeta– GeoMeta– Linked Open Data

Page 3: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 3

What is the Computing Centre of the Russian Academy of Science (CCRAS)

Directions of study– Numerical methods– Informatics

• Pattern recognition

• Information systems

• Computer algebra

Page 4: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 4

The Unified Research Information Space (URIS)

The Unified Research Information Space (URIS) of the Russian Academy of Sciences (RAS) is an integrated information space of distributed and local digital resources of RAS organizations and hardware and software tools that support its functionality and control.

Page 5: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 5

The main tasks of URIS RAS• Development of a unified metadata model on the basis of modern

technologies and implementation of a global search mechanisms on its;

• Active scientific communications;

• Building of distributed catalogs of scientific information;

• Information support of research study;

• Development and publication of corporate standards;

• Development and implementation of a software package “The RAS Basic Institution”;

• Construction of points for access these information (portals);

• Implementation of access and integration to information resources of RAS organizations;

• Security support;

• Interconnection with other information systems.

Page 6: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 6

URIS architecture

Institution RAS division

Access Access Access

RAS integrated system of information resources

Access control

Other information systems National

o Science o Education o

Foreing information systems

Regional information systems

Application field information systems

Physics Geology Economy

Division node: Metainformation Search indices

Institition node: Metainformation Search indices

Scientific information resources

Library

Library node: Metainformation Search indices

Institution

Institition node: Metainformation Search indices

Institution

Institition node: Metainformation Search indices

Institution

Institition node: Metainformation Search indices

Page 7: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 7

RAS Institution Library Publishing Digital library Administrative departments Scientific secretary Data basis of scientific results Publications Scientific reports Innovations Conferences Learning

RAS Institution RAS Institution

RAS dividion

Administratuve information

Administratuve information

RAS Presidium

Access Access Access

RAS Integrated system of information resources

Access control

Other information systems National

o Science o Education o

Foreing information systems

RAS regional information systems

Application information systems

Page 8: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 8

URIS Information Bus

The basis of the URIS RAS is an Information Bus that is a set of hardware, software and administrative tools that support:

• Resources and services supplement• Security• Metadata actualization• Data integration• Global search.

Page 9: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 9

URIS Information bus: architecture

Metadata Metadata

RAS Devision

RAS Institution

Regional IS

External IS

Devision IS

Search Security

RAS URIS Information Bus

Services Information resorces

Regional IS

External IS

Devision IS

Page 10: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 10

•Organizatios

•Persons

•Publications

•Projects

•Spatial data

•Application data

URIS Information bus: resources

Page 11: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 11

Technologies

The information model of URIS RAS is based on Semantic Web – RDF, RDFS, OWL ontology of scientific information. This includes:

• Scientific activity, in particular projects as a process, conferences, seminars etc.

• Participants of Scientific activity, like persons, working groups, organizations etc.

• Results of Scientific activity, like data bases, software projects, innovations etc.

• Documents and publications, like papers, dissertations etc

Page 12: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 14

URIS Metadata requirements

•Include basic resource types•Provide access to resources•Provide extensibility•Provide data integration•Provide identification•Provide searching in distributed environment•Use Semantic Web approach.•Provide interoperobility

Page 13: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 15

URIS Metadata standards

Semantic Web – RDF, RDFS, OWL DCMI - Dublin Core Metadata Initiative (dublincore.org)

PRISM - Publishing Requirements for Industry Standard Metadata (Adobe,…)

AGLS Metadata Standard

vCard – “visit card” in RDF.

FOAF open initiative Friend Of A Friend (personal information) BIBLINK, bibTeX, Math-Net, UKOLN CLD …

CERIF 2000, MARC и RUSMARC, CIDOC …

Page 14: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 16

The software package“RAS Institution”

Page 15: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 17

RAS institutions information tasks

• Inner administrative tasks• Institution as a research RAS

organization• Support of a research process• Public representation

Page 16: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 18

The software package“RAS Institution”

The software package “The RAS Institution” is intended to supply institutions with a modern information system that supports internal requirements (publication of scientific information, administrative processes and information etc.) from one hand, and external ones (representation of the information in URIS RAS and Internet) from another hand. It includes:

Infrastructure services that supports

•Data storage

•Global identification

•Data exchange and replication

•Security

•Indexing and search.

Page 17: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 19

The software package“RAS Institution”

Base components include subsystems

•Administrative directory

•Publications

•Projects.

•Interaction components

•News

•Forums

•Private communication

•Application components

•Publishing department

•Library

•Electronic library

•Library of dissertations.

Page 18: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 20

“RAS institution” applications

• Portal RAS• RAS organizations information

systems• Thematic information systems • Bridges

Page 19: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 21URIS Portal

Page 20: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 22RAS Portal

Page 21: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 23Division’s of Mathematics portal

Page 22: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 24Moscow State University dept’s of applied mathematics portal

Page 23: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 25RAS Institution’s of America and Canada portal

Page 24: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 26

LibMeta• Requirements

– Integration into URIS– Distributed environment– International standards

• OAIS

• Dublin Core

• CIDOC-CRM

• OAI-PMH

Page 25: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 27

LibMeta Fuctionality• User

– Searching• Full text• Attribute• Directories

– Navigation– Accessing

• Administrator– Content control– Rights control– Directories management

• OAI PMH metadata exchange

Page 26: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 28

LibMeta Profile

URIS basic Profile -Kernel -Person -Project -Organization -Publication

LibMeta Profile -Full texts (scanned) -Contents -Multimedia objects -Museum objects -Collections

URIS Library extension -Resumed publication -Publication collections -Bibliography -Series

Page 27: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 29RAS scientific heritage digital library portal

Page 28: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 30

GeoMeta portal

Page 29: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 31

Purposes– Metadata support (keeping and

editing)– Metadata harvesting– Integration of scientific spatial data– Searching of spatial data and

services– Spatial data visualisation (maps,

pictuers etc)

Page 30: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 32

Architecture

• Implementation is based on RAS Institution

• Based on ISO 19115/19139 standards

Page 31: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 33

GeoMeta Functionality• In addition to main URIS resources

(person, publication, organization, project) the system supports spatial data

• Main functions:– Resource cataloging, harvesting, loading,

searching;– Keeping spatial data in a repository and

access to these data;– Access via standard protocols (WFS, WMS);– Data (maps) visualization;– Directories management.

Page 32: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 34

http://geometa.ru

Page 33: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 35

Page 34: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 36

Protected Sites Information System

Page 35: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 37

Information system on protected sites

• Is based on GeoMeta

• Functionality– Data model is based on ISO ISO

19101 Reference model, 9109 Rules for Application Schema, INSPIRE

– Loading data– Navigation, searching– Queries

Page 36: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 38

ProposalsThe integration of scientific data in the

common scientific information space, the integration of this space with a distributed system of scientific digital libraries.

The challenge is to develop formalisms, methods of implementation and a pilot implementation, particularly:

Page 37: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 39

Proposals

• Ontologies for some scientific domains. Science is big, so the claim to universal coverage is not realistic. Therefore, we should focus on specific subject areas, such as spatial data, for which there are well-developed standards for describing and organizing data.

Page 38: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 40

Proposals

The formal means of data integration based on domain ontologies. The integration in particular, should include data binding, i.e. linkages based on the data identification.

Page 39: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 41

Proposals

• Creating key information (metadata) in storing it in special (data) centers, in particular, information about the relationships between data.

Page 40: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 42

Proposals

• Establishment of protocols that work with distributed information, in particular, searching.

Page 41: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 43

Proposals

• Development of means for extracting information from sources and loading appropriate meta information into the global environment (storage centers).

Page 42: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 44

Proposals

• Development of user interfaces in the format of digital libraries, ie, digital libraries, working with the metadata of the global environment and having the ability to extract data from sources.

Page 43: 19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick

21/04/23 45

Thank you!