35
disgenet2r The DisGeNET R package Núria Queralt Rosinach Integrave Biomedical Informacs Group (IBI) Research Programme on Biomedical Informacs (GRIB) Hospital del Mar Research Instute (IMIM) Pompeu Fabra University (UPF) Barcelona

disgenet2r: The DisGeNET R package

Embed Size (px)

Citation preview

Page 1: disgenet2r: The DisGeNET R package

disgenet2rThe DisGeNET R package

Núria Queralt RosinachIntegrative Biomedical Informatics Group (IBI)

Research Programme on Biomedical Informatics (GRIB)Hospital del Mar Research Institute (IMIM)

Pompeu Fabra University (UPF) Barcelona

Page 2: disgenet2r: The DisGeNET R package

DisGeNET

Page 3: disgenet2r: The DisGeNET R package

http://www.disgenet.org/

• Piñero et al. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. Database (2015) Vol. 2015: article ID bav028, (2015)

• Knowledge platform on human gene-disease associations (GDAs)

Integrates information from the literature (text mining) and expert-curated

databases

• All disease areas

• Supporting evidence

• Analysis tools

Page 4: disgenet2r: The DisGeNET R package

DisGeNET – 2016 release (v4.0)

New sources

Updated ontology

New annotation

New indexes

New mappings

New RDF and nanopublications distributions

Page 5: disgenet2r: The DisGeNET R package

New Sources

* diseases, disease groups and phenotypes

Page 6: disgenet2r: The DisGeNET R package

New Sources

* diseases, disease groups and phenotypes

BeFree is the major source

All sources updated

Page 7: disgenet2r: The DisGeNET R package

Data Model

Gene-DiseaseAssociation

Disease Gene

Gene-DiseaseAssociation

Ontology-based integration ID normalization Use of standards

Page 8: disgenet2r: The DisGeNET R package

Data Model

Gene-DiseaseAssociation

Disease Gene

EvidenceScore

Gene-DiseaseAssociation

SourcePubMed Sentence SNP

Ontology-based integration ID normalization Use of standards

Page 9: disgenet2r: The DisGeNET R package

DisGeNET ontology

Gene Association Disease

PO SP O

http://semanticscience.org/ontology/sio.owl

DisGeNET Association Type Ontology

rdf:type

Page 10: disgenet2r: The DisGeNET R package

DisGeNET ontologyhttp://semanticscience.org/ontology/sio.owl

DisGeNET Association Type Ontology

Page 11: disgenet2r: The DisGeNET R package

New Annotation

Gene-DiseaseAssociation

Disease Gene

Gene-DiseaseAssociation

MeSH ClassUMLS STY DO Class HPO Class

Disease Ontology (DO) Human Phenotype Ontology (HPO)

Page 12: disgenet2r: The DisGeNET R package

New Indexes

Gene-DiseaseAssociation

Disease Gene

Gene-DiseaseAssociation

Protein PathwayPANTHER

ClassDisease

SpecificityPleiotropy

DisGeNET Disease Specificity DisGeNET Pleiotropy

Page 13: disgenet2r: The DisGeNET R package

New Mappings

COVERAGE

Experimental Factor Ontoloty (EFO) <= BioHackathon 2015

Disease

Page 14: disgenet2r: The DisGeNET R package

New RDF and Nanopublications datasets• RDF

Metadata description (W3C HCLS) Interlinking

• Trusty Nanopublications

• Access• Download Data Dump • SPARQL Endpoint• Faceted Browser• Open PHACTS

• Nanopublication Network

• FAIR (ELIXIR and NIH)

http://lod-cloud.net/; Aug 2014DisGeNET - Tutorial

Page 15: disgenet2r: The DisGeNET R package

Tools for exploration

Page 16: disgenet2r: The DisGeNET R package

disgenet2r

Page 17: disgenet2r: The DisGeNET R package

disgenet2r

What is it? R package To query and expand DisGeNET data To analyze and visualize the results within the

powerful R framework To engage with the R/Bioconductor community Launched within the release of DisGeNET v4.0

(April, 2016)

Page 18: disgenet2r: The DisGeNET R package

disgenet2r

How is it implemented? R programming language S4 Object System Free open source To be added to the Bioconductor software project Data

Query: DisGeNET Expand: DisGeNET-RDF

Page 19: disgenet2r: The DisGeNET R package

disgenet2r

Who is developing it? DisGeNET project

The IBI Lab, GRIB-IMIM-UPF; Barcelona http://ibi.imim.es/

Developers Alba Gutierrez-Sacristan, PhD student Janet Pinero, PhD Nuria Queralt-Rosinach, PhD Emilio Centeno, Bioinformatician Laura I. Furlong, PhD (PI)

Maintainer: Alba Gutierrez-Sacristan Contact: Laura Furlong, [email protected] BioHackathon contact: Nuria Queralt (speaker),

[email protected]

Page 20: disgenet2r: The DisGeNET R package

disgenet2r

Why is it developed? New tool on Bioconductor to analyze high-

throughput genomics data Interaction with other R/Bioconductor packages

AtlasRDF, RpathVisio, DOSE,... Integration in workflows

KNIME

Page 21: disgenet2r: The DisGeNET R package

disgenet2r

Where to find it? https://bitbucket.org/ibi_group/disgenet2r Bitbucket repository used for package distribution

and testing until it is ready to be published in Bioconductor

Please test it! Feedback will be very welcome

Page 22: disgenet2r: The DisGeNET R package

disgenet2r - Functions

Query Gene-Disease Associations Query Variant-Disease Associations Query Disease-Phenotype Associations Query Disease-Disease Associations Query DisGeNET in the Linked Open Data

Query federation with WikiPathways and ChEMBL More to be added… + Visualization funcionalities

Page 23: disgenet2r: The DisGeNET R package

disgenet2r – Functions and Visualization

Query Gene-Disease Associations By Gene(s) or by Disease(s) Filters: database and score Visualization: network and heatmap

Page 24: disgenet2r: The DisGeNET R package

disgenet2r – Functions and Visualization

Query Gene-Disease Associations Visualization: grouping by class

MeSH disease class PANTHER protein class

Page 25: disgenet2r: The DisGeNET R package

disgenet2r - Functions and Visualization

Query Variant-Disease Associations

Page 26: disgenet2r: The DisGeNET R package

disgenet2r – Functions and Visualization

Query Disease-Disease Associations By disease(s)

Disease-Disease Network Comorbidity Network

Page 27: disgenet2r: The DisGeNET R package

disgenet2r – Functions and Visualization

Query Disease-Disease Associations By disease(s)

Disease-Disease Network

Page 28: disgenet2r: The DisGeNET R package

disgenet2r – Functions and Visualization

Query Disease-Disease Associations By disease(s)

Comorbidity Network

Page 29: disgenet2r: The DisGeNET R package

disgenet2r – Functions and Visualization

Query Disease-Disease Associations By disease(s)

Comorbidity Network

Page 30: disgenet2r: The DisGeNET R package

disgenet2r - Functions from RDF

IDs and URIs Query Disease-Phenotype Associations

disease2phenotype or phenotype2disease

Query DisGeNET in the Linked Open Data Query federation with WikiPathways and ChEMBL

disease2pathway or pathway2disase disease2compound or compound2disease

Disease Mappings UMLS to other ontologies and viceversa

Ontologies: MeSH, OMIM, ORPHANET, DO, ICD9, EFO, NCIT, DECIPHER, HPO

Page 31: disgenet2r: The DisGeNET R package

ANALYSISANALYSIS

KNOWLEDGE DISCOVERY

ACTIONABLEINFORMATION

Evidence

• Which genes are associated to Marfan syndrome?

• Which disease genes have approved drugs annotated?

• Which disease genes have differential expression?

• Which disease genes share a pathway?

• Is there genetic variation related to the MECP2 and Rett Syndrome association?

• What evidence supports the association between APP gene and Alzheimer Disease?

• Which genes and evidence support the comorbidity between Chronic Kidney disease and Diabetes Mellitus, Type 2?

Research Questions

Page 32: disgenet2r: The DisGeNET R package

Availability

● DisGeNET

http://www.disgenet.org

● disgenet2r

https://bitbucket.org/ibi_group/disgenet2r

● Open PHACTS, OpenLifeData, Pubannotation, FAIR data port (ELIXIR)

Page 33: disgenet2r: The DisGeNET R package

AcknowledgmentsIBI Group

Alba Gutiérrez-SacristánÀlex BravoAngela LeisEmilio CentenoJanet PiñeroNúria Queralt RosinachSantiago de la PenaAlexia GiannoulaMiguel A. MayerLaura I. FurlongFerran Sanz

Special thanksMichel DumontierSimon JuppNick JutyTobias KuhnandDisGeNET users!!!

Page 34: disgenet2r: The DisGeNET R package

Especially

OrganizersToshiaki KatayamaShin KawanoShuichi KawashimaJin-Dong KimYuji KoharaMari MinowaHiroyuki Mishima

Yuki MoriyaToshihisa TakagiToshiaki TokimatsuHongyan WuAtsuko YamaguchiYasunori Yamamoto

Page 35: disgenet2r: The DisGeNET R package

Thanks for your attention!Questions are welcome!