Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Embed Size (px)

Citation preview

Page 1: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Swan River foreshore, Perth, Western Australia

University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Page 2: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Ian Small Murray Badger David Day Harvey Millar

Steve Smith Barry Pogson Jim Whelan




tre P


t En


y B



Page 3: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBASUBcellular location database

for Arabidopsis proteins

Sandra Tanz and Ian Castleden4th March 2011

Page 4: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Why protein localisation?

• Contributes towards the understanding of protein function and of biological inter-relationships, i.e. only proteins in the same location can interact.

• Separate subcellular locations often represent distinct cellular environments: proteins share similar attributes and play roles in defining the function of a subcellular compartment.

• To build hypotheses or models: large-scale phenotyping screens, microarray experiments and protein-protein interaction assays rely on protein localisation info.

Page 5: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

How to localise proteins?

PredictionIn vitro uptake


In vivo (GFP)Enzyme activity measurements

Western blot

Immunogold labeling

Subcellular proteomics (MS)

Protein-protein interaction

Images modified from Millar et al., 2009

Page 6: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA: SUBcellular location database for Arabidopsis proteins

Page 7: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA: SUBcellular location database for Arabidopsis proteins

Page 8: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

What does SUBA document?

1193 5456942

MS (6398)GFP (2135)

SUBA II (2007) SUBA III (2011)

Combined sub-location data 250’719 1’022’040

Calls by PPI 0 6673

Calls by experiments (GFP, MS) 8273 19’528

Distinct proteins localised by GFP and/or MS 4531 8533

Bioinformatic predictions by 10 predictors 24 predictors


Page 9: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Data mining

• Search of the NCBI PubMed (Medline) and Entrez (GenBank) databases using keywords

• Alert via Email

Page 10: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Data mining

• Search publication to extract localisation information = fully curated data

Page 11: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface http://suba.plantenergy.uwa.edu.au/

Page 12: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 13: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 14: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 15: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 16: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 17: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 18: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 19: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 20: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 21: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 22: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 23: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 24: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III interface

Page 25: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA III flatfile

Page 26: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Analysis of SUBA III data – on the way…

Do data become more or less consistent over time?

Experimental data (MS vs GFP)• How reliable are experimental localisation data? Has the overlap of

data changed with increasing data sets?

Page 27: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

How reliable are GFP localisation data?

Total GFP localisations confirmed by MS

Total GFP localisations disputed by MS

1844 8306710

MS (9016)GFP (2554)

1386 73714458

MS (74172)GFP (1844)

1386 neither confirmed or disputed

Page 28: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Analysis of SUBA III data – on the way…

Do data become more or less consistent over time?

Experimental data (MS vs GFP)• How reliable are experimental localisation data? Has the overlap of

data changed with increasing data sets?• Does evidence for multiple locations mean the protein is dual

targeted/dynamic or is it a false positive?

Prediction vs experimental data• How reliable are predictors today?

PPI data• What do PPI data tell us about sub-cellular location? • Organellar proteome: Can we discover novel organellar proteins?

Page 29: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA under the hoodSUBA under the hood





















Page 30: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

• Why a Web interface?• GeneInvestigator, Mapman• AHM chemicals (Apache JPA)• For the foreseeable future databases are going to be

“Web” based (HTTP, Javascript, HTML ,css)• Need to be maintained by a minimum number of

developers (i.e. one!)

Page 31: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences


Page 32: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA Tables (predictors)

Page 33: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA Tables (“original” sources) http://www.ce4csb.org/amigo/

Page 34: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Suba Tables (publications)http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=pubmed&retmode=xml&id=18453549db=pubmed&retmode=xml&id=18453549

Page 35: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SUBA Tables (automation)

Page 36: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Julian Tonti-Filippini

Page 37: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences
Page 38: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Why Bother?

Page 39: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

SELECT suba3.suba3.*, suba3.src_ppi_1.* FROM suba3.suba3 LEFT OUTER JOIN suba3.src_ppi AS src_ppi_1 ON suba3.suba3.locus = src_ppi_1.`locusA` WHERE EXISTS (SELECT 1 FROM suba3.src_ppi WHERE suba3.suba3.locus = suba3.src_ppi.`locusA` AND suba3.src_ppi.`locusB` IN (‘AT3G62420.1’))

“denormalisation” src_msms


Page 40: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Suzanne M. Embury and Peter M.D. Gray

Page 41: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Computational Computational Systems BiologySystems BiologyCentre of ExcellenceCentre of Excellence

@suba.jsondef query(filter,offset=0,limit=1000): return Session().query(Suba3).filter(json2sqla(filter))\



{success: True, result:[

{ locus:’AT1G54321.1’, mwt:81454, ….

ppi:[{locusA:’AT1G54321.1’,locusB:’AT1G04234.1’,pubmed:14567845}]},{ locus:’AT1G63021.1’, mwt:91454, ….

ppi:[{locusA:’ AT1G63021.1’,locusB:’AT1G04234.1’ ,pubmed:34567767}]},… ] }

Page 42: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences

Computational Computational Systems BiologySystems BiologyCentre of ExcellenceCentre of Excellence

(Near) Future

• Large number of predictors often given conflicting predictions… what to do?• Bayesian analysis…

Page 43: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences
Page 44: Swan River foreshore, Perth, Western Australia University of Western Australia Biomedical, Biomolecular and Chemical Sciences


Ian Small Harvey Millar

Joshua Heazlewood Julian Tonti-Fillipini

Thanks for your attention!!