53
Professor Paul Layzell, Principal of Royal Holloway University of London For UUK annual conference 2014 10/09/2014 Jisc’s new shared data centre

Jisc's new shared data centre

  • Upload
    jisc

  • View
    260

  • Download
    3

Embed Size (px)

DESCRIPTION

By Phil Richards, Martyn Harrow, Professor Nick Luscombe, eMedLab and Professor Paul Layzell. Presented at the UUK annual conference 2014.

Citation preview

  • 1. 10/09/2014 Jiscs new shared data centreProfessor Paul Layzell, Principal of Royal Holloway University of LondonFor UUK annual conference 2014

2. 10/09/2014 Jiscs new shared data centreProfessor Martyn Harrow, chief executive officer, JiscFor UUK annual conference 2014 3. www.luscombelab.orgNicholas LuscombeProfessor of Computational Biology, UCLSenior Group Leader, Francis Crick Institute 4. Our researchApplying computational methods to genomic datato understand how genes are switched on and off. 5. Human genome encodes 23,000 genes 6. Right gene at right time and place 7. Right gene at right time and place 8. Switching is controlled by regulatory genes 9. We discovered 1,300 regulators inhumans 80% have no known target genes Foundation for many downstreamstudies33 human tissuesWhat are the regulators?[Vaquerizas (2019) Nat Rev Genet] 10. What do they control?predicted expression eve gene isexpressed in flyembryos Regulator-targetrelationshipsunknown Modellingidentifiesrelationships[Ilsley (2013) eLife]actual expressioneven-skipped 11. No hnRNP CNonsensicaltranscriptshnRNP C bindspre-mRNAAlus suppressedAnd if it goes wrong? hnRNP C loss causes 1000sof nonsense transcripts Mutations in binding sites cancause genetic disorders[Zarnack (2013) Cell] 12. The data challenge 13. Human genome is hugeThis is really hardRelationship between regulators and genes are notstraightforwardWe still dont know what 90% of regulators control 14. Human genome is hugeThis is really hardRelationship between regulators and genes are notstraightforwardWe still dont know what 90% of regulators control3 billion letters = 119 volumes! 15. GenBank growth since 1989# nucleotides1989timeDatasets keep growing1999 16. 1999GenBank growth since 1989# nucleotides1989timeDatasets keep growing1999 2010 17. 1999GenBank growth since 1989# nucleotides1989timeDatasets keep growing1999 2010 2010 2014 18. How do you transport them?data transfer 19. How do you transport them?data transfer 20. Discovery without boundaries 21. Francis Crick Institute Opening 2015 6 partners (MRC, CancerResearch UK, WellcomeTrust, UCL, KCL, ICL) World-class biomedicalinstitute withinterdisciplinary research Aiming for >100 dry"scientists 22. Central London location 23. A solution 24. Shared offsite data centreCrick computingCollaboratorproject 25. eMedLabData driven discovery for Personalised Medicine 26. eMedLab8.9M MRC award for medical bioinformaticseMedLab infrastructureCapacity building 27. eMedLab8.9M MRC award for medical bioinformaticseMedLab infrastructureCapacity buildingSecure storage,coordination, analysis 28. eMedLab8.9M MRC award for medical bioinformaticseMedLab infrastructureCapacity buildingSecure storage,coordination, analysisResearch outputClinical outcomes 29. Partners provide unique expertise's Interface between bioinformaticsand clinic Novel bioinformatics methods andinterface with wet lab Genomics of health and disease Public data access andchemoinformatics 30. eMedLab infrastructureCrick 31. eMedLab infrastructureCrickSangerUCLPartnersEBISecurecollaborativespace 32. eMedLab infrastructureCrickSangerUCLPartnersEBISecurecollaborativespace 33. eMedLab infrastructureCrickSangerUCLPartnersEBISecurecollaborativespace 34. Infrastructure enables scienceResearch cant be achieved without reliable infrastructure 35. 10/09/2014 Jiscs new shared data centreDr Phil Richards, Chief Innovation Officer, JiscFor UUK annual conference 2014 36. OutlineBackgroundBenefits of scaleThe human barriersPartnersThanks 37. BackgroundGovernment focus on shared services 2011HEFCE Universities Modernisation Fund (UMF) Feasibility work around shared data centres Technical proofs of concept via Janet networkThen a pause 38. Benefits of scale 39. Benefits of large scale construction of extremely large-scale,commodity-computer data centres atlow-cost locations uncovered thefactors of 5 to 7 decrease in cost ofelectricity, bandwidth, operations,software and hardware at these verylarge economies of scale.Armburst, Armando Fox et al.,Above the Clouds, Berkeley 40. Example industrial-scale data centresOwner Location Square feetApple Maiden, North Carolina 500,000Facebook Forest City, North Carolina 300,000Amazon Dublin, Ireland 240,000Google Hamina, Finland 300,000HP Winyard, UK 305,000Source: Greenpeace report How dirty is your data?, April 2011 41. The human barriers 42. The human barriersDistributed Data-centres Under Desks(DDUDs) 43. The Janet network our national grid for big data and computingIndustrial-scale data centre Industrial-scale data centreUniversities and research institutions 44. Partners 45. PRs candidate big themes and possible projects Lifting the student number cap Break replacement cycle for Student Record Systems Open source SRS modelling student lifecycle Backdoor to HESA for easier data entry and benchmarking Exorcising the ghost of the MAC initiative? MOOCs for the masses National platform to complement FutureLearn FutureLearn platform lite or EdX instance? Scalable approaches to Research Data and Equipment National site licences for commercial big data National Kit Catalogue Joining the big data to the meta-data Going beyond short-term compliance Will policy be diluted as true costs emerge? 46. Partners 47. Extra slides 48. Further opportunity 49. Further opportunityCollaborative research data sharingConsolidation of research computation supportScale to 100Ms PA saving for the sector Why does any HEI need its own data centre? Can we all start benefitting from large scale?Through the Janet network, we can! 50. Thanks 51. Thanks Colleagues Tim Marshall Bob Day Jeremy Sharpe Dan Perry Organisations Hefce Infinity 52. Find out moreDr Phil RichardsChief Innovation [email protected] Castlepark Tower Hill Bristol BS2 0JAT 020 3697 [email protected] jisc.ac.ukExcept where otherwise noted, this work is licensed under CC-BY-NC-ND 53. Panel discussion