11
Qimin Yan (LBNL, UC Berkeley) The Materials Project Database Funded by the DOE BES program Grant # EDCBEE Additional computational resources by NERSC / XSEDE Kristin Persson (UCB), Gerd Ceder (UCB), Mark Asta (UCB), Daryl Chrzan (UCB), Jeff Neaton (UCB), Dan Gunter (LBNL), Anubhav Jain (LBNL), Maciej Haranszyk (LBNL), Shyue- Ping Ong (UCSD), Anthony Gamst (UCSD), Stefano Curtarolo (Duke), Jeff Snyder (NW)

4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

QiminYan(LBNL,UCBerkeley)

TheMaterialsProjectDatabase

FundedbytheDOEBESprogramGrant#EDCBEEAdditionalcomputationalresourcesbyNERSC/XSEDE

KristinPersson (UCB),Gerd Ceder (UCB),MarkAsta (UCB),DarylChrzan (UCB),JeffNeaton (UCB),DanGunter(LBNL),Anubhav Jain(LBNL),Maciej Haranszyk (LBNL),Shyue-PingOng(UCSD),AnthonyGamst (UCSD),StefanoCurtarolo (Duke), JeffSnyder (NW)

Page 2: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

Supercomputing Resources

Input processing & transformations

ICSD Otherexperimentaldatabases Usersubmissions

StructureNotationalLanguage (SNL)

Workflow Manager Post-processing and

error-checking

Analysis

•Robustmaterialsanalysis

pymatgen

• Self-healingerrorrecovery

Custodian

• Smartworkflowmanagement

Fireworks

Webapps

MaterialsAPI

~100Tb

A. Jain*, S.P. Ong*, G. Hautier, W. Chen, W.D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, K.A. Persson,“The Materials Project: A materials genome approach to accelerating materials innovation” APL Materials 1, 011002 (2013)

S. P. Ong, et. al., Computational Materials Science, 2015, 97, 209–215.

S. P. Ong, et. al.,Computational Materials Science, 68, 314–319, 2013

A. Jain, et. al., Concurrency and Computation: Practice and Experience, 1532, 2015

Page 3: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

Progress To Date and Future Data• > 66,000 relaxed compounds: validated energy, phase diagrams. etc.• > 70,000 Pourbaix diagrams1

• > 43,000 band structures• > 2,300 elastic tensors2

• > 900 piezoelectric tensors• Dielectric tensor workflow complete. Target release 2016

DISSEMINATION

DESIGN • MPComplete; >400 community submissions to date• Design of novel functional materials (photocatalysts, thermoelectrics)

• Close to 17,000 registered users !• Ten Apps enabling material searching and design• First Materials data API ; community issues > 1.3M requests/month• MPContribs framework: ALS, NREL EFRC, MAST, for data sharing

High-Quality Materials

DATA

https://materialsproject.org/ https://github.com/materialsproject

1 K. A. Persson, et al, “ Prediction of solid-aqueous equilibria: Scheme to combine first-principles calculations of solids with experimental aqueous states”, Phys. Rev. B 85, 235438 (2012)2 M. de Jong, et al. “Charting the complete elastic properties of inorganic crystalline compounds” Scientific Data 2, 150009 (2015)

Page 4: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

"Raw"MPSimulationResults

Incremental"builders"

RESTandWebfriendlyresults

ValidationAddanew"rule"

Checkandreportdaily

DailyValidationSSomethingiswrong!

Verifyandunderstandtheproblem

Page 5: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

Provenanceforeverymaterial

Page 6: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

Reportseverymorning;spanningalldb1/10/2016 Berkeley Lab (Univ of California) Mail - [matgen-validate] Validation Report

https://mail.google.com/mail/u/0/?ui=2&ik=19a06e26c2&view=pt&q=dkgunter%40lbl.gov&qs=true&search=query&th=1522ada47d1f9023&siml=1522ada47d1f9023 1/1

Kristin Persson <[email protected]>

[matgen­validate] Validation Report 1 message

[email protected] <[email protected]> Sun, Jan 10, 2016 at 1:24 AMReply­To: matgen­[email protected]: matgen­[email protected]

Materials Project Validation ReportReport time

2016­01­10 08:00:02Report user

dangReport host

matgen1Database

mg_core_devLimit

0Elapsed time

5085.58s

Collection "tasks"

Collection "materials"

Constraint Violations A

Condition{'task_id': 'mp­20379'}

Id TaskId Field Constraint Value53be0ae9e051814d8fe721b3 mp­20379 icsd_id size = sequence

Page 7: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

MPContribs:Collaborativeplatform/templateforuserdata

“I have this great dataset, but need help sharing it with the world”

YourMaterialsData

YourOwnApp

Huck, P.; Cholia, S.; Gunter, D.; Winston, D.; N'Diaye, A. T.; Persson K.; “User Applications Driven by the Community Contribution Framework MPContribs in the Materials Project”Proceedings of 10th Gateway Computing Environments Workshop (2015), Concurrency in Computation: Practice and Experience, arXiv:1510.05727

Page 8: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

MPCONTRIBS Process

• pre-processingofuseroutputdataandconversionintoMPFile• visualanditerativecheckingofMPFilebyuser(“getdatainshape”)• MPFile submissionviacommandlineorwebportal (throughREST)• contributeddata canbeeasilydisplayed onMP

MPContribsUserspre-submission MPFile

MPFileViewer

ContributionDetailsPage

MPContribsUserspost-submission

Page 9: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

MPComplete:Crowd-sourcingMPLaunched Sept 2015

• Motivation: new compounds supplied directly by community

• Users suggest structures; MP checks for uniqueness and runs full suite of calculations.

• Ensures user-relevant materials with consistent provenance

PoweredbyXSEDE

Page 10: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

MPComplete UseCases• One-at-a-time, or in bulk:

oOne user contributed four new lead halide PV compounds, one at a time. (design)

oOne user submitted 64 ABN3perovskites, associating with publication. (data sharing)

o Another user submitted 131 structures to check for stability against all MP compounds (validation and data sharing)

• User dashboard links to workflow details è can monitor progress

Search/Analyze

Calculate

Manipulate

Page 11: 4 Yan Materials Project database - Northwestern University...Progress To Date and Future Data • > 66,000 relaxed compounds: validated energy, phase diagrams. etc. • > 70,000 Pourbaixdiagrams1

Thankstothecommunityandforyourattention!