8
Jens Thomas Lensfield Quixote

Jens Thomas Lensfield Quixote. Quixote Project An international, open-source, open-data collaboration to design, test and

Embed Size (px)

Citation preview

Page 1: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and

Jens Thomas

Lensfield Quixote

Page 2: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and

Quixote Projecthttp://quixote.wikispot.org

An international, open-source, open-data collaboration to design, test and deploy a modular, open source system of tools to organize, share and query computational chemistry data.

CCP1GUIAvogadro

Jens ThomasSTFC Daresbury

Laboratory.

Page 3: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and

What and Why

- countless cpu-hours are spent calculating data most of which is discarded/"archived" never to be found again

- data is the bread-and-butter of computational chemistry, yet most of it is never seen by the community

- we want to provide the infrastructure to archive and share chemical data

Expensivecomputerresources

Cumbersomearchaicoutput

BlackholeGenerate data

from scratch

Page 4: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and

Who and How a bottom-up pragmatic approach - build useful, open-

source, modular tools and encourage their adoption

the same tools for managing local data as running central repositories - encourages uptake

a community-based project, internet-based, owned by no-one (everybody).

Page 5: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and
Page 6: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and
Page 7: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and

countless cpu-hours are spent calculating data most of which is discarded/"archived" never to be found again

Results Databases prevent needless duplication of effort codes can be benchmarked/validated and new

methods compared with existing data supporting information for publications archived for re-

use/reproducibility new science will emerge from data-mining

Chemical Markup Language (CML) enormously simplifies implementing databases computational codes can be easily "chained" together data can be automatically validated tools (e.g. GUIs) automagically work with all codes

Page 8: Jens Thomas Lensfield Quixote. Quixote Project  An international, open-source, open-data collaboration to design, test and

modular, open source tools - collaborate with existing projects where possible

lightweight, easy to install, flexible, easy to update, simple user interface & support the major community codes

same infrastructure to manage a local datastore or a public data repository

lensfield2 monitors filesystem and manages file transformation - conversion of legacy output -> CML

automate file uploads/downloads to local/remote repositories - use of EMMA/SWORD if required

currently have groups testing the alpha software collaborating to use tools to do science in the open:

"The Cyclobutadiene Mysteries"