Upload
emonocot
View
819
Download
2
Tags:
Embed Size (px)
Citation preview
Grassbase: the data volume challengeMaria Vorontsova 26 May 2011
Grassbase: the first botanical agglomerate database behemoth?
Strengths: coherent classification systemcomplete and up to date
Weaknesses: divided stakeholder communitydated softwarepoor web functionalitylimited usefulness for identificationno plan for data exploitation
CATE Araceae2,000 descriptions
Solanaceae Source1,000 descriptions
Grassbase11,161 descriptions
area of boxes proportional to number of descriptions included
Grass genera are defined by the variability in spikelet and floret composition
pooid spikelet similar to Bromus (Holcus)
panicoid spikelet common in tropical grasses (Panicum)
glume 2
lowerlemma
glume 1
upperpalea
upperlemma lower
palea
FAO 2009: global consumption of 10 major vegetal foods (2003-2005)
Grass taxonomy at Kew
Otto Stapf: Flora of Tropical Africa. 1934.
Charles Hubbard: Grasses (of Britain). 1954.
C.R. Metcalfe: Anatomy of the Monocotyledons: Gramineae. 1960.
N.L. Bor: The Grasses of India. 1960.
Derek Clayton: Flora of Tropical East Africa. 1970. Flora of West Tropical Africa. 1972. Genera Graminum. 1986. World Grass Flora: “The Kew View” 1985 onwards.
1985Lazarides, Clayton &
PalmerWorld Grass Species
600 characters
2011Grassbase
GrassWorld1017 characters
1977Clifford & WatsonAustralian Grass
Genera332 characters
Evolutionary reconstruction of DELTA grass datasets
1992Watson & Dallwitz
Grass Genera of the World
Australian National University morphological data
Clayton Genera Graminum dataset
25 years full time
data entry
NAMESspecies + infra
63,000
NAMESgeneric2,000
DESCRIPTIONSspecies
DESCRIPTIONSgenera
INTKEYspecies
INTKEYgenera
ACCEPTED SPECIES
11,000
ACCEPTED GENERA
700
Access SYNON groups names into homotypic groups
an average of 88 pieces of information per species in DELTA descriptive language
species description webpages linked by a single index page
NAMESspecies + infra
63,000
NAMESgeneric2,000
DESCRIPTIONSspecies
DESCRIPTIONSgenera
INTKEYspecies
INTKEYgenera
ACCEPTED SPECIES
11,000
ACCEPTED GENERA
700
NAMESspecies + infra
63,000
NAMESgeneric2,000
DESCRIPTIONSspecies
DESCRIPTIONSgenera
INTKEYspecies
INTKEYgenera
ACCEPTED SPECIES
11,000
ACCEPTED GENERA
700TRIBES
TYPESspecies +
infra
AREASTDWG
countries
NAMESspecies + infra
63,000
NAMESgeneric2,000
DESCRIPTIONSspecies
DESCRIPTIONSgenera
INTKEYspecies
INTKEYgenera
ACCEPTED SPECIES
11,000
ACCEPTED GENERA
700
programs to check coding and tidy descriptions
programs for renaming files
programs to output natural language and INTKEY files
ca. 50 permanent and temporary queries
buttons to output species and generic lists
code to check data presence and spellingwithin and between tables
simplified version for website generated with one button
special program for putting name lists into two columns!
NAMESspecies + infra
63,000
NAMESgeneric2,000
DESCRIPTIONSspecies
DESCRIPTIONSgenera
INTKEYspecies
INTKEYgenera
ACCEPTED SPECIES
11,000
ACCEPTED GENERA
700TRIBES
TYPESspecies +
infra
AREASTDWG
countries
IPNI updates for newly described names only
coding from literatureholding files for new items
custom set of programs to sync Access and DELTA
multi-stage import procedure via a series of tables
macros for different data type imports
Grassbase: coding a new species
Grassbase: coding a new species
Grassbase: coding a new speciesProgram “Check” confirms internal consistency of data
Grassbase: coding a new species
Recent changes in grass names affect common species
Bromus sterilis Anisantha sterilis
“Panicum” as used by Grassbase includes numerous evolutionary lineages with simple panicoid spikelets
widening gap between morphological and phylogenetic: ca. 15% in species and generic names
Grassbase: The Kew View, an authoritative system with morphologically defined genera
Grass Phylogeny Working Group, GrassWorld, and others: multiple research groups in USA and Australia