Upload
urooj-sabar
View
494
Download
0
Tags:
Embed Size (px)
DESCRIPTION
A walk through biomart..!!!
Citation preview
1 of 38
BIOMART
2 of 38
BIOMART
• Developed jointly by EBI & CSHL• BioMart is a search engine that can
find multiple terms and put them into a table format.
• Such as: human gene (IDs), chromosome and base pair position
• No programming required!
3 of 38
BIOMART
• A wide variety of analyses and tasks:
SNP (single nucleotide polymorphism)
selection for candidate gene screening
microarray annotationrecovery of disease links, sequence
variations and expression patterns
4 of 38
General or Specific Data-Tables
• All the genes for one species
• Or… only genes on one specific region of a chromosome
• Or… genes on one region of a chromosome associated with a disease
5 of 38
BioMart Data Sets
• Ensembl genes• Vega genes• SNPs• Markers• Phenotypes• Gene expression information• Gene ontology• Homology predictions• Protein annotation
6 of 38
Web Interface
7 of 38
Simple Text-based Search Engine
8 of 38
‘Mouse Gene’ Gives Us Results
9 of 38
A More Complex Query is Not as Useful
10 of 38
BioMart Walkthrough
• Glucose-6-phosphate dehydrogenase (G6PD) human gene located on chromosome X in cytogenetic band q28.
• Which are the other genes in relevance to human diseases locate to the same band?
• Find out their Ensembl Gene IDs and Entrez Gene IDs?
• And also find out their cDNA sequences?
11 of 38
Information Flow
• Choose the species of interest (Dataset)
• Decide what you would like to know about the genes (Attributes)(sequences, IDs, description…)
• Decide on a smaller geneset using Filters.(enter IDs, choose a region …)
12 of 38
Choose ‘Ensemble Genes 66’as a primary database
13 of 38
Choose ‘Homo sapiens’ as the species of interest
14 of 38
On the left narrow the gene set by clicking “Filters”. In front of “REGION”, click on the “+” to expand the choices.
Filters: what we know
15 of 38
Select “Chromosome X”
Select “Band Start q28” and “End q28”
16 of 38
Expand the “GENE” panel.
17 of 38
Limit to genes with MIM disease ID’. These associations have been determined using MIM (Online Mendelian Inheritance in Man).
18 of 38
The filters have determined our gene set. Click ‘Count’ to see how many genes have passed these filters.
19 of 38
The ‘Count’ results show 26 human genes out of 56478 total genes passed the filters.
Click on ‘Attributes’ to select output options (i.e. what we would like to know about our gene set).
20 of 38
Expand the ‘GENE’panel.
21 of 38
Select, along with the default options, ‘Associated Gene name’ (this shows the gene symbol from HGNC).
Note the summary of selected options. The order of attributes determines the order of columns in the result table.
22 of 38
Expand the ‘EXTERNAL’ panel to select External References.
23 of 38
Select ‘EntrezGene ID’ and ‘Mim Morbid Accession’ and‘MIM Morbid Description’.
24 of 38
Click ‘RESULTS’ to preview the output.
25 of 38
To save a file of the complete table, click ‘Go’.
Go back and change Filters or Attributes if desired. Or, View ALL rows as HTML…
26 of 38
Result
27 of 38
Select ’Sequences’ and then expand the ‘SEQUENCES’ section.
To view sequences, go back to ‘Attributes’
28 of 38
Expand the ‘SEQUENCES’ panel and select cDNA sequences
Expand the ‘Header Information’ section.
29 of 38
Choose ‘Ensembl Gene ID’, ‘Associated Gene Name’, ‘Chromosome’, and ‘Ensembl Transcript ID’
30 of 38
Click ‘Results’
31 of 38
32 of 38
Many BioMarts have now been installed by external groups, in large part because of its automated deployment tools and compatibility cross different platforms. Some of the groups are model organism databases such as Gramene, Dictybase, Wormbase, HapMap variation.
33 of 38
Central Server
www.biomart.org
34 of 38
WormBase
35 of 38
HapMap
Population frequencies
Inter- population comparisons
Gene annotation
36 of 38
DictyBase
37 of 38
Uniprot, MSD
38 of 38
GRAMENE
Rice, Maize, Arabidopsis genomes…
39 of 38
Q&AThanks