23
Developed by James Estill, Dept. of Plant Biology, University of Georgia

Developed by James Estill, Dept. of Plant Biology, University of Georgia

  • Upload
    agnes

  • View
    28

  • Download
    1

Embed Size (px)

DESCRIPTION

Developed by James Estill, Dept. of Plant Biology, University of Georgia. TriAnnot. France. IOB Cluster: UGA. Pipeline Annotate Wheat Sequences. PERL. GAME XML. BLAST –m 8 -d MIPS. BLAST –m 8 -d RB_pln. BLAST –m 8 -d TIGRGram. BLAST –m 8 -d TREP9nr. >HEX0014K09 GCAATACT CGGCACTT. - PowerPoint PPT Presentation

Citation preview

Page 1: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Developed by James Estill, Dept. of Plant Biology, University of Georgia

Page 2: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Pipeline Annotate Wheat Sequences

PERL

TriAnnot

FranceIOB Cluster: UGA

GAME XML

Page 3: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Annotation PipelineBLAST –m 8-d MIPSBLAST –m 8

-d RB_plnBLAST –m 8-d TIGRGramBLAST –m 8

-d TREP9nr>HEX0014K09GCAATACTCGGCACTT

Gene Annotation TE Annotation

De Novo HomologyFindmiteLTR_StrucLTR_SeqFind_LTRLTR_Finder

HMMERRepeatmaskerTE NestBLAST

De Novo HomologyGENSCANGENIDFGENESH

BLASTBLATSIM4

Page 4: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Individual Program Procedure

Directoryof FASTAFiles

Configuration File

Run Program

RawResults

GFFFormated

Page 5: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 6: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 7: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 8: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 9: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 10: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Developed by James Estill, Dept. of Plant Biology, University of Georgia

Page 11: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 12: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 13: Developed by James Estill, Dept. of Plant Biology, University of Georgia

!! THIS DOCUMENT IS UNDER CURRENT DEVELOPMENT!!

This program manual and the scripts that make up the DAWG-PAWS package are under current development. Everything is subject to change without notice at this point. This software comes as is, without any expressed or implied warranty. Use at your own risk.

Page 14: Developed by James Estill, Dept. of Plant Biology, University of Georgia

File requirements:1. Each fasta file contains a single record2. BAC scaffolds need to be merged to a single sequence3. Short header

Page 15: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Repeat masking with RepeatMasker and TREP1. Softmask (using RepeatMasker)2. Convert softmask to hardmask because many gene prediction programs

are not softmasked aware

Page 16: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Structural feature annotation: Includes currently only the annotation of gaps

Page 17: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Gene annotation:1. Conduct gene prediction using TriAnnot pipeline2. Run individual gene prediction programs

Page 18: Developed by James Estill, Dept. of Plant Biology, University of Georgia

GenMarkHMM: can be run locally (free license required)GENSCAN: Run on web server & convert output to .gff fileFGeneSH: Run on web server & convert output to .gff file

Page 19: Developed by James Estill, Dept. of Plant Biology, University of Georgia

NCBI-Blast: Most time-consuming step in the pipeline

Page 20: Developed by James Estill, Dept. of Plant Biology, University of Georgia
Page 21: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Transposable element annotation:1. By homology: RepeatMasker, NCBI-Blast2. By structural criteria: LTR-finder

Page 22: Developed by James Estill, Dept. of Plant Biology, University of Georgia

De Novo LTR Annotation Software

PubYear

Source

Availabili

OperatingS

ystem

Speed

Param

eterC

ontrol

License

TSD

LTR

Dinucleotides

PB

S

GA

G IN RT

RH

PP

T

LTR_Struc 2003

LTR_Seq 2006

find_ltr 2007

LTR_Finder 2007

Computation Annotation

Best Good Neutral Bad Crap

Page 23: Developed by James Estill, Dept. of Plant Biology, University of Georgia

Preparing the computational results for Apollo1. Audit the computational results2. Concatenate the .gff files