21
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill [email protected]

Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill [email protected]

Embed Size (px)

Citation preview

Page 1: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Introduction toBioinformatics

Dr. Rybarczyk, PhDUniversity of North Carolina-Chapel Hill

[email protected]

Page 2: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu
Page 3: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

What is Bioinformatics?

Page 4: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

What is Bioinformatics?

Conceptualizing biology in terms of molecules and then applying “informatics” techniques from math,

computer science, and statistics to understand and organize the information associated with these

molecules on a large scale

Page 5: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Objectives:

• Identify appropriate search tools for your needs

• Perform searches using ‘unknown’ sequence data

• Analyze protein structure using Protein Explorer

Page 6: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Why do we use Bioinformatics?

• Store/retrieve biological information (databases)

• Retrieve/compare gene sequences

• Predict function of unknown genes/proteins

• Search for previously known functions of a gene

• Compare data with other researchers

• Compile/distribute data for other researchers

Page 7: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Central Dogma

DNA

RNA

Protein

Genomic sequence

RNA function & structure

Protein sequence

Protein structure

Protein Function

Phenotype

Page 8: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Computer Tools Used in Bioinformatics

NCBI

GenBank

Internet resources

Sequence retrieval:

Sequence comparison programs:

Protein Structure:

BLAST

modeling prediction programs – RasMol, Protein Explorer

GCG MacVector

Page 9: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu
Page 10: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Similarity Searching:

A tool for searching gene or protein sequence databases for related genes of interest

The structure, function, and evolution of a gene may be determined by such comparisons

Alignments between the query sequence and any given database sequence, allowing for mismatches and gaps, indicate their degree of similarity

Page 11: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu
Page 12: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Algorithms

Finds best matching pair comparisons of sequences

input output

Computer comparisons using mathematical probabilities

Page 13: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

MRCKTETGAR

MRCGTETGAR

% similarity

90%

CATTATGATA

GTTTATGATT

70%

Page 14: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Sequence Analysis:

Determine correct sequence of DNA

Compare sequence with known sequences

Translate DNA into protein sequence

Understand protein function in context

Page 15: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Current Tools Used in Bioinformatics

Strengths:

Accessibility

Growing rapidly

User friendly

Weaknesses:

Sometimes not up-to-date

Limited possibilities

Limited comparisons and information

Not accurate

Page 16: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Need for improved Bioinformatics

Genomics: Human Genome ProjectGene array technologyComparative genomicsFunctional genomics

Proteomics:

Global view of protein function

Protein motifs

Structural databases

Page 17: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Data Mining

Handling enormous amounts of data

Sort through what is important and what is not

Manipulate and analyze data to find patterns and variations that correlate with biological function

Page 19: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

3-D Protein Modeling

• Uses information determined by crystal structure methods

• Visualization of protein structure

• Make protein-protein comparisons

• Used to determine:

conformation/folding

antibody binding sites

protein-protein interaction sites

computer aided drug design

Page 20: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

Summary

• Introduction to Bioinformatics tools:

Genbank, BLAST, GCG, GeneCard, Chime

• How Bioinformatics is used

• Advantages and disadvantages of tools

• Applications

Page 21: Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill brybar@med.unc.edu

bioinformatics

students educators

researchers institutions