18
Gene transfer & pathogenicity: a big data approach to influenza macroevolution Paul Agapow Dept. of Genomics of Common Disease [email protected] October 2015

Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Embed Size (px)

Citation preview

Page 1: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Gene transfer & pathogenicity: a big data approach to

influenza macroevolution

Paul AgapowDept. of Genomics of Common Disease

[email protected]

October 2015

Page 2: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

The dull yet necessary background facts

• A respiratory pathogen

• Negative-sense ssRNA virus

• 3 types (A, B, C)

• 8 chromosomes encoding 11 proteins

Page 3: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Cliches of influenza evolution

Shift

• Acquisition of large functional units via horizontal gene transfer

• Sudden adaptive leaps

• Infrequent (?) but fast

• Pandemics

The success of a strain of influenza depends on its ability to subvert or subdue host immunity, which in turn depends on evolution:

Drift

• Conventional molecular mutation

• Stepwise / incremental

• Continual but slow

• Seasonal flu

Page 4: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Shift happens

Page 5: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

The problem

Reassortment

• Only a potential or freak event?

• If not, under what circumstances?

Recombination

• “...plays an important role in influenza evolution...”

• “...never happens...”

• Does it happen?

• Where & when does it happen?

• What happens?

Page 6: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Detecting HGT

Ordinarily the genes in two related organisms have the same most recent common ancestor. HGT disrupts this.

Page 7: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Correlating evolutionary distances

Construct "sequences" composed just of 3rd positions from coding regions. Then:

● For every pair of segments (HA vs NA, HA vs PA, etc):

● For every pair of isolates:

1. extract distance between homologous segments

2. graph the distance for gene A versus gene B

Where gene A and gene B share the same time to their MRCA, the observed evolution in A should be proportional to that in B.

Page 8: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Visualising correlation of evolutionary distances

The distance between two isolates is ΔA = kΔ

B+ x

Page 9: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

The datasets

For now

• 67 swine-lineage H1N1

• 167 avian-lineage H5N1

• 525 human-lineage H3N2

Use for

• Testing reassortment between segments

• Testing recombination within segments by splitting into two

Page 10: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Does it work (in humans)?

Page 11: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Does it work (in pigs)?

Page 12: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Does it work (in birds)?

Page 13: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Recombination?

No (probably).

Page 14: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

How to quantify reassortment?

With great difficulty ...

• Limited to comparisons

• What is a reassortment anyway?

• What is a rate?

Ad-hoc method

• Count populations (3+) of similar likelihood

• Measure “years” in phylogeny

• Measure relative population size as “success”

Page 15: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Reassortment across hosts and strains

Page 16: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Reassortment across segments

Page 17: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Conclusions

• There is no significant recombination

• Reassortment occurs in all compartments but varies

• Humans < pigs < birds

• structural < antigenic & replication

• More persistent in birds?

Page 18: Gene Transfer & pathogenicity: a big data approach to influenza macroevolution

Open questions & next steps

• More (segments / strains / hosts)

• Wherefore recombination?

• What is driving the different dynamics?

• Unknown implication of sampling

• How to best compare host & strains?

• Integration into modelling