14
SNP Allele Designations (Bio::SNP::Inher it) Christopher Bottoms BOSC 2010

Bottoms bosc2010 bio_snp_inherit

Embed Size (px)

Citation preview

Page 1: Bottoms bosc2010 bio_snp_inherit

SNP Allele Designations (Bio::SNP::Inherit)

Christopher BottomsBOSC 2010

Page 2: Bottoms bosc2010 bio_snp_inherit

5 million data “items”

one CPU: 2+ dayseight CPUs: 1-2 days

Page 3: Bottoms bosc2010 bio_snp_inherit

SNP ID Sample ID Base1 Base21 1 A A1 2 A A1 3 A G… … … …1 5000 A A2 1 C C… … … …… … … …1106 5000 GG GG

Page 4: Bottoms bosc2010 bio_snp_inherit

SNP ID Sample ID Base1 Base21 1 A A1 2 A A1 3 A G… … … …1 5000 A A2 1 C C… … … …… … … …1106 5000 GG GG

Page 5: Bottoms bosc2010 bio_snp_inherit

“Matrix” data file format

SNP ID 1 2 3 … 5000SNP1 AA AA AG … AASNP2 CC GG GG … CG

Page 6: Bottoms bosc2010 bio_snp_inherit

“Matrix” data file format

SNP ID 1 2 3 … 5000SNP1 AA AA AG … AASNP2 CC GG GG … CG

Page 7: Bottoms bosc2010 bio_snp_inherit

Using new data format

12 million data itemsone cpu: ~30 min

Page 8: Bottoms bosc2010 bio_snp_inherit

ID’s fileID Name Group

1 B73 B73

2 B73xZ1 NAMF1

3 Mo17 Control

4 M100 IBM

5 Bob B73xZ1

Page 9: Bottoms bosc2010 bio_snp_inherit

ID’s fileID Name Group

1 B73 B73

2 B73xZ1 NAMF1

3 Mo17 Control

4 M100 IBM

5 Bob B73xZ1

Page 10: Bottoms bosc2010 bio_snp_inherit

“Human Parsed” ID’s fileID Name Group A (ID) B (ID) AxB (ID)

1 B73 B73

2 B73xZ1 NAMF1

3 Mo17 Control

4 M100 IBM 1 3

5 Bob B73xZ1 1 2

Page 11: Bottoms bosc2010 bio_snp_inherit

Lessons learned

Explore other solutions before deciding on parallel processing

File format changes can simplify work

When appropriate, divide workHumans: Complicated but “once-only” taskComputers: Repetitive boring work

Page 12: Bottoms bosc2010 bio_snp_inherit

AcknowledgementsAdvisors

Mike McMullenSherry Flint-Garcia

Hardware supportArturo Garcia

FundingNational Science Foundation Plant Genome Program

Grant DBI-0820619USDA-ARS

Page 13: Bottoms bosc2010 bio_snp_inherit

AcknowledgementsProgramming support

You (CPAN)You (stackoverflow.com)You (perlmonks.org)

Page 14: Bottoms bosc2010 bio_snp_inherit

End