Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
",1�zXMg~�D�������GWAS�",0)�/6*��#37�
�čĽ|
éĩøĈ�§�Ð
p�q��'#��#37@m
2
Yang et al. (2003) Am. J. Hum. Genet 73: 627
• ACTN3+�¹��*<BNj¤¸MjZBÑα-ctinin-3:E�S�$�7• <efX+�ÖÉESj)4!$α-ctinin-3�Ä7�&�%�(�• a-actinin-3*đ�+a-actinin-2)4!$į��87��ACTN3*Įùç*·��5�ę�n(7&®98$��
126,559i�*�(�E��\dC��GWAS
• Fig. 1*3"*SNPs+��*¯�ėÏęŊ('&*<LG?�Gcj%2¦Ý�8��Rietveld et al. 2014, PNAS 13790, Ward et al. 2014 PLoS ONE e100248)
• ����íĶŊ+ġâ)ć��R2ĸ0.02%�• ûæ*pČÇ)Èě�8���*v�2ý�� �Fig. 2�
3
Rietveld et al. (2013) Science 340: 1467
\K�9u_V����
hG9u^�Rf
5Yano et al. (2016) Nature Genetics 48: 927
�'#��#37@m (association analysis)�'#��#37SI (association study)
……T……C……A……G……T………………A…………A………
• pČË«3rÔóÃ)�.87ħÔ3¢Đ:ľ�$��85%�Å�87DNAû &Ĥ û �*���5�ªtpČÇ*¦Ý:Ê/7ıIJ�±ěÒ©:ĢĿ&�(��
……T……C……A……A……T………………A…………A……………T……A……G……G……T………………A…………A……………G……C……A……A……T………………C…………T……………T……A……A……G……T………………A…………A……………T……C……A……A……T………………A…………A……………T……C……A……G……T………………A…………T……………T……A……G……G……T………………C…………A………
……G……C……A……G……C………………A…………T……………T……A……A……G……C………………A…………A……………T……C……A……G……C………………C…………T……………T……A……A……A……C………………C…………A……………T……C……G……G……C………………A…………T……………G……A……A……G……C………………A…………A……………T……C……A……A……C………………C…………A………
ħÔA
ħÔB
ħÔC
:
::
DNAû
ÑĊQ�Mŋ�,�200ħÔă*Ħ�ĉ´çħÔ*�¸
70/80
23/120
ňĊQ�M
Ĥ û
ŋ�,�200ħÔ*õß
6
�������� �*&�����$"%,��+2��+)������������8 �
�� �����'5!��10/
�� ������'5!��10/
�� �����'5�7 �-(#
T
�� �����'5�7 �-(#
C
",0)�&6�#37
Watanabe et al. (2005)Ann Bot. 95:1131
pČÇ
DNAû
�Åā%ñğ��*v�:ī�y�$pČĊęŊ:ĥ{�7�1�ñğ)zĘ2��7�
ÀĜÊ©
ûæ*DNAû )�#�Ļø
Ļøā%ñğ
�"%2�'�%2� 8)%2�ÀĜÊ©:¶9�)ñğ�%�7
DNAû :ĖŊ&�$�Õň3ħÑ:Ļø
wJ:c���lY
Garcia-Ruiz et al. Proc Natl Acad Sci 113(28): E3995-4004
“The most dramatic response to genomic selection was observed for the lowly heritable traits DPR, PL, and SCS. Genetic trends changed from close to zero to large and favorable, resulting in rapid genetic improvement in fertility, lifespan, and health in a breed where these traits eroded over time.”
'-�GS`U
Genomic prediction�9u_V%�4�+7 ��<�
BO for genomic screening of plant germplasm
*�(@m����
£øQ�M*į�
IE<y
GWAS & GSbQejC
ÙĀµö*�ë
GWAS
GSbQejC
ĻøêĎĥ{
12GSbQejC+łØ
sAF���@myi = u+β j xij + ei
¬Ĭ¡:Ĉňy��ãİ
1311*sx�û �SNPs�
All materials can be downloaded fromhttp://ricediversity.org/
sAFQ?
�������SNPs�Ĺm)… �.�)_jXOMj��
�;()���;pČÇ� 7*�
0e+00 1e+08 2e+08 3e+08
05
1020
30
position (bp)
−log
10(p
)
|etZn�W}N�DL
• IndicaħÔ��ä�)+Áą�M>\�û��JaponicaħÔ�ĝ�)+Ĩ³*M>\�û�
]{�����9uZn
ûļç�·�$2�'*¢Đ�2ē��ń% 8,�ŀç+é�(��
����ÒÂ+…ĪÙĀ� !�6� }¢µö� !�6�
7�→ �ŀç:é�7ªt&(7 16
ĪÙĀ)47Ĥ *o�:�Õ��7
|et�>OZn�[�� AF
¥w��A:bQf)ô/º0
• Yu et al. (2006) Nat. Genet. 38: 203
yi = u+β j xij + vkqikk=1
K
∑ +αi + ei
a ~ N(0, Aσα2 )
�'#��#37@m���H�k��H;k��HxT�
�� ���������
�� ��������
ijď+uç�ªtpČÇ%(��
A B (þ�Ô*�°�
ijď+ŀç�ªtpČÇ% 7�
C (þ�Ô*�°� D
����false positive rate: FPR�= B / (A + B)
����false negative rate: FNR�= C / (C + D)
�� ��false discovery rate: FDR�= B / (B + D)
ijď+uç(*)�ŀç�ªtpČÇ�&�$¦Ý�87�¸
ijď+ŀç�ªtpČÇ�(*)�uç&�$¦Ý�87�¸
ŀç�ªtpČÇ�&�$Ш�8�2**ă*�UKbW�uç�*�¸
�ŀçŅ&�uçŅ+Rg�S@[*��) 7���!$
�ШŅ:sĈ)EjRh�f�7ıIJ2¿587 18
large p small n�r
æï�æĴ æģ
y1y2
:yn
n 1
x1x2
:xn
=
n p
w + e
X’X+Ĕn�Û��+%�(��
Zda�Mæ�_�A��
Fj\fæ
p >> n
y X
w = (X'X)−1X'y
�IPTWK�%,�O8�
!����
�%,�
�IPTW
PRESS
Se
1��Oam_RO�IPTWP;fajZ0:NFYQFXSL/CMXA;�*am_N�FX�%,�P=X0:�Z�N��FX<
fajO0:�
Ļøñğ:¶�GS%+� $+.6*Ň�%+(��ĻøêĎ*Ň�:ĥ{�7�&�ÛĿ
0 2 4 6 8 10
02
46
810
12
0 2 4 6 8 10
02
46
810
12
x
y
0 2 4 6 8 10
02
46
810
12
xbby 10 +=
å=
+=7
10
k
kk xbby
96�46�3��.�����:
�*am_N@DX�%,�O4���� 3$�
(n-fold cross-validation)1. am_Zn^`bN� 2. i'(O^`bZ9>Ifajdhem_Z��
3. i'(O^`bNH>Ip2J#UGfajJ�% Z2+
4. 2, 3Zn�-W6Fq5. �IOam_NH>Ip�%K�% Z"5EI,�Z4�FX,�NP�%K�%7O)8V;�.O�O2��nPRESSoMLZ&>X
nAam_�OKBpleave-one-outnr�Bo[k]ciam\glK>?
iy
iyiy
iyiy
4)$AF
argminw
(yi − xiTw)2
i∑ +λ w 2
yi = xijwjj
M
∑ + ei = xiTw+ ei
���æ�ý��(7�&)^TfP=:���¾Þĕà
ûæ*Zda�M�Îĺ)Ē�*:Ł�7
��ƽ*ĕà ^TfP=
w 2= wj
2
j
M
∑
ĭæñÿ)47MLR&n(6�ò$*SNPs�bQf)�.87
λ��ƽ&^TfP=*YdjI:&7
LASSO
argminw
(yi − xiTw)2
i∑ +λ w
1
yi = xijwjj
M
∑ + ei = xiTw+ ei
^TfP=
w1= wj
j
M
∑
-2 -1 0 1 2
0.0
0.5
1.0
1.5
2.0
x
y
eOH��&4�Ì7��îüā*Ō)Ġŋ��^TfP=:��7
0e+00 1e+08 2e+08 3e+08
−0.0
40.
000.
02
Ridge
Position (bp)
Coe
ffici
ents
0e+00 1e+08 2e+08 3e+08
−0.2
0.0
0.1
0.2
LASSO
Position (bp)
Coe
ffici
ents
ridge, LASSO AFLj�yB
ridge ��)Ġ-LASSO+46��0)ÕÜ�shrink��$�7�&�Ī�7
GS3
.��LASSO*¤~:/7&GS3*²~+ý����*ú*ʼnq2
Ļø)Ńľ�8$�7↓
DW`i>S_�A�:Ńľ��Ļø*ĢĿç:ͼ
ridge �+²~*Þ�(pČÇ����; 7á¸)�LASSO+Ġ�Ċ²~*ý�(pČÇ� 7á¸)ċ�$�7
¬Ĭ¡*pČÇ
@a� ��2*5=��o� ��2*5=�8�
DW`i>S_�A�
Ĥ DW`i>S_�A�
Ĥ �GS model
ăå+Ī�5(�$4��ĻøêĎ�·�8,(;%24��
ªtpČÇ*lĂ3²~('�ă*aAUJ`)�ĵ� 7
�GWAS model
�è�(t~��*Ěk�ķĊ*�1�ÙĀµö)47�ŀç(')ÚĪĄm�$�ë
:¶�
t~��)+Ćì�ĵ�(���è�(Ļø�bQfy*ķĊ
GS�vPb��R.)!�$è÷yð¡��• ridge regression, LASSO, elastic net
• glmnet etc.»¸bQf• BLUP
• rrBLUP���ו SVM, RVM �A�VfIJ�
• kernlab etc.• random forest
• randomForest etc
]>JIJ• Bayesian linear regression (Bayesian ridge, Bayesian LASSO)
• BLR etc• RKHS regression �A�VfIJ�
• RKHSw�Crossa et al. (2010) Genetics 186: 713*@jd>jËņ)R\hCd` 6�
Äĩ)��7GSÓIJêĎĠ�)��$+Zhong et al. (2009) Genetics 182: 355Crossa et al. (2010) Genetics 186: 713Iwata and Jannink (2011) Crop Sci 51: 1915Heffner et al. (2011) Crop Sci 51: 2597('