21
Date: 2014/05/27 Author: Xiangnan Kong , Bokai Cao , Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification by Mining Label and Instance Correlations from Heterogeneous Information Networks

Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

Embed Size (px)

Citation preview

Page 1: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

Date: 2014/05/27

Author: Xiangnan Kong, Bokai Cao,

Philip S. Yu

Source: KDD’13

Advisor: Jia-ling Koh

Speaker: Sheng-Chih Chu

Multi-Label Classification by Mining Label and Instance Correlations from Heterogeneous Information Networks

Page 2: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

2

Outline

• Introduction•Meta-path-base Correlation

•PIPL Algorithm

•Experiment

•Conclusion

Page 3: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

3

Introduction

•The label correlations are not given and can be to learn from moderate-sized data.•Use heterogeneous information networks to facilitate the multi-label classication process.

Page 4: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

4

Single-label Classification• Ex:Single-label Classification

d1 d2 d3

Economy 1 0 0

Art 0 1 0

Polity 0 0 1

• Ex: Muti-Label Classification

d1 d2 d3

Economy 1 1 0

Art 0 1 1

Polity 1 0 1

Page 5: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

5

•EX: Drug-Target Binding Prediction

Multi-label Classificantion

Instance

label

Page 6: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

6

•EX:

Heterogeneous Information Networks

Page 7: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

7

Framework

Meta-path Constructure

Meta-path- based Label and Instance Correlation

Training Initialization

Bootstrap

Model

Iterative Inference

Output

Page 8: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

8

Outline

• Introduction•Meta-path-base Correlation

•PIPL Algorithm

•Experiment

•Conclusion

Page 9: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

9

Label and Instance correlationLabel :• The same gene correlation • Share similar pathway• Inter-connected through PPI link

Instance:• Similar side effects• Chemical ontologies• Similar substructures (feature)

Page 10: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

10

Meta-path-base Correlation• Meta-path-base Label Correlation

• Meta-path-base Instance Correlation

Page 11: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

11

Outline

• Introduction•Meta-path-base Correlation

•PIPL Algorithm

•Experiment

•Conclusion

Page 12: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

12

PIPL Algorithm

•Meta-path Constructure

Page 13: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

13

•Training Initialization

• Yi: each Instance has a label set.• Pj(i):link i-th label through

meta-Path j

Array(2-dimention)

考慮本身之外 xi,跟 xi有關係之 label,跟xi有關係之 Instabces

Page 14: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

14

•Bootstrap & Iterative Inference

•μ: unlabeled instances

Page 15: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

15

Outline

• Introduction•Meta-path-base Correlation

•PIPL Algorithm

•Experiment

•Conclusion

Page 16: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

16

Experiment• Heterogeneous Information networks:

290K nodes , 720K edge(SLAP)• Gene-Disease Association Prediction:

1943 instances , 300 feature , 50 labels• Drug-Target Binding Prediction:

5651 instances,1500 feature, 50 labels• 5-fold cross validation

Page 17: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

17

Evaluation Metrics

• Micro-F1 ↑,Better• HammingLoss ↓,Better• SubsetLoss↓,Better

Page 18: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

18

Page 19: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

19

Page 20: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

20

Outline

• Introduction•Meta-path-base Correlation

•PIPL Algorithm

•Experiment

•Conclusion

Page 21: Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification

21

Conclusion• The Paper proposed to use heterogeneous information networks to facilitate the learning process of multi-label classication by mining label correlations and instance correlations from the network.• And propose a novel solution to multi-label classication, called PIPL by exploiting complex linkage information in heterogeneous information networks.