37
Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Cluster-based Strategy Anran Lu Huanzhong Xu Atharva Parulekar Stanford University May 8, 2018 1

Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

  • Upload
    others

  • View
    5

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Cluster-based Strategy

Anran Lu Huanzhong Xu Atharva Parulekar

Stanford University

May 8, 2018

1

Page 2: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Summary

Review of classic Statistical Arbitrage

Main idea and steps of cluster-based strategy

Current result and future work

2

Page 3: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Summary

Review of classic Statistical Arbitrage

Main idea and steps of cluster-based strategy

Current result and future work

3

Page 4: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Summary

Review of classic Statistical Arbitrage

Main idea and steps of cluster-based strategy

Current result and future work

4

Page 5: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Review of Statistical Arbitrage

5

Page 6: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Figure: Stock chart of United Airline, Delta Airline and XTN

6

Page 7: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main Idea of Pairs Trading

Basic model:

dPt

Pt= αdt + β

dQt

Qt+ dXt

"Pair": two stocks with similar characteristicsα: negligible compared to dXtβ: determined by regressionXt : key part of the portfolio

"Generalized Pair": a stock and an ETF

7

Page 8: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main Idea of Pairs Trading

Basic model:

dPt

Pt= αdt + β

dQt

Qt+ dXt

"Pair": two stocks with similar characteristics

α: negligible compared to dXtβ: determined by regressionXt : key part of the portfolio

"Generalized Pair": a stock and an ETF

8

Page 9: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main Idea of Pairs Trading

Basic model:

dPt

Pt= αdt + β

dQt

Qt+ dXt

"Pair": two stocks with similar characteristicsα: negligible compared to dXt

β: determined by regressionXt : key part of the portfolio

"Generalized Pair": a stock and an ETF

9

Page 10: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main Idea of Pairs Trading

Basic model:

dPt

Pt= αdt + β

dQt

Qt+ dXt

"Pair": two stocks with similar characteristicsα: negligible compared to dXtβ: determined by regression

Xt : key part of the portfolio

"Generalized Pair": a stock and an ETF

10

Page 11: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main Idea of Pairs Trading

Basic model:

dPt

Pt= αdt + β

dQt

Qt+ dXt

"Pair": two stocks with similar characteristicsα: negligible compared to dXtβ: determined by regressionXt : key part of the portfolio

"Generalized Pair": a stock and an ETF

11

Page 12: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main Idea of Pairs Trading

Basic model:

dPt

Pt= αdt + β

dQt

Qt+ dXt

"Pair": two stocks with similar characteristicsα: negligible compared to dXtβ: determined by regressionXt : key part of the portfolio

"Generalized Pair": a stock and an ETF

12

Page 13: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Determine Portfolio by Xt

Cointegration: Ornstein — Uhlenbeck modelling of Xt

dXt = κ(µ− Xt)dt + σdWt

some implicit assumptionsκ: mean reversion speedXt ∼ N (µ, σ

2

2κ) at equilibrium

s = Xt−E XtVar Xt

determines long/short position

13

Page 14: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Determine Portfolio by Xt

Cointegration: Ornstein — Uhlenbeck modelling of Xt

dXt = κ(µ− Xt)dt + σdWt

some implicit assumptions

κ: mean reversion speedXt ∼ N (µ, σ

2

2κ) at equilibrium

s = Xt−E XtVar Xt

determines long/short position

14

Page 15: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Determine Portfolio by Xt

Cointegration: Ornstein — Uhlenbeck modelling of Xt

dXt = κ(µ− Xt)dt + σdWt

some implicit assumptionsκ: mean reversion speed

Xt ∼ N (µ, σ2

2κ) at equilibrium

s = Xt−E XtVar Xt

determines long/short position

15

Page 16: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Determine Portfolio by Xt

Cointegration: Ornstein — Uhlenbeck modelling of Xt

dXt = κ(µ− Xt)dt + σdWt

some implicit assumptionsκ: mean reversion speedXt ∼ N (µ, σ

2

2κ) at equilibrium

s = Xt−E XtVar Xt

determines long/short position

16

Page 17: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Determine Portfolio by Xt

Cointegration: Ornstein — Uhlenbeck modelling of Xt

dXt = κ(µ− Xt)dt + σdWt

some implicit assumptionsκ: mean reversion speedXt ∼ N (µ, σ

2

2κ) at equilibrium

s = Xt−E XtVar Xt

determines long/short position

17

Page 18: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Figure: Apple-XLK pair opening and closing signals

18

Page 19: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Figure: Stock chart of United Airline, Delta Airline and XTN

19

Page 20: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Main idea and steps of cluster-based strategy

20

Page 21: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Motivation of our work

Pairs with different "qualities"

Dynamically adjusting pairs based on real time factorsNon-stationary time series and cointegration

Cointegration: Here ut is the stationary residual obtainedafter regressing non-stationary xt and yt which also havea unit root. We can use Dicky Fuller of two step EngelGranger to determine cointegration.

yt − βxt − α = ut

21

Page 22: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Motivation of our work

Pairs with different "qualities"Dynamically adjusting pairs based on real time factors

Non-stationary time series and cointegration

Cointegration: Here ut is the stationary residual obtainedafter regressing non-stationary xt and yt which also havea unit root. We can use Dicky Fuller of two step EngelGranger to determine cointegration.

yt − βxt − α = ut

22

Page 23: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Motivation of our work

Pairs with different "qualities"Dynamically adjusting pairs based on real time factorsNon-stationary time series and cointegration

Cointegration: Here ut is the stationary residual obtainedafter regressing non-stationary xt and yt which also havea unit root. We can use Dicky Fuller of two step EngelGranger to determine cointegration.

yt − βxt − α = ut

23

Page 24: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Motivation of our work

Pairs with different "qualities"Dynamically adjusting pairs based on real time factorsNon-stationary time series and cointegration

Cointegration: Here ut is the stationary residual obtainedafter regressing non-stationary xt and yt which also havea unit root. We can use Dicky Fuller of two step EngelGranger to determine cointegration.

yt − βxt − α = ut

24

Page 25: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Motivation of our work

Pairs with different "qualities"Dynamically adjusting pairs based on real time factorsNon-stationary time series and cointegration

Cointegration: Here ut is the stationary residual obtainedafter regressing non-stationary xt and yt which also havea unit root. We can use Dicky Fuller of two step EngelGranger to determine cointegration.

yt − βxt − α = ut

25

Page 26: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Steps of our work

Main idea: We run pairs trading on each stock-ETF pairgiving rize to 5 key factors per stock-ETF pair. We calcu-late these factors for a stock with respect to all 6 ETFs.

Key factors for pair:Residual st . (Non decorrelated st = Xt −m)Mean Reversion Time 1/κCointegration factors

Apply K-means clustering based on the key factors for astock with respect to all ETFs (30 dim vector).

26

Page 27: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Steps of our work

Main idea: We run pairs trading on each stock-ETF pairgiving rize to 5 key factors per stock-ETF pair. We calcu-late these factors for a stock with respect to all 6 ETFs.

Key factors for pair:Residual st . (Non decorrelated st = Xt −m)Mean Reversion Time 1/κCointegration factors

Apply K-means clustering based on the key factors for astock with respect to all ETFs (30 dim vector).

27

Page 28: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Steps of our work

Main idea: We run pairs trading on each stock-ETF pairgiving rize to 5 key factors per stock-ETF pair. We calcu-late these factors for a stock with respect to all 6 ETFs.

Key factors for pair:Residual st . (Non decorrelated st = Xt −m)Mean Reversion Time 1/κCointegration factors

Apply K-means clustering based on the key factors for astock with respect to all ETFs (30 dim vector).

28

Page 29: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Current Result and Future Work

29

Page 30: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Visualization of Current Result

Figure: PCA scores based on K-means clustering (500 stocks)30

Page 31: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Future work

General performance estimation and efficiencyimprovement

Trade-off between cointegration and residualAlternative Models

Mean-reverting models to better describe XtEM for clustering

Trade-off between trading frequency and transactioncost

31

Page 32: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Future work

General performance estimation and efficiencyimprovementTrade-off between cointegration and residual

Alternative ModelsMean-reverting models to better describe XtEM for clustering

Trade-off between trading frequency and transactioncost

32

Page 33: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Future work

General performance estimation and efficiencyimprovementTrade-off between cointegration and residualAlternative Models

Mean-reverting models to better describe XtEM for clustering

Trade-off between trading frequency and transactioncost

33

Page 34: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Future work

General performance estimation and efficiencyimprovementTrade-off between cointegration and residualAlternative Models

Mean-reverting models to better describe XtEM for clustering

Trade-off between trading frequency and transactioncost

34

Page 35: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Future work

General performance estimation and efficiencyimprovementTrade-off between cointegration and residualAlternative Models

Mean-reverting models to better describe XtEM for clustering

Trade-off between trading frequency and transactioncost

35

Page 36: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Selection of positions

36

Page 37: Cluster-based Strategy - Stanford University...Statistical Arbitrage Cluster-based Strategy Current Result and Future Work Steps of our work Main idea: We run pairs trading on each

Statistical Arbitrage Cluster-based Strategy Current Result and Future Work

Thank you!

37