Use and Limitations of Machine Learning in Portfolio … · 2018-05-24 · Prediction -...

Use and Limitations

of Machine Learning

in Portfolio Management

Overview

1. Brief Introduction to Learning

2. Prediction

- “Futurecasting”

- “Nowcasting”

- factor analysis

3. Similarity Measures

- recommendation system

4. Generating Synthetic Datasets

A Brief Introduction to Learning

Learning: Y|X

• Regression: E[Y|X=x]

• Classification: P(Y=y|X=x)

• Synthetic data generation:

To each problem its solution

• What we want to know from Y

• Dimensionality of the data (X and Y)

• Signal to noise of the data

• Risk function

• Stationarity

• Etc.

An Introduction to Statistical Learning

Great overview of classic

machine learning techniques

with examples of code in R

Prediction

Methods Used

• OLS Regression

• Lasso, Ridge, Elastic Net

• Kernel Regression

• Trees

• Neural Nets

• Random Forests

• SVMs

• Etc.

Prediction - Things to Consider

• Linear versus non-linear

• Dimensionality of the data

• Density of the data

• Signal to noise

• Risk function

• Interpretability

• Over-fitting

Prediction - “Futurecasting”

• No access to contemporaneous data

• Very difficult to do

• Markets tend to be efficient

• Signal to noise ratio is poor

• It is difficult to beat naïve predictors

• Boosted Trees is the leader at the moment

Big Data and AI Strategies Machine Learning and Alternative Data Approach to Investing

Quantitative and Derivatives Strategy

Marko Kolanovic, PhDAC

marko.kolanovic@jpmorgan.com

Rajesh T. Krishnamachari, PhD

rajesh.tk@jpmorgan.com

May 2017

See page 278 for analyst certification and important di sclosures, including non-US analyst disclosures.

Completed 18 May 2017 04:15 PM EDT

Disseminated 18 May 2017 04:15 PM EDT

This document is being provided for the exclusive use of LOGAN SCOTT at JPMorgan Chase & Co. and clients of J.P. Morgan.

Big Data and AI Strategies

Good overview of the

current use of machine learning

in alpha generation and more

Prediction - “Nowcasting”

• Access to contemporaneous data

• Important data that is published with a lag or a low frequency

• Generating replicating portfolios (Stat Arb)

• Live estimates of

- Macroeconomic indicators

- Etc.

Prediction - Factor Analysis

• p: number of predictors

• n: number of observation

• It used to be n>>p- OLS was useful

• It is now p>n (zoo of factors)- curse of dimension

▪ dimensionality reduction, PCA, clustering, etc.▪ best subset, Lasso, Ridge, etc.▪ K-fold cross validation

• Also useful for hedging

Similarity Measures

Useful For

• Manager selection

• Stock selection

• Style drift detection

Similarity Measures

Methods Used

• PCA

• Hierarchical Clustering

• K-means

• Supervised classifiers

• Etc.

Used For

• Alternative data

• Big data

• Improving analyst’s productivity

Similarity Measures - Things to Consider

• Supervised- labeling the target variable and letting the learner infer useful

predictors

• Unsupervised- choosing predictors where “closeness” is of interest and letting

the algorithm do the clustering

• Non stationarity of data

• Renormalization

• Availability of data for back testing

Generating Synthetic Data

Useful For

• Scenario analysis

• Stress testing

• Risk budgeting

• Option pricing

• OOS testing

Could be Useful For

• Training data for data intensive learners (deep learning, reinforcement learning, etc.)

• Testing systematic strategies

Generating Synthetic Data

Methods Used

• Fitting of parametric models

- distributions (poisson, normal, cauchy, etc.)- DGP (EWMA, GARCH, variance gamma process, etc.)

• Kernel density estimation

• Eigen vector decomposition

• Factor analysis

• Auto Encoders

• LSTM NN

Generating Synthetic Data - Things to Consider

• Single versus multivariate inputs

• Single versus multivariate outputs

• Conditional versus unconditional outputs

• Linear versus non-linear relationships

• Bulk versus tails of the distribution

• Interpretability

Use and Limitations of Machine Learning in Portfolio … · 2018-05-24 · Prediction -...

Documents

Airports, Air Pollution, and Contemporaneous Health

Futurecasting effects of sea level rise, climate change, and … · 2012-06-20 · U.S. Department of the Interior U.S. Geological Survey Futurecasting effects of sea level rise,

Futurecasting Seminar 1: VAPàIT - Amazon Web Services · 2016-08-18 · Schedule for the TeamWorks: Futurecasting seminars and MyWork devotions Use the template below to schedule

Achieving Contemporaneous Geomorphic Reclamation at El Segundo Mine, New Mexico

Drought of Opportunities: Contemporaneous and Long Term

The Use of Contemporaneous Circumstances and Legislative

The Contemporaneous Beasts View of Daniel 7versebyversebibleteaching.com/daniel/daniel 7.pdf · The Contemporaneous Beasts View of Daniel 7 By Chris White Daniel 7 is considered by

S6 _ Contemporaneous Art

The Puritan Bible and Other Contemporaneous Protestant Versions (1913)

Foossa Futurecasting Canvas

Futurecasting Do you need to predict the future of your career field? Looking Into The Future

A Temporary Contemporaneous Fluffy Tuesday Afternoon Keeps Nativistic Trumpets Green

Canada’s Transfer Pricing Rules and Contemporaneous ...cba.org/cba/cle/PDF/TAX11_Murray_Paper.pdf · Canada’s Transfer Pricing Rules and Contemporaneous Documentation Requirements

The contemporaneous movement between cash flow and … · 2016-12-20 · THE CONTEMPORANEOUS MOVEMENT BETWEEN CASH FLOW AND ACCRUALS-BASED ACCOUNTING NUMBERS: THE NEW ZEALAND EVIDENCE

Airports, Air Pollution, and Contemporaneous Healthfaculty.haas.berkeley.edu/rwalker/research/Schlenker...Airports, Air Pollution, and Contemporaneous Health Wolfram Schlenker and

Cahier 2000-11 Exact Tests for Contemporaneous Correlation ... · CAHIER 2000-11 EXACT TESTS FOR CONTEMPORANEOUS CORRELATION OF DISTURBANCES IN SEEMINGLY UNRELATED REGRESSIONS Jean-Marie

S11 contemporaneous art

Contemporaneous effects of diabetes mellitus and

Multivariate Contemporaneous Threshold Autoregressive Models · ear money-output Granger causality patterns ... poraneous value of x1t ... 3 Multivariate Contemporaneous Threshold

COVID-19 mortality and contemporaneous air pollution · 2 days ago · COVID-19 mortality and contemporaneous air pollution . Wes Austin, Stefano Carattini and John Gomez Mahecha