2
Workshop Advances in Statistical and Risk Modeling for Actuaries Prague, 25-26 June 2018 Background of regression and generalized linear models (GLMs) For background, we will review the basic concepts of regression and its extension to generalized linear models. These models have become popular actuarial tools for better understanding claims prediction, pricing, risk classification, and even claims reserving. Penalized regression and likelihood Traditional methods of estimating regression and generalized linear models are extended to introduce the concept of regularization by introducing a penalty function. This is typically accomplished because estimates from traditional methods of least squares or likelihood may be distorted or meaningless. Regression and classification trees We explore the usefulness of decision trees as a predictive model. In decision trees, the regions of the explanatory variables are repeatedly partitioned to create a tree-based structure that can be used to predict the outcome variable. This is an alternative to classical regression and GLMs that have been popularized for decades, and we will assess how trees perform relative to these classical models. Data clustering A form of unsupervised learning, data clustering is the process of dividing a group of observations into homogeneous groups or clusters. Observations in the same cluster share “similar” features while those from dierent clusters are “dissimilar”. This method has many uses in “big data” analytics but it has a large potential in actuarial applications: understanding and monitoring claims, valuation of large portfolios of variable annuities. Abstract This workshop provides a survey of the use of modern statistical techniques for actuarial applications. There has been an increased interest in the use of data science to deliver predictive models applicable in all areas of insurance and financial security programs: life and health, property and casualty, pensions. Such predictive models are generally used for improved decision making in: (1) delivering consumer products that are financially sound, (2) developing better relationships with consumers to meet their evolving needs, and (3) evaluating sucient funding for promised benefits from insurance and pension programs, to name a few. We will conduct this workshop by introducing a little bit of the theory, but we will also simultaneously incorporate practical case studies and how data and models can be analyzed using R. In particular, we cover: Principal component analysis (PCA) PCA is a technique to simplify a high dimensional dataset by transforming or reducing the dimension of the data while keeping to preserve its original features. Such high dimensional data presents computational challenges and PCA has been used to reduce the burden of such datasets. Here, we present the procedure used to calculate principal components and we demonstrate how this technique can be used for forecasting currency exchange rates. Copula regression models Until recently, the assumption of independence has been typically used and accepted in insurance models. However, we find that there are many possible reasons why dependencies are present in several insurance operations: multiple coverage within a single contract, bundled policies owned by a single policyholder, spatial dependencies, intertemporal dependencies, and dependent claim frequency and severitiy. We introduce the concept of copulas as a tool for modeling and understanding relationships of dependent outcomes. To accommodate the heterogeneity of insurance policyholders, we also introduce the concept of regression within a copula framework.

Workshop Advances in Statistical and Risk Modeling for A. Valdez.pdf · Statistical and Risk Modeling for Actuaries Prague, 25-26 June 2018 Background of regression and generalized

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Workshop Advances in Statistical and Risk Modeling for A. Valdez.pdf · Statistical and Risk Modeling for Actuaries Prague, 25-26 June 2018 Background of regression and generalized

Workshop Advances in Statistical and Risk Modeling for Actuaries

Prague, 25-26 June 2018

Background of regression and generalized linear models (GLMs)

For background, we will review the basic concepts of regression and its extension to generalized linear models. These models have become popular actuarial tools for better understanding claims prediction, pricing, risk classification, and even claims reserving.

Penalized regression and likelihood

Traditional methods of estimating regression and generalized linear models are extended to introduce the concept of regularization by introducing a penalty function. This is typically accomplished because estimates from traditional methods of least squares or likelihood may be distorted or meaningless.

Regression and classification trees

We explore the usefulness of decision trees as a predictive model. In decision trees, the regions of the explanatory variables are repeatedly partitioned to create a tree-based structure that can be used to predict the outcome variable. This is an alternative to classical regression and GLMs that have been popularized for decades, and we will assess how trees perform relative to these classical models.

Data clustering

A form of unsupervised learning, data clustering is the process of dividing a group of observations into homogeneous groups or clusters. Observations in the same cluster share “similar” features while those from different clusters are “dissimilar”. This method has many uses in “big data” analytics but it has a large potential in actuarial applications: understanding and monitoring claims, valuation of large portfolios of variable annuities.

Abstract This workshop provides a survey of the use of modern statistical techniques for actuarial applications. There has been an increased interest in the use of data science to deliver predictive models applicable in all areas of insurance and financial security programs: life and health, property and casualty, pensions. Such predictive models are generally used for improved decision making in: (1) delivering consumer products that are financially sound, (2) developing better relationships with consumers to meet their evolving needs, and (3) evaluating sufficient funding for promised benefits from insurance and pension programs, to name a few. We will conduct this workshop by introducing a little bit of the theory, but we will also simultaneously incorporate practical case studies and how data and models can be analyzed using R. In particular, we cover:

Principal component analysis (PCA)

PCA is a technique to simplify a high dimensional dataset by transforming or reducing the dimension of the data while keeping to preserve its original features. Such high dimensional data presents computational challenges and PCA has been used to reduce the burden of such datasets. Here, we present the procedure used to calculate principal components and we demonstrate how this technique can be used for forecasting currency exchange rates.

Copula regression models

Until recently, the assumption of independence has been typically used and accepted in insurance models. However, we find that there are many possible reasons why dependencies are present in several insurance operations: multiple coverage within a single contract, bundled policies owned by a single policyholder, spatial dependencies, intertemporal dependencies, and dependent claim frequency and severitiy. We introduce the concept of copulas as a tool for modeling and understanding relationships of dependent outcomes. To accommodate the heterogeneity of insurance policyholders, we also introduce the concept of regression within a copula framework.

Page 2: Workshop Advances in Statistical and Risk Modeling for A. Valdez.pdf · Statistical and Risk Modeling for Actuaries Prague, 25-26 June 2018 Background of regression and generalized

About the Speaker Emiliano (Emil) A. Valdez, Ph.D., FSA, is a Professor of Actuarial Science in the Department of Mathematics at the University of Connecticut, USA. He is a Fellow of the Society of Actuaries, holds a Ph.D. from the University of Wisconsin in Madison, and is also the interim director of the graduate programs in actuarial science at Connecticut. His academic experience includes several years of teaching and conducting research in actuarial science in three different continents: North America, Australia and Asia. His previous academic posts include working for the Nanyang Business School in Singapore and for the University of New South Wales in Sydney, Australia. From 2013 to 2015, he was the Director of the actuarial science program at Michigan State University. He has been awarded the Edward A. Lew Award, the Halmstad Memorial Prize, and recently in 2010, the Charles A. Hachemeister Prize, in recognition for his significant contributions to the actuarial literature. His current research interest includes copula models and dependencies, managing post-retirement assets, and risk measures and capital requirements related to enterprise risk management. In addition, he has several years of industry experience working as an actuary for Connecticut Mutual in Hartford and held summer actuarial positions at Price Waterhouse.

About the Workshop Date

25-26 June 2018: Monday & Tuesday 9am-5pm

Location

Sokolovská 49/83, 18675 Prague 8, Karlín, Room K1

References

Gan, G. and Valdez, E.A. (2018), Actuarial Statistics with R: Theory and Case Studies, ACTEX publications: New Hartford, Connecticut.

Frees, E.W., Derrig, R.A. and Meyers, G., ed. (2016), Predictive Modeling Applications in Actuarial Science, Vol. II: Case Studies in Insurance, Cambridge University Press: New York.

Frees, E.W. and Valdez, E.A. (1998), Understanding Relationships Using Copulas, North American Actuarial Journal, Vol. 2, No. 1, pp. 1-25.

Frees, E.W., Shi, P. and Valdez, E.A. (2009), Actuarial Applications of a Hierarchical Insurance Claims Model, ASTIN Bulletin, Vol. 39, No. 1, pp. 165-197.

Organizers

Czech Society of Actuaries and Charles University, Faculty of Mathematics and Physics

Sponsors

Česká pojišťovna, a.s.

Emiliano A. Valdez Professor of Actuarial Science University of Connecticut