A Unified Approach to Sparse Tweedie Modeling of Multisource Insurance Claim Data

Study of multi-source insurance data via group regularization

By Simon Fontaine, Yi Yang, Wei Qian, Yuwen Gu, and Bo Fan in Research

September 5, 2019

Fontaine, S., Yang, Y., Qian, W., Gu, Y., & Fan, B. (2019). A unified approach to sparse tweedie modeling of multisource insurance claim data. Technometrics, 62(3), 339–356.

Abstract

Actuarial practitioners now have access to multiple sources of insurance data corresponding to various situations: multiple business lines, umbrella coverage, multiple hazards, and so on. Despite the wide use and simple nature of single-target approaches,modeling these types of datamay benefit froman approach performing variable selection jointly across the sources. We propose a unified algorithm to perform sparse learning of such fused insurance data under the Tweedie (compound Poisson) model. By integrating ideas frommultitask sparse learning and sparse Tweedie modeling, our algorithm produces flexible regularization that balances predictor sparsity and between-sources sparsity. When applied to simulated and real data, our approach clearly outperforms single-target modeling in both prediction and selection accuracy, notably when the sources do not have exactly the same set of predictors. An efficient implementation of the proposed algorithm is provided in our R package MStweedie, which is available at https://github.com/fontaine618/MStweedie. Supplementary materials for this article are available online.

Histogram of Tweedie data
Posted on:
September 5, 2019
Length:
1 minute read, 178 words
Categories:
Research
Tags:
Insurance Tweedie data Regularization
See Also:
MSTweedie