Preprint / Version 1

On The Effect of Polygenic Risk Scores and other Non-Genetic Factors on Life Insurance Underwriting Decisions

##article.authors##

  • Emiliano Careaga Maine South High School
  • Jameson Augustin

DOI:

https://doi.org/10.58445/rars.3196

Keywords:

Life Insurance, Polygenic Risk Score, Premium, Logistic Regression, Random Forest, Feature Importance, Risk Prediction, Underwriting Decisions, Adverse Selection, Information Asymmetry

Abstract

This study investigates the effect of polygenic risk scores and other variables on underwriting decisions in life insurance using machine learning. Using simulated data of 10,000 individuals, we developed logistic regression and random forest models to analyze the impact of each variable. Traditional variables and their limitations were described in order to contextualize the original methods for life insurance underwriting. Other genetic methods and their limitations excluding polygenic risk scores were outlined with the intention of comparing an emerging concept with various traditional genetic analytical methods. Data was generated and processed using machine learning techniques to ensure reasonable results. However, our multicollinearity analysis revealed important limitations: the synthetic PRS showed multicollinearity with other variables (VIF > 10) and correlations with family history variables that were artifacts of the data generation process. These analyses demonstrated that polygenic risk scores have a significant effect on life insurance underwriting decisions by enhancing the accuracy of the risk-prediction models. Feature importance charts showed that the more accurate model (the random forest, as confirmed by comparison tables) gave greater weight to variables related to polygenic risk scores and their interactions. However, the model also considered several traditional variables with substantial weight, including premium cost, underlying condition, age, and health risk score. This research offers a valuable perspective into the relationship between polygenic risk score and individual characteristics, highlighting the value of integrating genetic variables with traditional variables, while acknowledging the challenges of implementing synthetic PRS. These findings provide evidence supporting the potential application of polygenic risk scores in risk-prediction models, though real-world implementation would require addressing the methodological limitations identified. By increasing the accuracy of risk-prediction models, current conflicts such as adverse selection and information asymmetry could potentially be resolved with the addition of certain policies and proper implementation of true genetic risk scores. 

 

References

Low, Lisa, et al. “Genetic Discrimination in Life Insurance: Empirical Evidence from a Cross Sectional Survey of Genetic Support Groups in the United Kingdom.” BMJ: British Medical Journal, vol. 317, no. 7173, 1998, pp. 1632–1635. https://doi.org/10.1136/bmj.317.7173.1632.

Lewis, Cathryn, et al. Epidemiological and Genetic Prediction of Major Morbidity and All Cause Mortality.UKBiobank,2021,www.ukbiobank.ac.uk/enable-your-research/approved-research/investigate-the-importance-of-genetic-data-over-and-above-detailed-epidemiological-data-in-multivariable-prognostic-models-that-predict-risk-of-i-major-morbidity-ii-all-cause-mortality. Accessed June 2025.

Born, Patricia. “Genetic Testing in Underwriting: Implications for Life Insurance Markets.” Journal of Insurance Regulation, vol. 38, no. 5, 2019.

Karlsson Linnér, Richard, and Philipp D. Koellinger. “Genetic Risk Scores in Life Insurance Underwriting.” Journal of Health Economics, vol. 81, 2022, p. 102556.

Maxwell, Jessye M., et al. “Multifactorial Disorders and Polygenic Risk Scores: Predicting Common Diseases and the Possibility of Adverse Selection in Life and Protection Insurance.” Annals of Actuarial Science, vol. 15, no. 3, 2021, pp. 488–503.

Lund, Heather, and Richard Russell. “Polygenic Risk Scores: A Useful Tool in Our Risk Prediction Toolkit?” Japanese Journal of Insurance Medicine, vol. 117, no. 1, 2019, pp. 1–7.

Chen, Zhihua, Xuchen Xu, and Hongbo Liu. “The Successive Approximation Genetic Algorithm (SAGA) for Optimization Problems with Single Constraint.” Mathematics, vol. 11, no. 8, 2023, p. 1928.

Downloads

Posted

2025-10-12

Categories