International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 9 - Issue 1, January 2020 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616

Risk Prediction Assessment In Life Insurance Company Through Dimensionality

[Full Text]



Reduction Method Sandeep Kumar Dwivedi, Ashish Mishra, Rajeev Kumar Gupta



Big data, PCA, RMSE, classification, backward elimination, random forest, feature selection



Risk assessment is one of the major components in life insurance organization through which customers are grouped. These type of life insurance organization has to perform different operations so that they can settle on different choices bases on applications and to keep proper management. But nowadays there is major expansion in data collection due to large number of customers and advances in investigation process. This is the reason these analysis process has been automated for faster process. Through this automation process many updation can be done although it also helps to include the different new plans by predictive analysis approach. Although real world dataset consist of large numbers of features that are used for examination, that’s why dimensionality reduction has been applied to pick the selective attributes or features by which the power of the model can be increased. The dimensionality reduction can be done by strategies like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Correlation-Based Feature Selection (CFS), etc. Various machine learning classification methods like Artificial Neural Network, Multiple Linear Regression, Random Tree and the proposed Random Forest are applied on the dataset to predict the risk level of candidates. This work has shown that Backward Elimination Calculation has shown the most prominent result with least root mean square error (RMSE) OF 0.384 using the random forest strategy. This paper has also shown the training accuracy and testing accuracy on the basis of Random forest model.



[1] U. Sivarajah, M. M. Kamal, Z. Irani, and V. Weerakkody, “Critical analysis of Big Data challenges and analytical methods,” J. Bus. Res., 2017.
[2] Y. Joly et al., “Life insurance: Genomic stratification and risk classification,” Eur. J. Hum. Genet., 2014.
[3] Z. Ge, Z. Song, S. X. Ding, and B. Huang, “Data Mining and Analytics in the Process Industry: The Role of Machine Learning,” IEEE Access, 2017.
[4] A. C. Wuppermann, “Private Information in Life Insurance, Annuity, and Health Insurance Markets,” Scand. J. Econ., 2017.
[5] D. Hedengren and T. Stratmann, “Is there adverse selection in life insurance markets?,” Econ. Inq., 2016.
[6] J. M. Carson, C. M. Ellis, R. E. Hoyt, and K. Ostaszewski, “Sunk Costs and Screening: Two-Part Tariffs in Life Insurance,” J. Risk Insur., 2019.
[7] O. Devi, “Portfolio Rule ‐ based Clustering at Automobile Insurance in Portugal PORTFOLIO RULE ‐ BASED CLUSTERING AT AUTOMOBILE INSURANCE IN PORTUGAL.”
[8] N. Boodhun and M. Jayabalan, “Risk prediction in life insurance industry using supervised learning algorithms,” Complex Intell. Syst., 2018.
[9] C. Rubio-Bellido, A. Pérez-Fargallo, and J. Pulido-Arcas, “Multiple Linear Regressions,” 2018.
[10] R. Nair and A. Bhagat, “A Life Cycle on Processing Large Dataset - LCPL Rajit Nair,” vol. 179, no. 53, pp. 27–34, 2018.
[11] J. Phuong et al., “Automated retrieval, preprocessing, and visualization of gridded hydrometeorology data products for spatial-temporal exploratory analysis and intercomparison,” Environ. Model. Softw., 2019.
[12] T. Chai and R. R. Draxler, “Root mean square error (RMSE) or mean absolute error (MAE)? -Arguments against avoiding RMSE in the literature,” Geosci. Model Dev., 2014.
[13] D. J. Bartholomew, “Principal components analysis,” in International Encyclopedia of Education, 2010.
[14] M. Hall and L. a Smith, “Feature Selection for Machine Learning : Comparing a Correlation-based Filter Approach to the Wrapper CFS : Correlation-based Feature,” Int. FLAIRS Conf., 1999.
[15] M. Majumder, “Artificial Neural Network,” 2015.
[16] J. Fletcher, “Multiple linear regression,” BMJ, 2009.
[17] S. R. Safavian and D. Landgrebe, “A Survey of Decision Tree Classifier Methodology,” IEEE Trans. Syst. Man Cybern., 1991.