J. Pesantez-Narvaez, M. Alcañiz, M. Guillen

Machine-learning algorithms are widely used for predicting different types of phenomena in all areas, and in particular XGBoost is recognized for its exceptional predictive capacity. To derive the determining factors associated with traffic accidents, dichotomous response models indicating the existence of accident claims versus no claim can be used. We aim to compare the performance of logistic regression and the new XGBoost approach when predicting the existence of accident claims using telematics data. The dataset contains information from an insurance company including individual driving patterns such as total annual distance driven and percent of distance driven in urban areas. Findings show that the logistic regression is a suitable model due to its interpretability and good predictive capacity. XGBoost demands numerous tuning-model procedures to improve the logistic regression model predictive performance and needs an additional effort for interpretability.

Keywords: binary response; predictive model; tree boosting; GLM

Scheduled

GT15-2 Risk Analysis
September 6, 2019  11:20 AM
I3L8. Georgina Blanes building


Other papers in the same session


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.