Best ML algorithms to predict motor insurance claims

Team Members

Peter Maila
Shamsuddeen Lawal
Kasavuli Mark
Rofhiwa Ntshagovhe
Sandisiwe Mtsha
Festus Godwin

Project description

The aim of this project is to evaluate various machine learning (ML) algorithms, based on several outcomes - the pros and cons, performance, accuracy, and interpretability, as examples - for the task of predicting motor insurance claims.
Here's a link to our notion link
Here's a link to the test and train datasets for PMD: PMD Datasets
Here's a link to the test and train datasets for Mobility: Mobility Datasets

Each dataset has it's own notebook containing the following contents:

We undertook a meticulous gathering of datasets from two notable insurance organizations, PMD and Mobility. Following this, we meticulously refined and organized the data to guarantee its accuracy and suitability for thorough analysis. This comprehensive data preparation procedure is designed to eradicate irregularities and confirm the dataset's appropriateness for modeling. The integration of varied datasets allows us to confirm the strength and adaptability of our models in different motor insurance scenarios.

2. Feature Engineering

Through the application of sophisticated techniques informed by domain expertise, we methodically identified and extracted relevant features from the dataset. This intricate procedure involved utilizing knowledge specific to the domain and employing methodologies to enhance the predictive capability of the chosen features. By enriching the dataset with meaningful features, we aimed to boost the performance and precision of our predictive models.

3. Statistical model, algorithms and gradient booster used:

Generalised linear model
XGBoost
Random forest
CatBoost
Explainable Boosting Machines (EBM)
LightGBM

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.vscode		.vscode
docs		docs
notebooks		notebooks
src		src
tests		tests
.env.template		.env.template
.gitattributes		.gitattributes
.gitignore		.gitignore
Final Presentation (Best ML Algorithms for Predicting Motor Insurance Claims).pdf		Final Presentation (Best ML Algorithms for Predicting Motor Insurance Claims).pdf
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Best ML algorithms to predict motor insurance claims

Team Members

Project description

Each dataset has it's own notebook containing the following contents:

Table of Contents

1. Data Preprocessing

2. Feature Engineering

3. Statistical model, algorithms and gradient booster used:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Best ML algorithms to predict motor insurance claims

Team Members

Project description

Each dataset has it's own notebook containing the following contents:

Table of Contents

1. Data Preprocessing

2. Feature Engineering

3. Statistical model, algorithms and gradient booster used:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages