This repository contains the solution for the Shell.ai Hackathon 2025, focusing on predictive modeling for fuel blending properties.
Website : Predictive Fuel Blend Analysis
- model.py: The main script containing the Stacking Regressor model (LightGBM, CatBoost, XGBoost with RidgeCV meta-learner).
- corr_heatmap.py: Utility script for generating correlation heatmaps of the data/predictions.
- final.csv: The generated output file matches the target blend properties with high accuracy.
-
Ensure you have the required Python libraries installed (pandas, numpy, sklearn, lightgbm, catboost, xgboost).
-
Place
train.csvandtest.csvin the root directory. -
Run the model:
python model.py
This will process the data, train the stacked models, and generate
final.csv.
The solution employs a stacking ensemble approach:
- Feature Engineering: Weighted averages of component properties, outlier removal, and quantile transformation.
- Base Models: LightGBM, CatBoost, and XGBoost regressing on 10 target blend properties.
- Meta Learner: RidgeCV (Ridge Regression with cross-validation) to combine base model predictions.