Skip to content

Atharva-1512/AutoML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ AutoML Project

An End-to-End Automated Machine Learning Pipeline with Dynamic Feature Engineering

πŸ“Œ Overview

This project implements a complete Automated Machine Learning (AutoML) system that simplifies the process of building, training, and evaluating machine learning models.

The system is designed to:

Automatically preprocess data

Perform intelligent feature engineering

Train multiple models

Optimize performance

Select the best model

It reduces manual effort and enables faster experimentation for real-world ML problems.

🎯 Key Features

βœ… Automated Data Preprocessing

Handling missing values

Encoding categorical variables

Feature scaling

βœ… Dynamic Feature Engineering

Automatic feature selection

Feature transformation

Handling noisy/anomalous data

βœ… Model Training & Selection

Multiple ML algorithms support

Hyperparameter tuning

Best model selection

βœ… Evaluation & Metrics

Accuracy, Precision, Recall, F1-score

Model comparison

βœ… Modular Pipeline Design

Clean and reusable architecture

Easy to extend for new datasets/models

πŸ› οΈ Tech Stack

Language: Python

Libraries:

scikit-learn

pandas

numpy

matplotlib / seaborn

Tools:

Jupyter Notebook / VS Code

Git & GitHub

πŸ“‚ Project Structure AutoML/ │── data/ # Dataset files │── notebooks/ # Experiments & analysis │── src/ β”‚ β”œβ”€β”€ preprocessing.py # Data cleaning & transformation β”‚ β”œβ”€β”€ feature_engineering.py β”‚ β”œβ”€β”€ model_training.py β”‚ β”œβ”€β”€ evaluation.py │── models/ # Saved trained models │── utils/ # Helper functions │── main.py # Entry point │── requirements.txt │── README.md βš™οΈ Installation

Clone the repository

git clone https://github.com/Atharva-1512/AutoML.git

Navigate to project

cd AutoML/project

Create virtual environment

python -m venv venv source venv/bin/activate # Windows: venv\Scripts\activate

Install dependencies

pip install -r requirements.txt ▢️ Usage python main.py

OR (if notebook-based):

jupyter notebook πŸ“Š Workflow

Load Dataset

Data Preprocessing

Feature Engineering

Model Training

Hyperparameter Optimization

Model Evaluation

Best Model Selection

πŸ“ˆ Example Output

Best Model: Random Forest / XGBoost (example)

Accuracy: ~85–95% (depends on dataset)

Feature Importance Visualization

πŸ”₯ Use Cases

Predictive analytics

Fraud detection

Customer churn prediction

Academic ML experiments

Rapid prototyping for startups

🚧 Future Improvements

Add deep learning models

Integrate AutoML libraries (AutoGluon / H2O)

Build a web dashboard (Streamlit)

Deploy as API (FastAPI)

Add experiment tracking (MLflow)

🀝 Contributing

Contributions are welcome!

Fork the repo

Create a new branch

git checkout -b feature-name

Commit changes

git commit -m "Added new feature"

Push

git push origin feature-name πŸ“œ License

This project is licensed under the MIT License.

πŸ‘¨β€πŸ’» Author

Atharva Gade

AI/ML Enthusiast

BE IT (SPPU)

Interested in AutoML, Data Science & AI Research

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors