Data Science Projects

This repository collects notebook-based machine learning and deep learning experiments across tabular data, image classification, natural language processing, recommendation systems, and time series forecasting.

Project Map

Area	Contents
`Classification/`	Credit decisions, customer segmentation labels, game win-rate prediction, AutoML classification, and survey-style classification notebooks.
`Clustering/`	Customer clustering and a small clustering walkthrough.
`Computer Vision/`	CNN and TensorFlow examples for CIFAR-10, Fashion MNIST, Kannada MNIST, digit recognition, parasite images, and traffic signs.
`Natural Language Processing (NLP)/`	Sentiment, review, disaster tweet, duplicate-question, and Amazon review classification notebooks.
`Recommendation System/`	Cosine-similarity examples for Netflix and TEDx-style recommendations plus a product recommendation notebook.
`Regression/`	House pricing and advertising regression examples.
`Time Series/`	Stock and ridership forecasting notebooks.
`zingat house pricing/`	Web scraping and regression notebooks for housing-price analysis.

The root defect-prediction-xgboost-83.ipynb notebook is kept at the top level because it is a standalone XGBoost defect prediction experiment.

Working With The Notebooks

Most projects are self-contained Jupyter notebooks. A typical local workflow is:

python -m venv .venv
.venv\Scripts\activate
python -m pip install jupyter pandas numpy scikit-learn matplotlib seaborn
jupyter notebook

Some notebooks use additional libraries such as TensorFlow, XGBoost, PySpark, or AutoML packages. Install those only for the notebook you plan to run so the base environment stays small.

Maintenance Notes

Keep new notebooks inside the closest topic folder instead of adding more top-level files.
Add a short markdown introduction near the top of each notebook that names the dataset, target variable, and evaluation metric.
Avoid committing generated datasets, model artifacts, or local notebook checkpoint folders.
Before committing notebook changes, restart the kernel and run all cells so outputs match the current code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science Projects

Project Map

Working With The Notebooks

Maintenance Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Classification		Classification
Clustering		Clustering
Computer Vision		Computer Vision
Natural Language Processing (NLP)		Natural Language Processing (NLP)
Recommendation System		Recommendation System
Regression		Regression
Time Series		Time Series
zingat house pricing		zingat house pricing
LICENSE		LICENSE
README.md		README.md
defect-prediction-xgboost-83.ipynb		defect-prediction-xgboost-83.ipynb

Folders and files

Latest commit

History

Repository files navigation

Data Science Projects

Project Map

Working With The Notebooks

Maintenance Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages