Dog Breed Classification CNN

A deep learning project implementing a Convolutional Neural Network for classifying 120 dog breeds with 83.16% accuracy using transfer learning with ResNet-50.

📚 Quick Links:

🚀 Getting Started Guide - 5-minute setup
❓ FAQ - Common questions answered
📁 Project Structure - File organization explained
📊 View Presentation - Academic presentation

🎯 Overview

This project addresses the challenging task of fine-grained visual classification of 120 dog breeds from the Stanford Dogs dataset. The model leverages transfer learning with a pre-trained ResNet-50 architecture to achieve production-ready performance with efficient training time.

Key Achievements:

✅ 83.16% test accuracy on 120-class classification
✅ Training time: ~15 minutes on Tesla T4 GPU
✅ 1.1M trainable parameters (frozen base model)
✅ Comprehensive evaluation with confusion matrices and classification reports

⚡ Quick Start

# Clone the repository
git clone https://github.com/BenFricker/dog-breed-cnn-classifier.git
cd dog-breed-cnn-classifier

# Install dependencies
pip install -r requirements.txt

# Download dataset (see Dataset section below)
# Update data_dir in Dog-Breed-CNN.py (line 99)

# Train the model
python Dog-Breed-CNN.py

That's it! The script will train the model, generate visualizations, and save the best model checkpoint.

✨ Features

Transfer Learning: Utilizes pre-trained ResNet-50 (ImageNet weights)
Custom Classification Head: Multi-layer Sequential classifier with dropout regularization
Data Augmentation Pipeline: Random rotation, horizontal flip, and color jittering
Model Checkpointing: Automatic saving of best performing model
Comprehensive Metrics: Classification reports, confusion matrices, training curves
Production Ready: Clean, documented code with proper error handling

🏗️ Architecture

Complete pipeline: Model Architecture → Data Pipeline → Training Strategy

Base Model

ResNet-50 (pre-trained on ImageNet)
23.5M frozen parameters for feature extraction

Custom Classifier

Linear(2048 → 512)
ReLU Activation
Dropout(p=0.3)
Linear(512 → 120)

Training Configuration

Optimizer: Adam (lr=0.001)
Loss Function: CrossEntropyLoss
Scheduler: StepLR (step_size=5, gamma=0.1)
Batch Size: 32
Epochs: 10
Data Split: 70% Train / 15% Validation / 15% Test

📊 Results

Performance Metrics

Metric	Value
Test Accuracy	83.16%
Best Validation Accuracy	83.67% (Epoch 9)
Macro-Precision	83.1%
Macro-Recall	82.8%
Macro-F1 Score	82.5%
Training Time	15 minutes 23 seconds

Training Progression

Key Observations:

Rapid initial learning: 47.82% → 75.74% validation accuracy in epoch 1
Learning rate drop impact: Accuracy jumped from 80.62% → 83.32% at epoch 6
Best model: Epoch 9 with 83.67% validation accuracy
Steady convergence: Final epochs show minimal fluctuation (±0.5%)

Detailed Performance Analysis

Top Performing Breeds (F1 > 0.97):

Afghan Hound, Keeshond, Saint Bernard achieved perfect classification (F1 = 1.000)
These breeds have distinctive features: unique coat patterns, clear size differences, distinctive physical characteristics

Challenging Breeds (F1 < 0.60):

Poodle varieties (Miniature, Toy) and similar fluffy white breeds
Difficulty due to: inter-breed similarity, morphological overlap, size similarities

The model shows consistent performance across all 120 classes with minimal overfitting, as test accuracy (83.16%) closely aligns with validation performance (83.67%).

🚀 Installation

Prerequisites

Python 3.8+
CUDA-capable GPU (recommended)

Setup

# Clone the repository
git clone https://github.com/BenFricker/dog-breed-cnn-classifier.git
cd dog-breed-cnn-classifier

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

💻 Usage

Step 1: Download the Dataset

⚠️ Important: The dataset is NOT included in this repository due to its size (~750 MB, 20,580 images).

Option A: Download from Kaggle (Recommended - Easier)

Create a free account at Kaggle.com
Go to Stanford Dogs Dataset
Click "Download" button
Extract the downloaded ZIP file to a location on your computer

Option B: Download from Stanford (Original Source)

Visit: http://vision.stanford.edu/aditya86/ImageNetDogs/
Download: Images.tar
Extract to your preferred location

Expected folder structure after extraction:

your-chosen-location/
└── Images/
    ├── n02085620-Chihuahua/
    ├── n02085782-Japanese_spaniel/
    ├── n02085936-Maltese_dog/
    └── ... (117 more breed folders)

Step 2: Configure the Data Path

Open Dog-Breed-CNN.py in any text editor and find line 99:

# BEFORE (line 99):
data_dir = r'C:\Users\benwf\OneDrive\Desktop\UOW\UOW\Foundations of Artificial Intelligence\Assessments\Group Project\images'

# AFTER (update to YOUR path):
data_dir = r'C:/Users/YourName/Downloads/Images'  # Windows
# OR
data_dir = '/Users/YourName/Downloads/Images'      # Mac/Linux

💡 Tip: Use forward slashes / or raw strings r'...' to avoid path issues.

Step 3: Run Training

python Dog-Breed-CNN.py

What happens during training:

✅ Loads and preprocesses 20,580 images
✅ Splits data (70% train, 15% validation, 15% test)
✅ Trains for 10 epochs (~70 seconds per epoch on GPU)
✅ Saves best model as best_dog_breed_model.pth
✅ Generates training curves and confusion matrices

Step 4: View Results

After training completes, you'll find these files in your directory:

File	Description
`best_dog_breed_model.pth`	Trained model checkpoint (best validation accuracy)
`training_curves.png`	Loss and accuracy plots over epochs
`confusion_matrix_part_1.png`	Confusion matrix (classes 0-29)
`confusion_matrix_part_2.png`	Confusion matrix (classes 30-59)
`confusion_matrix_part_3.png`	Confusion matrix (classes 60-89)
`confusion_matrix_part_4.png`	Confusion matrix (classes 90-119)

Console output will show:

Training/validation accuracy and loss per epoch
Final test accuracy (~83%)
Detailed classification report (precision, recall, F1-score per breed)

Troubleshooting

Problem: FileNotFoundError: [Errno 2] No such file or directory

Solution: Check that data_dir path is correct and points to the Images folder

Problem: CUDA out of memory

Solution: Reduce batch_size from 32 to 16 (line 163 in the code)

Problem: Training is very slow

Solution: Ensure you have a CUDA-capable GPU. CPU training will take 10-20x longer.

Problem: ModuleNotFoundError: No module named 'torch'

Solution: Run pip install -r requirements.txt

📁 Dataset

Stanford Dogs Dataset

Total Images: 20,580
Classes: 120 dog breeds
Split:
- Training: 14,405 images (70%)
- Validation: 3,087 images (15%)
- Testing: 3,088 images (15%)

Data Augmentation

Resize: 224×224
Random Horizontal Flip
Random Rotation: ±15°
Color Jitter: ±20% brightness/contrast
ImageNet Normalization

🎓 Training Details

Key Design Decisions

Parameter Freezing: Reduces training time by 60% while maintaining transfer learning benefits
Dropout (0.3): Prevents overfitting without sacrificing learning capacity
Learning Rate Scheduling: Enables fine-grained convergence in later epochs
Model Checkpointing: Saves best model based on validation accuracy

Performance Observations

Rapid initial learning due to pre-trained features
Learning rate drop at epoch 6 provided significant boost (80.62% → 83.32%)
Minimal overfitting - test accuracy closely matches validation
Consistent performance across all 120 classes

🔮 Future Improvements

Architecture Enhancements

Ensemble models (Expected: +2-3%)
Fine-tuning deeper layers (Expected: +2-3%)
Attention mechanisms (Expected: +3-5%)

Training Enhancements

Increase epochs to 20-30
Class-weighted loss for imbalanced breeds
Advanced augmentation (Mixup, CutMix)
Focal loss for hard examples

Target Accuracy: 86-90%

🎯 Real-World Applications

Veterinary Services: Automated breed identification for health screening
Pet Adoption Platforms: Intelligent breed tagging and matching
Lost Pet Recovery: Identification systems for reunification
Mobile Applications: Consumer-facing breed recognition apps

📖 Project Presentation

A comprehensive presentation covering the project's methodology, results, and analysis is included in this repository:

📄 View Presentation (PDF)

The presentation includes:

Literature review of current approaches
Detailed architecture and training strategy
Complete results analysis with visualizations
Real-world applications and future directions

📝 Project Context

This project was developed as part of a Foundations of Artificial Intelligence course. Despite being assigned as a group project, I completed all aspects independently:

✅ Data acquisition and preprocessing
✅ Model architecture design and implementation
✅ Training pipeline and optimization
✅ Comprehensive evaluation and visualization
✅ 15-minute technical presentation
✅ Complete documentation

Academic Note: I received 93%+ for this project. I was the only contributing member out of a 6-person team. Non-contributing members received 0% as documented by the course coordinator.

📚 Technical Skills Demonstrated

Deep Learning: CNN architectures, transfer learning, fine-tuning
PyTorch: Model implementation, training loops, data pipelines
Computer Vision: Image classification, data augmentation
Data Science: Performance metrics, visualization, statistical analysis
Software Engineering: Clean code, documentation, version control
Communication: Technical presentation and reporting

🤝 Contributing

While this is an academic project, I welcome feedback and suggestions! Feel free to:

Open an issue for bugs or questions
Suggest improvements or optimizations
Share your results if you use this code

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

👤 Contact

Benjamin Fricker

GitHub: @BenFricker
LinkedIn: Connect with me on LinkedIn

Current Focus: Double Major in Artificial Intelligence & Cybersecurity (Computer Science)

💼 Open to opportunities in AI/ML Engineering, Computer Vision, and Cybersecurity roles.

🙏 Acknowledgments

Stanford Dogs Dataset creators
PyTorch and torchvision teams
ResNet architecture authors (He et al., 2015)

⭐ If you find this project useful or interesting, please consider giving it a star!

Last Updated: October 2025

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Dog-Breed-CNN.py		Dog-Breed-CNN.py
Dog_Breed_CNN_Presentation.pdf		Dog_Breed_CNN_Presentation.pdf
README.md		README.md
architecture_diagram.png		architecture_diagram.png
conclusion.png		conclusion.png
test_performance_results.png		test_performance_results.png
training_progression_table.png		training_progression_table.png

Folders and files

Latest commit

History

Repository files navigation

Dog Breed Classification CNN

📋 Table of Contents

🎯 Overview

⚡ Quick Start

✨ Features

🏗️ Architecture

Base Model

Custom Classifier

Training Configuration

📊 Results

Performance Metrics

Training Progression

Detailed Performance Analysis

🚀 Installation

Prerequisites

Setup

💻 Usage

Step 1: Download the Dataset

Step 2: Configure the Data Path

Step 3: Run Training

Step 4: View Results

Troubleshooting

📁 Dataset

Data Augmentation

🎓 Training Details

Key Design Decisions

Performance Observations

🔮 Future Improvements

Architecture Enhancements

Training Enhancements

🎯 Real-World Applications

📖 Project Presentation

📝 Project Context

📚 Technical Skills Demonstrated

🤝 Contributing

📄 License

👤 Contact

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages