Deep Learning - HW1: Image Classification

Author: Jay Liao (re6094028@gs.ncku.edu.tw)

This is an assignment project of Deep Learning, a course at Institute of Data Science, National Cheng Kung University. This project aims to extract features from images and to construct models to perform image classification.

Data

The raw data

Images: please go to https://drive.google.com/open?id=1kwYYWL67O0Dcbx3dvZIfbGg9NiHdyisr to download raw image files and put them under the folder ./images/. There are 64,225 files with 50 subfolders.
File name lists of images: ./data/train.txt, ./data/val.txt, and ./data/test.txt.

The processed data (extracted image features)

There were 3 methods of feature extraction utilized here. Since it cost much time to generate feature matrices, they have been produced in advance and placed in the folder ./data/.
1. Histogram: use the concept of the global color histogram. Given the number of ranges (bin cuts) of the histogram, count the no. of each range for 3 channels respectively. Thus 15 features would be obtained if set the no. of ranges as 5, for example. Since the input images are not in the same shape, we took the mean values.
2. Scale-invariant feature transform (SIFT): package OpenCV was utilized to take key points and descriptors with SIFT method. Since the input images are not in the same shape, we took column means as features for each image.
3. Speeded Up Robust Features (SURF): package OpenCV was utilized to take key points and descriptors with SURF method. Since the input images are not in the same shape, we took column means as features for each image.
If you want to see the process of feature extraction, just easily remove these feature matrices files from the folder.

Code

Source codes:
- utils.py: little tools
- feature_extraction.py: functions for feature extraction
- layers.py: layers for NN model construction, e.g., ReLU(), Sigmoid()
- models.py: construct the models, NaivePerceptron and TwoLayerNet
- trainers.py: class for training, predicting, and evaluating the models
- args.py: define the arguments parser
main.py: the main program with loading, training, and evaluating procedures.
exp_optimizers.py: experiment program to compare different optimizers.
exp_optimizers.py: experiment program to compare different no. of bars in the global color histogram.
experiments.ipynb: experiments results
requirements.txt: required packages

Folders

./images/ should contain raw image files (please go to download and put them with subfolders here).
./data/ contains .txt files of image lists and .npy files of extracted feature matrices.
./output/ will contain trained models, model performances, and experiments results after running.

Requirements

numpy==1.16.3
pandas==1.1.5
opencv_python==3.4.2.16
sklearn==0.24.1
xgboost==1.3.3
tqdm==4.50.0
matplotlib

Usage

Clone this repo.

git clone https://github.com/jayenliao/DL-image-classification.git

Set up the required packages.

cd DL-image-classification
pip3 install requirements.txt

Run the experiments.

python3 main.py
python3 exp_optimizers.py
python3 exp_n_Ranges.py

It may take much time to run the whole main.py. The arguments parser can be used to run several experiments only, such as:

python3 main.py --models 'TwoLayerNet' --savePATH './output/_TwoLayerNet/'
python3 main.py --models 'rf' 'xgb' --savePATH './output/_Baselines/'
python3 exp_n_Ranges.py --savePATH './output/exp_n_Ranges/' --n_Ranges_list 10 20 30 40 50 --epochs 100

You can also directly access the experiments results on experiments.ipynb.

Reference

Lowe, D. G. (1999, September). Object recognition from local scale-invariant features. In Proceedings of the seventh IEEE international conference on computer vision (Vol. 2, pp. 1150-1157). Ieee.
Bay, H., Ess, A., Tuytelaars, T., & Van Gool, L. (2008). Speeded-up robust features (SURF). Computer vision and image understanding, 110(3), 346-359.
斎藤康毅（吳嘉芳譯）（2017）。Deep Learning: 用Python進行深度學習的基礎理論實作。碁峰資訊股份有限公司。ISBN: 9789864764846。GitHub: https://github.com/oreilly-japan/deep-learning-from-scratch。
Watt, J., Borhani, R., & Katsaggelos, A. K. (2019). Machine learning refined. ISBN: 9781107123526. GitHub: https://github.com/jermwatt/machine_learning_refined.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning - HW1: Image Classification

Data

The raw data

The processed data (extracted image features)

Code

Folders

Requirements

Usage

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
README.md		README.md
args.py		args.py
exp_n_Ranges.py		exp_n_Ranges.py
exp_optimizers.py		exp_optimizers.py
feature_extraction.py		feature_extraction.py
layers.py		layers.py
main.py		main.py
models.py		models.py
optimizers.py		optimizers.py
requirements.txt		requirements.txt
trainers.py		trainers.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

Deep Learning - HW1: Image Classification

Data

The raw data

The processed data (extracted image features)

Code

Folders

Requirements

Usage

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages