Image_Classification_Survey

Datasets Used

Cifar10
STL10
Chest X-ray
ImageNet-Mini

Models trained

DenseNet
DLA
DPN
EffecientNet
GoogleNet
Preact Resnet
ResNext

Early stopping criteria:

The following class was used to stop training of the models:

Comparison (Validation Accuracy Percentage)

	Cifar10	STL10	Chest X-ray	ImageNet Mini
DenseNet	86.56	63.6	99.65	44.63
DLA	78.99	62.3	99.3	43.69
DPN	84.41	58.6	99.65	48.17
EffecientNet	85.11	57.6	99.99	44.08
GoogleNet	83.55	58.8	99.99	33.94
Preact Resnet	83.97	59.78	99.65	42.72
ResNext	81.87	61.12	98.94	46.28

Why DenseNet is the SOTA model

As we can see from the table above, DenseNet is the highest scoring for STL10 and Cifar10 while also performing really well on Chest X-ray dataset. Only for ImageNet can we see DPN performing considerably better than DenseNet and we will look into that in detail.

a) DenseNet is a CNN based architecture just like many other models, but the main reason it performs so well compared to other models is because of its defining feature, the Dense blocks. To explain what that is, let’s look at the following examples:

This is what a standard ConvNet looks like:

We have layers of Convolutions that obtain high-level features from the images to better classify them.

This is a ResNet Model Architecture:

In Resnet models, identity mapping is proposed to promote the gradient propagation. Element-wise addition is used. It can be viewed as algorithms with a state passed from one ResNet module to another one.

And finally, this is the DenseNet Architecture:

In DenseNet, each layer obtains additional inputs from all preceding layers and passes on its own feature-maps to all subsequent layers. Concatenation is used. Each layer is receiving a “collective knowledge” from all preceding layers.

Since each layer receives feature maps from all preceding layers, network can be thinner and compact, i.e., number of channels can be fewer. This gives multiple advantages such as faster training times and less overfitting, leading to better validation accuracies.

b) The reason DenseNet outperforms Inception models (GoogleNet) and DPN is because both of these models have a very high number of parameters and that often contributes to extremely inefficient training and overfitting. This on the other hand is also the reason why DPN outperforms DenseNet on ImageNet because even though DPN has more branches, its inference speed is slower than DenseNet. But on highly complex datasets like ImageNet, this complex architecture is able to learn better than simpler models like DenseNet.

Because of these reasons DenseNet won the best paper award CVPR 2017.

After 2017 however, things changed. In 2017 the paper “Attention is all you need” was published that introduced us to the transformer architecture. After 2017 transformer-based models have been scoring the highest in almost all tasks on almost all datasets. But discussing transformers is out of scope for this report.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
models		models
ChestXray.ipynb		ChestXray.ipynb
Cifar.ipynb		Cifar.ipynb
ImageNet.ipynb		ImageNet.ipynb
README.md		README.md
STL.ipynb		STL.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image_Classification_Survey

Datasets Used

Models trained

Early stopping criteria:

Comparison (Validation Accuracy Percentage)

Why DenseNet is the SOTA model

About

Releases

Packages

Languages

logan-mo/Image_Classification_Survey

Folders and files

Latest commit

History

Repository files navigation

Image_Classification_Survey

Datasets Used

Models trained

Early stopping criteria:

Comparison (Validation Accuracy Percentage)

Why DenseNet is the SOTA model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages