<h1 id="heading-what-is-vgg16-and-explain-the-architecture-of-vgg16">What is VGG16 and explain the architecture of VGG16?</h1>
VGG-16 is a simpler architecture model since it’s not using many hyperparameters.
<ul>
<li>It always uses 3 x 3 filters with the stride of 1 in the convolution layer and uses SAME padding in pooling layers 2 x 2 with a stride of 2.</li>
</ul>
<blockquote>
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1674919488906/97c0eb98-8d55-45f3-bbc9-7ac11900b41c.png" alt class="image--center mx-auto" />
</blockquote>
This architecture is from the VGG group, Oxford. It improves AlexNet by replacing the large kernel-sized filter with multiple 3X3 kernel-sized filters one after another. With a given receptive field(the effective area size of the input image on which output depends), multiple stacked smaller size kernel is better than the one with a larger size kernel because multiple non-linear layers increase the depth of the network which enables it to learn more complex features, and that too at a lower cost.
<ul>
<li>Three fully connected layers follow the VGG convolutional layers.
</li>
<li>The width of the networks starts at the small value of 64 and increases by a factor of 2 after every sub-sampling/pooling layer. It achieves the top-5 accuracy of 92.3 % on ImageNet.
</li>
</ul>

# What is VGG16 and explain the architecture of VGG16?

**VGG-16 is a simpler architecture model since it’s not using many hyperparameters.**

* It always uses 3 x 3 filters with the stride of 1 in the convolution layer and uses SAME padding in pooling layers 2 x 2 with a stride of 2.
    

> ![](https://cdn.hashnode.com/res/hashnode/image/upload/v1674919488906/97c0eb98-8d55-45f3-bbc9-7ac11900b41c.png align="center")

This architecture is from the VGG group, Oxford. It improves AlexNet by replacing the large kernel-sized filter with multiple 3X3 kernel-sized filters one after another. With a given receptive field(the effective area size of the input image on which output depends), multiple stacked smaller size kernel is better than the one with a larger size kernel because multiple non-linear layers increase the depth of the network which enables it to learn more complex features, and that too at a lower cost.

* Three fully connected layers follow the VGG convolutional layers.
    
* The width of the networks starts at the small value of 64 and increases by a factor of 2 after every sub-sampling/pooling layer. It achieves the top-5 accuracy of 92.3 % on ImageNet.

#76 Machine Learning & Data Science Challenge 76

A Mountain Hiker, Space, and Tech Enthusiast who became a Data Scientist.