Untitled Document

CMSC 373 Artificial Intelligence
Fall 2023

Assignment 4: Hands-on with ConvNets
(Due by start of class on Monday, November 21, 2023)

In this assignment, you will build a ConvNet model to train it to classify handwritten digits. The dataset we will use is the MNIST Digits Dataset. That is available in Keras.
Make sure you have a Google account and are able to navigate to the Google Colab page: https://colab.research.google.com

Part 1: Building and Training the Model

Go to the link: https://colab.research.google.com/drive/1vrpQMN7t59bbDHzlnTATeIPg-VFyEVfI?usp=sharing

First, create a copy in your own Drive by going to the File menu and selecting: Save a copy in drive.
Work through the entire workbook. Try some things of your own (this is why you saved a copy above).
Once done, try the following at the end:

Run the model as defined:

Input Layer: 28x28 image, 1 channel
A Convolution Layer with 32 filters (activation maps), kernel/patch size of 3x3, stride 1, and Relu activations
Maxpool Layer of 2x2 window, stride 2 (reduces previous layer to 13x13)
A Convolution Layer with 64 filters (activation maps), kernel/patch size of 3x3, stride 1, and Relu activations
Maxpool Layer of 2x2 window, stride 2 (reduces previous layer to 5x5)
A Convolution Layer with 128 filters (activation maps), kernel/patch size of 3x3, stride 1, and Relu activations
A Flattened layer (3x3x128=1152 units)
Output Layer of 10 units with Softmax activation

Answer the following questions (provide and use the data obtained by you to answer these):

1. How well does the model fit (accuracy).
2. How does the accuracy you obtained copare with the accuracy you obtained with the backpropagartion network in Assignment 3? Was it more/less/same accurate?
3. How well does the network generalize?

Part 2: Try another model - The CHIFAR 10 Dataset

Go to link: https://colab.research.google.com/drive/1eBZn_xLHuNKG3IjNm7imYja6jBdxUmf0?usp=sharing

The CIFAR-10 and CIFAR-100 are labeled subsets of the 80 million tiny images dataset. They were collected by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.

The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5000 images from each class.

Here are the classes in the dataset, as well as 10 random images from each (you can see some images at the above link):

airplane
automobile
bird
cat
deer
dog
frog
horse
ship
truck

The classes are completely mutually exclusive. There is no overlap between automobiles and trucks. "Automobile" includes sedans, SUVs, things of that sort. "Truck" includes only big trucks. Neither includes pickup trucks.

1. Run the model as defined.

Input: each input image is 32x32 pixels with RGB channels. So the shape of input is 32x32x3.
Convolution Layer1: Has 32 filters with kernel size 3x3 (Relu)
Maxpool Layer 1: pool size 2x2
Convolution Layer 2: Has 64 filters with kernel size 3x3 (Relu)
Maxpool Layer 2: pool size 2x2
Flatten to 1024 units
Dense Layer with 64 units (Relu)
Output Layer with 10 units (sofmax)

Answer the following questions (provide and use the data obtained by you to answer these):

1. How well does the model fit (accuracy).
2. How well does the network generalize? Use test and validation accuracy plots to answer this.
3. Would it help if we trained the model for 20 epochs? Again, use test and validation accuracy plots to answer this.
4. What happens if we you decrease the number of units in Dense layer from 64 to 8 units? Or, increase to 128 units?

Finally, if you have time and inclination, can you do better by using a different network architecture?

What to Hand In
Hand in a report containing the following:

Your observations on 1-3 of Part 1, and 1-4 on Part 2, and if you tried to improve the model architecture. Write a an analysis and reflection based on your results (1-2 pages)
An overall reflection on the exercise.

Hand in a PDF by e-mail (named: LastFirstA4.pdf) by the due date.

Back to CMSC 373 Page.