W4 Assignment - Success with Minimal Architecture

MikeML · August 28, 2024, 10:55pm

Thanks for the advice. I wasn’t paying attention to the loss at all, partly because I don’t know what a good loss value might be. I’m guessing when it gets so small that it’s displayed in scientific notation (e.g. 1.3167e-06)

I briefly tried a few variations. Here are my observations:

Allowing the minimal model to train for more epochs can sometimes get me to this point (outcomes are somewhat random)
Adding a convolution layer brings the loss down for the same number of epochs and similar accuracy, but it slows down training considerably
Adding multiple convolution layers does not noticeably improve metrics, but it does slow down training even more.
If, instead of convolution layers, I add a dense layer before the output layer, the loss comes down to very small numbers in very few epochs, and the training is fast
If I increase the size of the dense layer, there’s a point where training converges more slowly
If I use both a convolution layer and a dense layer, I can get maybe faster loss improvements, but slower training than the dense layer alone

It feels like a bit of a balancing act, trying to trade off model size and training speed. And a bigger model doesn’t seem to always mean faster convergence.

Topic		Replies	Views
C2W3 Assignment: I can't get over 99.9% accuracy Convolutional Neural Networks in TensorFlow week-3	17	104	August 30, 2024
Can't reach desired accuraacy for C2W4 task Convolutional Neural Networks in TensorFlow week-4	12	679	August 15, 2023
Course 1 Week 4 Assignment Low Accuracy Introduction to TF for Artificial Intelligence ... week-4	1	411	September 18, 2023
Course 4, Week 2, Assignment 1 (Residual Networks) Model performance Convolutional Neural Networks week-2 , coursera-platform	5	32	March 29, 2025
C4 Exercise_4_Multi_class_classifier_Question-FINAL Convolutional Neural Networks in TensorFlow week-4	2	639	March 6, 2022

W4 Assignment - Success with Minimal Architecture

Related topics