Assignment 1: How is Loss greater than (1- Accuracy)?

PD_Vaillancourt · January 3, 2024, 10:38pm

a few questions:

The performance of the model run on two epochs has a Loss of 0.48 and accuracy of 0.86. I thought Loss is 1- Accuracy. How is the loss and Accuracy both fairly large?

image1243×196 3.53 KB
In the expected output section it talks about training model on CPU and then training on GPU. Is the code running off of my machine’s CPU or GPU or the Coursera servers in the cloud? Is there someway to control whether the model runs off of CPU or GPU (whether on my machine or on Coursra’s servers) by specifying in code or does the require much more sophisticated understanding of hardware. See below:

image1129×128 2.67 KB
Based on the summary output the model we run is ~23M parameters? That seems quite large considering how expensive compute is and the fact we’re using either our computer CPU or GPU or Coursera’s cloud resources. I guess 23M parameter model does require a lot of compute to train then? Trying to develop an intuition for how expensive compute might be based on parameter count and data set size.

hackyon · January 4, 2024, 12:36am

Loss is not equal to 1 - Accuracy. The loss (or cost) is computed from the mathematical formula that gets minimized during training. Accuracy is computed from a complete different formula, and is used to evaluate how well your model performs.

If you haven’t downloaded the assignment files offline, then the assignment is running off Coursera servers. I believe that for most assignments, it is run off CPUs rather than GPUs.

There are indeed some things to note to perform training on GPU, and it is important to know the specs of the GPU hardware. One of the main problems is whether or not the model will fit onto a GPU. In the code, you need to “put” both your model and input data onto the GPU, and the way to do this is different for different implementations.

Running inference on 23M parameters shouldn’t take up that much resources. The training can take up resources, so we’re usually just fine-tuning a pre-trained model for the assignments (fine-tuning requires significantly less resources).

Many of the LLMs have billions upon billions of parameters, and those may be harder to run locally on your machine, but 23M isn’t really that high and should be fine. Most modern computers can easily go through millions of computations per second.

TMosh · January 4, 2024, 4:59am

When you run in Coursera Labs, you’re using a GPU farm on the server - not your local GPU.

PD_Vaillancourt · January 4, 2024, 2:09pm

thank you. Is accuracy correct predictions / all predictions?

TMosh · January 4, 2024, 3:46pm

Yes.

Topic		Replies	Views
Course 4, Week 2, Assignment 1 (Residual Networks) Model performance Convolutional Neural Networks week-2	5	30	March 29, 2025
Cat v Dogs -- High Loss (>1) on Programming Assignment Convolutional Neural Networks in TensorFlow week-1	4	533	February 1, 2022
Course 4 Week 4 Assignment 1 Exercise 1 Convolutional Neural Networks	3	588	July 4, 2021
Week2 Assignment - Residual Networks Convolutional Neural Networks week-2	2	31	January 16, 2025
Residual Networks (assignment) test accuracy not matched Convolutional Neural Networks	8	581	May 20, 2021

Assignment 1: How is Loss greater than (1- Accuracy)?

Related topics