You might have thought 19, right?

Bert_Lenaerts · November 17, 2022, 8:40pm

The explanation for why the model prediction is a bit off in C1_W1_Lab_1_hello_world_nn.ipynb is not correct in my opinion.

In more complex, real-world cases, I can follow the argument of probabilities that is given. However, this toy example performs a linear regression on a noise-free data set with an inherent linear relationship. It is stated that 6 datapoints do no suffice to find the exact relationship, while in this case only 2 datapoints would actually do!

The reason why the predicition is somewhat off, is that the solver has not yet fully converged. Though the problem is very simple, the nn-solver takes its usual baby steps towards some minimum. Increasing the learning rate improves the prediction significantly . This can be done by changing the code as follows:

# Compile the model
optimizer = tf.keras.optimizers.SGD(learning_rate=0.05)
model.compile(optimizer=optimizer, loss=‘mean_squared_error’)

I then get following output for the prediction:

1/1 [==============================] - 0s 154ms/step
[[18.999998]]

In fact, there exists an analytical solution for the optimum, as explained here as well as in Andrew Ng’s machine learning course. Applying this, would yield the exact prediction of 19 (effects of finite machine precision set aside).

balaji.ambresh · November 17, 2022, 9:49pm

Gradient descent on mini batches works well. It usually requires fewer compute resources than the analytical method and scales well to larger datasets. With proper tuning, performance can get close to optimal for a wide variety of problems.

Topic		Replies	Views
Linear regression with neural network (need help and advice) AI Discussions project	4	139	January 30, 2025
When I add more training valus neural network can not find a solution. Why? Introduction to TF for Artificial Intelligence ... week-module-1	16	394	December 9, 2023
Course 4 Week 3 Strange things with U-net Convolutional Neural Networks coursera-platform	6	577	May 12, 2022
Tensorflow hello world code piece Introduction to TF for Artificial Intelligence ... week-module-1	3	322	December 1, 2023
Regarding first assignment of the week 1/ house prediction model Introduction to TF for Artificial Intelligence ... week-module-1	10	577	December 12, 2022

You might have thought 19, right?

Related topics