Week 3 Programming assignment section 7

ldz · May 21, 2024, 6:03pm

When creating a post, please add:

Week # must be added in the tags option of the post.
Link to the classroom item you are referring to: Coursera | Online Courses & Credentials From Top Educators. Join for Free | Coursera
Description (include relevant info but please do not post solution code or your entire notebook)

I submitted the programming assignment for week 3 and the autograder gave me 100%, so I decided to go back and play with the NN on other datasets, in section 7.

I noticed that my outputs are non-deterministic. For example, when I set n_h = 5, and then train a model on gaussian_quantiles data set, the first execution of this block gives me the following costs:

Cost after iteration 0: 0.693124
Cost after iteration 1000: 0.068776
Cost after iteration 2000: 0.038353
Cost after iteration 3000: 0.031598
Cost after iteration 4000: 0.027778
Cost after iteration 5000: 0.025106
Cost after iteration 6000: 0.023074
Cost after iteration 7000: 0.021457
Cost after iteration 8000: 0.020128
Cost after iteration 9000: 0.019009

and the 2nd execution (with no changes to the code) outputs different costs

Cost after iteration 0: 0.693122
Cost after iteration 1000: 0.080631
Cost after iteration 2000: 0.058538
Cost after iteration 3000: 0.039226
Cost after iteration 4000: 0.031690
Cost after iteration 5000: 0.028178
Cost after iteration 6000: 0.026181
Cost after iteration 7000: 0.024730
Cost after iteration 8000: nan
Cost after iteration 9000: nan

Any idea what might be the issue?

Here is my code:

# Datasets
noisy_circles, noisy_moons, blobs, gaussian_quantiles, no_structure = load_extra_datasets()

datasets = {"noisy_circles": noisy_circles,
            "noisy_moons": noisy_moons,
            "blobs": blobs,
            "gaussian_quantiles": gaussian_quantiles}

### START CODE HERE ### (choose your dataset)
dataset = "gaussian_quantiles"
visualize_orig = 0
### END CODE HERE ###

X, Y = datasets[dataset]
X, Y = X.T, Y.reshape(1, Y.shape[0])

# make blobs binary
if dataset == "blobs":
    Y = Y%2

if visualize_orig:
    # Visualize the data
    plt.scatter(X[0, :], X[1, :], c=Y, s=40, cmap=plt.cm.Spectral);
else:  
    # set hidden layers
    hidden_nodes = 5
    
    # Build a model with a n_h-dimensional hidden layer
    parameters = nn_model(X, Y, hidden_nodes, num_iterations = 10000, print_cost=True)

    # Plot the decision boundary
    plot_decision_boundary(lambda x: predict(parameters, x.T), X, Y)
    plt.title("Decision Boundary for hidden layer size " + str(hidden_nodes))

Kic · May 21, 2024, 7:54pm

Hi @ldz ,

Add a print statement after print statement for cost in the nn_model() to find out the values of A2 might give some insights.

TMosh · May 21, 2024, 8:12pm

When you ran the notebook the second time, did you restart the kernel first?

If you just run the training portion of the notebook a second time, the initial weights of the NN layers may not be re-initialized. In that case, you’d be just be fine-tuning the solution you got the first time, rather than starting over.

Also:
Welcome to the wonderful world of cost functions that have local minima.

The NN cost function isn’t convex, so you could get different results each time you train the model.

ldz · May 22, 2024, 8:37pm

Very clear, thank you! I did not restart the kernel so when I did that, things are much more deterministic, thanks!

Topic		Replies	Views
Week 3 Programming Exercise 75/100 All Tests Pass Neural Networks and Deep Learning coursera-platform	10	682	May 6, 2021
Programming Assignment not working fine on local machine Neural Networks and Deep Learning week-module-3 , coursera-platform	2	15	September 9, 2024
Course 1 Week 4 Programming assignment #2 Neural Networks and Deep Learning coursera-platform	9	1335	September 10, 2021
Course1_Week4_Assignment2_Ex1 Neural Networks and Deep Learning week-module-4 , coursera-platform	6	27	January 15, 2025
Course 1: Week 3 assignment: accuracy does not change (5*%) Neural Networks and Deep Learning coursera-platform	7	572	June 17, 2021

Week 3 Programming assignment section 7

Related topics