Course 2 Week 3 Lesson 13 tape.gradient returns W=NaN

daizem · September 29, 2021, 5:56pm

After watching Tensorflow lessons i applied the first example own my own however with loop which range of 10 i get W=0 instead of 5.000001. Even with single backward propagation im getting NaN numpy value. I was confused, what am I missing?

Environment: Google Collab

import numpy as np #2.6.0
import tensorflow as tf #1.19.5

W = tf.Variable(0, dtype=tf.float32)
optimizer = tf.keras.optimizers.Adam(0,1)

def train_step():
    with tf.GradientTape() as tape:
        cost = W**2 - 10*W + 25
    trainable_variable = [W]
    grads = tape.gradient(cost, trainable_variable)
    optimizer.apply_gradients(zip(grads, trainable_variable))

print(W) #<tf.Variable 'Variable:0' shape=() dtype=float32, numpy=0.0>

train_step()
print(W) #<tf.Variable 'Variable:0' shape=() dtype=float32, numpy=nan>

for i in range(10):
    train_step()
print(W) #<tf.Variable 'Variable:0' shape=() dtype=float32, numpy=nan>

nramon · September 30, 2021, 9:31am

Hi, @daizem.

The problem is the way the optimizer was initialized. I think you meant to write:

optimizer = tf.keras.optimizers.Adam(0.1)

Instead, you set learning_rate to 0 and beta_1 to 1.

Have fun experimenting

daizem · September 30, 2021, 12:03pm

Omg I can’t believe I didn’t see this typo.

Thank you so much @nramon. You are my hero!

Topic		Replies	Views
TensorFlow function train_step() Improving Deep Neural Networks: Hyperparameter tun	3	596	February 5, 2022
Question on gradient tape Improving Deep Neural Networks: Hyperparameter tun	1	347	September 28, 2023
Week 3 Tensorflow - cannot replicate the code Improving Deep Neural Networks: Hyperparameter tun week-3	4	209	March 16, 2024
Introduction to TensorFlow Improving Deep Neural Networks: Hyperparameter tun	2	532	July 20, 2021
No gradients provided for any variable (DLS C2 W3 Assignment) Improving Deep Neural Networks: Hyperparameter tun	8	559	September 30, 2022

Course 2 Week 3 Lesson 13 tape.gradient returns W=NaN

Related topics