TensorFlow function train_step()

hitoro · February 4, 2022, 6:25am

Hi,

I’m following the video on tensorflow and trying to implement the steps of the function train_step() on my own but my gradients remain at zero and don’t change. What am I missing here ? Thanks.

import numpy as np

import tensorflow as tf


w = tf.Variable(0,dtype=tf.float32)

optimizer = tf.keras.optimizers.Adam(0.1)

def train_step():

    with tf.GradientTape() as tape:

        cost = w ** 2 - 10 + 25

    trainable_variables = [w]

    grads = tape.gradient(cost,trainable_variables)

    optimizer.apply_gradients(zip(grads,trainable_variables))


for i in range(1000):

    train_step()

print(w)

The result is:

<tf.Variable 'Variable:0' shape=() dtype=float32, numpy=0.0>

ai_curious · February 4, 2022, 5:29pm

Just took a quick glance, but are you confident in that indentation? Seems like only the cost = line is inside the with tf.GradientTape() as tape: loop

It also looks like the value for w is always 0^2 + 15, which means cost Is also a constant with that value.

Haven’t looked back at that notebook for a while but that makes my spidey sense tingle

paulinpaloalto · February 4, 2022, 5:46pm

Good point about the indentation. But once you fix that bug, notice that the minimum value of that cost function occurs at w = 0 and that is the initial value you specified for w. So by all rights the gradients should all be zero, right? Perhaps you intended the formula to be:

J = w ^2 - 10w + 25

hitoro · February 5, 2022, 11:00am

oh yes right, I forgot about w on 10w. that is the bug ! thanks

Topic		Replies	Views
Week 3 Tensorflow - cannot replicate the code Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	4	214	March 16, 2024
Course 2 Week 3 Lesson 13 tape.gradient returns W=NaN Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	524	September 30, 2021
C4 W4 A2 - Neural Style Transfer Convolutional Neural Networks coursera-platform	6	543	June 27, 2022
Course 4 week4 ex6 Convolutional Neural Networks week-module-4 , coursera-platform	2	34	October 7, 2024
Introduction to TensorFlow Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	532	July 20, 2021

TensorFlow function train_step()

Related topics