What's the difference between "optimizer.minimize" and "tape.gradient and optimizer.apply_gradients("

sunson29 · September 18, 2022, 8:48pm

Hi friend,

so, in the class of " TensorFlow", professor showed 2 example, in the 1st example, it wrote

def train_step():
    with tf.GradientTape() as tape:
        cost = w ** 2 - 10 * w + 25
    trainable_variables = [w]
    grads = tape.gradient(cost, trainable_variables)
    optimizer.apply_gradients(zip(grads,trainable_variables))

in the 2nd example, it wrote

def training(x, w, optimizer):
    def cost_fn():
        return x[0] * w ** 2 + x[1] * w + x[2]
    for i in range(1000):
        optimizer.minimize(cost_fn, [w])
    return w

So, this optimizer.minimize(cost_fn, [w]) simply did everything. My question is what this optimizer.minimize ? Why we don’t use the

    grads = tape.gradient(cost, trainable_variables)
    optimizer.apply_gradients(zip(grads,trainable_variables))

I know I need to understand the tensorflow more, but can someone just give me a little bit “preview” about it ? thank you!

balaji.ambresh · September 19, 2022, 2:12am

Please see this link.

Topic		Replies	Views
Question of week 3 programming exercise: How to use optimizer.minimize Improving Deep Neural Networks: Hyperparameter tun week-3	2	263	January 26, 2024
Use of apply method of optimizer Custom and Distributed Training with TF week-2	6	28	March 18, 2025
Question about apply_gradients Improving Deep Neural Networks: Hyperparameter tun	3	511	May 31, 2023
Course 2 Week 3 Lesson 13 tape.gradient returns W=NaN Improving Deep Neural Networks: Hyperparameter tun	2	524	September 30, 2021
TensorFlow function train_step() Improving Deep Neural Networks: Hyperparameter tun	3	596	February 5, 2022

What's the difference between "optimizer.minimize" and "tape.gradient and optimizer.apply_gradients("

Related topics