Course 1 Week 4, what is Activation cache & what are the parameters it contains?

xtanion · September 29, 2021, 1:31pm

I am confused about the “activation cache” part. the cache contains two tuples, activation_cache, and linear_cache. linear_cache is pretty straight forward it’s basically (A_prev, W, b) but what does activation cache contain?

kenb · September 29, 2021, 6:56pm

In Exercise 4 of the first assignment, in which you complete the
linear_activation_forward() function, the line before the return statement assigns two separate caches to form a bigger cache: cache = (linear_cache, activation_cache). Formally, a tuple with two elements (which themselves can have multiple elements).

The values assigned to the activation_cache will depend on whether it is a relu activation or a sigmoid activation, set in the activation argument of the linear_activation_forward(). The mathematics to this is explained in the prelude to Exercise 4. These values are “cached” so that can be used to evaluate the gradient in the backward propagation step.

I hope that this helps! @kenb

Topic		Replies	Views
Course1 Week4 Lab1 exercise4 Neural Networks and Deep Learning coursera-platform	2	512	September 14, 2022
Help on week 4 Q8 "linear_activation_backward" Neural Networks and Deep Learning coursera-platform	2	492	April 17, 2023
Class1_Week4_assignment1_ex8 Neural Networks and Deep Learning coursera-platform	2	535	October 1, 2021
Help with linear cache and activation cache Neural Networks and Deep Learning coursera-platform	3	439	October 13, 2023
Confused about linear_cache and activation_cache Neural Networks and Deep Learning coursera-platform	4	684	April 30, 2022

Course 1 Week 4, what is Activation cache & what are the parameters it contains?

Related topics