About 'Learning the state-value function' video in the Reinforcement Learning section

Francisco_Espejo_Gar · October 2, 2022, 8:10pm

Hello!

My name is Francisco, and I am almost finishing the Machine Learning course specialization.
I have seen the video called ‘Learning the state-value function’ where a Neural Network is created to input state and action as an X vector to output Q(s,a) as the target Y.
In that case, which is the reason why the first 2 hidden layers (before the output layer) contain 64 units instead of another number?

Many thanks!

rmwkwok · October 3, 2022, 1:10am

Hello Francisco @Francisco_Espejo_Gar,

Great question, and I think we are talking about this slide:

Like designing the architecture of neural network for any problem, the appropriate number of layers and number of units for the layers depend on the problem and it is not known unless by experiment. This means that we need to try. If we begin with a very small NN, say one hidden layer with only 4 units, then, as explained course 2 week 3, we may end up seeing an under-performed model. However, we can progressively increase the size of the NN by adding neurons and layers, and test what size is good enough.

The proposed NN in the slide may or may not be the best option, but it should be just good enough for serving the objective.

Cheers,
Raymond

TMosh · October 4, 2022, 6:35am

Moved to Course 3 Week 3.

Topic		Replies	Views
Choosing Neural Network architecture Advanced Learning Algorithms week-1	3	388	November 13, 2023
Number of units per hidden layer Unsupervised Learning, Recommenders, Reinforcement week-2	4	546	February 26, 2023
Could somebody help me understand why the dimensions 25 and 15 were chosen? AI For Everyone ai-discussions	2	28	December 22, 2024
Input to DQN in reinforcement learning Unsupervised Learning, Recommenders, Reinforcement week-2	2	476	May 16, 2023
Advanced Learning Algorithms: Neural Network Concept Question Advanced Learning Algorithms week-1	6	289	February 5, 2024

About 'Learning the state-value function' video in the Reinforcement Learning section

Related topics