Question on Model Parameters

Pjsat · August 30, 2023, 3:52am

I may have missed it in week 1, I understand that Model parameters relates to the size of the model, but is there a simple explation of what a model paramter is? When you look at something like BloombergGPT that has 50B parameters, how is that derived and are there examples by model what are some of the specific parameters?

gent.spah · August 30, 2023, 7:36am

Model parameters include: weights, learning rate, layers and neurons (architecture), optimizer, and so on. So when you build a model and after training it, you need to save its blueprint (lets say so) so it can be used for predictions.

The Deep Learning Specialization will give you a good intro into neural networks, check it out.

Shankar_Saikia · August 31, 2023, 3:05am

Another way to explain what a PARAMETER is is to think of it as a WEIGHT or COEFFICIENT of a FEATURE (ie VARIABLE) in the model.

Let me illustrate with an example. You are trying to predict housing prices (e.g. P) . The prediction MODEL (eg a linear regression) has 3 FEATURES (e.g., number of bedrooms, number of bathrooms, average income in neighborhood): x, y, z. The model, after training on the data, gives you this equation:
P = 3x + 2y + 4z

In this example, “2”, “3” and “4” are the PARAMETERS.

In more complex models, such as deep neural networks there are many parameters.

If you need a good recommendation for a book “Hands On Machine Learning with Scikit-Learn, Keras & Tensorflow” by Aurelien Geron Is an excellent one. Pages 11 and 24 of the 3rd edition give good explanations of concepts like FEATURE, PARAMETERS etc.

Pjsat · September 1, 2023, 8:46pm

TY for the clarifications

Topic		Replies	Views
A doubt in code in "L_model_forward" in Assignment 1, Week 4 Neural Networks and Deep Learning	1	529	November 28, 2021
C1_W2_Lab04_FeatEng_PolyReg_Soln Understanding Model Parameters Supervised ML: Regression and Classification week-2	1	18	January 30, 2025
RNN parameters understanding and vanishing gradient Sequence Models	2	504	September 16, 2022
Large model vs small model when augmenting data Machine Learning in Production	1	595	May 29, 2021
Help with parameters (Exercise 5 - Week 3)M4ML Linear Algebra for Machine Learning and Data Sc... week-3	11	631	March 17, 2023

Question on Model Parameters

Related topics