C4W4 Assignment - model architecture

balaji.ambresh · August 21, 2022, 5:02pm

The closest thing to reproducibility in tensorflow 2.7 is to set the seed before building the model. Here’s an example:

tf.random.set_seed(2022)
model = tf.keras.Sequential([...])
model.compile(...)
model.fit(...)

So, when the passing criteria is 6 for MSE on validation set, your NN has to have a much lower error to ensure that the randomness doesn’t hurt your model performance by much. It’s a good idea of use an adaptive optimizer like adam instead of tuning learning rate from scratch for SGD.

Here’s a suggestion based on your notebook. It’s valid to specify a list of metrics to your model.compile function like this to give you a better picture of model performance over time:

model.compile(loss=None,
		metrics=['mse', 'mae'],
		optimizer=None)

You can also max pooling 1d layer to better summarize the inputs to lstm layer. Consider using bidirectional lstm to see if performance improves over lstm layers. Other than output layer, keep number of units a power of 2.

Consider using a custom callback to ensure that when the model achieves the MSE / MAE on the training set, you stop training any further. See tf.keras.callbacks.Callback.

With these hints, you should see MSE ~ 5.35 and MAE ~ 1.80 in the validation dataset with less than 40 epochs of training. 100 gives a lot more room to better understand training data.

Topic		Replies	Views
C4W4:Assignment 4 Sequences, Time Series and Prediction week-4	6	79	July 14, 2024
C4W4 help (incompatible shapes) Sequences, Time Series and Prediction week-4	15	676	November 14, 2022
C3W4 Assignment create_model Problem with the network architecture Natural Language Processing in TensorFlow week-4	4	212	August 12, 2023
Week4 assignment - Lambda layer Sequences, Time Series and Prediction week-4	9	625	June 13, 2022
C4W4 Assignment: Cannot achieve MSE below 6, MAE below 2 Sequences, Time Series and Prediction week-4	31	973	April 15, 2024

C4W4 Assignment - model architecture

Related topics