Question about C2_M3 (final exercise = Configuring the Training Run)

Gary_Hess · February 10, 2026, 11:34am

Hello –

I just managed to pass the grader for the assignment (Course 2, Module 3), but it took some experimentation.

Here are my starting config values (initial guess, don’t use these!):

layers_to_train = 3
learning_rate = 0.001
num_epochs = 5

After training the model for 5 epochs with these values, I must have made a mess of the DistilBERT’s original parameters.

Question: What is the right way to recover the model’s original parameters and start training again with a new value for layers_to_train and learning_rate? It seems to me that if I just change layers_to_train and learning_rate and try training again, I am NOT starting with the model’s original parameters but with some corrupted parameters from my 1st attempt.

Mubsi · February 10, 2026, 11:50am

Hi @Gary_Hess,

Given your query, the best would be to do Kernel --> Restart Kernel and Clear Outputs of All Cells...:

For instructions on how to reset your workspace, you can follow the steps under Refreshing your Workspace.

Best,
Mubsi

Gary_Hess · February 10, 2026, 12:49pm

That is a brute force approach, but I am looking for a way to reset the model’s weights back to the original values. Does it work to just reload the model?

My objective is to try different config values in a loop, but I need to restore the original weights each time at the start of the loop.

Mubsi · February 10, 2026, 2:57pm

In that case, make a copy of the model’s original weights outside the loop, and reset the model’s weights using those.

But each time you call the “load_bert” function, it loads the model with fresh weights from the shelf. Though, you run it a few times, you will eventually run out of memory in the workspace.

The better approach would be to do something like this:

configs = [0.1, 0.01, 0.001] # Example hyperparams

for cfg in configs:
    # Load a fresh model
    model, tokenizer = load_bert()
    
    # Run your experiment
    # train_model(model, lr=cfg)
    
    # Clean up RAM/VRAM before the next loop
    del model
    gc.collect() 
    torch.cuda.empty_cache()

Another recommendation would be to play around only after you have passed the assignment, so that your experiments don’t conflict with your grades.

Gary_Hess · February 11, 2026, 8:13am

Thanks–that is exactly what I was looking for!

Topic		Replies	Views
Is there a reason for this? Advanced Learning Algorithms week-module-2	5	65	June 3, 2025
C3w3 assignment Natural Language Processing in TensorFlow week-module-3	7	124	October 27, 2024
C2M3 Assignment exercise 6 PyTorch: Techniques and Ecosystem Tools week-module-3 , dl-ai-learning-platform	5	57	February 3, 2026
C2_W1 Question regarding code at line 16 Advanced Learning Algorithms week-module-1	3	517	July 8, 2022
Transformer Network Application: Named-Entity Recognition lab - Kernel keeps dying before training can finish Sequence Models coursera-platform	2	534	October 26, 2021

Question about C2_M3 (final exercise = Configuring the Training Run)

Related topics