Week 3, Question about architecture used in Programming Assignment: Trigger Word Detection

tryingtounderstand · January 15, 2025, 12:00pm

Regarding the programming assignment, we are instructed to build a very specific network but no explanation or intuition is given as to why we would expect that architecture to work. In particular, why use two consecutive GRU layers? Why isn’t one good enough? Why not three or four of them?

saifkhanengr · January 15, 2025, 12:24pm

It’s based on experimentation. You can try different numbers of layers but to pass the grader, you should follow the instructions.

tryingtounderstand · January 18, 2025, 3:01pm

Thank you for your reply. I am still wondering if there is a more theoretical answer in addition to “this is based on experimentation”.

paulinpaloalto · January 18, 2025, 5:03pm

It’s possible that there is, but I don’t recall Prof Ng ever mentioning anything about that. Please note that the course authors and Prof Ng are not really listening here. It’s just your fellow students. The mentors are fellow students who have completed the course in question successfully, but that doesn’t mean we are academic level experts in the field. We are also volunteers, meaning we don’t get paid to do this. Meaning that you are not guaranteed an answer to any given question.

As an example of how theory and practice work in ML, there is the Universal Approximation Theorem. That only applies to fully connected feed forward networks, not Sequence Models, but I hope we can use it as an analogy. What it tells us is that we can approximate any of a very large class of functions using feed forward neural networks. The problem is that it gives you exactly zero guidance in how to actually construct such networks in practice. It’s very useful to know that we’re not fundamentally wasting our time by trying to use NNs to approximate complex functions, but beyond that the UAT is not really all that helpful from a practical standpoint.

Maybe this is yet another instance in which the famous A. Einstein quote applies: “In theory, theory and practice are the same. In practice, they’re not.”

tryingtounderstand · January 19, 2025, 6:34am

Thank you for your very informative message.

Topic		Replies	Views
Week 3 - Assignment 2 Sequence Models coursera-platform	3	456	May 23, 2023
Functional programming style (vs. object orientated) Sequence Models coursera-platform	4	516	March 2, 2023
Week 3 assignment objective and more Neural Networks and Deep Learning coursera-platform	2	325	October 2, 2023
[Week 1] YOLOv4 is a one-to-one architecture, right? Sequence Models coursera-platform	2	523	May 22, 2022
C5W4: Transformer Architectures with TensorFlow Sequence Models week-module-4 , coursera-platform	40	4529	August 3, 2024

Week 3, Question about architecture used in Programming Assignment: Trigger Word Detection

Related topics