Why does a Convnet not work for sequence modeling?

electro · June 9, 2021, 1:45pm

Based on the above slide, I understand that inputs/outputs may be of varying size, but assuming we ignore this issue, I don’t understand the second bullet point…

Could we not make a convolutional network with input dimensions of (m x Tx_max x 10000 x 1)? If so, what prevents this Convnet from sharing features across different positions, and in that case why do normal Convnets succeed in sharing features across positions of an image?

TMosh · June 27, 2021, 6:39am

The standard NN uses the linear combination of the examples and features in each layer - so the order of the examples does not matter.

For a sequence model, the order of the examples is critical.

Topic		Replies	Views
Discussion on CNN Vs RNN Sequence Models coursera-platform	1	528	April 20, 2023
Lecture Question - Convolutional Implementation of Sliding Windows Convolutional Neural Networks coursera-platform	3	545	June 7, 2022
RNN Shapes Clarification Sequence Models coursera-platform	2	522	July 4, 2022
Why RNNs over Simple Neural Networks? Sequence Models week-module-1 , coursera-platform	1	133	April 21, 2024
Why LSTM only accept fixed length input NLP with Sequence Models week-module-3	1	554	July 30, 2022

Why does a Convnet not work for sequence modeling?

Related topics