Lip Reading Neural Network - Same Training and Validation Accuracy

Rohan_Jai · January 12, 2024, 5:41pm

I am currently working on a project focused on lip reading using neural networks and have encountered a peculiar issue that has left me scratching my head. Despite my best efforts, I am consistently getting the same training and validation accuracy. I believe there might be something wrong with my model architecture or training process.

I have designed a neural network for lip reading, and the architecture is as follows:

input_shape = (22, 80, 112, 3)
model = tf.keras.models.Sequential()
model.add(tf.keras.layers.Conv3D(8, (3, 3, 3), activation='relu', input_shape=input_shape, kernel_regularizer=regularizers.l2(0.001)))
model.add(tf.keras.layers.MaxPooling3D((2, 2, 2)))
model.add(tf.keras.layers.Conv3D(32, (3, 3, 3), activation='relu', kernel_regularizer=regularizers.l2(0.001)))
model.add(tf.keras.layers.MaxPooling3D((2, 2, 2)))
model.add(tf.keras.layers.Reshape((16, 3744), input_shape=input_shape))
model.add(tf.keras.layers.Bidirectional(tf.keras.layers.LSTM(256, return_sequences=True)))
model.add(tf.keras.layers.Bidirectional(tf.keras.layers.LSTM(128)))
model.add(tf.keras.layers.Flatten())
model.add(tf.keras.layers.Dense(1024, activation='relu'))
model.add(tf.keras.layers.Dropout(0.5))
model.add(tf.keras.layers.Dense(256, activation='relu'))
model.add(tf.keras.layers.Dropout(0.5))
model.add(tf.keras.layers.Dense(64, activation='relu'))
model.add(tf.keras.layers.Dropout(0.5))
model.add(tf.keras.layers.Dense(50, activation='softmax'))

model.compile(optimizer=Adam(learning_rate=0.001), loss='categorical_crossentropy', metrics=['accuracy'])

The issue I am facing is that both training and validation accuracy seem to plateau at the same level beyond 6th epoch, indicating a potential problem. I have tried tweaking hyperparameters, adjusting layers, and even modifying the architecture, but the problem persists. I have low training and validation accuracy of 30%

Questions:

Is there anything wrong with my model architecture?
Are there specific hyperparameters that I should focus on adjusting for better convergence?

I appreciate your time and expertise in advance.Looking forward to your responses!

TMosh · January 12, 2024, 7:45pm

That’s a very complicated model.
How many output categories do you have (I’m guessing it’s 50?).
Is that also how many labels are in your training data?
How much data do you have for training and validation?

Rohan_Jai · January 13, 2024, 2:30am

Yeah I Do have 50 unique labels in entire dataset and the size of dataset is 2323 For training I am using 1858 and 465 for testing

TMosh · January 13, 2024, 3:13am

That’s not a lot of examples of each label, considering how many weights you have to learn.

Rohan_Jai · January 13, 2024, 3:35am

I am not able to collect more data though is there any other alternatives to improve my model accuracy

TMosh · January 13, 2024, 3:55am

You can try augmenting the data set.

Is your training set made up of video clips, or separate images?

Rohan_Jai · January 13, 2024, 4:36am

Okay I will try that. Its made up of images

TMosh · January 13, 2024, 6:23am

You might also try a lot simpler model, get a baseline, and then only add as much complexity as you need to improve it.

Aryan365 · July 9, 2024, 7:36am

Start with a larger layer of conv3D like 128 and also kernel size, also consider reducing dropout percentage and reduce size of test set

Topic		Replies	Views
Jagged lines for validation accuracy and loss - how to make the smoother curves? Natural Language Processing in TensorFlow	9	532	April 5, 2022
Has anyone been able to score a good validation accuracy with the C3W3 assignment model? Natural Language Processing in TensorFlow	3	482	August 31, 2022
Improving validation accuracy of an image sentiment identification system AI Discussions ai-discussions , project	4	105	April 21, 2024
Model is not learning Natural Language Processing in TensorFlow week-2 , week-3 , week-4	3	539	June 10, 2022
Architecture has 20 minutes per epoch Natural Language Processing in TensorFlow week-3	12	594	August 30, 2023

Lip Reading Neural Network - Same Training and Validation Accuracy

Related topics