Which loss/accuracy curve to choose for sentiment analysis?

bluetail · April 5, 2022, 9:29pm

I have the following 3 curves - am looking for advice which one is the best? they all give similar accuracy metric at about 50% -53% (yes, I know it is not very high).

My guess is that no. 4 is the best because the loss seems to be decreasing.

I have tried changing dimensions, adding and dropping dense and dropout layers, maxlen and embeddings, however, I don’t seem to get any improvements.

I will appreciate any suggestions as a beginner! Thank you.

I have used a LSTM architecture as follows for a movie reviews dataset from here (Sentiment Analysis ).

# Parameters

EMBEDDING_DIM = 100  (from 100d.glove.6B.100d.txt)
MAXLEN = 500   
VOCAB_SIZE =  33713

DENSE1_DIM = 64
DENSE2_DIM = 32

LSTM1_DIM = 64# 32
LSTM2_DIM = 32# or 64

FILTERS = 64  #
KERNEL_SIZE = 5


# Model Definition 
model_lstm = tf.keras.Sequential([
    tf.keras.layers.Embedding(VOCAB_SIZE+1, EMBEDDING_DIM, input_length=MAXLEN,weights=[EMBEDDINGS_MATRIX], trainable=False),
    tf.keras.layers.Bidirectional(tf.keras.layers.LSTM(LSTM1_DIM,dropout = 0.2, return_sequences=True)), 
    tf.keras.layers.Bidirectional(tf.keras.layers.LSTM(LSTM2_DIM, dropout = 0.2)),
    tf.keras.layers.Dense(DENSE1_DIM, activation='relu'), 
    tf.keras.layers.Dense(DENSE2_DIM, activation='relu'),
    tf.keras.layers.Dense(1, activation='sigmoid')
])

# Set the training parameters
model_lstm.compile(loss='binary_crossentropy',
                   optimizer=tf.keras.optimizers.Adam(), 
                   metrics=[tf.keras.metrics.BinaryAccuracy()])

# Print the model summary
model_lstm.summary()

Layer (type) Output Shape Param #

embedding_37 (Embedding) (None, 500, 100) 3371400

bidirectional_61 (Bidirecti (None, 500, 128) 84480
onal)

bidirectional_62 (Bidirecti (None, 64) 41216
onal)

dense_109 (Dense) (None, 64) 4160

dense_110 (Dense) (None, 32) 2080

dense_111 (Dense) (None, 1) 33

=================================================================
Total params: 3,503,369
Trainable params: 131,969
Non-trainable params: 3,371,400

P.S. a simple Naive Bayes gives a 84% accuracy for this dataset!

vsnupoudel · April 6, 2022, 3:20pm

The 4th one, but you can stop at 15 epochs. Looking at the loss curves is more important than accuracy. For the 4th, the accuracy are roughly similar.

You could try Glove embeddings with 300 dimesion.
Also, dropout in LSTM does not make much sense to me. Could you try without that.
Could you share the results, after you try above. Then we can discuss about better algorithms for the same problem.

Topic		Replies	Views
Jagged lines for validation accuracy and loss - how to make the smoother curves? Natural Language Processing in TensorFlow	9	526	April 5, 2022
Slope test at the assignment for Exploring Overfitting in NLP Natural Language Processing in TensorFlow	4	444	November 12, 2022
Improving validation accuracy of an image sentiment identification system AI Discussions ai-discussions , project	4	105	April 21, 2024
Week 3 Assignment - help with interpreting results Natural Language Processing in TensorFlow	2	338	December 22, 2022
Choosing metric for a binary classification (sentiment analysis) problem? how to use Binary Accuracy? AI Discussions	5	60	April 6, 2022

Which loss/accuracy curve to choose for sentiment analysis?

Layer (type) Output Shape Param #

Related topics