Stable training loss but oscillating validation curves training ViT

ConfusedPotato · March 8, 2024, 3:23pm

I am training imaging data with ~1000 channels on a modified vision transformer model.

I am limited in the number of samples as I only have 10 images (~200x200x1000) available to me of which I have converted into patches yielding to around 15k patches each with an associated label and balanced dataset. I have also performed PCA on the channels to reduce the dimensionality. The current set contains 6 training, 2 validation and 2 test.

Currently, these are my training and validation curves of my best results so far. Patches generated for these results were sized 8x8x25 with 50% overlap:
training loss

val accuracy

val balanced accuracy

val f1

val loss

My problem is, understanding how to move forward with these results. It seems that the model is training and learning based on the validation metrics, however, it fluctuates a lot and I am not sure how to mitigate that.

What I have tried:

Different patches sizes (4x4, 8x8, 16x16 etc.)
Different channel size (8,16,32 etc…)
Different overlaps when generating patches (20%, 50%, etc.)
Lowering learning rate
Lowering weight decay
Balancing dataset

These are ways I tried to miligate the flucuating and improve the overall accuracy. However, instead it resulted in poorer performance such as plateauing early in training and more extreme flucuation.

Topic		Replies	Views
Validation accuracy stuck at 20% Convolutional Neural Networks in TensorFlow week-1	4	653	July 10, 2023
I made a deep learning model and after 100 epochs the graph of validation accuracy is having too many oscillations with every epoch. How do we normalize the validation accuracy. Although the validation loss is satisfactory. AI Discussions ai-discussions , project	25	236	November 23, 2024
On building a Neural Network from sratch AI Discussions	5	113	May 12, 2022
Still overfitting on Horse or Humans dataset Convolutional Neural Networks in TensorFlow week-2	3	629	October 14, 2022
Validation and training accuracy just cant reach 80% Convolutional Neural Networks in TensorFlow week-2	20	1770	July 11, 2023

Stable training loss but oscillating validation curves training ViT

Related topics