Test Accuracy Higher than Train accuracy?

akshata_bm · August 9, 2024, 2:11pm

Noted. I’ll try the MANOVA approach.

Another question, Does not having all the categorical values in all features across train, validation and test set cause an issue or bias?

For example, say after spliting the data, train set has all features with all categorical values in each of them as mentioned below:
features A with categories (0,1,2), B with categories (0,1,2,3,4) and C with categories
(0,1,2,3).

However, xval is having A (0,1,2), B(0,2,3), C(0,1,2).
xtest is having A(0,1), B(0,1,2,3), C(0,2).

Would this lead to poor performance of the model? or inaccurate performance?

Deepti_Prasad · August 9, 2024, 2:52pm

I am not stating to include all the features, first divide your features into independent and dependent variables based on the understanding of the disorder or disease you are addressing, this could be done using the p-value hypothesis like as you mentioned chi-square test.

Then based on hypothesis results nto your and which scored was more relative to your disorder would be feature to select for which type of MANOVA analysis approach you want to do.

A total other approach would be surely K-fold cross validation too.

Regards
DP

TMosh · August 9, 2024, 5:54pm

Yes, this is a big problem. It goes back to my original reply - your train, val, and test sets don’t have the same statistics.

Topic		Replies	Views
Test Accuracy higher than training for brain images AI for Medical Diagnosis week-1	5	415	August 18, 2023
Regularization Dropout Programming Assignment: How to intepretate when test accuracy is higher than training accuracy Improving Deep Neural Networks: Hyperparameter tun	2	636	November 13, 2022
Course 2 Week 1 Programming Assignment Regularization Improving Deep Neural Networks: Hyperparameter tun	7	693	September 10, 2021
Why validation_accuracy is higher than training_accuracy in my model? Convolutional Neural Networks in TensorFlow week-2	5	575	March 22, 2022
Course 4 Week 2 Project 2: Why Training Loss Is Higher than Validation Loss? Convolutional Neural Networks	2	425	August 2, 2023

Test Accuracy Higher than Train accuracy?

Related topics