Hi there
thanks for your question!
Without taking a look at the actual pictures, it’s hard to judge remotely. But if you have many false negatives, you rather want your model to learn „more“, how an anomaly looks like, meaning:
- a more complex model could potentially make sense to learn „anomalies“. You might wanna try: less drop out or less regularization. See also: Training set error?
- the threshold can be tuned to improve false negatives. In this case this would represent a trade-off w/ the false positives in general which would worsen in return…
there are several strategies, e.g. you try to get better, more and „more realistic“ normal labels. You could also think of applying some artificial noise (e.g white noise) to increase the robustness in total.
There are cases where tilting collides with your business problems (e.g. think of tilting a 9 or 6 too much if you to number classification). But if tilting or flipping pictures is OK from your domain point of view, it might be worth a try to help the model to generalise and learn what the defects are actually about. See also here some other techniques for augmentation:
- https://neptune.ai/blog/data-augmentation-in-python/amp
- Data Augmentation Techniques in CNN using Tensorflow | by Prasad Pai | YML Innovation Lab | Medium
This part of your question was not clear for me:
What does „being detected as anomaly“ actually mean here? After all, I understood that your problem is rather too many false negatives (FNs) and the algorithm did not identify the anomalies correctly as such… how about the false positives (FPs)?
I am asking because I would be interested in how you chose your threshold.
Also: Did you check an ROC analysis w/ AUC?
I would highly recommend to take a look at it. It might also help you in dimensioning your threshold well, considering FPs and FNs.
Note: a good friend of mine did his PhD in defect detection and reconstruction. Feel free to take a look at these repos:
- GitHub - AbdullahKaratas/Reconstruction-with-PCA: Reconstruction using PCA
- GitHub - AbdullahKaratas/Generate-Cutting-Edge-and-CNN-Reconstruction: Generate Cutting Edges And CNN Reconstruction
Best
Christian