Deep Neural Network Construction for similar dataset

I’m building a CNN that is aimed to distinguish very similar images. Should I make the network deep using ResNet or should I use other architectures? Please give me some suggestions or reading materials!

Try out different things. Just pick anyone and start and see if it works. Then do some changes accordingly.