How to choose from different contrastive loss functions?

yangyushi · August 13, 2024, 3:39am

In lesson #4 the lecturer nicely presented the “original” form of the contrastive loss function

\mathcal{L} = \sum_{ij} \left[ y_{ij} \cdot \left( 1 - \textrm{sim}(u_i, v_j) \right)^2 +\ (1 - y_{ij}) \cdot \max\left(0,\; \textrm{sim}(u_i, v_j) - m \right)^2 \right]

But the actual loss to be minimized is based on the cross–entropy

\frac{1}{N} \sum_{i=1}^{N} \log \left( \frac{\exp{S_{ii}}}{\sum_{j=1}^N \exp(S_{ij})} \right)

I understand that these two different functions are possibly surrogates to each other as they both minimise the distance to the identity matrix. But I am not sure how to choose from the two. I think the CLIP paper also used the cross–entropy based loss.

Is it true that the cross–entropy based function is generally more widely applied recently?

Topic		Replies	Views
Confusion between two Loss functions Neural Networks and Deep Learning	4	531	July 18, 2023
How to choose a loss function in general? Improving Deep Neural Networks: Hyperparameter tun	2	545	April 9, 2022
Cross-entropy Supervised ML: Regression and Classification week-3	2	36	October 23, 2024
C1W2 -> understand the constrastive loss function Custom Models, Layers and Loss Functions with TF week-1	2	548	September 14, 2022
About the 'class ContrastiveLoss' in Course1-week2 Custom Models, Layers and Loss Functions with TF	0	292	December 19, 2021

How to choose from different contrastive loss functions?

Related topics