loss computation: mean and not sum #135

CassNot · 2023-08-03T14:49:33Z

Dear authors,

Thank you for your code!

We had a question concerning the loss implementation. We saw that for each minibatch, the mean is computed and not the sum as in the paper (https://arxiv.org/pdf/2004.11362.pdf - equation 2):

SupContrast/losses.py

Line 96 in 331aab5

loss = loss.view(anchor_count, batch_size).mean()

We were wondering if there was a reason for this choice.

Thank you

HobbitLong · 2023-08-13T19:45:03Z

Good catch! I think the eq 2 in the paper has ignored the 1/(2N).

dave4422 · 2023-10-11T21:43:31Z

Hi,

I've been reviewing the implementation, and I noticed the line loss = loss.view(anchor_count, batch_size).mean(). Given the computations, it seems that the result would be equivalent to simply using loss.mean(). Could you kindly explain the rationale behind the reshaping here?

dave4422 · 2023-10-11T21:45:10Z

I assume it's just for readability?

HobbitLong · 2023-10-12T01:55:37Z

yeah, it's just helping understand the shape (potentially may help understand what's going on).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loss computation: mean and not sum #135

loss computation: mean and not sum #135

CassNot commented Aug 3, 2023

HobbitLong commented Aug 13, 2023

dave4422 commented Oct 11, 2023

dave4422 commented Oct 11, 2023

HobbitLong commented Oct 12, 2023

loss computation: mean and not sum #135

loss computation: mean and not sum #135

Comments

CassNot commented Aug 3, 2023

HobbitLong commented Aug 13, 2023

dave4422 commented Oct 11, 2023

dave4422 commented Oct 11, 2023

HobbitLong commented Oct 12, 2023