You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It seems that the predicted features (p) are not in the negatives, which is different from what's suggested in the paper (appendix B). I understand that you switch p and z here (for a symmetric loss?)
but there is still no comparisons between different samples in p.
In the paper the distillation loss is applied to the two views independently. Based on the code above, does it mean that we should use them jointly to reproduce the result?
The four lines of code here seem to make logit_mask an all-ones matrix. In my understanding we should assign the diagonals to False. Am I missing something?
TIA
The text was updated successfully, but these errors were encountered:
Hi,
I have a few questions about the simclr code.
cassle/cassle/losses/simclr.py
Line 21 in b5b0929
It seems that the predicted features (p) are not in the negatives, which is different from what's suggested in the paper (appendix B). I understand that you switch p and z here (for a symmetric loss?)
cassle/cassle/distillers/contrastive.py
Lines 65 to 68 in b5b0929
but there is still no comparisons between different samples in p.
In the paper the distillation loss is applied to the two views independently. Based on the code above, does it mean that we should use them jointly to reproduce the result?
cassle/cassle/losses/simclr.py
Lines 30 to 33 in b5b0929
The four lines of code here seem to make logit_mask an all-ones matrix. In my understanding we should assign the diagonals to False. Am I missing something?
TIA
The text was updated successfully, but these errors were encountered: