Models not set to eval mode #17

bemi127 · 2024-01-15T20:30:00Z

Why are the models not set to eval() mode before running inference? If my understanding is correct, this means that dropout, etc. will be applied when perturbations are being generated and log likelihood is estimated.

joegenius98 · 2024-02-04T04:14:56Z

I'm no expert, but I believe because none of the models are actually trained in DetectGPT's experiments, there is not necessarily a need to put them in eval mode.

But I agree it is still good practice to do .eval() explicitly even when you're just loading pre trained weights

bemi127 · 2024-02-04T05:08:50Z

It is correct that the base models are not explicitly retrained or finetuned. However the perplexities in eval mode will be more accurate due to each attention layer having full context and not being influenced by dropout.

joegenius98 · 2024-02-10T22:21:51Z

Good thing is that we can try it out and see what happens

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models not set to eval mode #17

Models not set to eval mode #17

bemi127 commented Jan 15, 2024

joegenius98 commented Feb 4, 2024 •

edited

Loading

bemi127 commented Feb 4, 2024

joegenius98 commented Feb 10, 2024

Models not set to eval mode #17

Models not set to eval mode #17

Comments

bemi127 commented Jan 15, 2024

joegenius98 commented Feb 4, 2024 • edited Loading

bemi127 commented Feb 4, 2024

joegenius98 commented Feb 10, 2024

joegenius98 commented Feb 4, 2024 •

edited

Loading