Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RF Artifacts after long training times #37

Open
alex404 opened this issue Oct 22, 2024 · 2 comments
Open

RF Artifacts after long training times #37

alex404 opened this issue Oct 22, 2024 · 2 comments
Labels
Simulation Hypotheses/problems to be resolved with simulations

Comments

@alex404
Copy link
Contributor

alex404 commented Oct 22, 2024

In certain regimes the receptive fields across all layers decohere and exhibit weird artifacts:

  • Green/red 45 degree gabor overlayed on all RFs, with frequencies that are fairly independent of layer.
  • General movement upward of RFs, and decoherence into colour splashes in RFs that began as gabors.

This seems to emerge over training, and became more acute when I changed the defaults of the image transformations to be less extreme (mostly not quite so dark). When I switched back to the older config this "fixed" the artifacts. It seems like it might be some kind of overfitting thing, but weight_decay doesn't help. I'll keep running tests. This seems like less of a bug or more like a training issue at this point, so I think we're ready to merge.

@alex404 alex404 added the Simulation Hypotheses/problems to be resolved with simulations label Oct 22, 2024
@alex404
Copy link
Contributor Author

alex404 commented Oct 23, 2024

This still happens on master before circuit-extensions was merged. I could run simulations going back even more merges, but I have a filling this has always been there and we just haven't tripped over it/noticed.

@alex404
Copy link
Contributor Author

alex404 commented Oct 25, 2024

Current hypothesis that I'm testing:

  • Some kind of sparse+noisy autoencoder might might help
    • This in turn might call for post FC decoding
  • Might also be helped by managing depth. It'll be okay if very deep layers decohere, we just want a certain number of initial layers not to decohere.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Simulation Hypotheses/problems to be resolved with simulations
Projects
None yet
Development

No branches or pull requests

1 participant