Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

intermittent NaNs #30

Open
umeshksingla opened this issue Aug 25, 2023 · 1 comment
Open

intermittent NaNs #30

umeshksingla opened this issue Aug 25, 2023 · 1 comment

Comments

@umeshksingla
Copy link

Fitting the model terminates with NaNs some of the time and we haven't been able to locate the issue. For example, if we fit on the same dataset 8 times in separate jobs, it finishes successfully 5 times but the other 3 times it gives NaNs at the 47th or 48th iteration of SLDS. We are initializing the parameters from the ARHMM fits before. If it helps, we are using latent_dim=10 (10 pca components explain 90% of the variance) and nlags=3.

We have verified it's not an issue of single precision. I have also checked various sizes (150k frames or 2mil frames) and different samples for the same size of the dataset to rule out the possibility of having not enough data but we still get NaNs about 30-35% of the time. This is keypoint-moseq v0.1.5 and jax-moseq v0.0.3. We also tested with newer releases of jax-moseq with parallel LGSMM support but that resulted in giving NaNs early on during training (around 20th iteration of SLDS) almost every time on any size of the dataset so we decided to revert.

Have you encountered a similar issue before? What could be a good way to assess the quality of the fits at each SLDS iteration or, in general, debug this issue?

cc @r-shruthi11

@calebweinreb
Copy link
Contributor

Hello,

I'm sorry to hear that. Probably the easiest thing would be to send me a minimal example of a dataset that produces NaNs, along with the notebook you used for fitting it. Any file sharing method is good. You can use my email ([email protected])

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants