Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AssertionError: dimensionality mismatch #59

Open
wjddyd66 opened this issue Sep 29, 2020 · 1 comment
Open

AssertionError: dimensionality mismatch #59

wjddyd66 opened this issue Sep 29, 2020 · 1 comment

Comments

@wjddyd66
Copy link

My Dataset is Like this...

SNP
image

Gene Expression
image

Other Dataset
image

I have two question to mofapy/run/python_template.py

First

ep.set_data(data)

When the above code is executed, the following result window appears.

...  
Warning: columns seem to be the shared axis, transposing the data...  
...  

Is this warning code to group each sample together with the same samples based on the Pandas Index?

Second

ep.train_model()

When the above code is executed, the following result window appears.

...  
AssertionError: dimensionality mismatch
...  

image

Which dimension should be matched?

For reference, in the case of other datasets (complete data, the number of samples is the same in all modalities), no error occurred.

In the case of Incomplete Data, what additional work should be done?

@rargelaguet
Copy link
Contributor

rargelaguet commented Sep 29, 2020

HI @wjddyd66 ,
I am not sure what is going on, but please update to MOFA2 (https://github.com/bioFAM/MOFA2), as stated in the README file. The second version is much improved and this repository is no longer mantained.

See the "Getting started vignette" for Python. In addition, we now have a downstream analysis package coded in Python (https://github.com/gtca/mofax). It is less comprehensive though than the R package

Regarding SNPs, they are unlikely to be succesful in MOFA. This is because the genome does not show display many sources of variation between the features, except for population structure and related variables. This is unlike the transcriptome, which works in the form of a large number of gene regulatory networks that can vary massively across cell states, tissues, etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants