-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The ncc file #1
Comments
How many contacts (NCC lines) were input for the large file, what was the smallest scale resolution for the calculation (-s option) and what was the last reported output? Also, is this single-cell data? The nuc_dynamics program has been tested, without issue, to about 500,000 single-cell contacts at 100 kb resolution on a machine with 16 Gb RAM. This calculation completed overnight on a 3.5 GHz Xeon E5 CPU. |
There were 130 million contacts. The commods was as follows: |
In essence, this software is not designed for traditional, population Hi-C data. As described in the README, it is really only for single-cell Hi-C data. I suspect that 130 million contacts would give a memory problem with the restraint list. In any case, compared to single-cell data, the structure calculation will not work well with population Hi-C, where each bead would be restrained to a large proportion of others. I will investigate the memory allocation for the restraints and impose a limit (with a user warning) so that the code at least doesn't break. Nonetheless, in order to get some kind of averaged structure from population data I recommend binning the contacts into regions corresponding to the finest resolution (so 0.5 Mb in this case) and then selecting the most significant pairs (e.g. largest fold change, comparing observed count with expected at each sequence separation - the basic Hi-C normalization) and limiting to a count of 500,000. An NCC format input file would have to be made, but it need only reflect the chromosomal regions for the binned sections. Also, the "-upper" distance limit for the restraints may have to be increased to accommodate the highly antagonistic contacts. |
The program nuc_dynamics had been run for a large file. It took several days to finish the program, but it didn't produce the result file without any error message. The head 10000 lines took for a test, and it works with the pdb result. I don't know how to solve it.
The text was updated successfully, but these errors were encountered: