Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What are the ordering files? #1

Open
Astahlke opened this issue Apr 3, 2021 · 1 comment
Open

What are the ordering files? #1

Astahlke opened this issue Apr 3, 2021 · 1 comment

Comments

@Astahlke
Copy link

Astahlke commented Apr 3, 2021

Huge thanks for making this tool - I'm hoping to avoid re-running my repeat annotation. Can you provide some details on what the ordering files should look like? How are they generated upstream of this? Would they be produced via juicebox?

Amanda

@shawnpg
Copy link
Collaborator

shawnpg commented Apr 8, 2021

Hi Amanda,

These are ordering files of the kind produced by LACHESIS (https://github.com/shendurelab/LACHESIS), but they are a pretty simple file format and should be fairly easy to generate from scaffold metadata you already have somewhere. I believe a .assembly file, which Juicebox can produce, would be able to be converted into a .ordering file fairly easily.

.ordering files are a a 5-column, tab-delimited text file format. These are what the columns represent:

contig_ID(local) contig_name contig_rc orientation_Q_score gap_size_after_contig

Where:

  • contig_ID is an ID number for the contig (it should work fine if you just use the ID of the contig in the .assembly file)
  • contig_name is the name of the contig
  • contig_rc is a bit which specifies whether the contig should be in reverse complement (1) or forward (0) orientation in the scaffold, relative to the original assembly FASTA
  • orientation_Q_score is a quality score for the orientation. For purposes of this script, it doesn't matter, so putting all 0.0's should be fine
  • gap_size_after_contig is the number of gap ("N") base pairs to place after the contig. If the gap size is unknown, put "."

Here is an example of a .ordering file to show what they look like too. Let us know if this works out for you, or if you see any strange behavior. It's possible using hand-crafted .ordering files will turn over some bug that real ones don't hit.

Thanks,

Shawn

group0.ordering.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants