Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate partitioned data #30

Closed
tomwhite opened this issue Apr 16, 2015 · 3 comments
Closed

Generate partitioned data #30

tomwhite opened this issue Apr 16, 2015 · 3 comments

Comments

@tomwhite
Copy link
Member

This is called locuspart in the spec.

@tomwhite
Copy link
Member Author

PR at #31.

Not ready for commit yet as it depends on bigdatagenomics/adam#651, which itself has some dependencies on Kite changes.

@tomwhite
Copy link
Member Author

I updated #33 to use a new partitioning job (from https://github.com/tomwhite/adam-partitioning; see bigdatagenomics/adam#651 for discussion of the Spark-based one).

This is not quite ready as the number of partitions should be set based on the cluster size.

@laserson
Copy link
Contributor

Closing for cleanup/refactor. Perhaps taken care of by kite/Hive once we're on a Director cluster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants