-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dockerize supersmart #52
Comments
This move will also rid us of this trouble: hashicorp/vagrant#3341 |
So, what would be the advantages of Docker:
Disadvantages:
|
We might be able to do something like this, to re-use our puppet manifest: https://puppetlabs.com/blog/docker-and-puppet-for-application-management |
It would make sense, once we make this move, to do so in compliance with the recommendations of bioboxes (http://bioboxes.org/guide/developer/) |
@bioboxes/core-team would be happy to help with creating a biobox for |
The inputs and outputs are a bit debatable. In the current version, supersmart is an entire pipeline (in a VM) that starts out with a list of species names and fossil calibration points and results in an annotated tree. In the process, quite a few different tools are used:
It would probably be most useful if a bioboxed version of the pipeline would be composed of multiple containers so that some of these tools can be re-used independently by the community (for example: treePL is quite a hassle to compile, so would be nice if people had bioboxed access to it). Consequently, there might be a little ecosystem of boxes with their own inputs and outputs. At least, that's kind of how we're thinking about this right now - but I would be keen to hear your advice. |
If you mean a Docker container comprising a pipeline of smaller Docker Alternatively if you are suggesting decomposing your pipeline into Docker
I believe an effort like nucleotid.es for MSAs or tree builders would also be |
I am now actually discussing technology that I don't have enough hands-on experience with (yet) to really know what I'm talking about, but what I had in mind is docker compose. Are we talking about the same thing?
Yup, something like that. For the pipeline we have done the work to define how to compile all the required tools (and have recorded that in our puppet manifest). I think it would be most useful if that groundwork was available to as many people as possible. Which probably means it should be broken up into different boxes that people can mix and match as needed. |
I am now actually discussing technology that I don't have enough hands-on
experience with (yet) to really know what I'm talking about, but what I
had in mind is [docker compose](https://docs.docker.com/compose/). Are we
talking about the same thing?
As I understand it, Docker compose is used to create linked applications of
multiple long running containers such as web servers and so forth. I do not
think it has the ability to construct workflows where the output of one
container is passed as the input to another. I believe the common workflow
language, nextflow.io and galaxy are both however working on this.
Yup, something like that. For the pipeline we have done the work to
define how to compile all the required tools (and have recorded that in
our [puppet manifest](https://github.com/naturalis/supersmart/blob/master/conf/manifests/default.pp)).
I think it would be most useful if that groundwork was available to as
many people as possible. Which probably means it should be broken up into
different boxes that people can mix and match as needed.
I agree, in particular I think bioboxes is most useful where there are many
different implementations of essentially the same task. This then makes it
useful to be able to swap one container for another of the same type
without having to change anything else in the pipeline.
|
+1.... We're considering at Supersmart for http://science.kew.org/strategic-output/plant-and-fungal-trees-life and a Dockerized build (better yet an AMI) could be really useful. |
Should be self-explanatory: with the rise of docker it makes a lot of sense to release a dockerized SUPERSMART. Deferred for now.
The text was updated successfully, but these errors were encountered: