Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write a call-for-contributions commentary article for gigascience. #87

Closed
michaelbarton opened this issue Mar 12, 2015 · 51 comments
Closed

Comments

@michaelbarton
Copy link
Contributor

We will publish an commentary in GigaScience (discussed in #45 and #47). This
commentary will be a call for feedback and implementations of bioboxes. The word
limit for the editorial is 1200 words. The first step is to plan the outline of
this article.

The resolution of this issue should be a text plan as a bioboxes github repo.
Once this is created, other members of @bioboxes/core-team should provide
feedback through PRs until we reach consensus.

@abremges
Copy link
Contributor

Instructions for authors (an editorial is the second form listed AFAIK):
http://www.gigasciencejournal.com/authors/instructions/commentary

@ScottBGI
Copy link

FYI, unless we as the editors wrote this ourselves we'd probably take this as a commentary rather than editorial. Its essentially the same type of article though, and the word count and instructions for authors @abremges has posted are correct. The other thing to watch out for is it has to have 10 or less references. You can see a few examples of similar "call to arms"/project announcement papers here:

http://www.gigasciencejournal.com/content/2/1/2
http://www.gigasciencejournal.com/content/1/1/5 (this one was too long so we counted it as a short review instead)

If you want any feedback before you submit feel free to pass a draft to me. Looking forward to seeing the results soon!

@prvst
Copy link

prvst commented Mar 13, 2015

sorry if this is offtopic, but I think it may interest everyone; just saw this today:
http://arxiv.org/abs/1410.0846

@asczyrba
Copy link
Contributor

this is actually published already

http://dl.acm.org/citation.cfm?id=2723882 http://dl.acm.org/citation.cfm?id=2723882

On 13 Mar 2015, at 16:24, Felipe Leprevost [email protected] wrote:

sorry if this is offtopic, but I think it may interest everyone; just saw this today:
http://arxiv.org/abs/1410.0846 http://arxiv.org/abs/1410.0846

Reply to this email directly or view it on GitHub #87 (comment).

@avilella
Copy link

I think we have Carl Boettiger in the list of stakeholders for bioboxes,
haven't we?

@michaelbarton , did we contact him?

On Fri, Mar 13, 2015 at 3:31 PM, asczyrba [email protected] wrote:

this is actually published already

http://dl.acm.org/citation.cfm?id=2723882 <
http://dl.acm.org/citation.cfm?id=2723882>

On 13 Mar 2015, at 16:24, Felipe Leprevost [email protected]
wrote:

sorry if this is offtopic, but I think it may interest everyone; just
saw this today:
http://arxiv.org/abs/1410.0846 http://arxiv.org/abs/1410.0846

Reply to this email directly or view it on GitHub <
https://github.com/bioboxes/rfc/issues/87#issuecomment-79043179>.


Reply to this email directly or view it on GitHub
#87 (comment).

@avilella
Copy link

I emailed Carl just now. Let's see if he gets back to us.

On Fri, Mar 13, 2015 at 3:33 PM, Albert Vilella [email protected] wrote:

I think we have Carl Boettiger in the list of stakeholders for bioboxes,
haven't we?

@michaelbarton , did we contact him?

On Fri, Mar 13, 2015 at 3:31 PM, asczyrba [email protected]
wrote:

this is actually published already

http://dl.acm.org/citation.cfm?id=2723882 <
http://dl.acm.org/citation.cfm?id=2723882>

On 13 Mar 2015, at 16:24, Felipe Leprevost [email protected]
wrote:

sorry if this is offtopic, but I think it may interest everyone; just
saw this today:
http://arxiv.org/abs/1410.0846 http://arxiv.org/abs/1410.0846

Reply to this email directly or view it on GitHub <
https://github.com/bioboxes/rfc/issues/87#issuecomment-79043179>.


Reply to this email directly or view it on GitHub
#87 (comment).

@ypriverol
Copy link

@leprevost Both examples are fine but they are more like introduction to Docker. I read both of them and they basically propouse the idea of reproduce science using docker, but this idea is not new, some other references before propose the idea of store the machines that were used for the bioinfo analysis (http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003285).

@michaelbarton I don't know how feasible could be to write this manuscript on Bioinformatics or in Plos Computational Biology. But is just an idea

@avilella
Copy link

Thanks @ypriverol .

Let me just briefly summarise where we are now, and please
everybody/anybody correct me if I am wrong. I think there are two stages
here:

One is the short editorial/comment publication, which is this github ticket
#87, and it seems gigascience are keen on us publishing there.

Then there is something bigger, which is the first draft of the
specification, which in Summer time we will present at BOSC, and just after
BOSC, given the feedback from the community at large, we will turn into a
manuscript (similar to MIAME or MIAPE), and submit to a scientific journal.

Cheers

On Mon, Mar 16, 2015 at 11:53 AM, Yasset Perez-Riverol <
[email protected]> wrote:

@leprevost https://github.com/Leprevost Both examples are fine but they
are more like introduction to Docker. I read both of them and they
basically propouse the idea of reproduce science using docker, but this
idea is not new, some other references before propose the idea of store the
machines that were used for the bioinfo analysis (
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003285).

@michaelbarton https://github.com/michaelbarton I don't know how
feasible could be to write this manuscript on Bioinformatics or in Plos
Computational Biology. But is just an idea


Reply to this email directly or view it on GitHub
#87 (comment).

@ypriverol
Copy link

@avilella thanks for clarification.

@michaelbarton michaelbarton changed the title Create a planned outline for a bioboxes editorial publication. Create a planned outline for a bioboxes commentary. Mar 16, 2015
@michaelbarton michaelbarton changed the title Create a planned outline for a bioboxes commentary. Create a planned outline for a bioboxes commentary article. Mar 16, 2015
@michaelbarton
Copy link
Contributor Author

Thanks for the clarification @ScottBGI. I've updated the issue title to reflect this is a commentary.

@michaelbarton
Copy link
Contributor Author

I've created a repository for the article here - https://github.com/bioboxes/commentary-article

I encourage anyone to who would like to start planning this article to go ahead and submit a pull request. The BOSC 2015 abstract deadline is approaching so I will also ask that someone consider taking a look at this issue too - #65

@michaelbarton
Copy link
Contributor Author

@ypriverol What @avilella wrote is correct. I think we've agreed that a larger second article be written when the standard is more mature and we are seeing bioboxes being regularly used.

@michaelbarton
Copy link
Contributor Author

I have written an initial draft of this article.

https://github.com/bioboxes/commentary-article/tree/initial-draft

@michaelbarton
Copy link
Contributor Author

@bioboxes/core-team @ScottBGI I have written an initial draft within the 1200 word limit. What do you think?

https://github.com/bioboxes/commentary-article

@abremges
Copy link
Contributor

Great starting point (and more), @michaelbarton

  1. As bioboxes gained some momentum by now, I believe we should include bioboxes in in the title.
  2. You motivate bioboxes in much detail, but ignore its history (joining forces between your nucleotid.es and our CAMI initiatives). I think this should be included, as without either of these bioboxes would not exist.
  3. I'd emphasize the work done within the bioboxes initiative a bit more (i.e. which bbx standards are there already, how many bbx implementations do we have ready to use)
  4. Make the call to action more concrete: Ask developers to actively form the bioboxes standards, and package their tools as bioboxes.

Let me know if you want me to get actively involved, or if this is reserved to the @bioboxes/core-team

@michaelbarton michaelbarton changed the title Create a planned outline for a bioboxes commentary article. Write a call-for-contributions commentary article for gigascience. Jul 16, 2015
@michaelbarton
Copy link
Contributor Author

I've updated the issue title to reflect this more than a discussion of the outline. I closed related issue #45 in favor of this one.

@ScottBGI
Copy link

Looks a good start, but agree with @abremges points, especially the need for an explicit call for action - as this is sort of the point of the announcement paper. Plugging the website, GitHub, RFC could be useful, plus you could even link out to some interactive examples. Can probably squeeze in another few hundred words, but the 10 reference limit is more fixed (including URLs). You can also fit more in if you include figures or links to dockerised examples. Your Balti & Bioinformatics talk had lots of nice graphics, and if there was ways to explain the rationale with flow diagrams or infographics that is all bonus wordcount.

@michaelbarton
Copy link
Contributor Author

@rvosa opened an issue on the commentary article, I will paste his comment here as it is related to the discussion.

I think the bioboxes initiative is great. At Naturalis we are developing applications that are just complicated enough to be a huge hassle for users to install, so for now we are opting for (Vagrant-managed) virtual machines.

However, VMs have a big overhead so we are planning to move to docker containers. I strongly agree that a bit of sensible community planning to adopt bioboxes would be a great boon, and so we are planning to make such a move (e.g. naturalis/supersmart#52).

As a watcher of bioboxes on github I noticed you have started a commentary article. I am curious about your intentions with this. Presumably you are looking for community "buy in". Does that also mean that this commentary article is intended to be collaborative with a whole bunch of people signing off on this? I.e. can I help?

@rvosa
Copy link

rvosa commented Jul 16, 2015

Yup, that was me. I've been watching bioboxes on the sidelines for a while now because I think it's such a good initiative and something that we would like to adopt. I basically had the same question as @abremges, i.e.:

Let me know if you want me to get actively involved, or if this is reserved to the @bioboxes/core-team

@michaelbarton
Copy link
Contributor Author

Let me know if you want me to get actively involved, or if this is
reserved to the @bioboxes/core-team

The article is open to everyone. We would like bioboxes to be a community
effort and so we welcome suggestions and pull requests.

@michaelbarton
Copy link
Contributor Author

michaelbarton commented Jul 17, 2015 via email

@rvosa
Copy link

rvosa commented Jul 17, 2015

I guess my basic point was that the provenance of bioinformatics research results is often hard to reconstruct or reproduce: often times, some paper gets published with significant results, but others trying to replicate the study don't arrive at the same results.

One of the factors that contributes to this is that people don't explicitly record which exact version of what tool they used (not to mention all the turtles that are stacked underneath that tool in terms of OS libraries, bit depth, and so on).

If we dockerize our tools, then some of the ambiguity with respect to the turtles that are stacked underneath the tools that we use is removed. Hence, this should contribute to reproducibility, more or less along the lines of Rule 3 in this paper: http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003285 (note also the studies that are cited in the introduction of that paper, which substantiate my point about this "crisis" in reproducibility).

Today is unfortunately my last day before I go on vacation so right now I can't really contribute text to the article. Once I'm back (3 weeks from now) I'd be able to contribute. Last time I contributed to a manuscript on github, the approach we took was to fork the ms, make lots of little edits and commits, then send a pull request. This gives you the option of deciding quite granularly what you want to merge back in. Would that work for you?

@michaelbarton
Copy link
Contributor Author

Additional changes on this github branch - https://github.com/bioboxes/commentary-article/tree/second-draft

@michaelbarton
Copy link
Contributor Author

bioboxes

Here's an initial draft of the figure for article. Any feedback or improvements are welcome, the eps file is on the 'second-draft' branch in the git repository.

@michaelbarton
Copy link
Contributor Author

I have revised and updated the article. I will aim to submit this by the end of
the week. The latest version is the master branch if there are further
suggestions and feedback.

@prvst
Copy link

prvst commented Jul 27, 2015

hey Michael, I just forked the repo. I was thinking about adding a small paragraph to talk about possible problems or issues about the idea of using containers, like license infringements for example. I will go on the paper and add some points if you don't mind. Feel free to accept/reject them.

@michaelbarton
Copy link
Contributor Author

Thank you for your PR @leprevost. I have a few comments:

  • I think it's good that you hightlighted this shouldn't just be a genomics
    focused article and include all of the bioinformatics community.
  • The article has a word limit of ~1200, although Scott mentioned we could
    include a couple more hundred. I think the additional paragraphs you added
    push this upwards of 1600. I think licensing is important to mention, could
    you reduce this to a sentence or two?
  • Reproducibility is mentioned earlier in the article, is this similar to
    what you intended to include?

@prvst
Copy link

prvst commented Jul 29, 2015

That is OK, I presumed that the article was on its limit. I will re
organize that licensing phrase and remove the reproducibility part.

On Tue, Jul 28, 2015 at 4:38 PM Michael Barton [email protected]
wrote:

Thank you for your PR @leprevost https://github.com/Leprevost. I have a
few comments:

I think it's good that you hightlighted this shouldn't just be a
genomics
focused article and include all of the bioinformatics community.

The article has a word limit of ~1200, although Scott mentioned we
could
include a couple more hundred. I think the additional paragraphs you
added
push this upwards of 1600. I think licensing is important to mention,
could
you reduce this to a sentence or two?

Reproducibility is mentioned earlier in the article, is this similar to
what you intended to include?


Reply to this email directly or view it on GitHub
#87 (comment).

@michaelbarton
Copy link
Contributor Author

@leprevost thanks that would be useful.

@michaelbarton
Copy link
Contributor Author

Thanks for the PR by @fungs, @pbelmann and @abremges. I have merged these
together and updated the main document. I will submit this on Friday. Please
take a look, all authors need to approve the document before I submit it. Minor
fixes are welcome but I don't want to make anymore large changes.

@abremges
Copy link
Contributor

abremges commented Aug 3, 2015

Thanks, @michaelbarton - I really like the manuscript as it is now. Will re-read it one last time, extra-carefully, before we submit.

FYI, Bielefeld University can cover all publication costs through their institutional BMC account if e.g. @pbelmann handles the submission process (and we add one sentence to the acknowledgements section).

@abremges
Copy link
Contributor

abremges commented Aug 4, 2015

I'd emphasize the work done within the bioboxes initiative a bit more (i.e.
which bbx standards are there already, how many bbx implementations do we
have ready to use)

Could you expand on this point more?

So far, we have bbx specs for assembly, binning, and profiling tools. We mention the assemblers in the article (plus read mapping, for which we do not have a spec, yet), but ignore binning and profiling.

We might even go one step further, and explicitly list how many bbx implementations we currently have (e.g. how many assemblers did we implement), but this of course this info will be outdated very soon. Something to list on the website in my opinion.

@abremges
Copy link
Contributor

abremges commented Aug 4, 2015

I just talked to @alicemchardy, she read and approved the article. She is on parental leave at the moment, and won't comment in here.

We have @asczyrba's okay to submit, too. He is on vacation until Monday, @pbelmann and I already incorporated changes he suggested.

@michaelbarton
Copy link
Contributor Author

poster

Attached is the updated figure.

@michaelbarton
Copy link
Contributor Author

Thanks for the suggestions and edits everyone. I've integrated all of the these
and proof read it once more. I have also updated the figure, including @fungs
suggestions.

If everyone is satisfied, I will submit this.

@fungs
Copy link
Member

fungs commented Aug 5, 2015

@leprevost If not for the commentary I would welcome details about the differences/limitations using containers on the bioboxes website. This would also inform users before they run into problems building bioboxes.

michaelbarton added a commit to bioboxes/commentary-article that referenced this issue Aug 9, 2015
@michaelbarton
Copy link
Contributor Author

I have submitted the manuscript to gigascience. Thank you to everyone who
participated. I read through the article once more before submitting it and I
felt that everyone's constructive feedback has created an article much, much
better than my original draft.

Publications are important for building an academic CV, and if we continue
working on improving bioboxes and it's adoption in the bioinformatics
community, I think this will not be the last article we produce together.

@michaelbarton
Copy link
Contributor Author

I've received the copy editor changes from gigascience and integrated them into
the article. These are mainly just minor changes except where Figure 2 is now
Box 1. These changes are in the latest version of the master branch. As these
changes are limited I will submit on Monday unless there any problems.

@abremges
Copy link
Contributor

Thank you, @michaelbarton. I've created bioboxes/commentary-article#17 with some minor enhancements. Other than that, looking great & ready for re-submission!

@michaelbarton
Copy link
Contributor Author

I received notification this morning that the article has gone to production.

@abremges
Copy link
Contributor

Awesome, thanks & congratulations!

@michaelbarton
Copy link
Contributor Author

I received the article proofs today so hopefully it should appear in press soon.

@accopeland
Copy link

Any idea what 's up with cami ?

On Fri, Oct 2, 2015 at 6:55 PM, Michael Barton [email protected]
wrote:

I received the article proofs today so hopefully it should appear in press
soon.


Reply to this email directly or view it on GitHub
#87 (comment).

Alex Copeland
Joint Genome Institute, Walnut Creek, CA 94598

@abremges
Copy link
Contributor

abremges commented Oct 3, 2015

Finally, thanks @michaelbarton! Took quite a while, nice to see progress.

@accopeland We're on it! In depth evaluations take time and effort...
If you want to help/get involved, please get in touch with @asczyrba.

@accopeland
Copy link

Yes, of course, no criticism intended.

On Sat, Oct 3, 2015 at 2:19 PM, Andreas Bremges [email protected]
wrote:

Finally, thanks @michaelbarton https://github.com/michaelbarton! Took
quite a while, nice to see progress.

@accopeland https://github.com/accopeland We're on it! In depth
evaluations take time and effort...
If you want to help/get involved, please get in touch with @asczyrba
https://github.com/asczyrba.


Reply to this email directly or view it on GitHub
#87 (comment).

Alex Copeland
Joint Genome Institute, Walnut Creek, CA 94598

@abremges
Copy link
Contributor

abremges commented Oct 4, 2015

Don't worry, no offense taken -- we also wish we had more to show by now, and are aware of the community's curiosity. At least for the assembly part, we plan to make first results available soon!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

10 participants