diff --git a/1-2-hurdles.qmd b/1-2-hurdles.qmd new file mode 100644 index 0000000..4b46099 --- /dev/null +++ b/1-2-hurdles.qmd @@ -0,0 +1,12 @@ +## Hurdles: reproducible research is hard + +From the definition above, reproducible research sounds really easy. It is a minimum standard. But still it is hard! + +Common hurdles that researchers face are: + +- Missing knowledge (What am I allowed to do? How to do it?) +- Shame and fear +- Time pressure +- Data/code that cannot be shared openly + +What are your personal hurdles? What can you do to address them? diff --git a/1-3-fair.qmd b/1-3-fair.qmd new file mode 100644 index 0000000..8a60a36 --- /dev/null +++ b/1-3-fair.qmd @@ -0,0 +1,65 @@ +## FAIR principles for reproducible research + +TODO: see also https://data.research.cornell.edu/content/preparing-fair-data-reuse-and-reproducibility + +FAIR stands for Findable, Accessible, Interoperable and Reusable. Implementing the FAIR principles for data can be a challenge though. Let's talk about how to do it in practice. + +The FAIR principles are at the core of many current initiatives in research and beyond. For example the German National Research Data Infrastructure (NFDI) consortia are working on making research data (and sometimes software) FAIR. There is even a special FAIR Data Spaces project. But what does FAIR mean for you? + + +::: captioned-image-container +![FAIR prinicples](images/fair.jpg){fig-alt="F=findable, A=accessible, I=interoperable, R=reusable"} +::: + +This chapter is based on ressources from the University of Mannheim, that I +recommend to check out: https://github.com/UB-Mannheim/FAIR-Data-Week. +In the following I will use these and my own experience to give you a bit of a +guidance on how to get started. + +### Generally for all FAIR principles + +The FAIR principles try to help you answer the most common questions people have about data. + +::: captioned-image-container +![FAIR questions](images/fair-questions.jpg){fig-alt="Where is the data? How can I access it ? How can I read and work with it? Yow can I reuse it?"} +::: + +By the way, "data" in this context can mean many different things. Of course things like regular tabular data sets, but also images and other research materials. + +#### How to get started {.unnumbered} + +Store your data somewhere that makes sense. If you can make your data openly available regular data platforms such as [Zenodo](https://zenodo.org/) or the [Open Science Framework (OSF)](https://osf.io) will do. Of course also field specific or institutional platforms/repositories are good options. + + +[![View the images of this chapter on Zenodo](https://zenodo.org/badge/DOI/10.5281/zenodo.8070861.svg)](https://doi.org/10.5281/zenodo.8070861) + +If you cannot make them openly available, you can usually still make the metadata available. Metadata is information about your data such as the author(s), how to cite it, what the data set contains, and so on. + +Making your data known in the community increases not only your chance of creating an impact with your work but also your work's FAIRness. You can do so by publishing a data paper or otherwise sharing more info with the community (social media, podcasts, conferences, ...). + +And then additionally... + +#### F for Findable + +(Three ingredients: data, metadata and infrastructure) + +- Attach a DOI to your data. Many data platforms (e.g. Zenodo) make that really easy for you. +- Provide rich machine-readable metadata. If you upload your data to a good data platform, the most relevant metadata will be asked from you anyhow. So it's easy to do things right. + +#### A for Accessible + +(FAIR is not the same as Open 👉 the point is to provide the exact conditions of accessibility) + +- Explain how someone can access your data. May that be via accessing it through a data platform or through an application that is evaluated by a data-use-and-access committee. + +#### I for Interoperable + +- Use common data formats. For tabular data that could for example be csv, for images jpeg. What's best in your community might be decided through a community standard. +- Use words that others will understand or define them. For example if the column names in your table are not self explanatory, explain them. +- Provide context for your data. Is it connected with other data or papers? You can also add your metadata to public knowledge graphs, e.g. [Wikidata](https://www.wikidata.org/wiki/Wikidata:Main_Page). + +#### R for Reusable + +- Include rich machine-readable metadata according to the community standards. +- Attach a license to your data (license is part of the metadata) that makes it clear, what others can do with your data. You can for example use the Creative Commons license [CC-BY](https://creativecommons.org/licenses/by/4.0/) (more on [how to choose a license](https://creativecommons.org/choose/)). + diff --git a/1-4-team-work.qmd b/1-4-team-work.qmd new file mode 100644 index 0000000..ba11b60 --- /dev/null +++ b/1-4-team-work.qmd @@ -0,0 +1,3 @@ +## Team work that fosters reproducibility + +TODO: ... \ No newline at end of file diff --git a/1-reproducible-research.qmd b/1-reproducible-research.qmd index 154f9b2..117d510 100644 --- a/1-reproducible-research.qmd +++ b/1-reproducible-research.qmd @@ -21,17 +21,10 @@ A popular definition of reproducibility comes from [The Turing Way Book](https:/ ![Matrix defining reproducible research (from The Turing Way)](images/reproducible-matrix.jpg){fig-alt="Reproducibility scale: a scale showing **not reproducible** on one end and **reproducible** on the other end."} ::: -## Hurdles: reproducible research is hard -From the definition above, reproducible research sounds really easy. It is a minimum standard. But still it is hard! -TODO: more on hurdles -## FAIR principles for reproducible research -TODO: see also https://data.research.cornell.edu/content/preparing-fair-data-reuse-and-reproducibility -## Teams work that fosters reproducibility -TODO: ... diff --git a/_quarto.yml b/_quarto.yml index 2d36f16..6afbb04 100644 --- a/_quarto.yml +++ b/_quarto.yml @@ -5,13 +5,19 @@ book: title: "BERD Course: Make Your Research Reproducible" author: "Heidi Seibold" chapters: - - index.qmd - - intro.qmd - - 1-reproducible-research.qmd - - 2-project-organization.qmd - - 3-computational-workflows.qmd - - 4-publishing-research.qmd - - summary.qmd + - part: index.qmd + chapters: + - intro.qmd + - part: 1-reproducible-research.qmd + chapters: + - 1-2-hurdles.qmd + - 1-3-fair.qmd + - 1-4-team-work.qmd + - part: 2-project-organization.qmd + - part: 3-computational-workflows.qmd + - part: 4-publishing-research.qmd + - part: summary.qmd + chapters: - references.qmd bibliography: references.bib @@ -19,6 +25,6 @@ bibliography: references.bib format: html: theme: flatly - - + +number-depth: 2 diff --git a/images/fair-questions.jpg b/images/fair-questions.jpg new file mode 100644 index 0000000..22d9335 Binary files /dev/null and b/images/fair-questions.jpg differ diff --git a/images/fair.jpg b/images/fair.jpg new file mode 100644 index 0000000..34dc74d Binary files /dev/null and b/images/fair.jpg differ diff --git a/intro.qmd b/intro.qmd index ed336fc..63d37f4 100644 --- a/intro.qmd +++ b/intro.qmd @@ -10,12 +10,12 @@ This is an online course. Each week you will.. - watch the weekly videos (~45 min), - review one of the booklet chapters, -- have a short session with your accountability buddy* (15-20 minutes), +- have a short session with your accountability buddy (15-20 minutes), - implement the tasks of the week, and - discuss your progress in the weekly online meeting with the instructor and fellow course participants (1-1.5 hours). -* You will choose your accountability buddy during the course. You and your +You will choose your accountability buddy during the course. You and your buddy will help each other in implementing the tasks of each week. Weekly Meetings: