We developed a workflow which builds your TeachBook in your repository for all branches and releases them online via GitHub Pages. In simplified terms, it automatically builds the book website based on updates to your repository, creates multiple instances of your book (defined by each branch) and provides the ability to customize the URL's at which each instance of the book can be accessed. This tool is designed specifically for educational contexts, for example, when you may want to preserve book versions from multiple academic years so that students are able to access it later. The TeachBooks Template uses this functionality for example.
The workflow call-deploy-book.yml
calls the deploy-book.yml
workflow, which builds a Jupyter Book at the calling repository for all branches, and deploys them via GitHub Pages. It is currently configured to create a Jupyter Book using the TeachBooks Python package (i.e., teachbooks build book
), although this may be adapted in the future to make it easier to use in other applications (e.g., to build books with other software or any static website in general).
The workflow has the following features:
- Releasing of your TeachBook-repository (built with Jupyter Book) to GitHub Pages
- Ability to release both private (GitHub Pro, GitHub Team, GitHub Enterprise Cloud, or GitHub Enterprise Server required) and public (GitHub Free is enough) repositories.
GitHub Teams is free for teachers as described in the GitHub documentation.
If you have an organization for your TeachBook on GitHub, link your GitHub team rights to your organization as described on the GitHub website. - Releasing all or a selection of branches, allowing to build a draft version of the TeachBook online which reduces the need for local builds of the book
- Provides a summary describing where the TeachBook is released, errors in the build process per branch and how the release step is configured
- Caching of already built books so that it can be partially reused when another branch is released or the next build contains critical errors
- Caching of python environment to speed up the workflow
- Allowing use of submodules within your book
- Customizable trigger for the workflow itself
- Optionally preprocess branches using the
teachbooks
package (e.g., Draft-Release Worklflow). - Converting branch-names to well-defined URLs
- Customizable settings on where the books should be deployed including alias for branch-names and selection of one branch to be deployed on root. The workflow will gives warnings if these settings are ill-defined or conflicting. Although aliases are generally not allowed by GitHub Pages, it seems you can use one alias, but not more.
- Customizable behavior of book URL root directory, either by redirecting the root to one of the branches or by copying or moving one of the branches to root.
- Adds an 'archived'-banner to old branches / branches of previous years.
As previously mentioned, this workflow is used in TeachBooks/template
. Feel free to use it for your TeachBook as well:
- Add
teachbooks
to yourrequirements.txt
file in your root folder - Move all the TeachBook files (including
_config.yml
and_toc.yml
) to a subdirectorybook/
. - Copy the
call-deploy-book.yml
workflow to the/.github/workflows
folder in your repository. - Set source for GitHub pages to GitHub Actions under
Settings
-Pages
-Build and deployment
-Source
-GitHub Actions
(as long as you don't do this the workflow deploys all branches which build successfully). - Trigger the workflow by making an edit to the TeachBook by editing and committing changes to one of the files in the
book/
subdirectory (available underCode
) or manually activating the workflow underActions
-All workflows
-call-deploy-book
-Run workflow
-Use workflow from branch: <the branch you did step 1, 2 and 3 in>
-Run workflow
(this workflow). - Now checkout the progress and summary of the releasing workflow under
Actions
-All workflows
-call-deploy-book
-<the most recent workflow run>
.
You can adapt the behaviour by setting repository variables as explained here or using the VS Code Extension GitHub Actions. Define the following repository variables:
PRIMARY_BRANCH
which is set tomain
whenever it's not defined in the repository variables.- This sets the branch or alias (when using 'redirect' for
BEHAVIOR_PRIMARY
) which is shown on root. - It is advised to show either your default branch on root, or the branch shared with your primary/active book audience (e.g., your current students).
- This sets the branch or alias (when using 'redirect' for
BEHAVIOR_PRIMARY
which is set toredirect
whenever it's not defined in the repository variables.- This indicates whether to copy the PRIMARY_BRANCH to root ('copy'), move the PRIMARY_BRANCH to root ('move') or redirect from root to the PRIMARY_BRANCH ('redirect')
- Advised to use 'redirect' if you expect to archive a version in the future so that the URL doesn't change for the reader (e.g., to preserve URL containing the current academic year shared with students).
- Use copy or move when you only expect readers to use the root URL. Move is useful to remove unnecessary build artifacts and if you don't need to visit the URL containing the branch or alias name.
BRANCH_ALIASES
which is set to- This defines an alias (custom URL) for a branch
- Variables should be a space-separated list of branch names, e.g. 'alias:really-long-branch-name`
- If no alias is wanted,
BRANCH_ALIASES
may be set to
BRANCHES_TO_DEPLOY
which is set to*
(all branches) whenever it's not defined in the repository variables.- This defines the branches to deploy.
- It should be a space-separated list of branch names, e.g. 'main second third'.
BRANCHES_TO_PREPROCESS
which is to to- This defines the branches to preprocess with the
TeachBooks
package, which removed book-pages and config lines defined with# START REMOVE FROM RELEASE
and# END REMOVE FROM RELEASE
- It should be a space-separated list of branch names, e.g. 'main second third'.
- If no preprocessing is required,
BRANCH_TO_PREPROCESS
may be set to ' ' (space).
- This defines the branches to preprocess with the
BRANCHES_ARCHIVED
which is set to- This adds a banner to the released branch: You are viewing an archived version of the book. Click here for the latest version.
- It should be a space-separate list of branch names, e.g. 'main second third'.
In call-deploy-book.yml
itself you can specify the trigger for this workflow. By default, a push to any branch triggers the workflow. You can limit the branches or subdirectories.
Relevant use cases are explained here, along with an explanation for how to set up the workflow accordingly. Note that it is not required to set your PRIMARY_BRANCH
to the default branch of your GitHub repository; this is a choice that is determined by what version of the source code you want visitors to see (i.e., work in progress, or the most recent "complete" or "released" version of a book).
Consider a case where each academic year you would like to create a new book for your students. However, you need to ensure that students from previous years can still access "their" version of the book.
Assume for this example that we are working in the "my_organization" GitHub Organization in repository "my_book" and that the current academic year is 2025, and that we have one or more books in our repository from previous years. The desired URL structure is thus:
- students from this year use the book at
my_organization.github.io/my_book/2025/
- students from last year use the book at
my_organization.github.io/my_book/2024/
- visitors to
my_organization.github.io/my_book/
will automatically be redirected to the book from the current year
To create this behavior, do the following:
- Create a branch for each year: a logical name would have format
YYYY
, (technically it can be anything, as the URL can be set toYYYY
withBRANCH_ALIASES
). - Set
PRIMARY_BRANCH
to the branch for the current academic year (e.g.,2025
) - Set
BEHAVIOR_PRIMARY
toredirect
(default)
One possible modification to this setup would be if you are progressively releasing content to your current students (e.g., 2025
) but you wanted visitors to my_organization.github.io/my_book/
to see a complete version of your book. Assuming that the ideal version for such visitors is the completed book from the previous year (2024
), you could do this:
- Set
PRIMARY_BRANCH
to2024
- Set
BEHAVIOR_PRIMARY
tocopy
As your current students may then accidentally go to the older version of the book, we recommend you use a banner to indicate to readers that there is a (partial) new version available, and advise them where to find it. You can add a banner using this workflow (BRANCHES_ARCHIVED
) or the standard Jupyter Book banner feature.
Consider a case where several authors are working on material for a book and you would like to be able to see the work of any author at any time. You also have a branch that is used to share finalized material with your readers (we call this a "release" branch) and a branch that is used to collect and review the work of all authors before releasing it (we call this a "draft" branch).
Assume for this example that we are working in the "my_organization" GitHub Organization in repository "my_book." The released book is on branch release
the draft book is on branch draft
; author branches are author_1
, author_2
, etc. The desired URL structure is thus:
- official (released) version of the book is at
my_organization.github.io/my_book/
- draft version of the book is at
my_organization.github.io/my_book/draft/
- the work of each author can be found at
my_organization.github.io/my_book/author_1/
, etc.
To create this behavior, do the following:
- Create a branch for each author as well as
release
anddraft
branches - Set
draft
to the default branch - Set
PRIMARY_BRANCH
torelease
- Set
BEHAVIOR_PRIMARY
tocopy
Note that in this situation we strongly recommend using the Draft-Release workflow that is incorporated in the TeachBooks Python package, which allows you to restrict specific content from appearing in the release
book that is already incorporated in the draft
book. Find out more in the TeachBooks Manual. This is useful, for example, if you would like to review many chapters, but release only one at a time, or use a banner in the draft book (e.g., via _config.yml
) but not the released book.
What if you want to make sure the source code for the released book is visible in the repository, rather than the draft version (e.g., you want to make sure external visitors see this version of the material)? You can do this by making release
your default branch instead of draft
. However, this also means that Pull Requests will be default be made into the release
branch instead of draft
, which may alter the working method of your team in an undesirable way (e.g., not being able to use a draft
branch, accidentally releasing material before checking on draft
, or having to manually adjust every PR that is created by GitHub to merge into draft
instead of release
).
In case your book includes private submodules, you'll need to create a Personal Access Token (classic) with at least the scope repo
as described in the github documentation, and add this token with the name GH_PAT
as a Repository secret
or Organization secret
( Settings
> Secrets and variables
> Actions
> Repository secrets
or Organization secrets
.)
We advise you to enable two options in the general repository setting regarding pull requests in GitHub:
- Enable
Always suggest updating pull request branches
, suggesting a merge from the default branch into any separate branch before merging into main. - Enable
Automatically delete head branches
to delete branches after they are merged (you'll still be able to restore those).
Whenever the workflow is triggered, its progress and a summary can be seen under the Actions
- All workflows
- call-deploy-book
in GitHub! It shows you a descriptive summary:
- Ill-defined repository configuration variables (in
Annotations
) - Which branches are released and where (
https://<username/organization_name>.github.io/<repository_name>
(case sensitive)) including which branch is released on the website root and the applied alias - Errors in the build process
- How the repository variables are defined during the build.
Here's an example for a summary for the template book:
Branch π Link π Build status βοΈ main https://teachbooks.github.io/template/main β Released
version2 https://teachbooks.github.io/template/version2 π΄ Build failed [1]
version3 https://teachbooks.github.io/template/version3 β Build failed [2]
β
Released
- build success, new version released.π΄
Build failed [1]
- build failure, previous version of the book reused.β
Build failed [2]
- build failure, no previous version reused.The book at the website root https://teachbooks.github.io/template/ redirects to the primary branch
main
(status: βReleased
).
Alias β‘οΈ Target π― Link π Build status βοΈ draft main https://teachbooks.github.io/template/draft β Released
For more details please see the corresponding
build-books
jobs in the left pane.On branch
version2
:οΏ½[91m/home/runner/work/template/template/book/some_content/overview.md:5: WARNING: Non-consecutive header level increase; H1 to H3 [myst.header]οΏ½[39;49;00m
On branch
version3
:/home/runner/work/_temp/ff8c8325-8d8b-4c0b-a2b2-32d2169c55bc.sh: line 8: teachbooks: command not found
Variables can be set at https://github.com/TeachBooks/template/settings/variables/actions
PRIMARY_BRANCH=main (default value used) BRANCH_ALIASES=draft:main BRANCHES_TO_DEPLOY=* (default value used) BRANCHES_TO_PREPROCESS=main BEHAVIOR_PRIMARY=redirect (default value used) BRANCHES_ARCHIVED= (default value used)
WIP
The GitHub Action Cache is used in the Deploy Book Workflow (DBW) to save time when building the book. This is accompliched by caching two sets of files for each branch: 1) the Python virtual environment, and 2) the build artifact (the HTML files forming each book website).
The reason for doing this stems from the primary purpose of the Deploy Book Workflow, which builds many versions of a book based on each branch, as well as creating a Python virtual environment from the requirements.txt
file in the repository. A unique environment for each branch is necessary to test changes to book dependencies and configuration in an isolated way, but can dramatically increase the time required to build all versions of the book when the DBW runs. In addition, it also takes time to build the book itself. For this reason, both of these sets of files are cached (although at the moment, most of the time savings is primarily caching the build artifact).
This works by hashing a specific set of files and including that in the filename of the cache. Every time the workflow is triggered, a new hashed filename is created and compared to the list of existing caches to see if one with the same name exists; if so, the cache is reused. If not, a new environment and/or build artifact is constructed and a new cache is made.
Cached files can be found in GitHub under the Actions tab, then looking for the "Caches" section under the "Management" pane on the left hand side of the screen. For example, this link shows the caches for the TeachBooks Manual. If needed, caches can be deleted manually from this page. A cache will expire if unused for longer than 1 week or if the maximum allowed disk space for all cached files is exceeded (both are GitHub policies).
Files created during a GitHub Action are called artifacts. Although this is not the same thing as a cache, the files in a specific cache or artifact are identical. Artifacts have their own expiration period, which is possible to customize in the repository settings (default is 90 days). Unexpired artifacts can be viewed at the bottom of any workflow run summary that is available on the Actions tab of a GitHub repository.
To add even more confusion to the situation, if an artifact built during an Action is a set of HTML files (a website) that are used to create a GitHub Pages website, the files served when visiting the URL for that are stored on a webserver; they are not the artifact files, but are identical. Fortunately webserver files are preserved indefinitely (or until GitHub changes this policy), meaning that even though your cache or artifact may be deleted, the website will still remain active.
The cache is a key component of the DBW and leads to several considerations for the building and maintenance of a book, which are explained below, after the criteria for each cache type are described.
The DBW uses Python virtual environments (python -m venv .venv
) and python -m pip install -r requirements.txt
to install packages and create the book building environment. The entire .venv
directory is preserved in the cache with a file name beginning with venv-...
. The week number is also included in the file hash, which ensures that a new hash is created at least once per week.
Once created during a successful build action, an existing cached environment will be found and reused unless two specific criteria are met: 1) a book build is required (replacing the cached build artifact), and 2) the requirements.txt
file is changed. Requirements for a "book build" are described in the next section.
Note that any change to the requirements.txt
file will trigger creation a new environment, not necessarily one that specifies the version number of a package.
Build artifacts are the HTML files that define the website (i.e., the book) and are typically located in the subdirectory book/_build/html
of a repository once the book is built. Note that these files are typically only visible if you are building the book locally, as they are not committed to the repository by default (i.e., .gitignore
). For users only using the DBW and the GitHub browswer editor, downloading the build cache is the easiest way to view these files, which have filenames beginnig with html-build-<branch-name>-...
.
Once created during a successful build action, an existing cached build artifact will be found and reused unless any of three specific criteria are met: 1) a change is made to a file in book/
, 2) the requirements.txt
file is changed, or 3) the status of a branch as archive or preprocess has changed (BRANCHES_TO_PREPROCESS
of BRANCHES_ARCHIVED
).
Immediately after a successful build action, the book website files exist in three places: cache, artifact and on a GitHub webserver. However, as described at the beginning of this section, the cache will be automatically deleted after 1 week, if unused, and the artifact will be deleted based on the setting in your repository (90 days by default).
Implementation of the DBW has several characteristics that should be noted which could help understand certain behaviors of the book build and website deployment process. These points may be useful to consider if you are experiencing undesired behavior in your Actions builds and/or your actual book websites.
- Remember that the DBW action will only run if a commit is made that changes
requirements.txt
or contents ofbook/
- When first creating a repository, the action may not run, or may run and fail; occasionally it is needed to make a new commit to get the workflow to run for the first time (remember to modify something in
book/
) or you may try to re-run the job from the Actions tab - when using multiple branches, only the branch that is edited will be updated
- the website for each branch may be built with a different Python environments (different package versions)
Unlike the virtual environments (next subsection), caches of the book build do not influence subsequent builds of the book, as this artifact is replaced whenever a commit to a specific branch triggers a new build process. As described above, old cached build artifacts are used if the build process from a new commit fails, ensuring that the URL of a website does not return a 404 page.
In particular, note that older branches may have been built with cached environments that are different than those in a newer branch, leading to (undesired) differences in book appearance or functionality---often without the author being aware! This occurs if a package in requirements.txt
is updated in the time between the creation of environments on different branches, as pip
will use the newest version when downloading a package (the first time a venv cache is created in the DBW), but it will not always automatically update a package if pip install -r requirements.txt
is used on an existing environment (this happens every time an existing cache is used!). This means that, when compared to the venv on a newer branch, a frequently used branch (where the cache has not expired) may retain old and out of date packages long after an updated version is available; however, since the week number is included in the venv file hash, this will last for a maximum of 1 week. A best practice to avoid this situation is to pin version numbers explicitly (e.g., teachbooks==0.2.0
) if you want book builds to remain consistent. In addition, a feature like dependabot can be used to automatically notify you when an update is made; this is described in the TeachBooks Manual.
This tool's repository is stored on GitHub. The README.md
of the branch manual_docs
is also part of the TeachBooks manual as a submodule. If you'd like to contribute, you can create a fork and open a pull request on the GitHub repository. To update the README.md
shown in the TeachBooks manual, create a fork and open a merge request for the GitHub repository of the manual. If you intent to clone the manual including its submodules, clone using: git clone [email protected]:TeachBooks/manual.git
.
Future improvements to the DBW may address the ability for a user to specify additional aspects of the build process, for example, the Python version or the specific book build commands (e.g., teachbooks build
or jupyter-book build
). If this is something that interests you, please create an Issue in the repository (perhaps with "feature request" in the title).