How should/would the cache be used remotely?

_Originally posted by @choldgraf in https://github.com/ExecutableBookProject/jupyter-cache/pull/6#issuecomment-590100257_

Maybe a use-case to consider here.

A team has a really big book, it takes 2 hours to complete. An author forks the book, clones it locally, edits one page. They want to contribute the page back. A few questions:

* Do they need to run the entire 2-hour build process locally before seeing what the page looks like? --> seems like this could be handled by letting cache execution step be configurable only to specific files
* When they make a PR, does the entire book need to re-build top to bottom on the CI/CD job? --> here the cache could probably be stored as a build artifact in a CI/CD job independent of the `.git` repository
* Is there any way for a "master cache" to be bundled with the book?
  * If so, then is that a pattern we *want* to encourage?


I could see a benefit of committing the cache, in the sense that then git would keep track of changes to the cache and diffs to the pages would propagate through github, clones, etc. However, I worry about a few things:

* The cache would probably become gigantic for non-trivial projects, unless it could be incrementally-updated and have some kind of "shallow clone" behavior.
* It would require sub-moduling a book repository, so I think it would only work for fairly power-users.
* The cache diffs themselves would be binary (I think?) so they wouldn't make any sense in github which would make it hard to know what has changed in the cache.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How should/would the cache be used remotely? #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How should/would the cache be used remotely? #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions