Contributing to coreax

Getting started

If you would like to contribute to the development of coreax, you can do so in a number of ways:

Highlight any bugs you encounter during usage, or any feature requests that would improve coreax by raising appropriate issues.
Develop solutions to open issues and create pull requests (PRs) for the development team to review.
Implement optimisations in the codebase to improve performance.
Contribute example usages & documentation improvements.
Increase awareness of coreax with other potential users.

All contributors must sign the GCHQ Contributor Licence Agreement.

Developers should install additional packages required for development using pip install -e .[dev]. Then, set up pre-commit hooks using pre-commit install.

Reporting issues

Search existing issues (both open and closed).
Make sure you are using the latest version of coreax.
Open a new issue:
- For bugs use the bug report issue template.
- For features use the feature request issue template.
- This will make the issue a candidate for inclusion in future sprints, as-well as open to the community to address.
If you are able to fix the bug or implement the feature, create a pull request with the relevant changes.

Pull requests

Currently, we are using a GitHub Flow development approach.

To avoid duplicate work, search existing pull requests.
All pull requests should relate to an existing issue.
- If the pull request addresses something not currently covered by an issue, create a new issue first.
Make changes on a feature branch instead of the main branch.
Branch names should take one of the following forms:
- feature/<feature-name>: for adding, removing or refactoring a feature.
- bugfix/<bug-name>: for bug fixes.
Avoid changes to unrelated files in the same commit.
Changes must conform to the code guidelines.
Changes must have sufficient test coverage.
Ensure your changes are recorded in CHANGELOG.md, and ensure that the changelog entry links to each issue that is closed by your PR. (When the next version is released, these links will be replaced by a link to your PR.)
Delete your branch once it has been merged.

Pull request process

Create a Draft pull request while you are working on the changes to allow others to monitor progress and see the issue is being worked on.
Pull in changes from upstream often to minimise merge conflicts.
Make any required changes.
Resolve any conflicts with the target branch.
Change your PR to ready when the PR is ready for review. You can convert back to Draft at any time.

Do not add labels like [RFC] or [WIP] to the title of your PR to indicate its state. Non-Draft PRs are assumed to be open for comments; if you want feedback from specific people, @-mention them in a comment.

PRs should be merged by the reviewer on approval.

Pull request commenting process

Use a comment thread for each required change.
Reviewer closes the thread once the comment has been resolved.
Only the reviewer may mark a thread they opened as resolved.

Commit messages

Follow the conventional commits guidelines to make reviews easier and to make the git logs more valuable. An example commit, including reference to some GitHub issue #123, might take the form:

feat: add gpu support for matrix multiplication

If a gpu is available on the system, it is automatically used when performing matrix
multiplication within the code.

BREAKING CHANGE: numpy 1.0.2 no longer supported

Refs: #123

Breaking changes and deprecation

Since we are still pre-1.0, SemVer states that any release may contain breaking changes. However, breaking changes should not be made without warning.

Any breaking changes must have a deprecation period of at least one minor release, or one month (whichever is longer), before the breaking change is made. If the change is one that may require significant changes to client code, such as removing a function or class entirely, the deprecation period must instead be at least two minor releases, or two months (whichever is longer).

Ensure that during the deprecation period, the old behaviour still works, but raises a DeprecationWarning with an appropriate message (which should include which version the behaviour is deprecated since, and which version the deprecated behaviour is expected to be removed in). If at all possible, ensure that there is straightforward signposting for how users should change their code to use non-deprecated parts of the codebase instead.

As an example, this is what the deprecation period for renaming my_old_function to my_new_function would look like:

# v0.1.0:
def my_old_function(x: int) -> int:
    return x + x + x + x

# v0.2.0:
def my_new_function(x: int) -> int:
    return x*4

@deprecated(
    "Renamed to my_new_function."
    + " Deprecated since v0.2.0; will be removed in v0.3.0."
)
def my_old_function(x: int) -> int:
    return my_new_function(x)

# v0.3.0:
def my_new_function(x: int) -> int:
    return x*4

Code

Code must be documented, adequately tested and compliant with in style prior to merging into the main branch. To facilitate code review, code should meet these standards prior to creating a pull request.

Some of the following points are checked by pre-commit hooks, although others require manual implementation by authors and reviewers. Conversely, further style points that are not documented here are enforced by pre-commit hooks; it is unlikely that authors need to be aware of them.

Style

A high level overview of the expected style is:

Follow PEP 8 style where possible.
Use clear naming of variables rather than mathematical shorthand (e.g. kernel instead of k).
Black will be applied by the pre-commit hook but will not reformat strings, comments or docstrings. These must be manually checked and limited to 88 characters per line starting from the left margin and including any indentation.
Avoid using inline comments.
Type annotations must be used for all function or method parameters.

Spelling and grammar

This project uses British English. Spelling is checked automatically by cspell. When a word is missing from the dictionary, double check that it is a real word spelled correctly. Contractions in object or reference names should be avoided unless the meaning is obvious; consider inserting an underscore to effectively split into two words. If you need to add a word to the dictionary, use the appropriate dictionary inside the .cspell folder:

library-terms.txt for object names in third-party libraries,
people.txt for the names of people,
custom-misc.txt for anything that does not fit into the above categories.

If the word fragment only makes sense as part of a longer phase, add the longer phrase to avoid inadvertently permitting spelling errors elsewhere, e.g. add Blu-Tack instead of Blu.

External dependencies

Use standard library and existing well maintained external libraries where possible. New external libraries should be licensed permissive (e.g MIT) or weak copyleft (e.g. LGPL).

Testing

All tests are run via the following Pytest command:

  pytest tests/

Either Pytest or Unittest can be used to write tests for coreax. Pytest is recommended where it would simplify code, such as for parameterized tests. As much effort should be put into developing tests as is put into developing the code. Tests should be provided to test functionality and also ensuring exceptions and warnings are raised or managed appropriately. This includes:

Unit testing of new functions added to the codebase
Verifying all existing tests pass with the integrated changes

Keep in mind the impact on runtime when writing your tests. Favour more tests that are smaller rather than a few large tests with many assert statements unless it would significantly affect run time, e.g. due to excess set up or duplicated function calls.

Use the form: (actual, expected) in asserts, e.g.

assertEqual(actualValue, expectedValue)

Testing before releases to PyPI

Before a release is issued on PyPI, all tests for Coreax will be run on a GPU machine. This avoids having to incorporate GPU runners into the CI/CD. However, note that code pushed to main may not necessarily have been tested on a GPU machine until a release to PyPI is made. If you observe any issues on GPU machines using the code, please raise an issue detailing the behaviour, and create a PR with the relevant fix if possible.

In addition, the "Release" GitHub action will test the built package, to ensure that the build process has included all the correct files (to avoid a repeat of gchq#843!)

Abstract functions

Abstract methods, functions and properties should only contain a docstring. They should not contain a pass statement.

Exceptions and error messages

Custom exceptions should be derived from the most specific relevant Exception class. Custom messages should be succinct and, where easy to implement, offer suggestions to the user on how to rectify the exception.

Avoid stating how the program will handle the error, e.g. avoid Aborting, since it will be evident that the program has terminated. This enables the exception to be caught and the program to continue in the future.

Docstrings

Docstrings must:

Be written for private functions, methods and classes where their purpose or usage is not immediately obvious.
Be written in reStructured Text ready to be compiled into documentation via Sphinx.
Follow the PEP 257 style guide.
Not have a blank line inserted after a function or method docstring unless the following statement is a function, method or class definition.
Start with a capital letter unless referring to the name of an object, in which case match that case sensitively.
Have a full stop at the end of the one-line descriptive sentence.
Use full stops in extended paragraphs of text.
Not have full stops at the end of parameter definitions.
If a :param: or similar line requires more than the max line length, use multiple lines. Each additional line should be indented by a further 4 spaces.
Class __init__ methods should not have docstrings. All constructor parameters should be listed at the end of the class docstring. __init__ docstrings will not be rendered by Sphinx. Any developer comments should be contained in a regular comment.

Each docstring for a public object should take the following structure:

"""
Write a one-line descriptive sentence as an active command.

As many paragraphs as is required to document the object.

:param a: Description of parameter a
:param b: Description of parameter b
:raises SyntaxError: Description of why a SyntaxError might be raised
:return: Description of return from function
"""

If the function does not return anything, the return line above can be omitted.

Comments

Comments must:

Start with a capital letter unless referring to the name of an object, in which case match that case sensitively.
Not end in a full stop for single-line comments in code.
End with a full stop for multi-line comments.

Maths overflow

Prioritise overfull lines for mathematical expressions over artificially splitting them into multiple equations in both comments and docstrings.

Thousands separators

For hardcoded integers >= 1000, an underscore should be written to separate the thousands, e.g. 10_000 instead of 10000.

Documentation and references

The coreax documentation should reference papers and mathematical descriptions as appropriate. New references should be placed in the references.bib file. An entry with key word RefYY can then be referenced within a docstring anywhere with :cite:`RefYY`.

Generating docs with Sphinx

You can generate Sphinx documentation with:

documentation/make html

Releases

Releases are made on an ad-hoc basis, not on every merge into main. When the maintainers decide the codebase is ready for another release:

Create an issue for the release.
Run additional release tests on main including GPU testing, as described in Testing before releases to PyPI.
Fix any issues and merge into main, iterating until we have a commit on main that is ready for release, except for housekeeping that does not affect the functionality of the code.
Create a branch release/#.#.# off the identified commit, populating with the target version number.
Tidy CHANGELOG.md including:
- Move the content under Unreleased to a section under the target version number.
- Create a new unpopulated Unreleased section at the top.
- Update the hyperlinks to Git diffs at the bottom of the file so that they compare the relevant versions.
- Replace all issue links in the new version's sections with links to the PRs that closed them.
Update the version number in coreax/__init.py__.
Create and review a pull request.
Once approved, merge the release branch into main as soon as possible. Do not delete the release branch when you merge the PR - only delete it once the full release process has been completed.
Manually trigger the "Release" action, inputting the name of the release branch.
Create a release in GitHub pointing at the final commit on the release branch (that is, the commit before merging into main). Add the wheel file produced by the Release action to the GitHub release as an artifact.
Publish to PyPI. Ensure that the wheel and sdist uploaded are those produced by the Release action.
Publish to ReadTheDocs, ensuring that the documentation is built from the same commit as in step 10.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

Contributing to coreax

Getting started

Reporting issues

Pull requests

Pull request process

Pull request commenting process

Commit messages

Breaking changes and deprecation

Code

Style

Spelling and grammar

External dependencies

Testing

Testing before releases to PyPI

Abstract functions

Exceptions and error messages

Docstrings

Comments

Maths overflow

Thousands separators

Documentation and references

Generating docs with Sphinx

Releases

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to coreax

Getting started

Reporting issues

Pull requests

Pull request process

Pull request commenting process

Commit messages

Breaking changes and deprecation

Code

Style

Spelling and grammar

External dependencies

Testing

Testing before releases to PyPI

Abstract functions

Exceptions and error messages

Docstrings

Comments

Maths overflow

Thousands separators

Documentation and references

Generating docs with Sphinx

Releases