Skip to content

Commit

Permalink
Merge pull request #76 from graphcore-research/add-umup-slides
Browse files Browse the repository at this point in the history
fix paper link in docs
  • Loading branch information
thecharlieblake authored Nov 4, 2024
2 parents 7d94472 + e691999 commit 28962ad
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions docs/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,9 @@ Unit Scaling
Welcome to the :code:`unit-scaling` library. This library is designed to facilitate
the use of the *unit scaling* and *u-µP* methods, as outlined in the papers
`Unit Scaling: Out-of-the-Box Low-Precision Training (ICML, 2023)
<https://arxiv.org/abs/2303.11257>`_ and `u-μP: The Unit-Scaled Maximal Update Parametrization
<https://arxiv.org/abs/2407.17465>`_.
<https://arxiv.org/abs/2303.11257>`_ and
`u-μP: The Unit-Scaled Maximal Update Parametrization
<https://arxiv.org/abs/2407.17465>`_

For a demonstration of the library, see `u-μP using the unit_scaling library
<https://github.com/graphcore-research/unit-scaling/blob/main/examples/demo.ipynb>`_ — a notebook showing the definition and training of a u-µP language model, comparing against Standard Parametrization (SP).
Expand Down

0 comments on commit 28962ad

Please sign in to comment.