Implement (as faithfully as I can) the GPT algorithms discussed in the DeepMind paper - Formal Algorithms for Transformers.
This notebook borrows inspiration from Karpathy's Let's build GPT and code from Gabriel's implementations.
Implement (as faithfully as I can) the GPT algorithms discussed in the DeepMind paper - Formal Algorithms for Transformers.
This notebook borrows inspiration from Karpathy's Let's build GPT and code from Gabriel's implementations.