GPT Brick by Brick Implement (as faithfully as I can) the GPT algorithms discussed in the DeepMind paper - Formal Algorithms for Transformers. This notebook borrows inspiration from Karpathy's Let's build GPT and code from Gabriel's implementations.