This file lists different approaches for setting up your machine and using the code in this repository. I recommend browsing through the different sections from top to bottom and then deciding which approach best suits your needs.
If you already have Python installed on your machine, the quickest way to get started is to install the package requirements from the ../requirements.txt file by executing the following pip installation command from the root directory of this code repository:
pip install -r requirements.txt
The code provided in repository is designed to run on conventional laptops within a reasonable timeframe and does not require specialized hardware. I tested all code on an M2 MacBook Pro Max laptop with 32 GB RAM. Additionally, if your laptop or desktop computer has an NVIDIA GPU, the code will automatically take advantage of it.
If you don't have Python set up on your machine yet, I have written about my personal Python setup preferences in the following directories:
There are many good options for code editors. My choice is the popular open-source Visual Studio Code (VSCode) editor, which can be easily enhanced with many useful plugins and extensions (see the VSCode Extensions section below for more information). Download instructions for macOS, Linux, and Windows can be found on the main VSCode website.
If you are using Visual Studio Code (VSCode) as your primary code editor, you can find recommended extensions in the .vscode
subfolder. These extensions provide enhanced functionality and tools helpful for this repositoy.
To install these, open this "setup" folder in VSCode (File -> Open Folder...) and then click the "Install" button in the pop-up menu on the lower right.
Alternatively, you can move the .vscode
extension folder into the root directory of this GitHub repository:
mv setup/.vscode ./
Then, VSCode automatically checks if the recommended extensions are already installed on your system every time you open the Building-an-LLM-From-Scratch
main folder.
This section describes cloud alternatives for running the code presented in this course.
While the code can run on conventional laptops and desktop computers without a dedicated GPU, cloud platforms with NVIDIA GPUs can substantially improve the runtime of the code, especially in parts 4 to 6.
For a better development experience in the cloud, I recommend the Lightning AI Studio platform, which allows users to set up a persistent environment and use both VSCode and Jupyter Lab on cloud CPUs and GPUs.
Once you start a new Studio, you can open the terminal and execute the following setup steps to clone the repository and install the dependencies:
git clone https://github.com/Sangwan70/Building-an-LLM-From-Scratch.git
cd Building-an-LLM-From-Scratch
pip install -r requirements.txt
Then, navigate to the Python script or Jupyter Notebook you want to run. Optionally, you can also easily connect a GPU to accelerate the code's runtime, for example, when you are pretraining the LLM in part 4 or finetuning it in part 5 and 6.
If you have any questions, reach out to me via the Discussions forum in the GitHub repository.