lipogramGPT

A simple implementation using Karpathy's (2023) nanoGPT and fine-tuning it using PPO.

Data

To begin first download the tiny Shakespeare Dataset:

wget https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt

Then make sure you have the file saved as input.txt in the same folder as the rest of the code files.

Train

To train baseGPT (Character-Level GPT trained on Shakespeare) run train.py.

In train.py, you can adjust hyperparameter settings. Including the eval_interval. This determines how often training and evaluation loss are printed to the console. More importantly, it also determines how often the weights of the model are saved. They are saved at every evaluation interval which is overkill considering the large files. After running keep only the relevant file.

Any suggestions on how to improve setting check-points welcome, simply open an issue!

langmodel.py

This is the GPT implementation based on Karpathy's (2023) Let's Build GPT lecture which can be accessed Here

In the file you can adjust hyperparameter settings to cater to the compute available to you. If GPU is available the model will automatically run on the GPU.

PPO_main.py

This implements the PPO fine-tuning to eliminate U counts. Note that if you want to adjust the reward Threshold this needs to be done manually in GenEnv.py.

You can run PPO_main.py in the terminal to train. This will be very slow unless you have access to a GPU, or you have significantly reduced the size of the GPT in langmodel.py.

print_outputs.py

Does what it says on the tin. Run this with the appropriate weight (might need to be changed in the file) to generate text with your model's weights.

GenEnv.py

This implements the environment used for PPO.

ValueNetwork.py

Implements the Critic for PPO to get state_value estimates.

Sorry in Advance

This is quite an ugly implementation to recreate this you will have to tweak the files a bit. Adjusting for example the u_threshold in GenEnv. If there are issues please reach out!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
GenEnv.py		GenEnv.py
PPO_main.py		PPO_main.py
README.md		README.md
encoded_data_tensor.pkl		encoded_data_tensor.pkl
langmodel.py		langmodel.py
print_outputs.py		print_outputs.py
shakespeare_word_set.pkl		shakespeare_word_set.pkl
train.py		train.py
value_network.py		value_network.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lipogramGPT

Data

Train

langmodel.py

PPO_main.py

print_outputs.py

GenEnv.py

ValueNetwork.py

Sorry in Advance

About

Releases

Packages

Languages

LeoHink/lipogramGPT

Folders and files

Latest commit

History

Repository files navigation

lipogramGPT

Data

Train

langmodel.py

PPO_main.py

print_outputs.py

GenEnv.py

ValueNetwork.py

Sorry in Advance

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages