Generative AI from Scratch

This repository aims to cover minimal codes for generative models for an educational purpose. They basically depend on PyTorch 2.0, no HugginFace transformers.

To begin with, I included the code to train a 51M-parameter language model. I will add image generation and more features in the future.

Prerequisites

This repository is tested on:

Python 3.10.12
Poetry 1.6.1
NVIDIA V100 GPU
CUDA 11.8

For the Python packages, please refer to pyproject.toml.

Text Generation

I trained a 51M-parameter language model on 1B tokens from BookCorpus. The training took around 20 hours with a single V100 GPU, which cost around $50. The final model achieved the perplexity of 0.83.

Training Procedure

To create a tokenizer, run:

poetry run python generative_ai/scripts/create_tokenizer.py

To launch training, run:

poetry run python generative_ai/scripts/train.py

To generate sentences with pretrained model, run:

$ poetry run python generative_ai/scripts/generate.py --model generative_ai/artifacts/model.pt --prompt "life is about"

> number of parameters: 50.98M
life is about romance , and love and adrenaline , at the same time .

model.pt can be obtained at Hugging Face Models.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
fig		fig
generative_ai		generative_ai
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative AI from Scratch

Prerequisites

Text Generation

Training Procedure

About

Languages

License

shionhonda/generative-ai

Folders and files

Latest commit

History

Repository files navigation

Generative AI from Scratch

Prerequisites

Text Generation

Training Procedure

About

Resources

License

Stars

Watchers

Forks

Languages