RedPajama-Calculator

This is an LLM, trained specifically to be able to add big numbers. Mainly based on RedPajama3B, but also contains other experiments on this topic.

Local Setup

git clone https://github.com/xufana/4B_LLM_Calculator.git 
cd 4B_LLM_Calculator
pip install -r requirements.txt

You can follow the ('experiments.ipynb') notebook to repeat the whole experiment.

Dataset (`dataset_generator.py`)

Run the cell in the notebook or download the dataset on HuggingFace https://huggingface.co/datasets/xufana/RedPajama-INCITE-Instruct-3B-Addition.

Training (`lora_training.py`)

! git clone https://github.com/xufana/4B_LLM_Calculator.git
%cd /content/4B_LLM_Calculator
! pip install -r requirements.txt
! python3 dataset_generator.py --add_volume 100 --sub_volume 100
! python3 lora_training.py

Inference (`lora_inference.py`)

TBD

Acknowledgements

My implementation is mainly based on GOAT by Liu T., Hsiang B. 2023.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
templates		templates
utils		utils
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
dataset_generator.py		dataset_generator.py
experiments.ipynb		experiments.ipynb
lora_inference.py		lora_inference.py
lora_training.py		lora_training.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RedPajama-Calculator

Local Setup

Dataset (`dataset_generator.py`)

Training (`lora_training.py`)

Inference (`lora_inference.py`)

Acknowledgements

About

Releases

Packages

Languages

xufana/4B_LLM_Calculator

Folders and files

Latest commit

History

Repository files navigation

RedPajama-Calculator

Local Setup

Dataset (dataset_generator.py)

Training (lora_training.py)

Inference (lora_inference.py)

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Dataset (`dataset_generator.py`)

Training (`lora_training.py`)

Inference (`lora_inference.py`)

Packages