This is an LLM, trained specifically to be able to add big numbers. Mainly based on RedPajama3B, but also contains other experiments on this topic.
git clone https://github.com/xufana/4B_LLM_Calculator.git
cd 4B_LLM_Calculator
pip install -r requirements.txt
You can follow the ('experiments.ipynb') notebook to repeat the whole experiment.
Run the cell in the notebook or download the dataset on HuggingFace https://huggingface.co/datasets/xufana/RedPajama-INCITE-Instruct-3B-Addition.
! git clone https://github.com/xufana/4B_LLM_Calculator.git
%cd /content/4B_LLM_Calculator
! pip install -r requirements.txt
! python3 dataset_generator.py --add_volume 100 --sub_volume 100
! python3 lora_training.py
TBD
My implementation is mainly based on GOAT by Liu T., Hsiang B. 2023.