Notice:
This code built on llama2.c for Inference deployment of the llama3, where convert-tokenizer-llama3.py is distribution-llama. The code explanation can be found on this page.
For Chinese version, see 知乎
First time open source, please point out any errors.
For CN,点它试试
git clone https://github.com/guoguo1314/llama3_learn.c.git
cd llama3_learn.c
python convert-tokenizer-llama3.py tokenizer.model
python convert-llama3.py Meta-Llama-3-8B/original llama3_8.bin
make run
./run llama3_8.bin
Finally, thank you karpathy, b4rtaz open source, I don't know if you can see.