GitHub - guoguo1314/llama3_learn.c: Inference deployment of the llama3

Notice： This code built on llama2.c for Inference deployment of the llama3, where convert-tokenizer-llama3.py is distribution-llama. The code explanation can be found on this page.
For Chinese version, see 知乎
First time open source, please point out any errors.

1. download model

For CN,点它试试

2. clone code

git clone https://github.com/guoguo1314/llama3_learn.c.git
cd llama3_learn.c

3. convert tokenizer

python convert-tokenizer-llama3.py tokenizer.model

4. convert model

python convert-llama3.py Meta-Llama-3-8B/original llama3_8.bin

5. run

make run
./run llama3_8.bin

6. appreciate

Finally, thank you karpathy, b4rtaz open source, I don't know if you can see.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
11111.txt		11111.txt
Makefile		Makefile
README.md		README.md
convert-llama3.py		convert-llama3.py
convert-tokenizer-llama3.py		convert-tokenizer-llama3.py
download.py		download.py
run.c		run.c
runq.c		runq.c
visual.py		visual.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. download model

2. clone code

3. convert tokenizer

4. convert model

5. run

6. appreciate

About

Releases

Packages

Languages

guoguo1314/llama3_learn.c

Folders and files

Latest commit

History

Repository files navigation

1. download model

2. clone code

3. convert tokenizer

4. convert model

5. run

6. appreciate

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages