- Support for q4 mode only
- Single-thread mode only
- Use Vitis 2023.2 to compile the
hw/blas
project and export the HLS IP. - Use Vivado 2023.2 to compile the
hw/project
project. - Copy
llama.cpp
to the Ubuntu system on the KR26 development board and use the following commands:make llama-cli -j $(nproc) cd firmware ./load.sh cd .. sudo ./llama-cli -m some_q4.gguf -t 1