feat: sync llama.cpp #79

a-ghorbani · 2024-10-30T12:27:48Z

Sync llama.cpp with the b3972 release.

This will close #77.

a-ghorbani · 2024-10-30T12:31:57Z

I’m almost done, just running the final tests.
Mostly based on @Vali-98's work: https://github.com/Vali-98/cui-llama.rn

@Vali-98 , if you have a moment, a quick review would be much appreciated to catch anything I might have missed!

Vali-98 · 2024-10-30T12:46:36Z

It looks okay from a glance, can't confirm for the IOS code though.

Just to confirm, I applied changes for cui-llama.rn from my fork of llama.cpp: https://github.com/Vali-98/llama.cpp/tree/cui-llama.rn. @a-ghorbani , did you swap to this fork or translated my changes to the relevant patch files?

The current method of patching changes to llama.rn is the main reason why I wanted my own fork instead, as patching the files tends to fail spectacularly when syncing llama.cpp. After 2-3 sync's, I decided that it was a major maintainability issue. I assume this is the reason why the CI is failing too.

Ideally for release builds, we would include prebuilt binaries instead of needing to redundantly build rn-llama.cpp.

a-ghorbani · 2024-10-30T13:30:56Z

It looks okay from a glance, can't confirm for the IOS code though.

Cool! For the iOS part, I took some ideas from Android-related changes.

Just to confirm, I applied changes for cui-llama.rn from my fork of llama.cpp: https://github.com/Vali-98/llama.cpp/tree/cui-llama.rn. @a-ghorbani , did you swap to this fork or translated my changes to the relevant patch files?

I translated the changes into relevant patch files to maintain consistency with the approach in this repo.

The current method of patching changes to llama.rn is the main reason why I wanted my own fork instead, as patching the files tends to fail spectacularly when syncing llama.cpp. After 2-3 sync's, I decided that it was a major maintainability issue. I assume this is the reason why the CI is failing too.

Yep, maintaining these patches is a challenge. usually, I revert the changes and create the patches from scratch again.
(the ci was actually due to an issue with the submodule update had an extra --remote that led to fetching the latest rather than using the one pinned here; it should be fixed now).

Vali-98 · 2024-10-31T03:36:30Z

You might want to mention that this adds the vocab_only field as it isn't actually a part of gpt_params on llama.cpp.

jhen0409 · 2024-11-02T02:41:29Z

This is awesome, thanks for the contribution!

It looks like n_gpu_layers = 0 for disable GPU is broken (see ggerganov/llama.cpp#10089), I'll checkout for that before merge this.

ref: ggerganov/llama.cpp#10071

ref: ggerganov/llama.cpp#10132

feat: sync llama.cpp

697c1e4

fix: fix submodule update - as part of llama.cpp sync

3ee23e4

Vali-98 mentioned this pull request Oct 31, 2024

cui-llama.rn build error on iOS Vali-98/ChatterUI#102

Closed

a-ghorbani marked this pull request as ready for review October 31, 2024 08:16

a-ghorbani mentioned this pull request Oct 31, 2024

feat: sync llama.rn a-ghorbani/pocketpal-ai#72

Merged

6 tasks

jhen0409 added 2 commits November 2, 2024 10:31

chore: remove unnecessary comment

ac8e983

chore(example): revert unnecessary changes

0722848

jhen0409 added 3 commits November 2, 2024 10:45

feat: sync llama.cpp

992a4d6

fix: remove tfs_z

1211095

ref: ggerganov/llama.cpp#10071

fix(cpp): skip gpu device if n_gpu_layers <= 0

055df7f

ref: ggerganov/llama.cpp#10132

jhen0409 merged commit 1ca3044 into mybigday:main Nov 2, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: sync llama.cpp #79

feat: sync llama.cpp #79

a-ghorbani commented Oct 30, 2024 •

edited

Loading

a-ghorbani commented Oct 30, 2024

Vali-98 commented Oct 30, 2024 •

edited

Loading

a-ghorbani commented Oct 30, 2024

Vali-98 commented Oct 31, 2024

jhen0409 commented Nov 2, 2024

feat: sync llama.cpp #79

feat: sync llama.cpp #79

Conversation

a-ghorbani commented Oct 30, 2024 • edited Loading

a-ghorbani commented Oct 30, 2024

Vali-98 commented Oct 30, 2024 • edited Loading

a-ghorbani commented Oct 30, 2024

Vali-98 commented Oct 31, 2024

jhen0409 commented Nov 2, 2024

a-ghorbani commented Oct 30, 2024 •

edited

Loading

Vali-98 commented Oct 30, 2024 •

edited

Loading