embedjs llama-cpp integration fails to run #180

GhostDog98 · 2024-12-03T10:29:35Z

🐛 Describe the bug

To reproduce:

import { RAGApplicationBuilder, TextLoader } from '@llm-tools/embedjs'
import { LlamaCppEmbeddings, LlamaCpp } from '@llm-tools/embedjs-llama-cpp';
import { HNSWDb } from '@llm-tools/embedjs-hnswlib'

const app = await new RAGApplicationBuilder()
    .setModel(new LlamaCpp({modelPath:"./models/Llama-3.2-3B-Instruct-f16.gguf"}))
    .setEmbeddingModel(new LlamaCppEmbeddings({modelPath: "./models/dragon-yi-1-5-9.gguf"}))
    .setVectorDatabase(new HNSWDb())
    .build();

This code works if ran with ollama on both setModel and setEmbeddingModel.
It appears that it fails due to the file at models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts not importing the requisite function getEmbeddingFor as called on line 24. Resulting in error;

TypeError: Cannot read properties of undefined (reading 'getEmbeddingFor')
    at file:///home/ghostdog/private-site/models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts:24:50
    at Array.map (<anonymous>)
    at LlamaCppEmbeddings.embedDocuments (file:///home/ghostdog/private-site/models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts:23:33)
    at LlamaCppEmbeddings.getDimensions (file:///home/ghostdog/private-site/models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts:17:35)
    at RAGApplication.init (file:///home/ghostdog/private-site/core/embedjs/src/core/rag-application.ts:71:88)
    at async RAGApplicationBuilder.build (file:///home/ghostdog/private-site/core/embedjs/src/core/rag-application-builder.ts:46:9)
    at async file:///home/ghostdog/private-site/index.js:40:13

Am i missing any imports or obvious errors?

The text was updated successfully, but these errors were encountered:

GhostDog98 · 2024-12-03T10:37:45Z

For a more clear sample code:

import { RAGApplicationBuilder, TextLoader } from '@llm-tools/embedjs'
import { LlamaCppEmbeddings, LlamaCpp } from '@llm-tools/embedjs-llama-cpp';
import { HNSWDb } from '@llm-tools/embedjs-hnswlib'
import { OllamaEmbeddings, Ollama } from '@llm-tools/embedjs-ollama';

async function load_app_llamacpp(){
    const app = await new RAGApplicationBuilder()
    .setModel(new LlamaCpp({modelPath:"models/Llama-3.2-3B-Instruct-f16.gguf"}))
    .setEmbeddingModel(new LlamaCppEmbeddings({modelPath: "models/dragon-yi-1-5-9.gguf"}))
    .setVectorDatabase(new HNSWDb())
    .build();

    return app;
}

async function load_app_ollama(){
    const app = await new RAGApplicationBuilder()
    .setModel(new Ollama({modelName: "llama3.2", baseUrl: 'http://localhost:11434'}))
    .setEmbeddingModel(new OllamaEmbeddings({modelName: "nomic-embed-text", baseUrl: 'http://localhost:11434'}))
    .setVectorDatabase(new HNSWDb())
    .build();

    return app;
}

//let app = await load_app_llamacpp(); // Always fails

//let app = await load_app_ollama(); // Works as expected

GhostDog98 · 2024-12-03T10:49:04Z

Update; actually, I think the OllamaEmbeddings doesn't set the modelName correctly, as it still defaults to mxbai-embed-large and thus errors out. The default is defined in @langchain/ollama/dist/embeddings, so a hack could be to change this value, but preferably it'd be set correctly... Shall I create another issue for this?
Edit: It appears this is soley a documentation issue, as while the quickstart has the correct option, the embeddings page does not.

adhityan · 2024-12-03T14:00:52Z

Yes please. There was another issue that I found in your original post. The way the initialization was happening allowed for a race condition - this has been addressed in the version just published (0.1.23). For the model name, please create a separate issue.

GhostDog98 · 2024-12-04T03:54:08Z

Yes please. There was another issue that I found in your original post. The way the initialization was happening allowed for a race condition - this has been addressed in the version just published (0.1.23). For the model name, please create a separate issue.

I'm still seeing this issue unfortunately. npm list outputs:

├── @llm-tools/[email protected]
├── @llm-tools/[email protected]
├── @llm-tools/[email protected]
├── @llm-tools/[email protected]
└── @llm-tools/[email protected]

So I am up to date, however it seems it is getting slightly further, as the error log is now:

ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 CUDA devices:
  Device 0: NVIDIA GeForce RTX 3060, compute capability 8.6, VMM: yes
TypeError: Cannot read properties of undefined (reading 'getEmbeddingFor')
    at file:///home/private-site/models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts:32:54
    at Array.map (<anonymous>)
    at LlamaCppEmbeddings.embedDocuments (file:///home/private-site/models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts:31:19)
    at LlamaCppEmbeddings.getDimensions (file:///home/private-site/models/embedjs-llama-cpp/src/llama-cpp-embeddings.ts:24:35)
    at RAGApplication.init (file:///home/private-site/core/embedjs/src/core/rag-application.ts:71:88)
    at async RAGApplicationBuilder.build (file:///home/private-site/core/embedjs/src/core/rag-application-builder.ts:46:9)
    at async load_app_llamacpp (file:///home/private-site/mre.js:7:17)
    at async file:///home/private-site/mre.js:16:11

GhostDog98 · 2024-12-05T23:03:48Z

@adhityan if you could reopen this issue that would be fantastic, as I'm still having issues 👍

github-actions · 2024-12-27T00:40:20Z

This issue is stale because it has been open for 14 days with no activity.

adhityan · 2024-12-27T12:32:09Z

Sorry, testing this is a little difficult for me. I do most of the programming in a Macbook. I have a windows laptop with Nvidia but it has limitations in being able to run llama-cpp.

github-actions · 2025-01-11T00:40:57Z

This issue is stale because it has been open for 14 days with no activity.

github-actions bot assigned adhityan Dec 3, 2024

adhityan closed this as completed Dec 3, 2024

adhityan reopened this Dec 12, 2024

github-actions bot added the stale label Dec 27, 2024

adhityan added the bug Something isn't working label Dec 27, 2024

github-actions bot removed the stale label Dec 28, 2024

github-actions bot added the stale label Jan 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

embedjs llama-cpp integration fails to run #180

embedjs llama-cpp integration fails to run #180

GhostDog98 commented Dec 3, 2024

GhostDog98 commented Dec 3, 2024

GhostDog98 commented Dec 3, 2024 •

edited

Loading

adhityan commented Dec 3, 2024

GhostDog98 commented Dec 4, 2024

GhostDog98 commented Dec 5, 2024

github-actions bot commented Dec 27, 2024

adhityan commented Dec 27, 2024

github-actions bot commented Jan 11, 2025

embedjs llama-cpp integration fails to run #180

embedjs llama-cpp integration fails to run #180

Comments

GhostDog98 commented Dec 3, 2024

🐛 Describe the bug

GhostDog98 commented Dec 3, 2024

GhostDog98 commented Dec 3, 2024 • edited Loading

adhityan commented Dec 3, 2024

GhostDog98 commented Dec 4, 2024

GhostDog98 commented Dec 5, 2024

github-actions bot commented Dec 27, 2024

adhityan commented Dec 27, 2024

github-actions bot commented Jan 11, 2025

GhostDog98 commented Dec 3, 2024 •

edited

Loading