Which LLM models are supported？ #341

yyyhainan · 2024-07-03T09:17:44Z

Whether other LLM models are supported, such as ChatGLM and QWEN？

Lbaiall · 2024-07-03T10:41:45Z

i got with same question ....... where can i see the main source code ....

dinobot22 · 2024-07-03T11:19:04Z

+1

andysingal · 2024-07-03T11:24:14Z

+1

young169 · 2024-07-04T07:38:27Z

And also can we use locally deployed LLMs other than via api keys?

zzk2021 · 2024-07-04T08:39:17Z

same question

gallypette · 2024-07-04T12:31:32Z

+1

AlonsoGuevara · 2024-07-04T21:16:05Z

Hi!
During our research we got the most quality out of gpt-4, gpt-4-turbo and gpt-4o, that's why out of the box we include support for these in both OpenAI and Azure environments.

Regarding local hosting there's a very interesting conversation going on in this thread #339

bmaltais · 2024-07-04T23:00:30Z

I have tested gemma2 and llama3 with success. The only thing that does not work locally is the embeddings. There need to be a fix to accept the style of response coming from ollama when quering embeddings... Once that is fixed you will be able to run this 100% local on a personal computer... but probably need a NVidia with 24GB of VRAM like a 3090 or a Mx Mac with 32GB RAM.

zzk2021 · 2024-07-05T04:02:00Z

I have tested gemma2 and llama3 with success. The only thing that does not work locally is the embeddings. There need to be a fix to accept the style of response coming from ollama when quering embeddings... Once that is fixed you will be able to run this 100% local on a personal computer... but probably need a NVidia with 24GB of VRAM like a 3090 or a Mx Mac with 32GB RAM.

can we use local embedding?

vamshi-rvk · 2024-07-05T17:37:38Z

I have tested gemma2 and llama3 with success. The only thing that does not work locally is the embeddings. There need to be a fix to accept the style of response coming from ollama when quering embeddings... Once that is fixed you will be able to run this 100% local on a personal computer... but probably need a NVidia with 24GB of VRAM like a 3090 or a Mx Mac with 32GB RAM.

Can you help me with running llama 3 from the local please..

ishotoli · 2024-07-07T10:59:13Z

I have tested gemma2 and llama3 with success. The only thing that does not work locally is the embeddings. There need to be a fix to accept the style of response coming from ollama when quering embeddings... Once that is fixed you will be able to run this 100% local on a personal computer... but probably need a NVidia with 24GB of VRAM like a 3090 or a Mx Mac with 32GB RAM.

Can you help me with running llama 3 from the local please..

Here's my .env file, put it under ./ragtest dir, hope this can help you:
'''
GRAPHRAG_LLM_API_KEY=DEFAULTS
GRAPHRAG_LLM_TYPE=openai_chat
GRAPHRAG_LLM_API_BASE=http://127.0.0.1:5081/v1
GRAPHRAG_LLM_MODEL=Hermes-2-Pro-Llama-3-Instruct-Merged-DPO
GRAPHRAG_LLM_REQUEST_TIMEOUT=700
GRAPHRAG_LLM_MODEL_SUPPORTS_JSON=True
GRAPHRAG_LLM_THREAD_COUNT=16
GRAPHRAG_LLM_CONCURRENT_REQUESTS=16
GRAPHRAG_EMBEDDING_TYPE=openai_embedding
GRAPHRAG_EMBEDDING_API_BASE=http://127.0.0.1:9997/v1
GRAPHRAG_EMBEDDING_MODEL=bce-embedding-base_v1
GRAPHRAG_EMBEDDING_BATCH_SIZE=64
GRAPHRAG_EMBEDDING_BATCH_MAX_TOKENS=512
GRAPHRAG_EMBEDDING_THREAD_COUNT=16
GRAPHRAG_EMBEDDING_CONCURRENT_REQUESTS=16
GRAPHRAG_INPUT_FILE_PATTERN=".*.txt$"
'''

vamshi-rvk · 2024-07-09T04:50:36Z

this worked for me

https://github.com/TheAiSingularity/graphrag-local-ollama

AlonsoGuevara · 2024-07-09T22:02:42Z

Hi! We are centralizing other LLM discussions in these threads:
Other LLM/Api bases: #339,
Ollama: #345
Local embeddings: #370

I'll resolve this issue so we can keep the focus on those threads

xpdd123 · 2024-07-10T08:07:15Z

I test gemma2 sucess, but glm4 failed, I guess its because of the input length of the llm

whisper-bye · 2024-08-12T07:25:09Z

qwen2:7b fail

AlonsoGuevara closed this as completed Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which LLM models are supported？ #341

Which LLM models are supported？ #341

yyyhainan commented Jul 3, 2024

Lbaiall commented Jul 3, 2024

dinobot22 commented Jul 3, 2024

andysingal commented Jul 3, 2024

young169 commented Jul 4, 2024

zzk2021 commented Jul 4, 2024

gallypette commented Jul 4, 2024

AlonsoGuevara commented Jul 4, 2024

bmaltais commented Jul 4, 2024

zzk2021 commented Jul 5, 2024

vamshi-rvk commented Jul 5, 2024

ishotoli commented Jul 7, 2024

vamshi-rvk commented Jul 9, 2024

AlonsoGuevara commented Jul 9, 2024

xpdd123 commented Jul 10, 2024

whisper-bye commented Aug 12, 2024

Which LLM models are supported？ #341

Which LLM models are supported？ #341

Comments

yyyhainan commented Jul 3, 2024

Lbaiall commented Jul 3, 2024

dinobot22 commented Jul 3, 2024

andysingal commented Jul 3, 2024

young169 commented Jul 4, 2024

zzk2021 commented Jul 4, 2024

gallypette commented Jul 4, 2024

AlonsoGuevara commented Jul 4, 2024

bmaltais commented Jul 4, 2024

zzk2021 commented Jul 5, 2024

vamshi-rvk commented Jul 5, 2024

ishotoli commented Jul 7, 2024

vamshi-rvk commented Jul 9, 2024

AlonsoGuevara commented Jul 9, 2024

xpdd123 commented Jul 10, 2024

whisper-bye commented Aug 12, 2024