MultiModal MultiLingual (3ML) ChatBot

This model is 4bit quantized of lm-4v-9b Model (Less than 9G).

It excels in document, image, chart questioning answering and delivers superior performance over GPT-4-turbo-2024-04-09, Gemini 1.0 Pro, Qwen-VL-Max, and Claude 3 Opus.

Some part of the original Model changed and It can excute on free version of google colab.

Try it with gradio support:

demo

Note: For optimal performance with document and image understanding, please use English or Chinese docs. The model can still handle chat in any supported language.

Quick Start

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from PIL import Image

device = "cuda"

modelPath="nikravan/glm-4vq"
tokenizer = AutoTokenizer.from_pretrained(modelPath, trust_remote_code=True)

model = AutoModelForCausalLM.from_pretrained(
    modelPath,
    torch_dtype=torch.bfloat16,
    low_cpu_mem_usage=True,
    trust_remote_code=True,
    device_map="auto"
)

query ='explain all the details in this picture'
image = Image.open("a3.png").convert('RGB')
#image=""
inputs = tokenizer.apply_chat_template([{"role": "user", "image": image, "content": query}],
                                       add_generation_prompt=True, tokenize=True, return_tensors="pt",
                                       return_dict=True)  # chat with image mode

inputs = inputs.to(device)

gen_kwargs = {"max_length": 2500, "do_sample": True, "top_k": 1}
with torch.no_grad():
    outputs = model.generate(**inputs, **gen_kwargs)
    outputs = outputs[:, inputs['input_ids'].shape[1]:]
    print(tokenizer.decode(outputs[0]))

##Samples:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
3ML.ipynb		3ML.ipynb
README.md		README.md
nature.jpg		nature.jpg
receipt1.png		receipt1.png
sales.png		sales.png
test.jpg		test.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MultiModal MultiLingual (3ML) ChatBot

Try it with gradio support:

Quick Start

About

Releases

Packages

Languages

nikravan1/3ML

Folders and files

Latest commit

History

Repository files navigation

MultiModal MultiLingual (3ML) ChatBot

Try it with gradio support:

Quick Start

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages