-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with integrating with lm-eval harness #97
Comments
Hi @sriyachakravarthy , Would you mind provide the scripts to reproduce? |
you may also want to checkout https://github.com/LeiWang1999/vllm-bitblas/tree/bitblas-intg with with VllmRunner(
"BitBLASModel/open_llama_3b_1.58bits_bitblas",
dtype="half",
quantization="bitblas",
enforce_eager=False,
) as bitnet_model:
prompt = ""
for i in range(0, in_seq_len):
prompt += "a "
prompts = [prompt] * batch_size
from vllm import SamplingParams
sampling_params = SamplingParams(max_tokens=out_seq_len)
torch.cuda.profiler.start()
bitbnet_outputs = bitnet_model.generate(
prompts, sampling_params=sampling_params
)
torch.cuda.profiler.stop() Which is much more faster than the naive integration implementation. |
Sure, here is the script. %pip install git+https://github.com/EleutherAI/lm-evaluation-harness.git@big-refactor !lm_eval --model hf and when i am trying to use instructions from model card(https://huggingface.co/1bitLLM/bitnet_b1_58-3B), i am getting the following: |
Sure, will do |
The code there was not provided by bitblas, checkout the integration under btw, some benchmark numbers of 1.58bits vllm <style> </style>
|
Thanks! Also, is the transformer trainer package compatible for fine tuning the model? |
@sriyachakravarthy Sry, I have no experience with that |
Hi! I tried evaluating 1bitLLM/bitnet_b1_58-3B from hugging face. i am getting the error ValueError: Tokenizer class BitnetTokenizer does not exist or is not currently imported.
Kindly help!
Tasks
The text was updated successfully, but these errors were encountered: