Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chatting with mistral generates answer with no spaces #1822

Open
micrem73 opened this issue Nov 4, 2024 · 1 comment
Open

chatting with mistral generates answer with no spaces #1822

micrem73 opened this issue Nov 4, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@micrem73
Copy link

micrem73 commented Nov 4, 2024

Bug description

steps to reproduce the issue:

  1. litgpt chat checkpoints/mistralai/Mistral-7B-Instruct-v0.3 --max_new_tokens 2048
  2. enter any prompt

I get answer with no space between words, i.e.: "Hello!I'mjustacomputerprogram,soIdon'thavefeelingslikeahumandoes.ButI'mheretohelpyouwithanyquestionsortasksyoumighthave!HowcanIassistyoutoday?"

Here is the full log:

⚡ ~ litgpt chat checkpoints/mistralai/Mistral-7B-Instruct-v0.3 --max_new_tokens 2048
{'access_token': None,
'checkpoint_dir': PosixPath('checkpoints/mistralai/Mistral-7B-Instruct-v0.3'),
'compile': False,
'max_new_tokens': 2048,
'multiline': False,
'precision': None,
'quantize': None,
'temperature': 0.8,
'top_k': 50,
'top_p': 1.0}
Now chatting with Mistral-7B-Instruct-v0.3.
To exit, press 'Enter' on an empty prompt.

Seed set to 1234

Prompt: hi, how are you?
Reply: Hello!I'mjustacomputerprogram,soIdon'thavefeelingslikeahumandoes.ButI'mheretohelpyouwithanyquestionsortasksyoumighthave!HowcanIassistyoutoday?
Time for inference: 3.55 sec total, 12.96 tokens/sec, 46 tokens

What operating system are you using?

Unknown

LitGPT Version




⚡ ~ pip show litgpt | grep Version
Version: 0.5.2
Version 2.0, January 2004
Licensed under the Apache License, Version 2.0 (the "License");

@micrem73 micrem73 added the bug Something isn't working label Nov 4, 2024
@rasbt
Copy link
Collaborator

rasbt commented Nov 4, 2024

Thanks for flagging this. I know Mistral is using their own tokenizer, but I could swear this worked before. Something to look into some time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants