feat: offline support for tiktoken #7588
Open
+10
−2
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Title
Implement pre-download of tiktoken tokenizer file for the non_root offline image
Relevant issues
N/A
Type
🆕 New Feature
Changes
CUSTOM_TIKTOKEN_CACHE_DIR
environment value in the final image so litellm will set the correct offline cache dir[REQUIRED] Testing - Attach a screenshot of any new tests passing locally
Tested by running openai / non recognized model, which defaults to tiktoken
Future work
Pre-download llama-2 and llama-3 tokenizers. Cohere and Anthropic ones are probably not required, as these are not available for self hosting without internet connection anyways.