Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Llama3 weight analysis and visualization using safetensors #33

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

pythonLoader
Copy link
Collaborator

SUMMARY:
Weight analysis and visualization for llama3 models loaded from safetensors.

TEST PLAN:
No automatic tests written yet. It's a work in progress. Will add test after review.

@markurtz
Copy link
Collaborator

Thanks for he contribution @pythonLoader, I'm going to see if there's a bit more we can do to generalize this further over the next few weeks once I can find sometime.

I'm worried about landing specific scripts under utils or the code base.

If you'd like to get this in, then we can package it as an example but we'll need some more things around it such as a readme fo rusage.

@markurtz markurtz self-assigned this Oct 18, 2024
@pythonLoader
Copy link
Collaborator Author

Hi @markurtz, sure!

I can add an argparser and a README for this. Is there anything else that I need to do for making it more generalized?

@markurtz
Copy link
Collaborator

That should be fine for now provided we can add I as a dedicated directory under /examples!

markmc pushed a commit to markmc/llm-compressor that referenced this pull request Nov 13, 2024
* Compressed lifecycle implementation (INT8 only)

* Apply suggestions from code review

* small fixes for runtime

* Quantization Compressor Support (vllm-project#45)

* add classes

* WIP

* moving around classes

* code complete

* tests passing

* unit test bugs

* fill out int decompression

* docstrings

* allow repeat frozens

* int compressor unit tests

* PR comments

* fix device issue

* fixing leaf checker

* initial commit

* Revert "Merge branch 'main' into compressed-lifecycle"

This reverts commit 8dcdde51bb153036b551a5a0252208f1ff9275e1, reversing
changes made to bb369367bd74f7b18ddd3bfdc32db2902b604f92.

* update version

* fix test

---------

Co-authored-by: Sara Adkins <[email protected]>
Co-authored-by: dbogunowicz <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants