Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PT2] weight_compression #3293

Open
wants to merge 6 commits into
base: develop
Choose a base branch
from

Conversation

AlexanderDokuchaev
Copy link
Collaborator

@AlexanderDokuchaev AlexanderDokuchaev commented Feb 18, 2025

Changes

Implement weight compression algorithms for experimental PT tracing

Related tickets

152996

@AlexanderDokuchaev AlexanderDokuchaev changed the title [PT2} weight_compression [PT2] weight_compression Feb 18, 2025
@github-actions github-actions bot added NNCF PT Pull requests that updates NNCF PyTorch NNCF PTQ Pull requests that updates NNCF PTQ labels Feb 18, 2025
@AlexanderDokuchaev
Copy link
Collaborator Author

Model Metric value Metric diff Num int8 Num int4 Compr. time Stat. collection time Mixed-Precision search time Apply Compression time Total time RAM MiB RAM MiB System Status
tinyllama_data_aware 0.85767 -0.14233 123 94 0:01:00 0:00:51 0:00:04 0:00:02 0:03:09 5099 1618 XFAIL: num_compressed_xfail_reason - Regression: The number of int8 ops is different than reference 124 != 123
tinyllama_int8_data_free 0.95624 -0.04376 311 0 0:00:03     0:00:02 0:02:25 5336 2499 XFAIL: num_compressed_xfail_reason - Regression: The number of int8 ops is different than reference 312 != 311
tinyllama_int4_data_free 0.73873 -0.26127 83 114 0:00:08   0:00:04 0:00:02 0:02:16 5201 1990 XFAIL: num_compressed_xfail_reason - Regression: The number of int8 ops is different than reference 84 != 83
tinyllama_scale_estimation_per_channel 0.80799 -0.19201 123 188 0:02:31 0:00:55 0:00:04 0:00:01 0:04:40 5280 4012 XFAIL: num_compressed_xfail_reason - Regression: The number of int8 ops is different than reference 124 != 123
tinyllama_scale_estimation_group_size_64 0.8566 -0.1434 123 94 0:03:21 0:00:57 0:00:04 0:00:02 0:05:29 5461 4287 XFAIL: num_compressed_xfail_reason - Regression: The number of int8 ops is different than reference 124 != 123

@AlexanderDokuchaev AlexanderDokuchaev marked this pull request as ready for review February 21, 2025 01:35
@AlexanderDokuchaev AlexanderDokuchaev requested a review from a team as a code owner February 21, 2025 01:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
NNCF PT Pull requests that updates NNCF PyTorch NNCF PTQ Pull requests that updates NNCF PTQ
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant