Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python-level Linear Modularization. #24

Open
Artiprocher opened this issue Nov 14, 2024 · 1 comment
Open

Python-level Linear Modularization. #24

Artiprocher opened this issue Nov 14, 2024 · 1 comment

Comments

@Artiprocher
Copy link

Hello, I am the developer of DiffSynth-Studio, another diffusion engine. We are very surprised by the amazing speedup achieved by SVDQuant. We hope to bring this technology to more tech enthusiasts.

We noticed that many model structures in the Nunchaku engine are written in C++, which makes it very difficult for us to integrate them into other projects. Based on our understanding of the SVDQuant paper, the key acceleration happens in the Linear layer. Could you provide a pre-compiled quantized Linear layer module? This would make it easier for developers to integrate it with existing technologies at the Python level.

We look forward to your reply. If you can provide such a module, we are willing to promote it to more open-source projects.

Zhongjie Duan
DiffSynth Team, ModelScope
Best regards.

@lmxyy
Copy link
Collaborator

lmxyy commented Nov 16, 2024

Of course. We are still cleaning and improving the codebase and modularization is in our TODO list. I will let you know when getting it done.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants