support for 4bit quantization from transfomer library. #1798

harpomaxx · 2023-06-27T13:54:37Z

Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?

cidtrips · 2023-06-29T23:01:09Z

Honestly, it's updating to transformers 4.30, adding one other dependency package, and about 8 changes in the code if I recall correctly. Plus it works with multi-gpus.

Unfortunately I lost my changes from my running copy when I updated for the API updates, but I think most of the work is already done in my fork.

merrymercy · 2023-07-01T13:53:46Z

Contributions are welcome

02shanks · 2024-08-07T14:26:28Z

@merrymercy is this issue still open for contribution?

surak · 2024-08-08T00:52:53Z

@02shanks absolutely!!!!

02shanks · 2024-08-08T08:03:51Z

@surak as this is my first code contribution, could you please guide me through the process? Where should I start?

surak · 2024-08-10T21:47:42Z

Well, the usual:

fork the repo,
branch it into a relevant name,
and contribute ONLY those changes related to the issue.
keep the repo up-to-date with the main branch, as this makes for an easier merge
comment it where applicable
once it's good enough, do a merge request. We will look into it and people will review it.

Nothing special, really!

02shanks · 2024-08-14T16:09:25Z

@surak @merrymercy I have just created the PR. Can you please review it?

merrymercy added enhancement New feature or request good first issue Good for newcomers labels Jul 1, 2023

02shanks linked a pull request Aug 13, 2024 that will close this issue

Add Support for Loading Models in 4-bit Quantized Version (Fixes #1798) #3476

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for 4bit quantization from transfomer library. #1798

support for 4bit quantization from transfomer library. #1798

harpomaxx commented Jun 27, 2023

cidtrips commented Jun 29, 2023

merrymercy commented Jul 1, 2023

02shanks commented Aug 7, 2024

surak commented Aug 8, 2024

02shanks commented Aug 8, 2024

surak commented Aug 10, 2024

02shanks commented Aug 14, 2024

support for 4bit quantization from transfomer library. #1798

support for 4bit quantization from transfomer library. #1798

Comments

harpomaxx commented Jun 27, 2023

cidtrips commented Jun 29, 2023

merrymercy commented Jul 1, 2023

02shanks commented Aug 7, 2024

surak commented Aug 8, 2024

02shanks commented Aug 8, 2024

surak commented Aug 10, 2024

02shanks commented Aug 14, 2024