[docs] Quantization tip #10249

stevhliu · 2024-12-16T22:59:07Z

Follows up on discussion about adding a quantization section for big models. Instead of adding it to the individual model doc (for example, MochiTransformer3DModel) and the pipeline doc, I think it's more discoverable/cleaner to add it only to the pipeline doc (for example, MochiPipeline).

Let me know if this works for you, and then I can add it to the other big models!

HuggingFaceDocBuilderDev · 2024-12-16T23:05:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

I left two broad comments and your plan looks perfect to me!

@a-r-r-o-w WDYT?

docs/source/en/api/pipelines/mochi.md

sayakpaul · 2024-12-20T03:31:27Z

I think the following classes are remaining:

We could either open this up to the community or tackle it ourselves in this PR.

sayakpaul

Thanks for the changes!

Two broad comments. Once they are addressed I think we should be good to ship this 🚀

docs/source/en/api/pipelines/aura_flow.md

docs/source/en/api/pipelines/cogvideox.md

stevhliu · 2024-12-20T16:29:51Z

Thanks @sayakpaul! I included the latest batch of pipelines (Allegro, Latte, LTX, etc.) in this PR as well 🙂

stevhliu requested a review from sayakpaul December 16, 2024 23:16

sayakpaul reviewed Dec 17, 2024

View reviewed changes

docs/source/en/api/pipelines/mochi.md Outdated Show resolved Hide resolved

docs/source/en/api/pipelines/mochi.md Show resolved Hide resolved

stevhliu force-pushed the quant-tip branch from 1e699e0 to 3d646a1 Compare December 18, 2024 18:21

stevhliu requested a review from sayakpaul December 19, 2024 16:41

sayakpaul reviewed Dec 20, 2024

View reviewed changes

docs/source/en/api/pipelines/aura_flow.md Outdated Show resolved Hide resolved

docs/source/en/api/pipelines/cogvideox.md Outdated Show resolved Hide resolved

stevhliu added 4 commits December 20, 2024 08:28

quantization

704dbb4

add other vid models

f6cb65c

typo

b76aba1

more pipelines

c6f4016

stevhliu force-pushed the quant-tip branch from 2ac0b7f to c6f4016 Compare December 20, 2024 16:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Quantization tip #10249

[docs] Quantization tip #10249

stevhliu commented Dec 16, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 16, 2024

sayakpaul left a comment

sayakpaul commented Dec 20, 2024 •

edited by stevhliu

Loading

sayakpaul left a comment

stevhliu commented Dec 20, 2024

[docs] Quantization tip #10249

Are you sure you want to change the base?

[docs] Quantization tip #10249

Conversation

stevhliu commented Dec 16, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 16, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul commented Dec 20, 2024 • edited by stevhliu Loading

sayakpaul left a comment

Choose a reason for hiding this comment

stevhliu commented Dec 20, 2024

stevhliu commented Dec 16, 2024 •

edited

Loading

sayakpaul commented Dec 20, 2024 •

edited by stevhliu

Loading