[docs] Video generation update #10272

stevhliu · 2024-12-18T00:09:40Z

Updates video generation guide to showcase newer/modern models (CogVideoX, HunyuanVideo, LTX, Mochi-1).

add example generated outputs
link to Quantization section (bnb, torchao, gguf) in optimization section

HuggingFaceDocBuilderDev · 2024-12-18T00:16:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Thanks for starting this, Steven! I made a pass through the changes and left some comments.

Here are some higher-level comments:

I think it might make sense to actually keep Stable Video Diffusion because of its quality.
Show examples of video-to-video?
Keep AnimateDiff as it's widely used and popular, perhaps under a new section "animation"?
Add a note saying "users can explore other video generation models on the Hub" (and provide some links like this).

docs/source/en/using-diffusers/text-img2vid.md

sayakpaul · 2024-12-19T05:13:37Z

docs/source/en/using-diffusers/text-img2vid.md


 ```py
 import torch
 from diffusers import CogVideoXImageToVideoPipeline
 from diffusers.utils import export_to_video, load_image

 prompt = "A vast, shimmering ocean flows gracefully under a twilight sky, its waves undulating in a mesmerizing dance of blues and greens. The surface glints with the last rays of the setting sun, casting golden highlights that ripple across the water. Seagulls soar above, their cries blending with the gentle roar of the waves. The horizon stretches infinitely, where the ocean meets the sky in a seamless blend of hues. Close-ups reveal the intricate patterns of the waves, capturing the fluidity and dynamic beauty of the sea in motion."
-image = load_image(image="cogvideox_rocket.png")
+image = load_image(image="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cogvideox/cogvideox_rocket.png")
 pipe = CogVideoXImageToVideoPipeline.from_pretrained(
    "THUDM/CogVideoX-5b-I2V",


TODO: if we change to the latest checkpoint, this will need to be updated, too.

I don't think I have enough juice to generate a video from this latest checkpoint, would someone mind updating it for me?

docs/source/en/using-diffusers/text-img2vid.md

stevhliu · 2024-12-20T17:13:50Z

I think it might make sense to actually keep Stable Video Diffusion because of its quality.

Done!

Show examples of video-to-video?

I think its okay to leave out video-to-video at the moment. There is a mention that CogVideoX is capable of image/video-to-video and users can refer to the API doc example for it. I'm worried if we try to show too much it can be a bit overwhelming especially given all the other models we are also showing here.

Keep AnimateDiff as it's widely used and popular, perhaps under a new section "animation"?

Done, but I just kept it with the other video models. I don't think we need to create an entirely new section just for "animation", and to users the distinction probably doesn't make that much of a difference.

Add a note saying "users can explore other video generation models on the Hub" (and provide some links like this).

Done!

sayakpaul · 2024-12-25T03:11:41Z

@DN6 @a-r-r-o-w could you give this a look?

stevhliu force-pushed the video-gen branch from bc76e02 to b5af8bb Compare December 18, 2024 19:31

stevhliu marked this pull request as ready for review December 18, 2024 19:40

stevhliu requested a review from sayakpaul December 18, 2024 19:40

sayakpaul reviewed Dec 19, 2024

View reviewed changes

sayakpaul requested review from DN6 and a-r-r-o-w December 19, 2024 05:23

stevhliu added 3 commits December 20, 2024 08:32

update

440580d

update

ac6939f

feedback

73c640f

stevhliu force-pushed the video-gen branch from aa641af to 73c640f Compare December 20, 2024 17:13

stevhliu and others added 2 commits December 20, 2024 09:25

fix videos

4addf37

Merge branch 'main' into video-gen

3854917

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Video generation update #10272

[docs] Video generation update #10272

stevhliu commented Dec 18, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 18, 2024

sayakpaul left a comment

sayakpaul Dec 19, 2024

stevhliu Dec 20, 2024

stevhliu commented Dec 20, 2024

sayakpaul commented Dec 25, 2024

[docs] Video generation update #10272

Are you sure you want to change the base?

[docs] Video generation update #10272

Conversation

stevhliu commented Dec 18, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 18, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul Dec 19, 2024

Choose a reason for hiding this comment

stevhliu Dec 20, 2024

Choose a reason for hiding this comment

stevhliu commented Dec 20, 2024

sayakpaul commented Dec 25, 2024

stevhliu commented Dec 18, 2024 •

edited

Loading