Standardise outputs for video pipelines #6626

DN6 · 2024-01-18T09:07:56Z

What does this PR do?

This PR

Standardises the output for the following pipelines by making them all use the same tensor2vid function to postprocess the 3D tensor output with shape (batch size, channels, num frames, height, width).

TextToVideoSDPipeline
AnimateDiff
StableVideoDiffusionPipeline

Updates TextToVideoSDPipeline tests so that shapes of expected numpy array match the output from new postprocessing.

TextToVideoZeroPipeline and TextToVideoZeroSDXLPipeline are excluded because they make use of 2D UNets, and produce an output with shape (batch size, channels, height, width) where the batch size == number of frames generated.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

src/diffusers/pipelines/animatediff/pipeline_animatediff.py

patrickvonplaten · 2024-01-19T09:24:22Z

src/diffusers/pipelines/stable_video_diffusion/pipeline_stable_video_diffusion.py

@@ -40,10 +40,8 @@ def _append_dims(x, target_dims):
    return x[(...,) + (None,) * dims_to_append]


+# Copied from diffusers.pipelines.animatediff.pipeline_animatediff.tensor2vid


patrickvonplaten

Nice clean-up!

* begin animatediff img2video and video2video * revert animatediff to original implementation * add img2video as pipeline * update * add vid2vid pipeline * update imports * update * remove copied from line for check_inputs * update * update examples * add multi-batch support * fix __init__.py files * move img2vid to community * update community readme and examples * fix * make fix-copies * add vid2vid batch params * apply suggestions from review Co-Authored-By: Dhruv Nair <dhruv.nair@gmail.com> * add test for animatediff vid2vid * torch.stack -> torch.cat Co-Authored-By: Dhruv Nair <dhruv.nair@gmail.com> * make style * docs for vid2vid * update * fix prepare_latents * fix docs * remove img2vid * update README to :main * remove slow test * refactor pipeline output * update docs * update docs * merge community readme from :main * final fix i promise * add support for url in animatediff example * update example * update callbacks to latest implementation * Update src/diffusers/pipelines/animatediff/pipeline_animatediff_video2video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/animatediff/pipeline_animatediff_video2video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix merge * Apply suggestions from code review * remove callback and callback_steps as suggested in review * Update tests/pipelines/animatediff/test_animatediff_video2video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix import error caused due to unet refactor in #6630 * fix numpy import error after tensor2vid refactor in #6626 * make fix-copies * fix numpy error * fix progress bar test --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update * update * update * update * update * update * update * clean up * clean up

* begin animatediff img2video and video2video * revert animatediff to original implementation * add img2video as pipeline * update * add vid2vid pipeline * update imports * update * remove copied from line for check_inputs * update * update examples * add multi-batch support * fix __init__.py files * move img2vid to community * update community readme and examples * fix * make fix-copies * add vid2vid batch params * apply suggestions from review Co-Authored-By: Dhruv Nair <dhruv.nair@gmail.com> * add test for animatediff vid2vid * torch.stack -> torch.cat Co-Authored-By: Dhruv Nair <dhruv.nair@gmail.com> * make style * docs for vid2vid * update * fix prepare_latents * fix docs * remove img2vid * update README to :main * remove slow test * refactor pipeline output * update docs * update docs * merge community readme from :main * final fix i promise * add support for url in animatediff example * update example * update callbacks to latest implementation * Update src/diffusers/pipelines/animatediff/pipeline_animatediff_video2video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/animatediff/pipeline_animatediff_video2video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix merge * Apply suggestions from code review * remove callback and callback_steps as suggested in review * Update tests/pipelines/animatediff/test_animatediff_video2video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix import error caused due to unet refactor in huggingface#6630 * fix numpy import error after tensor2vid refactor in huggingface#6626 * make fix-copies * fix numpy error * fix progress bar test --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

DN6 added 7 commits January 17, 2024 12:39

update

f3420ed

Merge branch 'main' into vid-pipe-output

2f91d45

update

f2f58d1

update

e601a99

update

bd1ac23

update

1d27c52

update

b4616b0

DN6 requested a review from patrickvonplaten January 18, 2024 12:48

patrickvonplaten reviewed Jan 19, 2024

View reviewed changes

src/diffusers/pipelines/animatediff/pipeline_animatediff.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Jan 19, 2024

View reviewed changes

src/diffusers/pipelines/animatediff/pipeline_animatediff.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Jan 19, 2024

View reviewed changes

patrickvonplaten approved these changes Jan 19, 2024

View reviewed changes

DN6 added 3 commits January 22, 2024 08:22

update

c765e86

clean up

623d448

clean up

341f48d

DN6 merged commit 6620eda into main Jan 23, 2024
16 checks passed

a-r-r-o-w added a commit to a-r-r-o-w/diffusers that referenced this pull request Jan 23, 2024

fix numpy import error after tensor2vid refactor in huggingface#6626

032c24f

DN6 mentioned this pull request Jan 23, 2024

export_to_video issue #6681

Closed

DN6 mentioned this pull request Jan 26, 2024

Update export to video to support new tensor_to_vid function in video pipelines #6715

Merged

6 tasks

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

Standardise outputs for video pipelines (huggingface#6626)

c49d8b9

* update * update * update * update * update * update * update * clean up * clean up

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardise outputs for video pipelines #6626

Standardise outputs for video pipelines #6626

DN6 commented Jan 18, 2024

patrickvonplaten Jan 19, 2024

patrickvonplaten left a comment

		@@ -40,10 +40,8 @@ def _append_dims(x, target_dims):
		return x[(...,) + (None,) * dims_to_append]


		# Copied from diffusers.pipelines.animatediff.pipeline_animatediff.tensor2vid

Standardise outputs for video pipelines #6626

Standardise outputs for video pipelines #6626

Conversation

DN6 commented Jan 18, 2024

What does this PR do?

Before submitting

Who can review?

patrickvonplaten Jan 19, 2024

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment