
One of our main innovations is a new conditional generation method for unconditional diffusion models. Our new conditioning method, which refer to as the gradient method, modifies the sampling procedure of the model to improve a conditioning loss on denoised data using gradient-based optimization. We find that the gradient method is more capable than existing methods in ensuring consistency of the generated samples with the conditioning information.
We use the gradient method to autoregressively extend our models to more timesteps and higher resolutions.
Frames from our gradient method (left) and a baseline "replacement" method (right) for autoregressive extension. Videos sampled using the gradient method attain superior temporal coherence compared to the baseline method.
We show that high quality videos can be generated by essentially the standard formulation of the Gaussian diffusion model, with little modification other than straightforward architectural changes to accommodate video data within memory constraints of deep learning accelerators. We train models that generate a block of a fixed number of frames of a video, and to generate videos longer than that number of frames, we additionally show how to repurpose a trained model to act as a model which is block-autoregressive over frames. We test our methods on an unconditional video generation benchmark, where we achieve state-of-the-art sample quality scores, and we also show promising results on text-conditioned video generation.
数据统计
数据评估
关于Video Diffusion Models特别声明
本站鸟瑞导航提供的Video Diffusion Models数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午6:59收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

支持文字生成AI图;支持图生图;可controlnet条件生图,上传特征参考图和特征,依照特征进行创作;支持inpainting的神奇涂抹,可局部修改,支持自训练AI绘画模型;支持在基础风格模型上,使用叠加AI绘图模型;支持在模型广场收藏各类画风、IP、场景、人物、设计类模型。

SuperCraft
SuperCraft helps teams design great physical products

Nvidia·GET3D
NVIDIA 发明了 GPU,并推动了 AI、HPC、游戏、创意设计、自动驾驶汽车和机器人开发领域的进步。

星汉未来 – SD模型集
星汉未来AI应用平台

Skybox AI
Skybox AI: One-click 360° image generator from Blockade Labs

IMI Prompt推荐
IMI Prompt Builder is a comprehensive Midjourney v5 prompt generator with thousands of options available on web, Android, and iOS. With just a few clicks, users can create unique Midjourney v5 artworks that reflect their personal style and artistic vision.

Artefacts
Artefacts is a 3D AI toolkit that enables users to effortlessly transform text or 2D images into 3D assets. Unleash your creativity with Artefacts - the future of 3D content creation.
站酷ZCOOL
站酷ZCOOL,中国设计师互动平台.深耕设计领域十八年,站酷聚集了1800万设计师、摄影师、插画师、艺术家、创意人,设计创意群体中具有较高的影响力与号召力.
暂无评论...





