Diffusion model-based inverse problem solvers (DIS) have emerged as state-of-the-art approaches for addressing inverse problems.
However, their application to video inverse problems arising from spatio-temporal degradation remains largely unexplored.
In response, we introduce an innovative video inverse solver using only image diffusion models.
Specifically, our method treats the time dimension of a video as the batch dimension of image diffusion models,
thereby solving spatio-temporal optimization problems within denoised spatio-temporal batches derived from each image diffusion model.
We address the batch inconsistency issue in diffusion models by controlling batch-stochasticity,
thereby enabling batch-consistent sampling.
|
Ours 😁 |
Video reconstruction |
Better Reconstruction Quality & Batch-Consistent Samples |
Memory effieciency |
Requires 13GB VRAM for 16-frame videos, maximum 32-frame in 24 GB VRAM. |
Accessibility |
Using open-sourced image diffusion model (ADM) |
Experimental results demonstrate that our method effectively addresses various temporal and spatial degradations in video inverse problems,
achieving state-of-the-art reconstructions.