We propose SSD-Edit, the first instruction-based image and video editing framework built on Mamba-2 diffusion. No prior work has applied any Mamba variant to instruction-following editing. SSD-Edit introduces Source-Instruction Dual-Stream SSD, Selective Edit Masking via State Divergence for automatic edit localization, and Temporal Edit Propagation for consistent video editing.
Hiroki Abe (Thu,) studied this question.