Key points are not available for this paper at this time.
Ultrasound video segmentation gains increasing attention in clinical practice due to the redundant dynamic references in video frames. However, traditional convolutional neural networks have a limited receptive field and transformer-based networks are unsatisfactory in constructing long-term dependency from the perspective of computational complexity. This bottleneck poses a significant challenge when processing longer sequences in medical video analysis tasks using available devices with limited memory. Recently, state space models (SSMs), famous by Mamba, have exhibited linear complexity and impressive achievements in efficient long sequence modeling, which have developed deep neural networks by expanding the receptive field on many vision tasks significantly. Unfortunately, vanilla SSMs failed to simultaneously capture causal temporal cues and preserve non-casual spatial information. To this end, this paper presents a Video Vision Mamba-based framework, dubbed as Vivim, for ultrasound video segmentation tasks. Our Vivim can effectively compress the long-term spatiotemporal representation into sequences at varying scales with our designed Temporal Mamba Block. We also introduce an improved boundary-aware affine constraint across frames to enhance the discriminative ability of Vivim on ambiguous lesions. Extensive experiments on thyroid segmentation in ultrasound videos, breast lesion segmentation in ultrasound videos, and polyp segmentation in colonoscopy videos demonstrate the effectiveness and efficiency of our Vivim, superior to existing methods. The code and dataset are available at: https://github.com/scott-yjyang/Vivim.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yijun Yang
Hong Kong University of Science and Technology
Zhaohu Xing
Hong Kong University of Science and Technology
Lequan Yu
University of Hong Kong
IEEE Transactions on Circuits and Systems for Video Technology
University of Hong Kong
Chinese University of Hong Kong
Hong Kong University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Yang et al. (Tue,) studied this question.
synapsesocial.com/papers/6a0fdd74d13714ec96feb7ec — DOI: https://doi.org/10.1109/tcsvt.2025.3563411
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: