Key points are not available for this paper at this time.
Video object segmentation uses the first annotated video mask to achieve consistent and precise segmentation in subsequent frames. Recently, memory-based methods have received significant attention owing to their substantial performance enhancements. However, these approaches rely on a fixed global memory strategy, which poses a challenge to segmentation accuracy and speed in the context of longer videos. To alleviate this limitation, we propose a novel semi-supervised video object segmentation model, founded on the principles of the adaptive memory network. Our proposed model adaptively extracts object features by focusing on the object area while effectively filtering out extraneous background noise. An identification mechanism is also thoughtfully applied to discern each object in multiobject scenarios. To further reduce storage consumption without compromising the saliency of object information, the outdated features residing in the memory pool are compressed into salient features through the employment of a self-attention mechanism. Furthermore, we introduce a local matching module, specifically devised to refine object features by fusing the contextual information from historical frames. We demonstrate the efficiency of our approach through experiments, substantially augmenting both the speed and precision of segmentation for long-term videos, while maintaining comparable performance for short videos. The source code is available at https://github.com/GQLi-cong/AMN .
Building similarity graph...
Analyzing shared references across papers
Loading...
Shan Zhong
Tongji University
Guoqiang Li
Henan Cancer Hospital
Wenhao Ying
Fudan University
IEEE Transactions on Cognitive and Developmental Systems
Soochow University
Jilin University
Jiangxi Normal University
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhong et al. (Mon,) studied this question.
synapsesocial.com/papers/68e6fffcb6db64358767a0bd — DOI: https://doi.org/10.1109/tcds.2024.3385849