Los puntos clave no están disponibles para este artículo en este momento.
Spatial pooling has been proven highly effective to capture long-range contextual information for pixel-wise prediction tasks, such as scene parsing. In this paper, beyond conventional spatial pooling that usually has a regular shape of NxN, we rethink the formulation of spatial pooling by introducing a new pooling strategy, called strip pooling, which considers a long but narrow kernel, i.e., 1xN or Nx1. Based on strip pooling, we further investigate spatial pooling architecture design by 1) introducing a new strip pooling module that enables backbone networks to efficiently model long-range dependencies; 2) presenting a novel building block with diverse spatial pooling as a core; and 3) systematically comparing the performance of the proposed strip pooling and conventional spatial pooling techniques. Both novel pooling-based designs are lightweight and can serve as an efficient plug-and-play modules in existing scene parsing networks. Extensive experiments on Cityscapes and ADE20K benchmarks demonstrate that our simple approach establishes new state-of-the-art results. Code is available at https://github.com/Andrew-Qibin/SPNet.
Building similarity graph...
Analyzing shared references across papers
Loading...
Qibin Hou
Nankai University
Li Zhang
The University of Texas Rio Grande Valley
Ming–Ming Cheng
Oxford Brookes University
University of Oxford
National University of Singapore
Nankai University
Building similarity graph...
Analyzing shared references across papers
Loading...
Hou et al. (Mon,) studied this question.
synapsesocial.com/papers/69d80a0405ee2ba81dbeec85 — DOI: https://doi.org/10.1109/cvpr42600.2020.00406
Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context: