Key points are not available for this paper at this time.
This paper considers the problem of action localization, where the objective is to determine when and where certain actions appear. We introduce a sampling strategy to produce 2D+t sequences of bounding boxes, called tubelets. Compared to state-of-the-art alternatives, this drastically reduces the number of hypotheses that are likely to include the action of interest. Our method is inspired by a recent technique introduced in the context of image localization. Beyond considering this technique for the first time for videos, we revisit this strategy for 2D+t sequences obtained from super-voxels. Our sampling strategy advantageously exploits a criterion that reflects how action related motion deviates from background motion. We demonstrate the interest of our approach by extensive experiments on two public datasets: UCF Sports and MSR-II. Our approach significantly outperforms the state-of-the-art on both datasets, while restricting the search of actions to a fraction of possible bounding box sequences.
Building similarity graph...
Analyzing shared references across papers
Loading...
Mihir Jain
Institut national de recherche en sciences et technologies du numérique
Jan van Gemert
Delft University of Technology
Hervé Jeǵou
Institut polytechnique de Grenoble
University of Amsterdam
Institut national de recherche en sciences et technologies du numérique
Building similarity graph...
Analyzing shared references across papers
Loading...
Jain et al. (Sun,) studied this question.
synapsesocial.com/papers/6a11c69381e48c4370dcd1b5 — DOI: https://doi.org/10.1109/cvpr.2014.100
Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context: