Improving bag-of-features action recognition with non-local cues

Key Points

Key points are not available for this paper at this time.

Abstract

Local space-time features have recently shown promising results within Bag-of-Features (BoF) approach to action recognition in video. Pure local features and descriptors, however, provide only limited discriminative power implying ambiguity among features and sub-optimal classification performance. In this work, we propose to disambiguate local space-time features and to improve action recognition by integrating additional nonlocal cues with BoF representation. For this purpose, we decompose video into region classes and augment local features with corresponding region-class labels. In particular, we investigate unsupervised and supervised video segmentation using (i) motion-based foreground segmentation, (ii) person detection, (iii) static action detection and (iv) object detection. While such segmentation methods might be imperfect, they provide complementary region-level information to local features. We demonstrate how this information can be integrated with BoF representations in a kernel-combination framework. We evaluate our method on the recent and challenging Hollywood-2 action dataset and demonstrate significant improvements.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Muhammad Muneeb Ullah

University of the Sciences

Sobhan Naderi Parizi

Google (United States)

Ivan Laptev

Mohamed bin Zayed University of Artificial Intelligence

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Ullah et al. (Fri,) studied this question.

synapsesocial.com/papers/6a1589765347fbb1739fee72 — DOI: https://doi.org/10.5244/c.24.95

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Actions in context· 2009 · 1,192 citations
Recognizing human actions: A local SVM approach· 2004 · 2,946 citations
On feature combination for multiclass object classification· 2009 · 794 citations
Rapid object detection using a boosted cascade of simple features· 2005 · 18,220 citations
Video Google: a text retrieval approach to object matching in videos· 2003 · 6,434 citations

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Actions in context· 2009 · 1,192 citations
Recognizing human actions: A local SVM approach· 2004 · 2,946 citations
On feature combination for multiclass object classification· 2009 · 794 citations
Rapid object detection using a boosted cascade of simple features· 2005 · 18,220 citations
Video Google: a text retrieval approach to object matching in videos· 2003 · 6,434 citations

Improving bag-of-features action recognition with non-local cues

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider