Key points are not available for this paper at this time.
Automated segmentation and recognition of fine-grained activities is important for enabling new applications in industrial automation, human-robot collaboration, and surgical training. Many existing approaches to activity recognition assume that a video has already been segmented and perform classification using an abstract representation based on spatio-temporal features. While some approaches perform joint activity segmentation and recognition, they typically suffer from a poor modeling of the transitions between actions and a representation that does not incorporate contextual information about the scene. In this paper, we propose a model for action segmentation and recognition that improves upon existing work in two directions. First, we develop a variation of the Skip-Chain Conditional Random Field that captures long-range state transitions between actions by using higher-order temporal relationships. Second, we argue that in constrained environments, where the relevant set of objects is known, it is better to develop features using high-level object relationships that have semantic meaning instead of relying on abstract features. We apply our approach to a set of tasks common for training in robotic surgery: suturing, knot tying, and needle passing, and show that our method increases micro and macro accuracy by 18.46% and 44.13% relative to the state of the art on a widely used robotic surgery dataset.
Building similarity graph...
Analyzing shared references across papers
Loading...
Colin Lea
Apple (United States)
Gregory D. Hager
Johns Hopkins University
René Vidal
California University of Pennsylvania
Johns Hopkins University
Building similarity graph...
Analyzing shared references across papers
Loading...
Lea et al. (Thu,) studied this question.
synapsesocial.com/papers/6a0fe50bd8c5cf602efd4f6b — DOI: https://doi.org/10.1109/wacv.2015.154
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: