January 1, 2015

An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks

CLColin LeaApple (United States)GHGregory D. HagerJohns Hopkins University RVRené VidalCalifornia University of Pennsylvania

Key Points

Key points are not available for this paper at this time.

Abstract

Automated segmentation and recognition of fine-grained activities is important for enabling new applications in industrial automation, human-robot collaboration, and surgical training. Many existing approaches to activity recognition assume that a video has already been segmented and perform classification using an abstract representation based on spatio-temporal features. While some approaches perform joint activity segmentation and recognition, they typically suffer from a poor modeling of the transitions between actions and a representation that does not incorporate contextual information about the scene. In this paper, we propose a model for action segmentation and recognition that improves upon existing work in two directions. First, we develop a variation of the Skip-Chain Conditional Random Field that captures long-range state transitions between actions by using higher-order temporal relationships. Second, we argue that in constrained environments, where the relevant set of objects is known, it is better to develop features using high-level object relationships that have semantic meaning instead of relying on abstract features. We apply our approach to a set of tasks common for training in robotic surgery: suturing, knot tying, and needle passing, and show that our method increases micro and macro accuracy by 18.46% and 44.13% relative to the state of the art on a widely used robotic surgery dataset.

KI fragen

Bookmark

View Full Paper

Cite This Study

Lea et al. (Thu,) studied this question.

synapsesocial.com/papers/6a0fe50bd8c5cf602efd4f6b https://doi.org/https://doi.org/10.1109/wacv.2015.154

KI fragen

Bookmark

View Full Paper