☆ 4.7 Article Proceedings Paper

Surgical gesture classification from video and kinematic data

MEDICAL IMAGE ANALYSIS (2013)

Journal

MEDICAL IMAGE ANALYSIS

Volume 17, Issue 7, Pages 732-745

Publisher

ELSEVIER

DOI: 10.1016/j.media.2013.04.007

Keywords

Surgical gesture classification; Time series classification; Dynamical system classification; Bag of features; Multiple kernel learning

Funding

Division Of Computer and Network Systems
Direct For Computer & Info Scie & Enginr [0931805] Funding Source: National Science Foundation
Office Of The Director
Office of Integrative Activities [0941362] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Much of the existing work on automatic classification of gestures and skill in robotic surgery is based on dynamic cues (e.g., time to completion, speed, forces, torque) or kinematic data (e.g., robot trajectories and velocities). While videos could be equally or more discriminative (e.g., videos contain semantic information not present in kinematic data), they are typically not used because of the difficulties associated with automatic video interpretation. In this paper, we propose several methods for automatic surgical gesture classification from video data. We assume that the video of a surgical task (e.g., suturing) has been segmented into video clips corresponding to a single gesture (e.g., grabbing the needle, passing the needle) and propose three methods to classify the gesture of each video clip. In the first one, we model each video clip as the output of a linear dynamical system (LDS) and use metrics in the space of LDSs to classify new video clips. In the second one, we use spatio-temporal features extracted from each video clip to learn a dictionary of spatio-temporal words, and use a bag-of-features (BoF) approach to classify new video clips. In the third one, we use multiple kernel learning (MKL) to combine the LDS and BoF approaches. Since the LDS approach is also applicable to kinematic data, we also use MKL to combine both types of data in order to exploit their complementarity. Our experiments on a typical surgical training setup show that methods based on video data perform equally well, if not better, than state-of-the-art approaches based on kinematic data. In turn, the combination of both kinematic and video data outperforms any other algorithm based on one type of data alone. (C) 2013 Elsevier B.V. All rights reserved.

Surgical gesture classification from video and kinematic data

Journal

MEDICAL IMAGE ANALYSIS

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Surgical gesture classification from video and kinematic data

Journal

MEDICAL IMAGE ANALYSIS

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper