3.8 Article

Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade

期刊

HEALTHCARE TECHNOLOGY LETTERS
卷 6, 期 6, 页码 275-279

出版社

WILEY
DOI: 10.1049/htl.2019.0064

关键词

object detection; medical image processing; image colour analysis; medical robotics; regression analysis; surgery; learning (artificial intelligence); convolutional neural nets; robot vision; convolutional neural network cascade; robot-assisted surgery videos; vision component; single-tool detection; cascading convolutional neural network; CNN; real-time multitool detection; hourglass network; modified VGG network; detection heatmaps; tool tip areas; bounding-box regression; authors; mainstream detection methods; RGB image frames; frame-by-frame detection method; deep learning methods; EndoVis Challenge dataset; ATLAS Dione dataset; real-time surgical instrument detection; real-time multi-tool detection

资金

  1. Specialised Research Fund for the Doctoral Program of Higher Education of China [20130131120036]
  2. Promotive Research Fund for Excellent Young and Middle-aged Scientists of Shandong Province [BS2013DX027]
  3. National Natural Science Foundation of China [81401543, 61273277]
  4. French National Research Agency (ANR) through TecSan Program (DEPORRA)

向作者/读者索取更多资源

Surgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method using a cascading convolutional neural network (CNN) which consists of two different CNNs for real-time multi-tool detection. An hourglass network and a modified visual geometry group (VGG) network are applied to jointly predict the localisation. The former CNN outputs detection heatmaps representing the location of tool tip areas, and the latter performs bounding-box regression for tool tip areas on these heatmaps stacked with input RGB image frames. The authors' method is tested on the publicly available EndoVis Challenge dataset and the ATLAS Dione dataset. The experimental results show that their method achieves better performance than mainstream detection methods in terms of detection accuracy and speed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据