☆ 4.7 Article

A geometry-aware deep network for depth estimation in monocular endoscopy

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

Journal

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Volume 122, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.engappai.2023.105989

Keywords

Geometry-aware; Deep learning; Depth estimation; Endoscopy

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Monocular depth estimation is critical for spatial perception and 3D navigation in surgery. Existing methods often neglect geometric structural consistency, resulting in performance degradation and distorted 3D reconstruction. To address this, we propose a gradient loss, a normal loss, and a geometric consistency loss to improve depth estimation and anatomical structure reconstruction.

Monocular depth estimation is critical for endoscopists to perform spatial perception and 3D navigation of surgical sites. However, most of the existing methods ignore the important geometric structural consistency, which inevitably leads to performance degradation and distortion of 3D reconstruction. To address this issue, we introduce a gradient loss to penalize edge fluctuations ambiguous around stepped edge structures and a normal loss to explicitly express the sensitivity to frequently small structures, and propose a geometric consistency loss to spreads the spatial information across the sample grids to constrain the global geometric anatomy structures. In addition, we develop a synthetic RGB-Depth dataset that captures the anatomical structures under reflections and illumination variations. The proposed method is extensively validated across different datasets and clinical images and achieves mean RMSE values of 0.066 (stomach), 0.029 (small intestine), and 0.139 (colon) on the EndoSLAM dataset. The generalizability of the proposed method achieves mean RMSE values of 12.604 (T1-L1), 9.930 (T2-L2), and 13.893 (T3-L3) on the ColonDepth dataset. The experimental results show that our method exceeds previous state-of-the-art competitors and generates more consistent depth maps and reasonable anatomical structures. The quality of intraoperative 3D structure perception from endoscopic videos of the proposed method meets the accuracy requirements of video-CT registration algorithms for endoscopic navigation. The dataset and the source code will be available at https://github.com/YYM-SIA/LINGMI-MR.

A geometry-aware deep network for depth estimation in monocular endoscopy

Journal

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A geometry-aware deep network for depth estimation in monocular endoscopy

Journal

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper