4.7 Article Proceedings Paper

Long-Range Augmented Reality with Dynamic Occlusion Rendering

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TVCG.2021.3106434

Keywords

Rendering (computer graphics); Cognition; Cameras; Real-time systems; Image segmentation; Estimation; Navigation; Augmented reality; occlusion reasoning; depth inference; object tracking

Funding

  1. Office of Naval Research (ONR) [N00014-19-C-2025]
  2. ONR

Ask authors/readers for more resources

This paper addresses the challenge of fast and accurate dynamic occlusion reasoning for large scale outdoor AR applications. The proposed method involves utilizing instance segmentation and depth estimation to infer the metric distance of real objects in the scene, minimizing latency in occlusion rendering. The solution is implemented in a low latency real-time framework and can be extended to optical-see-through AR.
Proper occlusion based rendering is very important to achieve realism in all indoor and outdoor Augmented Reality (AR) applications. This paper addresses the problem of fast and accurate dynamic occlusion reasoning by real objects in the scene for large scale outdoor AR applications. Conceptually, proper occlusion reasoning requires an estimate of depth for every point in augmented scene which is technically hard to achieve for outdoor scenarios, especially in the presence of moving objects. We propose a method to detect and automatically infer the depth for real objects in the scene without explicit detailed scene modeling and depth sensing (e.g. without using sensors such as 3D-LiDAR). Specifically, we employ instance segmentation of color image data to detect real dynamic objects in the scene and use either a top-down terrain elevation model or deep learning based monocular depth estimation model to infer their metric distance from the camera for proper occlusion reasoning in real time. The realized solution is implemented in a low latency real-time framework for video-see-though AR and is directly extendable to optical-see-through AR. We minimize latency in depth reasoning and occlusion rendering by doing semantic object tracking and prediction in video frames.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available