4.4 Article

Virtual Reality vs Dry Laboratory Models: Comparing Automated Performance Metrics and Cognitive Workload During Robotic Simulation Training

期刊

JOURNAL OF ENDOUROLOGY
卷 35, 期 10, 页码 1571-1576

出版社

MARY ANN LIEBERT, INC
DOI: 10.1089/end.2020.1037

关键词

robotics; education; simulation; performance metrics; virtual reality; cognitive workload

向作者/读者索取更多资源

This study compared surgical performance in VR and DL environments for VUA tasks and found moderate to strong correlations between performance metrics in the two platforms, indicating transferability of skills. Automated performance metrics in DL tasks were better at distinguishing expertise compared to VR-generated metrics.
Background: This study compares surgical performance during analogous vesico-urethral anastomosis (VUA) tasks in two robotic training environments, virtual reality (VR) and dry laboratory (DL), to investigate transferability of skill assessment across the two platforms. Utilizing computer-generated performance metrics and pupillary data, we evaluated the two environments to distinguish surgical expertise and ultimately whether performance in the VR simulation correlates with performance in live robotic surgery in the DL. Materials and Methods: Experts (>= 300 cases) and trainees (<300 cases) performed analogous VUAs during VR and DL sessions on a da Vinci robotic console following an Institutional Review Board (IRB) approved protocol (HS-16-00318). Twenty-two metrics were generated in each environment (kinematic metrics, tissue metrics, and biometrics). The DL included 18 previously validated automated performance metrics (APMs) (kinematics and event metrics) captured by an Intuitive system data recorder. In both settings, Tobii Pro Glasses 2 recorded the task-evoked pupillary response (reported as Index of Cognitive Activity [ICA]) to indicate cognitive workload, analyzed by EyeTracking cognitive workload software. Pearson correlation, Mann-Whitney, and independent t-tests were used for the comparative analyses. Results: Our study included six experts (median caseload 1300 [interquartile range 400-3000]) and 11 trainees (25 [0-250]). A total of 8/9 metrics directly comparable between VR and DL showed significant positive correlation (r0.554, p <= 0.032); 5/22 VR metrics distinguished expertise, including task time (p=0.031), clutch usage (p=0.040), unnecessary needle piercing (p=0.026), and suspected injury to the endopelvic fascia (p=0.040). This contrasts with 14/22 APMs in DL (p <= 0.038), including linear velocities of all three instruments (p <= 0.038) and dominant-hand instrument wrist articulation (p=0.013). Trainees experienced higher cognitive workload (ICA) in both environments when compared with experts (p<0.036). Conclusions: Most performance metrics between VR and DL exhibited moderate to strong correlations, showing transferability of skills across the platforms. Comparing training environments, APMs during DL tasks are better able to distinguish expertise than VR-generated metrics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据