☆ 4.5 Article

Crowd-Sourced Assessment of Technical Skills: a novel method to evaluate surgical performance

JOURNAL OF SURGICAL RESEARCH (2014)

Journal

JOURNAL OF SURGICAL RESEARCH

Volume 187, Issue 1, Pages 65-71

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jss.2013.09.024

Keywords

Crowdsourcing; Robotic surgery; OSATS; GEARS; Education; Training

Funding

NSF Graduate Research Fellowship in Computer Science [DGE-0718124]
Clinician Scientist Award from the National Institute of Health Research, UK [NIHR/CS/099/001]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Background: Validated methods of objective assessments of surgical skills are resource intensive. We sought to test a web-based grading tool using crowdsourcing called Crowd-Sourced Assessment of Technical Skill. Materials and methods: Institutional Review Board approval was granted to test the accuracy of Amazon.com's Mechanical Turk and Facebook crowdworkers compared with experienced surgical faculty grading a recorded dry-laboratory robotic surgical suturing performance using three performance domains from a validated assessment tool. Assessor free-text comments describing their rating rationale were used to explore a relationship between the language used by the crowd and grading accuracy. Results: Of a total possible global performance score of 3-15, 10 experienced surgeons graded the suturing video at a mean score of 12.11 (95% confidence interval [CI], 11.11 -13.11). Mechanical Turk and Facebook graders rated the video at mean scores of 12.21 (95% CI, 11.98-12.43) and 12.06 (95% CI, 11.57-12.55), respectively. It took 24 h to obtain responses from 501 Mechanical Turk subjects, whereas it took 24 d for 10 faculty surgeons to complete the 3-min survey. Facebook subjects (110) responded within 25 d. Language analysis indicated that crowdworkers who used negation words (i.e., but, although, and so forth) scored the performance more equivalently to experienced surgeons than crowdworkers who did not (P < 0.00001). Conclusions: For a robotic suturing performance, we have shown that surgery-naive crowdworkers can rapidly assess skill equivalent to experienced faculty surgeons using Crowd-Sourced Assessment of Technical Skill. It remains to be seen whether crowds can discriminate different levels of skill and can accurately assess human surgery performances. (C) 2014 Elsevier Inc. All rights reserved.

Crowd-Sourced Assessment of Technical Skills: a novel method to evaluate surgical performance

Journal

JOURNAL OF SURGICAL RESEARCH

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Crowd-Sourced Assessment of Technical Skills: a novel method to evaluate surgical performance

Journal

JOURNAL OF SURGICAL RESEARCH

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper