3.8 Review

Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing

Journal

JCO CLINICAL CANCER INFORMATICS
Volume 6, Issue -, Pages -

Publisher

LIPPINCOTT WILLIAMS & WILKINS
DOI: 10.1200/CCI.22.00006

Keywords

-

Categories

Funding

  1. National Institutes of Health (NIH) [1U01TR002062-01, U24CA194215-01A1]

Ask authors/readers for more resources

This review assesses the use of natural language processing (NLP) in electronic health records (EHRs) for cancer research and patient care. The findings highlight the need for additional data elements beyond the Minimal Common Oncology Data Elements (mCODE) for comprehensive analysis and evaluation. The review also identifies challenges and barriers in the adoption of NLP methods for cancer research and patient care.
PURPOSEThe advancement of natural language processing (NLP) has promoted the use of detailed textual data in electronic health records (EHRs) to support cancer research and to facilitate patient care. In this review, we aim to assess EHR for cancer research and patient care by using the Minimal Common Oncology Data Elements (mCODE), which is a community-driven effort to define a minimal set of data elements for cancer research and practice. Specifically, we aim to assess the alignment of NLP-extracted data elements with mCODE and review existing NLP methodologies for extracting said data elements.METHODSPublished literature studies were searched to retrieve cancer-related NLP articles that were written in English and published between January 2010 and September 2020 from main literature databases. After the retrieval, articles with EHRs as the data source were manually identified. A charting form was developed for relevant study analysis and used to categorize data including four main topics: metadata, EHR data and targeted cancer types, NLP methodology, and oncology data elements and standards.RESULTSA total of 123 publications were selected finally and included in our analysis. We found that cancer research and patient care require some data elements beyond mCODE as expected. Transparency and reproductivity are not sufficient in NLP methods, and inconsistency in NLP evaluation exists.CONCLUSIONWe conducted a comprehensive review of cancer NLP for research and patient care using EHRs data. Issues and barriers for wide adoption of cancer NLP were identified and discussed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available