4.8 Review

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

Journal

CANCER RESEARCH
Volume 79, Issue 21, Pages 5463-5470

Publisher

AMER ASSOC CANCER RESEARCH
DOI: 10.1158/0008-5472.CAN-19-0579

Keywords

-

Categories

Funding

  1. NCI [U24CA184407, U01CA231840]
  2. LM [LM 10090]
  3. NIGMS [R01GM114355]
  4. Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) program
  5. National Cancer Institute (NCI) of National Institutes of Health
  6. U.S. Department of Energy by Argonne National Laboratory [DE-AC02-06-CH11357]
  7. U.S. Department of Energy by Lawrence Livermore National Laboratory [DE-AC52-07NA27344]
  8. U.S. Department of Energy by Los Alamos National Laboratory [DE-AC5206NA25396]
  9. U.S. Department of Energy by Oak Ridge National Laboratory [DE-AC05-00OR22725]
  10. U.S. Department of Energy [DE-AC05-00OR22725]

Ask authors/readers for more resources

Current models for correlating electronic medical records with -omics data largely ignore clinical text, which is an important source of phenotype information for patients with cancer. This data convergence has the potential to reveal new insights about cancer initiation, progression, metastasis, and response to treatment. Insights from this real-world data will catalyze clinical care, research, and regulatory activities. Natural language processing (NLP) methods are needed to extract these rich cancer phenotypes from clinical text. Here, we review the advances of NLP and information extraction methods relevant to oncology based on publications from PubMed as well as NLP and machine learning conference proceedings in the last 3 years. Given the interdisciplinary nature of the fields of oncology and information extraction, this analysis serves as a critical trail marker on the path to higher fidelity oncology phenotypes from real-world data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available