4.5 Article

Creating rare epilepsy cohorts using keyword search in electronic health records

Journal

EPILEPSIA
Volume -, Issue -, Pages -

Publisher

WILEY
DOI: 10.1111/epi.17725

Keywords

automated; clinical data; cohort creation; genetic epilepsy; natural language processing

Ask authors/readers for more resources

This study evaluated the use of keyword search as an alternative method for identifying individuals with rare epilepsies in electronic health records. The results showed that keyword search was effective in identifying rare epilepsy cases and promoting their access to specialized care, clinical research, and support groups.
Objective Administrative codes to identify people with rare epilepsies in electronic health records are limited. The current study evaluated the use of keyword search as an alternative method for rare epilepsy cohort creation using electronic health records data.Methods Data included clinical notes from encounters with International Classification of Diseases, Ninth Revision (ICD-9) codes for seizures, epilepsy, and/or convulsions during 2010-2014, across six health care systems in New York City. We identified cases with rare epilepsies by searching clinical notes for keywords associated with 33 rare epilepsies. We validated cases via manual chart review. We compared the performance of keyword search to manual chart review using positive predictive value (PPV), sensitivity, and F-score. We selected an initial combination of keywords using the highest F-scores.Results Data included clinical notes from 77 924 cases with ICD-9 codes for seizures, epilepsy, and/or convulsions. The all-keyword search method identified 6095 candidates, and manual chart review confirmed that 2068 (34%) had a rare epilepsy. The initial combination method identified 1862 cases with a rare epilepsy, and this method performed as follows: PPV median = .64 (interquartile range [IQR] = .50-.81, range = .20-1.00), sensitivity median = .93 (IQR = .76-1.00, range = .10-1.00), and F-score median = .71 (IQR = .63-.85, range = .18-1.00). Using this method, we identified four cohorts of rare epilepsies with over 100 individuals, including infantile spasms, Lennox-Gastaut syndrome, Rett syndrome, and tuberous sclerosis complex. We identified over 50 individuals with two rare epilepsies that do not have specific ICD-10 codes for cohort creation (epilepsy with myoclonic atonic seizures, Sturge-Weber syndrome).Significance Keyword search is an effective method for cohort creation. These findings can improve identification and surveillance of individuals with rare epilepsies and promote their referral to specialty clinics, clinical research, and support groups.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available