4.6 Article

Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations

Journal

BMJ OPEN
Volume 11, Issue 12, Pages -

Publisher

BMJ PUBLISHING GROUP
DOI: 10.1136/bmjopen-2021-049249

Keywords

mental health; psychiatry; health informatics

Funding

  1. National Institute for Health Research (NIHR) Biomedical Research Centre at the South London and Maudsley NHS Foundation Trust
  2. King's College London
  3. Medical Research Council (MRC) Mental Health Data Pathfinder Award
  4. NIHR Senior Investigator Award
  5. National Institute for Health Research (NIHR) Applied Research Collaboration South London (NIHR ARC South London) at King's College Hospital NHS Foundation Trust

Ask authors/readers for more resources

The study aimed to investigate the distribution and content of quoted text in electronic health records (EHRs), revealing that patients receiving mental healthcare who have been hospitalized or are of black ethnicity are more likely to have quoted text. Word embeddings trained on early psychosis intervention records proved useful in categorising small subsets of clinical records represented by one-word quotations, indicating the potential for systematic bias in clinical attention.
Objective To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. Design chi(2) and logistic regression were used to assess the profile of patients receiving mental healthcare for whom quotations exist. K-means clustering using pre-trained word embeddings developed on general discharge summaries and psychosis specific mental health records were used to group one-word quotations into semantically similar groups and labelled by human subjective judgement. Setting EHRs from a large mental healthcare provider serving a geographic catchment area of 1.3 million residents in South London. Participants For analysis of distribution, 33 499 individuals receiving mental healthcare on 30 June 2019 in South London and Maudsley. For analysis of content, 1587 unique lemmatised words, appearing a minimum of 20 times on the database of quotations created on 16 January 2020. Results The strongest individual indicator of quoted text is inpatient care in the preceding 12 months (OR 9.79, 95% CI 7.84 to 12.23). Next highest indicator is ethnicity with those with a black background more likely to have quoted text in comparison to white background (OR 2.20, 95% CI 2.08 to 2.33). Both are attenuated slightly in the adjusted model. Early psychosis intervention word embeddings subjectively produced categories pertaining to: mental illness, verbs, negative sentiment, people/relationships, mixed sentiment, aggression/violence and negative connotation. Conclusions The findings that inpatients and those from a black ethnic background more commonly have quoted text raise important questions around where clinical attention is focused and whether this may point to any systematic bias. Our study also shows that word embeddings trained on early psychosis intervention records are useful in categorising even small subsets of the clinical records represented by one-word quotations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available