4.7 Article

Web-Based Risk Prediction Tool for an Individual's Risk of HIV and Sexually Transmitted Infections Using Machine Learning Algorithms: Development and External Validation Study

Journal

JOURNAL OF MEDICAL INTERNET RESEARCH
Volume 24, Issue 8, Pages -

Publisher

JMIR PUBLICATIONS, INC
DOI: 10.2196/37850

Keywords

HIV; sexually transmitted infections; syphilis; gonorrhea; chlamydia; sexual health; sexual transmission; sexually transmitted; prediction; web-based; risk assessment; machine learning; model; algorithm; predictive; risk; development; validation

Funding

  1. Australian National Health and Medical Research Council [GNT1172873, GNT1193955, GNT1172900]
  2. National Natural Science Foundation of China [81950410639]
  3. Outstanding Young Scholars Support Program [3111500001]
  4. Xi'an Jiaotong University Basic Research and Profession Grant [xtr022019003, xzy032020032]
  5. Epidemiology modeling and risk assessment [20200344]
  6. Xi'an Jiaotong University [YX6J004]

Ask authors/readers for more resources

This study aimed to develop an HIV and STI risk prediction tool using machine learning algorithms. Clinic consultations and external validation data sets were used to develop and validate 34 machine learning models. The results showed that the machine learning-based risk prediction tool performed well and accurately predicted the risk of HIV and STIs.
Background: HIV and sexually transmitted infections (STIs) are major global public health concerns. Over 1 million curable STIs occur every day among people aged 15 years to 49 years worldwide. Insufficient testing or screening substantially impedes the elimination of HIV and STI transmission. Objective: The aim of our study was to develop an HIV and STI risk prediction tool using machine learning algorithms. Methods: We used clinic consultations that tested for HIV and STIs at the Melbourne Sexual Health Centre between March 2, 2015, and December 31, 2018, as the development data set (training and testing data set). We also used 2 external validation data sets, including data from 2019 as external validation data 1 and data from January 2020 and January 2021 as external validation data 2. We developed 34 machine learning models to assess the risk of acquiring HIV, syphilis, gonorrhea, and chlamydia. We created an online tool to generate an individual's risk of HIV or an STI. Results: The important predictors for HIV and STI risk were gender, age, men who reported having sex with men, number of casual sexual partners, and condom use. Our machine learning-based risk prediction tool, named MySTIRisk, performed at an acceptable or excellent level on testing data sets (area under the curve [AUC] for HIV=0.78; AUC for syphilis=0.84; AUC for gonorrhea=0.78; AUC for chlamydia=0.70) and had stable performance on both external validation data from 2019 (AUC for HIV=0.79; AUC for syphilis=0.85; AUC for gonorrhea=0.81; AUC for chlamydia=0.69) and data from 2020-2021 (AUC for HIV=0.71; AUC for syphilis=0.84; AUC for gonorrhea=0.79; AUC for chlamydia=0.69). Conclusions: Our web-based risk prediction tool could accurately predict the risk of HIV and STIs for clinic attendees using simple self-reported questions. MySTIRisk could serve as an HIV and STI screening tool on clinic websites or digital health platforms to encourage individuals at risk of HIV or an STI to be tested or start HIV pre-exposure prophylaxis. The public can use this tool to assess their risk and then decide if they would attend a clinic for testing. Clinicians or public health workers can use this tool to identify high-risk individuals for further interventions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available