4.6 Article

A Novel Approach for Polycystic Ovary Syndrome Prediction Using Machine Learning in Bioinformatics

Journal

IEEE ACCESS
Volume 10, Issue -, Pages 97610-97624

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2022.3205587

Keywords

Bioinformatics; data analysis; infertility; machine learning; pregnancy complications; polycystic ovary syndrome; PCOS prediction; syndrome classification

Funding

  1. University of Hafr Albatin, Saudi Arabia

Ask authors/readers for more resources

Polycystic ovary syndrome (PCOS) is a critical disorder in women during their reproduction phase, commonly caused by excess male hormone and androgen levels, resulting in complications such as miscarriage, infertility issues, and complications during pregnancy. This study aims to predict PCOS using advanced machine learning techniques and achieved satisfactory results through the proposed optimized chi-squared feature selection and gaussian naive bayes model.
Polycystic ovary syndrome (PCOS) is a critical disorder in women during their reproduction phase. The PCOS disorder is commonly caused by excess male hormone and androgen levels. The follicles are the collections of fluid developed by ovaries and may fail to release eggs regularly. The PCOS results in miscarriage, infertility issues, and complications during pregnancy. According to a recent report, PCOS is diagnosed in 31.3% of women from Asia. Studies show that 69% to 70% of women did not avail of a detecting cure for PCOS. A research study is needed to save women from critical complications by identifying PCOS early. The main aim of our research is to predict PCOS using advanced machine learning techniques. The dataset based on clinical and physical parameters of women is utilized for building study models. A novel feature selection approach is proposed based on the optimized chi-squared (CS-PCOS) mechanism. The ten hyper-parametrized machine learning models are applied in comparison. Using the novel CS-PCOS approach, the gaussian naive bayes (GNB) outperformed machine learning models and state-of-the-art studies. The GNB achieved 100% accuracy, precision, recall, and fl-scores with minimal time computations of 0.002 seconds. The k-fold cross-validation of GNB achieved a 100% accuracy score. The proposed GNB model achieved accurate results for critical PCOS prediction. Our study reveals that the dataset features prolactin (PRL), blood pressure systolic, blood pressure diastolic, thyroid stimulating hormone (TSH), relative risk (RR-breaths), and pregnancy are the prominent factors having high involvement in PCOS prediction. Our research study helps the medical community overcome the miscarriage rate and provide a cure to women through the early detection of PCOS.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available