4.7 Article

Cell classification with worse-case boosting for intelligent cervical cancer screening

Journal

MEDICAL IMAGE ANALYSIS
Volume 91, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.media.2023.103014

Keywords

Worse-case boosting; Underrepresentative training datasets; Gradient norm; Cervical cell classification; Intelligent cervical cancer screening

Ask authors/readers for more resources

Cell classification is crucial for intelligent cervical cancer screening, but the variation in cells' appearance and shape poses challenges. A new learning algorithm, worse-case boosting, is proposed to improve classification accuracy for under-represented data. Experimental results demonstrate the effectiveness of this algorithm in two publicly available datasets, achieving a 4% improvement in accuracy.
Cell classification underpins intelligent cervical cancer screening, a cytology examination that effectively decreases both the morbidity and mortality of cervical cancer. This task, however, is rather challenging, mainly due to the difficulty of collecting a training dataset representative sufficiently of the unseen test data, as there are wide variations of cells' appearance and shape at different cancerous statuses. This difficulty makes the classifier, though trained properly, often classify wrongly for cells that are underrepresented by the training dataset, eventually leading to a wrong screening result. To address it, we propose a new learning algorithm, called worse-case boosting, for classifiers effectively learning from under-representative datasets in cervical cell classification. The key idea is to learn more from worse-case data for which the classifier has a larger gradient norm compared to other training data, so these data are more likely to correspond to underrepresented data, by dynamically assigning them more training iterations and larger loss weights for boosting the generalizability of the classifier on underrepresented data. We achieve this idea by sampling worse-case data per the gradient norm information and then enhancing their loss values to update the classifier. We demonstrate the effectiveness of this new learning algorithm on two publicly available cervical cell classification datasets (the two largest ones to the best of our knowledge), and positive results (4% accuracy improvement) yield in the extensive experiments. The source codes are available at: https://github.com/YouyiSong/Worse-Case-Boosting.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available