4.6 Article

Comparison and benchmark of name-to-gender inference services

Journal

PEERJ COMPUTER SCIENCE
Volume -, Issue -, Pages -

Publisher

PEERJ INC
DOI: 10.7717/peerj-cs.156

Keywords

Name-based gender inference; Classification algorithms; Performance evaluation; Gender analysis; Scientometrics; Bibliometrics

Funding

  1. Grants Programme of the International Council for Science (ICSU)

Ask authors/readers for more resources

The increased interest in analyzing and explaining gender inequalities in tech, media, and academia highlights the need for accurate inference methods to predict a person's gender from their name. Several such services exist that provide access to large databases of names, often enriched with information from social media profiles, culture-specific rules, and insights from sociolinguistics. We compare and benchmark five name-to-gender inference services by applying them to the classification of a test data set consisting of 7,076 manually labeled names. The compiled names are analyzed and characterized according to their geographical and cultural origin. We define a series of performance metrics to quantify various types of classification errors, and define a parameter tuning procedure to search for optimal values of the services' free parameters. Finally, we perform benchmarks of all services under study regarding several scenarios where a particular metric is to be optimized.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available