4.6 Article

Thirty Years of The Network Scale-up Method

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
Volume 116, Issue 535, Pages 1548-1559

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/01621459.2021.1935267

Keywords

Aggregated relational data; Key populations; Size estimation; Small area estimation

Funding

  1. National Institute of Allergy and Infectious Diseases of the National Institutes of Health [R01AI136664]

Ask authors/readers for more resources

Estimating the size of hard-to-reach populations using the network scale-up method involves asking respondents how many individuals of interest they know, and utilizes various estimators like direct estimators, maximum likelihood estimators, and Bayesian estimators. This method has been applied to estimate the size of subpopulations and involves complex considerations regarding assumptions, relationships between estimators, and practical implementation. The article provides an in-depth analysis of aggregated relational data properties and data collection techniques, as well as a comprehensive review of different estimation methods, their assumptions, and practical considerations.
Estimating the size of hard-to-reach populations is an important problem for many fields. The network scale-up method (NSUM) is a relatively new approach to estimate the size of these hard-to-reach populations by asking respondents the question, How many X's do you know, where X is the population of interest (e.g., How many female sex workers do you know?). The answers to these questions form aggregated relational data (ARD). The NSUM has been used to estimate the size of a variety of subpopulations, including female sex workers, drug users, and even children who have been hospitalized for choking. Within the network scale-up methodology, there are a multitude of estimators for the size of the hidden population, including direct estimators, maximum likelihood estimators, and Bayesian estimators. In this article, we first provide an in-depth analysis of ARD properties and the techniques to collect the data. Then, we comprehensively review different estimation methods in terms of the assumptions behind each model, the relationships between the estimators, and the practical considerations of implementing the methods. We apply many of the models discussed in the review to one canonical dataset and compare their performance and unique features, presented in the . Finally, we provide a summary of the dominant methods and an extensive list of the applications, and discuss the open problems and potential research directions in this area.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available