4.7 Article

Determining the number of factors for non-negative matrix and its application in source apportionment of air pollution in Singapore

Journal

STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT
Volume 33, Issue 4-6, Pages 1175-1186

Publisher

SPRINGER
DOI: 10.1007/s00477-019-01677-z

Keywords

Air-pollution; Cross-validation; Factor model; Non-negative matrix; Source apportionment

Funding

  1. MOE Tier 1 Grant [R-155-000-193-114]
  2. MOE Grant of Singapore [MOE2014-T2-1-072]
  3. National Natural Science Foundation of China [11771066]

Ask authors/readers for more resources

The non-negative matrix factorization has been used in many disciplines of research, where the number of factors plays a crucial role. However, a fully data-driven method for determining the number is yet not available in the literature. Based on the fact that the most appropriate number of factors should generate the best prediction, in this paper we propose a selection method using a two-step delete-one-out approach, called twice cross-validation. This method is easy to implement and is fully data-driven. It also works when constraints are imposed on the factorization including the sparsity. Intensive simulations and real data analyses suggest that the proposed method performs well in most cases and can select the number of factors correctly when the number of factors is much less than the dimension of variables and the sample size is reasonably large. As an important application, the proposed method is used for source apportionment of air pollution in Singapore, and provides physically reasonable source profiles.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available