4.6 Article

Spatiotemporal Statistical Imbalance: A Long-Term Neglected Defect in UN Comtrade Dataset

Journal

SUSTAINABILITY
Volume 14, Issue 3, Pages -

Publisher

MDPI
DOI: 10.3390/su14031431

Keywords

United Nations International Trade Statistics Database; bilateral trade; statistical imbalance; co-clustering algorithm; geographic information system

Ask authors/readers for more resources

This study utilizes the bilateral trade data from the United Nations International Trade Statistics Database to identify statistical imbalances in trade records. Statistical imbalance refers to the inequality between the import or export trade value of a commodity category and the total value of its subcategories. The findings reveal that statistical imbalance is widespread and exhibits clustering patterns, particularly in specific commodity categories. Thus, it is recommended that researchers pre-screen the data for statistical imbalance to ensure the validity of their research results.
The bilateral trade data provided by the United Nations International Trade Statistics Database are some of the most authoritative trade statistics and have been widely used in many research fields. Here, we propose a new form of inconsistency in its records, namely statistical imbalance, which refers to the phenomenon of inequality between the import or export trade value of a commodity category and the total value of all its subcategories. We investigated the frequency and spatial-temporal patterns of the statistical imbalances of 15 reporters (i.e., Australia, Brazil, Canada, China, France, Germany, India, the Netherlands, the Rep. of Korea, the Russian Federation, Switzerland, the United Arab Emirates, the United States of America, and Vietnam) from 1996-2016 and explored their distributional differences in commodity categories with a co-clustering algorithm. The results show that statistical imbalance is widespread with obvious clustering patterns. Trade records related to specific categories such as fossil fuels, pharmaceuticals, machinery, and unspecified commodity categories presented severe statistical imbalances, which may lead to erroneous trade research results. Since statistical imbalance is difficult to detect in studies focusing only on specific commodity categories, we suggested that researchers should prescreen the data for statistical imbalance to ensure the validity of their results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available