4.2 Article

fastWKendall: an efficient algorithm for weighted Kendall correlation

Journal

COMPUTATIONAL STATISTICS
Volume 33, Issue 4, Pages 1823-1845

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s00180-017-0775-6

Keywords

Nonparametric statistics; Sequence analysis; Cluster analysis; Similarity analysis; Sequence similarity measurement

Funding

  1. Chinese National Natural Science Foundation [61472082]
  2. Natural Science Foundation of Fujian Province of China [2014J01220]
  3. Scientific Research Innovation Team Construction Program of Fujian Normal University [IRTL1702]
  4. US National Science Foundation [IIS-1552860]

Ask authors/readers for more resources

The Kendall correlation is a non- parametric method that measures the strength of dependence between two sequences. Like Pearson correlation and Spearman correlation, Kendall correlation is widely applied in sequence similarity measurements and cluster analysis. We propose an efficient algorithm, fastWKendall, to compute the approximate weighted Kendall correlation in O( n log n) time and O( n) space complexity. This is an improvement to the state- of- the- art O( n2) time requirement. The proposed method can be incorporated to perform conventional sequential similarity measurement and cluster analysis much more rapidly. This is important for analysis of huge- volume datasets, such as genome databases, streaming stock market data, and publicly available huge datasets on the Internet. The code which is implemented in R is available for public access.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available