4.8 Article

Kernelized Bayesian Matrix Factorization

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2014.2313125

Keywords

Automatic relevance determination; biological interaction networks; large margin learning; matrix factorization; multilabel classification; multiple kernel learning; multiple output regression; variational approximation

Funding

  1. Integrative Cancer Biology Program of the National Cancer Institute [1U54CA149237]
  2. Academy of Finland [140057]
  3. Academy of Finland (Finnish Centre of Excellence in Computational Inference Research COIN) [251170]
  4. Academy of Finland (AKA) [140057, 140057] Funding Source: Academy of Finland (AKA)

Ask authors/readers for more resources

We extend kernelized matrix factorization with a full-Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernels have been introduced to integrate side information about the rows and columns, which is necessary for making out-of-matrix predictions. We discuss specifically binary output matrices but extensions to real-valued matrices are straightforward. We extend the state of the art in two key aspects: (i) A full-conjugate probabilistic formulation of the kernelized matrix factorization enables an efficient variational approximation, whereas full-Bayesian treatments are not computationally feasible in the earlier approaches. (ii) Multiple side information sources are included, treated as different kernels in multiple kernel learning which additionally reveals which side sources are informative. We then show that the framework can also be used for supervised and semi-supervised multilabel classification and multi-output regression, by considering samples and outputs as the domains where matrix factorization operates. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. On multilabel classification, our algorithm obtains the lowest Hamming losses on 10 out of 14 data sets compared to five state-of-the-art multilabel classification algorithms. We finally show that the proposed approach outperforms alternatives in multi-output regression experiments on a yeast cell cycle data set.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available