4.8 Article

Data Fusion by Matrix Factorization

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2014.2343973

Keywords

Data fusion; intermediate data integration; matrix factorization; data mining; bioinformatics; cheminformatics

Funding

  1. ARRS [P2-0209, J2-5480]
  2. NIH [P01-HD39691]
  3. EU [Health-F5-2010-242038]
  4. EUNICE KENNEDY SHRIVER NATIONAL INSTITUTE OF CHILD HEALTH & HUMAN DEVELOPMENT [P01HD039691] Funding Source: NIH RePORTER

Ask authors/readers for more resources

For most problems in science and engineering we can obtain data sets that describe the observed system from various perspectives and record the behavior of its individual components. Heterogeneous data sets can be collectively mined by data fusion. Fusion can focus on a specific target relation and exploit directly associated data together with contextual data and data about system's constraints. In the paper we describe a data fusion approach with penalized matrix tri-factorization (DFMF) that simultaneously factorizes data matrices to reveal hidden associations. The approach can directly consider any data that can be expressed in a matrix, including those from feature-based representations, ontologies, associations and networks. We demonstrate the utility of DFMF for gene function prediction task with eleven different data sources and for prediction of pharmacologic actions by fusing six data sources. Our data fusion algorithm compares favorably to alternative data integration approaches and achieves higher accuracy than can be obtained from any single data source alone.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available