4.7 Article

Hyppo-X: A Scalable Exploratory Framework for Analyzing Complex Phenomics Data

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2019.2947500

Keywords

Feature extraction; Agriculture; Data mining; Topology; Tools; Data visualization; Biology; Computational phenomics; topological data analysis; graph algorithms; hypothesis extraction; visualization

Funding

  1. U.S. National Science Foundation [1661348, 1819229]
  2. Direct For Mathematical & Physical Scien
  3. Division Of Mathematical Sciences [1819229] Funding Source: National Science Foundation
  4. Div Of Biological Infrastructure
  5. Direct For Biological Sciences [1661348] Funding Source: National Science Foundation

Ask authors/readers for more resources

Phenomics is a new branch of biology that captures environmental and phenotypic traits using high throughput tools, providing insights into how multiple factors interact and contribute to growth and behavior. Hyppo-X is a new algorithmic approach for exploring complex phenomics data visually and characterizing the role of the environment on phenotypic traits.
Phenomics is an emerging branch of modern biology that uses high throughput phenotyping tools to capture multiple environmental and phenotypic traits, often at massive spatial and temporal scales. The resulting high dimensional data represent a treasure trove of information for providing an in-depth understanding of how multiple factors interact and contribute to the overall growth and behavior of different genotypes. However, computational tools that can parse through such complex data and aid in extracting plausible hypotheses are currently lacking. In this article, we present Hyppo-X, a new algorithmic approach to visually explore complex phenomics data and in the process characterize the role of environment on phenotypic traits. We model the problem as one of unsupervised structure discovery, and use emerging principles from algebraic topology and graph theory for discovering higher-order structures of complex phenomics data. We present an open source software which has interactive visualization capabilities to facilitate data navigation and hypothesis formulation. We test and evaluate Hyppo-X on two real-world plant (maize) data sets. Our results demonstrate the ability of our approach to delineate divergent subpopulation-level behavior. Notably, our approach shows how environmental factors could influence phenotypic behavior, and how that effect varies across different genotypes and different time scales. To the best of our knowledge, this effort provides one of the first approaches to systematically formalize the problem of hypothesis extraction for phenomics data. Considering the infancy of the phenomics field, tools that help users explore complex data and extract plausible hypotheses in a data-guided manner will be critical to future advancements in the use of such data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available