4.8 Article

TomExpress, a unified tomato RNA-Seq platform for visualization of expression data, clustering and correlation networks

Journal

PLANT JOURNAL
Volume 92, Issue 4, Pages 727-735

Publisher

WILEY
DOI: 10.1111/tpj.13711

Keywords

tomato; RNA-Seq; database; platform; web tools; gene expression; data mining

Categories

Funding

  1. 'Laboratoire d'Excellence' (LABEX) [ANR-10-LABX-41]
  2. ANR TomEpiSet project
  3. TomGEM H2020 project

Ask authors/readers for more resources

The TomExpress platform was developed to provide the tomato research community with a browser and integrated web tools for public RNA-Seq data visualization and data mining. To avoid major biases that can result from the use of different mapping and statistical processing methods, RNA-Seq raw sequence data available in public databases were mapped de novo on a unique tomato reference genome sequence and post-processed using the same pipeline with accurate parameters. Following the calculation of the number of counts per gene in each RNA-Seq sample, a communal global normalization method was applied to all expression values. This unifies the whole set of expression data and makes them comparable. A database was designed where each expression value is associated with corresponding experimental annotations. Sample details were manually curated to be easily understandable by biologists. To make the data easily searchable, a user-friendly web interface was developed that provides versatile data mining web tools via on-the-fly generation of output graphics, such as expression bar plots, comprehensive in planta representations and heatmaps of hierarchically clustered expression data. In addition, it allows for the identification of co-expressed genes and the visualization of correlation networks of co-regulated gene groups. TomExpress provides one of the most complete free resources of publicly available tomato RNA-Seq data, and allows for the immediate interrogation of transcriptional programs that regulate vegetative and reproductive development in tomato under diverse conditions. The design of the pipeline developed in this project enables easy updating of the database with newly published RNA-Seq data, thereby allowing for continuous enrichment of the resource. Significance Statement After applying a bioinformatics and statistics pipeline, a unified TomExpress platform was developed to provide the tomato research community with a browser and integrated web tools for public RNA-Seq data visualization and data mining via on-the-fly generation of output graphics such as expression bar plots, comprehensive in planta representations and heatmaps of hierarchically clustered expression data. TomExpress provides one of the most complete free resources of publicly available tomato RNA-Seq data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available