4.6 Article

Maximizing the utility of public data

Related references

Note: Only part of the references are listed.
Article Multidisciplinary Sciences

Hierarchical regulation of autophagy during adipocyte differentiation

Mahmoud Ahmed et al.

Summary: This study evaluated the impact of adipogenic transcription factors and co-factors on autophagy gene expression during adipocyte differentiation and established a model to investigate the regulatory mechanisms involved. The findings suggest a hierarchical arrangement between adipogenic transcription factors and co-factors, which collectively regulate autophagy during adipocyte differentiation.

PLOS ONE (2022)

Editorial Material Multidisciplinary Sciences

A wealth of discovery built on the Human Genome Project - by the numbers

Alexander J. Gates et al.

Summary: The new analysis examines the impact of the draft genome on genomics since 2001, highlighting its effects on publications, drug approvals, and understanding of diseases.

NATURE (2021)

Editorial Material Biology

A research parasite's perspective on establishing a baseline to avoid errors in secondary analyses

Ayush T. Raman

Summary: To enhance reproducibility in scientific research, more and more datasets are being made publicly available for secondary analyses. However, these datasets are not perfect and require a better understanding of the assumptions that shaped them.

GIGASCIENCE (2021)

Article Oncology

A Functional Network Model of the Metastasis Suppressor PEBP1/RKIP and Its Regulators in Breast Cancer Cells

Mahmoud Ahmed et al.

Summary: The researchers identified potential drug perturbations on known interactions between metastasis suppressors and their regulators, focusing on the effect of these drugs on RKIP in breast cancer cells. Their approach could discover alternative mechanisms of existing cancer drugs and repurpose them in different disease types, with a focus on understanding the mechanisms by which these drugs produce the desired outcome.

CANCERS (2021)

Article Cell Biology

A Small Fraction of Progenitors Differentiate Into Mature Adipocytes by Escaping the Constraints on the Cell Structure

Mahmoud Ahmed et al.

Summary: 3T3-L1 pre-adipocytes are a mixture of non-identical culture cells, with only a small fraction responding to induction and developing into mature adipocytes. The remaining cells may be under structural constraints or committed to differentiating into alternative phenotypes. These findings suggest that pre-adipocytes exhibit diverse responses to stimuli and only a limited fraction will differentiate into mature adipocytes.

FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY (2021)

Article Biochemical Research Methods

A Computational Framework for Identifying Promoter Sequences in Nonmodel Organisms Using RNA-seq Data Sets

Erin H. Wilson et al.

Summary: This study developed a computational framework to identify constitutively, strongly expressed genes and predict strong promoter signals using standard RNA-seq data sets. The framework was applied to methanotroph Methylotuvimicrobium buryatense 5GB1, identifying 25 genes with high expression levels across diverse experimental conditions. The predicted promoter motifs were experimentally validated and found to be biologically meaningful for engineering diverse microorganisms for biomolecule production.

ACS SYNTHETIC BIOLOGY (2021)

Article Biotechnology & Applied Microbiology

recount3: summaries and queries for large-scale RNA-seq expression and splicing

Christopher Wilks et al.

Summary: recount3 is a resource containing over 750,000 publicly available human and mouse RNA sequencing samples processed by the new Monorail analysis pipeline. Access to the data is facilitated through the recount3 and snapcount R/Bioconductor packages, along with complementary web resources. Monorail can process local and private data, allowing researchers to compare their results directly to any study in recount3.

GENOME BIOLOGY (2021)

Article Mathematical & Computational Biology

LINPS: a database for cancer-cell-specific perturbations of biological networks

Mahmoud Ahmed et al.

Summary: The process of screening for potential cancer therapies using existing large datasets of drug perturbations often requires specialized expertise and resources not readily available to all lab scientists. One solution to this obstacle is to leverage prior knowledge, particularly those encoded in standard formats such as causal biological networks (CBN). By converting large datasets into appropriate structures and analyzing them once, the results can be freely accessible in user-friendly formats. Researchers demonstrated using the Library of Integrated Cellular Signatures to model the cell-specific effects of drug treatments on gene expression, allowing the prediction of treatment effects on various CBN through network perturbation amplitudes analysis.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2021)

Article Biochemistry & Molecular Biology

KnockTF: a comprehensive human gene expression profile database with knockdown/knockout of transcription factors

Chenchen Feng et al.

NUCLEIC ACIDS RESEARCH (2020)

Editorial Material Biology

Data detectives, self-love, and humility: a research parasite's perspective

Claire Duvallet

GIGASCIENCE (2020)

Article Biotechnology & Applied Microbiology

Integrating binding and expression data to predict transcription factors combined function

Mahmoud Ahmed et al.

BMC GENOMICS (2020)

Article Multidisciplinary Sciences

The reuse of public datasets in the life sciences: potential risks and rewards

Katharina Sielemann et al.

PEERJ (2020)

Article Multidisciplinary Sciences

The GenomeAsia 100K Project enables genetic discoveries across Asia

Jeffrey D. Wall et al.

NATURE (2019)

Article Endocrinology & Metabolism

Modelling the gene expression and the DNA-binding in the 3T3-L1 differentiating adipocytes

Mahmoud Ahmed et al.

ADIPOCYTE (2019)

Article Biochemical Research Methods

Combining RNA-seq data and homology-based gene prediction for plants, animals and fungi

Jens Keilwagen et al.

BMC BIOINFORMATICS (2018)

Article Biochemistry & Molecular Biology

Co-Expression Network Analysis of AMPK and Autophagy Gene Products during Adipocyte Differentiation

Mahmoud Ahmed et al.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2018)

Article Mathematical & Computational Biology

BEL Commons: an environment for exploration and analysis of networks encoded in Biological Expression Language

Charles Tapley Hoyt et al.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2018)

Article Multidisciplinary Sciences

Attitudes and norms affecting scientists' data reuse

Renata Goncalves Curty et al.

PLOS ONE (2017)

Article Biochemistry & Molecular Biology

A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles

Aravind Subramanian et al.

Article Multidisciplinary Sciences

Announcement: Where are the data?

NATURE (2016)

Article Multidisciplinary Sciences

A global reference for human genetic variation

David M. Altshuler et al.

NATURE (2015)

Editorial Material Genetics & Heredity

The Cancer Genome Atlas Pan-Cancer analysis project

John N. Weinstein et al.

NATURE GENETICS (2013)

Article Biochemical Research Methods

Assessment of transcript reconstruction methods for RNA-seq

Tamara Steijger et al.

NATURE METHODS (2013)

Article Biochemical Research Methods

Target analysis by integration of transcriptome and ChIP-seq data with BETA

Su Wang et al.

NATURE PROTOCOLS (2013)

Article Mathematical & Computational Biology

curatedOvarianData: clinically annotated data for the ovarian cancer transcriptome

Benjamin Frederick Ganzfried et al.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2013)

Article Mathematical & Computational Biology

Assessment of network perturbation amplitudes by applying high-throughput data to causal biological networks

Florian Martin et al.

BMC SYSTEMS BIOLOGY (2012)

Article Biochemical Research Methods

Unsupervised pattern discovery in human chromatin structure through genomic segmentation

Michael M. Hoffman et al.

NATURE METHODS (2012)

Article Biochemistry & Molecular Biology

The Sequence Read Archive

Rasko Leinonen et al.

NUCLEIC ACIDS RESEARCH (2011)

Editorial Material Multidisciplinary Sciences

Reproducible Research in Computational Science

Roger D. Peng

SCIENCE (2011)

Review Genetics & Heredity

APPLICATIONS OF NEXT-GENERATION SEQUENCING Sequencing technologies - the next generation

Michael L. Metzker

NATURE REVIEWS GENETICS (2010)

Article Multidisciplinary Sciences

Genome-wide detection and characterization of positive selection in human populations

Pardis C. Sabeti et al.

NATURE (2007)

Article Biochemistry & Molecular Biology

ArrayExpress - a public database of microarray experiments and gene expression profiles

H. Parkinson et al.

NUCLEIC ACIDS RESEARCH (2007)

Article Biochemistry & Molecular Biology

Gene Expression Omnibus: NCBI gene expression and hybridization array data repository

R Edgar et al.

NUCLEIC ACIDS RESEARCH (2002)

Review Multidisciplinary Sciences

Initial sequencing and analysis of the human genome

ES Lander et al.

NATURE (2001)