4.6 Article

Scalable transcriptomics analysis with Dask: applications in data science and machine learning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biochemical Research Methods

Squidpy: a scalable framework for spatial omics analysis

Giovanni Palla et al.

Summary: Squidpy is a Python framework that combines tools from omics and image analysis to efficiently store, manipulate, and visualize spatial omics data. It is extensible and can be interfaced with other libraries for scalable analysis of spatial omics data.

NATURE METHODS (2022)

Proceedings Paper Computer Science, Hardware & Architecture

Proteome-scale Deployment of Protein Structure Prediction Workflows on the Summit Supercomputer

Mu Gao et al.

Summary: Deep learning has made significant contributions to protein structure prediction, and it is now possible to perform genome-scale structure prediction using state-of-the-art models. The authors describe their efforts to efficiently deploy the AlphaFold v.2 program on the Oak Ridge Leadership Computing Facility's resources, and showcase the predicted structures for a large number of protein sequences.

2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022) (2022)

Review Biochemical Research Methods

Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data

Tallulah S. Andrews et al.

Summary: Single-cell RNA sequencing (scRNA-seq) is a popular and powerful technology for profiling the whole transcriptome of individual cells, but analyzing the large volumes of data requires specialized statistical and computational methods. This article provides an overview of the computational workflow, common tasks and tools for addressing biological questions, as well as guidelines for best practices in computational analyses. It serves as a hands-on guide for experimentalists and an overview for bioinformaticians developing new computational methods.

NATURE PROTOCOLS (2021)

Article Computer Science, Information Systems

Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System

Devin Petersohn et al.

Summary: Dataframes are popular tools in data science, but current systems like pandas have scalability issues. The parallel dataframe system MODIN was developed to address these limitations, translating pandas functions into parallelized operators with metadata independence, allowing for faster operations on large datasets.

PROCEEDINGS OF THE VLDB ENDOWMENT (2021)

Article Chemistry, Physical

LiPyphilic: A Python Toolkit for the Analysis of Lipid Membrane Simulations

Paul Smith et al.

Summary: LiPyphilic is a fast, fully tested, and easy-to-install Python package for analyzing emergent phenomena in lipid membranes through molecular dynamics simulations. It offers various analysis tools and on-the-fly trajectory transformations to handle membranes with complex compositions effectively. Additionally, LiPyphilic addresses the issue of fluctuations in box volume under the NPT ensemble, which has been overlooked in most current implementations.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2021)

Review Immunology

Immunology in the Era of Single-Cell Technologies

Mirjana Efremova et al.

ANNUAL REVIEW OF IMMUNOLOGY, VOL 38 (2020)

Article Biochemical Research Methods

SciPy 1.0: fundamental algorithms for scientific computing in Python

Pauli Virtanen et al.

NATURE METHODS (2020)

Article Genetics & Heredity

Convolutional neural network models for cancer type prediction based on gene expression

Milad Mostavi et al.

BMC MEDICAL GENOMICS (2020)

Review Multidisciplinary Sciences

Array programming with NumPy

Charles R. Harris et al.

NATURE (2020)

Article Computer Science, Information Systems

Towards Scalable Dataframe Systems

Devin Petersohn et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2020)

Review Biotechnology & Applied Microbiology

Applications of machine learning in drug discovery and development

Jessica Vamathevan et al.

NATURE REVIEWS DRUG DISCOVERY (2019)

Article Oncology

Cancer treatment and survivorship statistics, 2019

Kimberly D. Miller et al.

CA-A CANCER JOURNAL FOR CLINICIANS (2019)

Review Biochemistry & Molecular Biology

Current best practices in single-cell RNA-seq analysis: a tutorial

Malte D. Luecken et al.

MOLECULAR SYSTEMS BIOLOGY (2019)

Review Genetics & Heredity

RNA sequencing: the teenage years

Rory Stark et al.

NATURE REVIEWS GENETICS (2019)

Article Biochemical Research Methods

GRNBoost2 and Arboreto: efficient and scalable inference of gene regulatory networks

Thomas Moerman et al.

BIOINFORMATICS (2019)

Article Cell Biology

A Deep Learning Framework for Predicting Response to Therapy in Cancer

Theodore Sakellaropoulos et al.

CELL REPORTS (2019)

Article Biotechnology & Applied Microbiology

Scaling computational genomics to millions of individuals with GPUs

Amaro Taylor-Weiner et al.

GENOME BIOLOGY (2019)

Article Biotechnology & Applied Microbiology

scRNA-seq assessment of the human lung, spleen, and esophagus tissue stability after cold preservation

E. Madissoon et al.

GENOME BIOLOGY (2019)

Article Biochemistry & Molecular Biology

The Encyclopedia of DNA elements (ENCODE): data portal update

Carrie A. Davis et al.

NUCLEIC ACIDS RESEARCH (2018)

Article Astronomy & Astrophysics

Vaex: big data exploration in the era of Gaia

Maarten A. Breddels et al.

ASTRONOMY & ASTROPHYSICS (2018)

Article Astronomy & Astrophysics

Vaex: big data exploration in the era of Gaia

Maarten A. Breddels et al.

ASTRONOMY & ASTROPHYSICS (2018)

Article Biotechnology & Applied Microbiology

Predicting age from the transcriptome of human dermal fibroblasts

Jason G. Fleischer et al.

GENOME BIOLOGY (2018)

Article Medicine, Research & Experimental

Transcriptomics and machine learning predict diagnosis and severity of growth hormone deficiency

Philip G. Murray et al.

JCI INSIGHT (2018)

Review Biochemistry & Molecular Biology

Single-cell RNA sequencing technologies and bioinformatics pipelines

Byungjin Hwang et al.

EXPERIMENTAL AND MOLECULAR MEDICINE (2018)

Article Biochemical Research Methods

Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions

Ciaran Evans et al.

BRIEFINGS IN BIOINFORMATICS (2018)

Review Biochemistry & Molecular Biology

Transcriptome Profiling in Human Diseases: New Advances and Perspectives

Amelia Casamassimi et al.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2017)

Article Biochemical Research Methods

Power analysis of single-cell RNA-sequencing experiments

Valentine Svensson et al.

NATURE METHODS (2017)

Article Cell Biology

Improving genetic diagnosis in Mendelian disease with transcriptome sequencing

Beryl B. Cummings et al.

SCIENCE TRANSLATIONAL MEDICINE (2017)

Article Biotechnology & Applied Microbiology

A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data

Yuanyuan Li et al.

BMC GENOMICS (2017)

Review Biochemistry & Molecular Biology

Transcriptional Addiction in Cancer

James E. Bradner et al.

Article Computer Science, Hardware & Architecture

Apache Spark: A Unified Engine for Big Data Processing

Matei Zaharia et al.

COMMUNICATIONS OF THE ACM (2016)

Article Medicine, Research & Experimental

Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data

Alexander Aliper et al.

MOLECULAR PHARMACEUTICS (2016)

Review Genetics & Heredity

Translating RNA sequencing into clinical diagnostics: opportunities and challenges

Sara A. Byron et al.

NATURE REVIEWS GENETICS (2016)

Article Dermatology

Gene expression profiling for molecular staging of cutaneous melanoma in patients undergoing sentinel lymph node biopsy

Pedram Gerami et al.

JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY (2015)

Article Multidisciplinary Sciences

Integrative analysis of 111 reference human epigenomes

Anshul Kundaje et al.

NATURE (2015)

Review Genetics & Heredity

Machine learning applications in genetics and genomics

Maxwell W. Libbrecht et al.

NATURE REVIEWS GENETICS (2015)

Article Multidisciplinary Sciences

Genomic correlates of response to CTLA-4 blockade in metastatic melanoma

Eliezer M. Van Allen et al.

SCIENCE (2015)

Review Multidisciplinary Sciences

Machine learning: Trends, perspectives, and prospects

M. I. Jordan et al.

SCIENCE (2015)

Article

Review The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge

Katarzyna Tomczak et al.

Wspolczesna Onkologia-Contemporary Oncology (2015)

Article Biotechnology & Applied Microbiology

The Impact of Normalization Methods on RNA-Seq Data Analysis

J. Zyprych-Walczak et al.

BIOMED RESEARCH INTERNATIONAL (2015)

Article Biochemical Research Methods

compcodeR-an R package for benchmarking differential expression methods for RNA-seq data

Charlotte Soneson

BIOINFORMATICS (2014)

Review Multidisciplinary Sciences

The causes and consequences of genetic heterogeneity in cancer evolution

Rebecca A. Burrell et al.

NATURE (2013)

Article Multidisciplinary Sciences

Transcriptome and genome sequencing uncovers functional variation in humans

Tuuli Lappalainen et al.

NATURE (2013)

Editorial Material Genetics & Heredity

The Genotype-Tissue Expression (GTEx) project

John Lonsdale et al.

NATURE GENETICS (2013)

Article Biotechnology & Applied Microbiology

Gene expression changes with age in skin, adipose tissue, blood and brain

Daniel Glass et al.

GENOME BIOLOGY (2013)

Article Multidisciplinary Sciences

An integrated encyclopedia of DNA elements in the human genome

Ian Dunham et al.

NATURE (2012)

Review Biochemical Research Methods

Computational methods for transcriptome annotation and quantification using RNA-seq

Manuel Garber et al.

NATURE METHODS (2011)

Article Biochemical Research Methods

Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments

James H. Bullard et al.

BMC BIOINFORMATICS (2010)

Article Biotechnology & Applied Microbiology

A scaling normalization method for differential expression analysis of RNA-seq data

Mark D. Robinson et al.

GENOME BIOLOGY (2010)

Article Biochemical Research Methods

Mapping and quantifying mammalian transcriptomes by RNA-Seq

Ali Mortazavi et al.

NATURE METHODS (2008)

Article Biochemical Research Methods

Tumor classification by partial least squares using microarray gene expression data

DV Nguyen et al.

BIOINFORMATICS (2002)