4.6 Review

Data Integration Challenges for Machine Learning in Precision Medicine

Related references

Note: Only part of the references are listed.
Article Health Care Sciences & Services

Should electronic differential diagnosis support be used early or late in the diagnostic process? A multicentre experimental study of Isabel

Matt Sibbald et al.

Summary: The study found that using EDS, whether early or late in the diagnostic process, increased the number of diagnostic hypotheses and the likelihood of the correct diagnosis appearing in the differential. Early use primarily increased the number of diagnostic hypotheses, while late use increased the likelihood of the correct diagnosis being present in the differential regardless of experience level.

BMJ QUALITY & SAFETY (2022)

Review Biochemical Research Methods

Current RNA-seq methodology reporting limits reproducibility

Joel Simoneau et al.

Summary: The translation highlights that the current standard practice in RNA-seq studies often lacks the necessary methodological information, leading to potential reproducibility issues. This work emphasizes the importance of standardized and explicit display of methodological information in RNA-seq experiments.

BRIEFINGS IN BIOINFORMATICS (2021)

Review Biochemical Research Methods

The road towards data integration in human genomics: players, steps and interactions

Anna Bernasconi et al.

Summary: With thousands of new experimental datasets being generated daily within large cooperative efforts, data integration becomes crucial but faces challenges due to heterogeneity. This paper outlines a technological pipeline for data production to integration, introduces a taxonomy of genomic data players, and focuses on addressing issues in genomic data integration.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Ophthalmology

The Quality of Medical Evidence and Medical Practice March 1987

David M. Eddy et al.

AMERICAN JOURNAL OF OPHTHALMOLOGY (2021)

Review Medicine, Research & Experimental

Precision Medicine, AI, and the Future of Personalized Health Care

Kevin B. Johnson et al.

Summary: The convergence of artificial intelligence and precision medicine promises to revolutionize healthcare by helping solve the most difficult challenges facing precision medicine and facilitating personalized diagnosis and prognostication through the combination of genomic and nongenomic determinants with patient information.

CTS-CLINICAL AND TRANSLATIONAL SCIENCE (2021)

Article Public, Environmental & Occupational Health

AI's gonna have an impact on everything in society, so it has to have an impact on public health: a fundamental qualitative descriptive study of the implications of artificial intelligence for public health

Jason D. Morgenstern et al.

Summary: Experts are cautiously optimistic about the impacts of artificial intelligence on public health practice, especially for improving disease surveillance. However, they identified significant barriers, such as a lack of expertise, and risks, including inadequate regulation. Therefore, investment and research in AI for public health practice could be beneficial, but improving access to high-quality data, educating about AI limitations, and establishing rigorous regulation are necessary to realize these benefits.

BMC PUBLIC HEALTH (2021)

Review Biochemical Research Methods

The status of causality in biological databases: data resources and data retrieval possibilities to support logical modeling

Vasundra Toure et al.

Summary: Causal molecular interactions are crucial for computational modeling and predicting biological behaviors. Different biological interests have led to various ways of describing and annotating these interactions, making it challenging to efficiently explore and utilize the data. Understanding the variety of data formats, biological process representations, and exchange procedures is important in extracting and downloading causal interaction data.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Cell Biology

A multimodal and integrated approach to interrogate human kidney biopsies with rigor and reproducibility: guidelines from the Kidney Precision Medicine Project

Tarek M. El-Achkar et al.

Summary: The Kidney Precision Medicine Project aims to generate 3-D molecular atlases of healthy and diseased kidney biopsies using state-of-the-art omics and imaging technologies. The emphasis is on quality assurance, quality control, validation, and harmonization across different technologies.

PHYSIOLOGICAL GENOMICS (2021)

Article Biochemistry & Molecular Biology

ELIXIR-EXCELERATE: establishing Europe's data infrastructure for the life science research of the future

Jennifer Harrow et al.

EMBO JOURNAL (2021)

Article Health Care Sciences & Services

Artificial Intelligence-Aided Precision Medicine for COVID-19: Strategic Areas of Research and Development

Enrico Santus et al.

Summary: This paper provides an overview of the application of AI technology in diagnosis, drug development, vaccine research, and literature mining in the context of the COVID-19 pandemic. It emphasizes the role of AI in healthcare, which can help improve the efficiency of public health systems in handling outbreaks and enhance patient outcomes.

JOURNAL OF MEDICAL INTERNET RESEARCH (2021)

Review Oncology

Artificial intelligence in cancer research: learning at different levels of data granularity

Davide Cirillo et al.

Summary: From genome-scale experimental studies to imaging data, behavioral footprints, and longitudinal healthcare records, the convergence of big data in cancer research and advances in Artificial Intelligence (AI) are paving the way to develop a systems view of cancer. However, the co-existence of big data and small data resources in this biomedical area highlights the need for a deeper investigation about the crosstalk between different levels of data granularity, including varied sample sizes, labels, data types, and other data descriptors. This review introduces the current challenges, limitations, and solutions of AI in the heterogeneous landscape of data granularity in cancer research, emphasizing the necessity of advancing interoperability among AI approaches and discussing the synergy between discriminative and generative models with examples of techniques and applications.

MOLECULAR ONCOLOGY (2021)

Review Medicine, General & Internal

AI and Big Data in Healthcare: Towards a More Comprehensive Research Framework for Multimorbidity

Ljiljana Trtica Majnaric et al.

Summary: Multimorbidity, the coexistence of two or more chronic diseases in a person, presents unique care needs that current healthcare systems struggle to address due to their focus on single diseases. To improve patient care in these cases, a radical change in medical research and treatment approaches is required, with a shift towards interactive research supported by artificial intelligence and big data analytics.

JOURNAL OF CLINICAL MEDICINE (2021)

Editorial Material Medicine, General & Internal

Trials and Tribulations-11 Reasons Why We Need to Promote Clinical Trials Data Sharing

Atul J. Butte

JAMA NETWORK OPEN (2021)

Editorial Material Biochemistry & Molecular Biology

Precision medicine in 2030-seven ways to transform healthcare

Joshua C. Denny et al.

Summary: Precision medicine promises to improve health by considering individual variability in genes, environment, and lifestyle. It will continue to transform healthcare in the next decade through advancements in genomics, environment, lifestyle, and artificial intelligence.
Article Health Care Sciences & Services

Text-mining in electronic healthcare records can be used as efficient tool for screening and data collection in cardiovascular trials: a multicenter validation study

Wouter B. van Dijk et al.

Summary: This study validates the use of text-mining in electronic healthcare records for patient screening and data extraction in trials. The results showed that automated EHR data screening was more efficient than manual screening, and although the accuracy of automatically extracted data was lower than manual data entry, the overall accuracy was high.

JOURNAL OF CLINICAL EPIDEMIOLOGY (2021)

Editorial Material Biochemical Research Methods

A global view of standards for open image data formats and repositories

Jason R. Swedlow et al.

Summary: Imaging technologies play a crucial role in understanding biological mechanisms and in diagnosis and therapy in animal and human medicine. Establishing globally applicable guidelines for open image data tools and resources can help advance the rapidly developing fields of biological and biomedical imaging.

NATURE METHODS (2021)

Article Multidisciplinary Sciences

NGScloud2: optimized bioinformatic analysis using Amazon Web Services

Fernando Mora-Marquez et al.

Summary: NGScloud2 is an enhanced and expanded version of NGScloud with major technical improvements, such as the ability to run spot instances and the latest AWS instance types, leading to significant cost savings. This improved version includes common applications and tools for de novo RNAseq analysis, as well as workflows for reference-based RNAseq, RADseq, and functional annotation.

PEERJ (2021)

Review Health Care Sciences & Services

The Role of Artificial Intelligence in Managing Multimorbidity and Cancer

Alfredo Cesario et al.

Summary: Personalized Medicine is an innovative approach that takes into consideration individual patient characteristics, such as lifestyle and preferences, to address the care needs of patients with multimorbidity. Traditional disease-centered healthcare paradigms may not be suitable for understanding and managing complex conditions, requiring the integration of heterogeneous data to guide interventions.

JOURNAL OF PERSONALIZED MEDICINE (2021)

Review Health Care Sciences & Services

Translational Research in the Era of Precision Medicine: Where We Are and Where We Will Go

Ruggero De Maria Marchiano et al.

Summary: The emergence of Precision Medicine has revolutionized translational research globally, emphasizing patient-centric therapeutic choices and digitizing individual health status. Real world data-based translational applications offer a promising alternative to traditional evidence-based medicine approaches.

JOURNAL OF PERSONALIZED MEDICINE (2021)

Article Medicine, General & Internal

AUTOMATED REVERSE TRANSCRIPTION POLYMERASE CHAIN REACTION DATA ANALYSIS FOR SARS-COV-2 DETECTION

Laura Gomez-Romero et al.

Summary: ARPA is a sensitive and specific software designed to analyze RT-PCR data for SARS-CoV-2 detection, which can reduce the time required in the diagnostic pipeline.

REVISTA DE INVESTIGACION CLINICA-CLINICAL AND TRANSLATIONAL INVESTIGATION (2021)

Article Computer Science, Interdisciplinary Applications

Helastic: On combining threshold-based and Serverless elasticity approaches for optimizing the execution of bioinformatics applications

Mateus Rauback Aubin et al.

Summary: This article introduces a model called Helastic for exploring cloud elasticity on jModelTest. The model combines traditional reactive methods with Serverless to improve system performance. Testing results show that the dual-elasticity approach has more advantages compared to a single resource rearrangement technique.

JOURNAL OF COMPUTATIONAL SCIENCE (2021)

Article Mathematical & Computational Biology

BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine

Olga Majewska et al.

Summary: This study introduces a new resource BioVerbNet, making progress in the semantic-syntactic classification of biomedical verbs and demonstrating the ability to improve model performance in text classification tasks.

JOURNAL OF BIOMEDICAL SEMANTICS (2021)

Editorial Material Medicine, General & Internal

Assessing Clinical Outcomes in a Data-Rich World-A Reality Check on Real-World Data

Julian C. Hong et al.

JAMA NETWORK OPEN (2021)

Proceedings Paper Computer Science, Artificial Intelligence

The DeepHealth Toolkit: A Unified Framework to Boost Biomedical Applications

Michele Cancilla et al.

Summary: The DeepHealth Toolkit is an open-source deep learning toolkit designed to enhance the productivity of data scientists in the medical field by providing a unified framework for distributed training of neural networks in a transparent manner, leveraging hybrid HPC and cloud computing environments.

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Article Health Care Sciences & Services

Development of a core competency framework for clinical informatics

Alan Davies et al.

Summary: An iterative and mixed-methods approach was used in developing a core competency framework for clinical informatics in the UK, involving the target community. The final framework consists of 6 primary domains, 36 subdomains, and 111 individual competencies, based on input from over 102 participants. Careful consideration is needed to avoid professional burnout and ensure effective implementation of the framework in practice.

BMJ HEALTH & CARE INFORMATICS (2021)

Article History & Philosophy Of Science

Evidence-based medicine: a brief historical analysis of conceptual landmarks and practical goals for care

Lina Faria et al.

Summary: Evidence-based medicine (EBM) aims to improve the efficiency and quality of health services while reducing costs, with the goal of addressing relevant issues and promoting the social applicability of conclusions. Scholars in the twentieth century have made significant contributions to the development and dissemination of EBM in clinical teaching and healthcare practice, expanding discussions on the relationship between teaching and medical practice.

HISTORIA CIENCIAS SAUDE-MANGUINHOS (2021)

Review Automation & Control Systems

Big Data Analytics in Healthcare - A Systematic Literature Review and Roadmap for Practical Implementation

Sohail Imran et al.

Summary: The emergence of healthcare information management systems has generated a vast amount of healthcare data globally. Big data analytics in healthcare offers great potential for improving diagnosis, treatment, and efficiency of healthcare services. Implementing big data analytics in healthcare presents challenges but also opportunities for significant advancements in patient care.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2021)

Article Health Policy & Services

Can structured EHR data support clinical coding? A data mining approach

Jose Carlos Ferrao et al.

Summary: This article explores the potential of using structured clinical data to support clinical code assignment, addressing high dimensionality issues, the multi-label nature of coding, and optimizing model parameters. The methodology involves transforming raw data into feature sets, constructing data matrices, and testing combinations of feature selection methods and machine learning models for predicting code assignment. Testing on a real hospital dataset showed varying predictive power across codes, indicating the efficiency and workload reduction benefits of leveraging structured data for clinical coding.

HEALTH SYSTEMS (2021)

Review Biochemical Research Methods

CURATE.AI: Optimizing Personalized Medicine with Artificial Intelligence

Agata Blasiak et al.

SLAS TECHNOLOGY (2020)

Article Computer Science, Software Engineering

The state-of-the-art in container technologies: Application, orchestration and security

Emiliano Casalicchio et al.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE (2020)

Article Health Care Sciences & Services

Patient Perspectives on the Usefulness of an Artificial Intelligence-Assisted Symptom Checker: Cross-Sectional Survey Study

Ashley N. D. Meyer et al.

JOURNAL OF MEDICAL INTERNET RESEARCH (2020)

Review Biochemistry & Molecular Biology

How Machine Learning Will Transform Biomedicine

Jeremy Goecks et al.

Review Cardiac & Cardiovascular Systems

State-of-the-Art Machine Learning Techniques Aiming to Improve Patient Outcomes Pertaining to the Cardiovascular System

Rahul Kumar Sevakula et al.

JOURNAL OF THE AMERICAN HEART ASSOCIATION (2020)

Article Multidisciplinary Sciences

Reliability and validity of the UK Biobank cognitive tests

Chloe Fawns-Ritchie et al.

PLOS ONE (2020)

Article Medical Informatics

EHR-Independent Predictive Decision Support Architecture Based on OMOP

Philipp Unberath et al.

APPLIED CLINICAL INFORMATICS (2020)

Article Multidisciplinary Sciences

COVID-19 pandemic reveals the peril of ignoring metadata standards

Lynn M. Schriml et al.

SCIENTIFIC DATA (2020)

Review Computer Science, Interdisciplinary Applications

Social media based surveillance systems for healthcare using machine learning: A systematic review

Aakansha Gupta et al.

JOURNAL OF BIOMEDICAL INFORMATICS (2020)

Article Computer Science, Interdisciplinary Applications

Recent advances of HCI in decision-making tasks for optimized clinical workflows and precision medicine

Leonardo Rundo et al.

JOURNAL OF BIOMEDICAL INFORMATICS (2020)

Article Multidisciplinary Sciences

Expanded encyclopaedias of DNA elements in the human and mouse genomes

Jill E. Moore et al.

NATURE (2020)

Editorial Material Medicine, General & Internal

The Case for Algorithmic Stewardship for Artificial Intelligence and Machine Learning Technologies

Stephanie Eaneff et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2020)

Article Cell Biology

Facing multimorbidity in the precision medicine era

Graziano Onder et al.

MECHANISMS OF AGEING AND DEVELOPMENT (2020)

Article Cell Biology

Untangling the complexity of multimorbidity with machine learning

Abdelaali Hassaine et al.

MECHANISMS OF AGEING AND DEVELOPMENT (2020)

Article Multidisciplinary Sciences

LifeTime and improving European healthcare through cell-based interceptive medicine

Nikolaus Rajewsky et al.

NATURE (2020)

Letter Biotechnology & Applied Microbiology

A novel computational architecture for large-scale genomics

Matthias Becker et al.

NATURE BIOTECHNOLOGY (2020)

Editorial Material Biochemistry & Molecular Biology

Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist

Beau Norgeot et al.

NATURE MEDICINE (2020)

Article Multidisciplinary Sciences

The GTEx Consortium atlas of genetic regulatory effects across human tissues

Francois Aguet et al.

SCIENCE (2020)

Review Pharmacology & Pharmacy

Road to effective data curation for translational research

Wei Gu et al.

DRUG DISCOVERY TODAY (2020)

Article Health Care Sciences & Services

Characteristics and challenges of the clinical pipeline of digital therapeutics

Nisarg A. Patel et al.

NPJ DIGITAL MEDICINE (2020)

Article Biotechnology & Applied Microbiology

Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework

Tanveer Ahmad et al.

BMC GENOMICS (2020)

Letter Biotechnology & Applied Microbiology

Guidelines for reporting single-cell RNA-seq experiments

Anja Fullgrabe et al.

NATURE BIOTECHNOLOGY (2020)

Article Health Care Sciences & Services

Protected Health Information filter (Philter): accurately and securely de-identifying free-text clinical notes

Beau Norgeot et al.

NPJ DIGITAL MEDICINE (2020)

Article Computer Science, Information Systems

Differential Privacy Techniques for Cyber Physical Systems: A Survey

Muneeb Ul Hassan et al.

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS (2020)

Review Mathematical & Computational Biology

Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine

Zeeshan Ahmed et al.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2020)

Article Computer Science, Theory & Methods

Investigating class rarity in big data

Tawfiq Hasanin et al.

JOURNAL OF BIG DATA (2020)

Article Operations Research & Management Science

Massive datasets and machine learning for computational biomedicine: trends and challenges

Anton Kocheturov et al.

ANNALS OF OPERATIONS RESEARCH (2019)

Article Computer Science, Artificial Intelligence

Machine learning for integrating data in biology and medicine: Principles, practice, and opportunities

Marinka Zitnik et al.

INFORMATION FUSION (2019)

Review Biochemistry & Molecular Biology

Personalized Medicine and the Power of Electronic Health Records

Noura S. Abul-Husn et al.

Editorial Material Medicine, Research & Experimental

Promises, promises, and precision medicine

Michael J. Joyner et al.

JOURNAL OF CLINICAL INVESTIGATION (2019)

Review Genetics & Heredity

Machine Learning and Integrative Analysis of Biomedical Big Data

Bilal Mirza et al.

GENES (2019)

Article Engineering, Electrical & Electronic

IBM Watson, Heal Thyself

Eliza Strickland

IEEE SPECTRUM (2019)

Article Engineering, Electrical & Electronic

BERST: An Engine and Tool for Exploring Biomedical Entities and Relationships

Bai Tian et al.

CHINESE JOURNAL OF ELECTRONICS (2019)

Review Biochemical Research Methods

Big data analytics for personalized medicine

Davide Cirillo et al.

CURRENT OPINION IN BIOTECHNOLOGY (2019)

Article Pharmacology & Pharmacy

Propensity score-integrated power prior approach for incorporating real-world evidence in single-arm clinical studies

Chenguang Wang et al.

JOURNAL OF BIOPHARMACEUTICAL STATISTICS (2019)

Review Genetics & Heredity

Leveraging European infrastructures to access 1 million human genomes by 2022

Gary Saunders et al.

NATURE REVIEWS GENETICS (2019)

Article Biochemical Research Methods

PyGMQL: scalable data extraction and analysis for heterogeneous genomic datasets

Luca Nanni et al.

BMC BIOINFORMATICS (2019)

Article Biochemical Research Methods

Patient Dossier: Healthcare queries over distributed resources

Miguel Vazquez et al.

PLOS COMPUTATIONAL BIOLOGY (2019)

Article Health Care Sciences & Services

Health Care and Precision Medicine Research: Analysis of a Scalable Data Science Platform

Jacob McPadden et al.

JOURNAL OF MEDICAL INTERNET RESEARCH (2019)

Editorial Material Medicine, General & Internal

Deep Learning in Medicine-Promise, Progress, and Challenges

Fei Wang et al.

JAMA INTERNAL MEDICINE (2019)

Editorial Material Public, Environmental & Occupational Health

Challenges and Opportunities for Using Big Health Care Data to Advance Medical Science and Public Health

Susan M. Shortreed et al.

AMERICAN JOURNAL OF EPIDEMIOLOGY (2019)

Article Computer Science, Artificial Intelligence

A Machine Learning Perspective on Personalized Medicine: An Automized, Comprehensive Knowledge Base with Ontology for Pattern Recognition

Frank Emmert-Streib et al.

MACHINE LEARNING AND KNOWLEDGE EXTRACTION (2019)

Article Engineering, Biomedical

Windows into human health through wearables data analytics

Daniel R. Witt et al.

CURRENT OPINION IN BIOMEDICAL ENGINEERING (2019)

Review Health Policy & Services

A systematic perspective on the applications of big data analytics in healthcare management

Sachin S. Kamble et al.

INTERNATIONAL JOURNAL OF HEALTHCARE MANAGEMENT (2019)

Article Medical Informatics

Blockchain Applications for Healthcare Data Management

Dimiter Dimitrov

HEALTHCARE INFORMATICS RESEARCH (2019)

Article Computer Science, Information Systems

MPPDS: Multilevel Privacy-Preserving Data Sharing in a Collaborative eHealth System

Jong Wook Kim et al.

IEEE ACCESS (2019)

Review Medicine, Research & Experimental

Deep learning opens new horizons in personalized medicine

Georgios Z. Papadakis et al.

BIOMEDICAL REPORTS (2019)

Article Computer Science, Theory & Methods

Mining Electronic Health Records (EHRs): A Survey

Pranjul Yadav et al.

ACM COMPUTING SURVEYS (2018)

Article Biochemical Research Methods

SciApps: a cloud-based platform for reproducible bioinformatics workflows

Liya Wang et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine

Billy Chiu et al.

BMC BIOINFORMATICS (2018)

Article Cardiac & Cardiovascular Systems

Biomedical Informatics on the Cloud A Treasure Hunt for Advancing Cardiovascular Medicine

Peipei Ping et al.

CIRCULATION RESEARCH (2018)

Article Computer Science, Hardware & Architecture

Applying spark based machine learning model on streaming big data for health status prediction

Lekha R. Nair et al.

COMPUTERS & ELECTRICAL ENGINEERING (2018)

Article Health Care Sciences & Services

Precision Medicine: From Science To Value

Geoffrey S. Ginsburg et al.

HEALTH AFFAIRS (2018)

Review Computer Science, Information Systems

Concurrence of big data analytics and healthcare: A systematic review

Nishita Mehta et al.

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS (2018)

Review Computer Science, Information Systems

Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review

Cao Xiao et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2018)

Article Biotechnology & Applied Microbiology

Secure genome-wide association analysis using multiparty computation

Hyunghoon Cho et al.

NATURE BIOTECHNOLOGY (2018)

Review Biochemistry & Molecular Biology

Panomics for Precision Medicine

Charanjit Sandhu et al.

TRENDS IN MOLECULAR MEDICINE (2018)

Article Biochemistry & Molecular Biology

CellProfiler 3.0: Next-generation image processing for biology

Claire McQuin et al.

PLOS BIOLOGY (2018)

Article Biochemical Research Methods

Cloud computing applications for biomedical science: A perspective

Vivek Navale et al.

PLOS COMPUTATIONAL BIOLOGY (2018)

Article Medicine, General & Internal

From hype to reality: data science enabling personalized medicine

Holger Froehlich et al.

BMC MEDICINE (2018)

Article Medicine, General & Internal

Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data

Milena A. Gianfrancesco et al.

JAMA INTERNAL MEDICINE (2018)

Editorial Material Cardiac & Cardiovascular Systems

Promise and Perils of Big Data and Artificial Intelligence in Clinical Medicine and Biomedical Research

Fatima Rodriguez et al.

CIRCULATION RESEARCH (2018)

Article Multidisciplinary Sciences

The UK Biobank resource with deep phenotyping and genomic data

Clare Bycroft et al.

NATURE (2018)

Article Genetics & Heredity

An atlas of genetic associations in UK Biobank

Oriol Canela-Xandri et al.

NATURE GENETICS (2018)

Article Multidisciplinary Sciences

Realizing private and practical pharmacological collaboration

Brian Hie et al.

SCIENCE (2018)

Editorial Material Medicine, General & Internal

Machine learning in medicine: Addressing ethical challenges

Effy Vayena et al.

PLOS MEDICINE (2018)

Article Cell Biology

Automated muscle histopathology analysis using CellProfiler

Yeh Siang Lau et al.

SKELETAL MUSCLE (2018)

Article Public, Environmental & Occupational Health

Epidemiological Data Challenges: Planning for a More Robust Future Through Data Standards

Geoffrey Fairchild et al.

FRONTIERS IN PUBLIC HEALTH (2018)

Article Health Care Sciences & Services

Scalable and accurate deep learning with electronic health records

Alvin Rajkomar et al.

NPJ DIGITAL MEDICINE (2018)

Article Medical Informatics

Depression detection from social network data using machine learning techniques

Md. Rafiqul Islam et al.

HEALTH INFORMATION SCIENCE AND SYSTEMS (2018)

Proceedings Paper Computer Science, Information Systems

IoT-Based System Health Management Infrastructure as a Service

Gokul Sidarth Thirunavukkarasu et al.

PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2018) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

AI based HealthCare Platform for Real Time, Predictive and Prescriptive Analytics using Reactive Programming

Jagreet Kaur et al.

10TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (2018)

Review Public, Environmental & Occupational Health

Big Data's Role in Precision Public Health

Shawn Dolley

FRONTIERS IN PUBLIC HEALTH (2018)

Article Medicine, Research & Experimental

Evidence-based medicine and precision medicine: complementary approaches to clinical decision-making

Ngai Chow et al.

PRECISION CLINICAL MEDICINE (2018)

Article Biotechnology & Applied Microbiology

The Human Cell Atlas: Technical approaches and challenges

Chung-Chau Hon et al.

BRIEFINGS IN FUNCTIONAL GENOMICS (2018)

Review Biochemistry & Molecular Biology

Human genomics projects and precision medicine

F. Carrasco-Ramiro et al.

GENE THERAPY (2017)

Article Health Care Sciences & Services

Hijacked evidence-based medicine: stay the course and throw the pirates overboard

John P. A. Ioannidis

JOURNAL OF CLINICAL EPIDEMIOLOGY (2017)

Review Medicine, General & Internal

Progress in evidence-based medicine: a quarter century on

Benjamin Djulbegovic et al.

LANCET (2017)

Article Medicine, General & Internal

Pharmacogenomics: Precision Medicine and Drug Response

Richard M. Weinshilboum et al.

MAYO CLINIC PROCEEDINGS (2017)

Editorial Material Genetics & Heredity

Enhancing GTEx by bridging the gaps between genotype, gene expression, and disease

Barbara E. Stranger et al.

NATURE GENETICS (2017)

Article Mathematical & Computational Biology

BELMiner: adapting a rule-based relation extraction system to extract biological expression language statements from bio-medical literature evidence sentences

K. E. Ravikumar et al.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2017)

Editorial Material Biochemical Research Methods

Ten simple rules for responsible big data research

Matthew Zook et al.

PLOS COMPUTATIONAL BIOLOGY (2017)

Article Medical Informatics

Designing An Individualized EHR Learning Plan For Providers

Yumi DiAngi et al.

Applied Clinical Informatics (2017)

Article Biology

The Human Cell Atlas

Aviv Regev et al.

ELIFE (2017)

Article Computer Science, Information Systems

Intrainstitutional EHR collections for patient-level information retrieval

Stephen Wu et al.

JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY (2017)

Review Mathematical & Computational Biology

Semantic annotation in biomedicine: the current landscape

Jelena Jovanovic et al.

JOURNAL OF BIOMEDICAL SEMANTICS (2017)

Article Multidisciplinary Sciences

rEHR: An R package for manipulating and analysing Electronic Health Record data

David A. Springate et al.

PLOS ONE (2017)

Article Social Sciences, Interdisciplinary

Three lessons from evidence-based medicine and policy: increase transparency, balance inputs and understand power

Kathryn Oliver et al.

PALGRAVE COMMUNICATIONS (2017)

Article Medical Informatics

Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach

Wei-Hung Weng et al.

BMC MEDICAL INFORMATICS AND DECISION MAKING (2017)

Article Public, Environmental & Occupational Health

Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population

Anna Fry et al.

AMERICAN JOURNAL OF EPIDEMIOLOGY (2017)

Review Medicine, Research & Experimental

Personalized medicine could transform healthcare

Sunil Mathur et al.

BIOMEDICAL REPORTS (2017)

Article Cell Biology

A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine

Izumi V. Hinkson et al.

FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY (2017)

Article Health Care Sciences & Services

ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository

Martin Dugas et al.

BMC MEDICAL RESEARCH METHODOLOGY (2016)

Review Biochemistry & Molecular Biology

Use of cloud computing in biomedicine

Vladimir Sobeslav et al.

JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS (2016)

Article Neurosciences

Multimodal population brain imaging in the UK Biobank prospective epidemiological study

Karla L. Miller et al.

NATURE NEUROSCIENCE (2016)

Review Genetics & Heredity

From big data analysis to personalized medicine for all: challenges and opportunities

Akram Alyass et al.

BMC MEDICAL GENOMICS (2015)

Editorial Material Medicine, General & Internal

Integrating big data and actionable health coaching to optimize wellness

Leroy Hood et al.

BMC MEDICINE (2015)

Editorial Material Medicine, General & Internal

A New Initiative on Precision Medicine

Francis S. Collins et al.

NEW ENGLAND JOURNAL OF MEDICINE (2015)

Article

Review The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge

Katarzyna Tomczak et al.

Wspolczesna Onkologia-Contemporary Oncology (2015)

Article Biochemical Research Methods

Ten Simple Rules for Creating a Good Data Management Plan

William K. Michener

PLOS COMPUTATIONAL BIOLOGY (2015)

Editorial Material Oncology

Building Data Infrastructure to Evaluate and Improve Quality: PCORnet

Douglas A. Corley et al.

JOURNAL OF ONCOLOGY PRACTICE (2015)

Article Computer Science, Software Engineering

CLOUD COMPUTING IN HEALTHCARE AND BIOMEDICINE

Barbara Calabrese et al.

SCALABLE COMPUTING-PRACTICE AND EXPERIENCE (2015)

Article Biochemical Research Methods

Bayesian network prior: network analysis of biological data using external knowledge

Senol Isci et al.

BIOINFORMATICS (2014)

Article Biochemical Research Methods

A multivariate approach to the integration of multi-omics datasets

Chen Meng et al.

BMC BIOINFORMATICS (2014)

Article Mathematical & Computational Biology

STATegra EMS: an Experiment Management System for complex next-generation omics experiments

Rafael Hernandez de Diego et al.

BMC SYSTEMS BIOLOGY (2014)

Article Mathematical & Computational Biology

Use of prior knowledge for the analysis of high-throughput transcriptomics and metabolomics data

Polina Reshetova et al.

BMC SYSTEMS BIOLOGY (2014)

Article Mathematical & Computational Biology

The common ground of genomics and systems biology

Ana Conesa et al.

BMC SYSTEMS BIOLOGY (2014)

Editorial Material Mathematical & Computational Biology

Data integration in the era of omics: current and future challenges

David Gomez-Cabrero et al.

BMC SYSTEMS BIOLOGY (2014)

Article Pharmacology & Pharmacy

P4 Medicine Needs P4 Education

Alfredo Cesario et al.

CURRENT PHARMACEUTICAL DESIGN (2014)

Article Medicine, General & Internal

Clinical Interpretation and Implications of Whole-Genome Sequencing

Frederick E. Dewey et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2014)

Article Computer Science, Interdisciplinary Applications

Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses

Bo Liu et al.

JOURNAL OF BIOMEDICAL INFORMATICS (2014)

Article Computer Science, Interdisciplinary Applications

Robust sparse regression and tuning parameter selection via the efficient bootstrap information criteria

Heewon Park et al.

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION (2014)

Article Computer Science, Information Systems

Bionimbus: a cloud for managing, analyzing and sharing large genomics datasets

Allison P. Heath et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2014)

Article Computer Science, Information Systems

Launching PCORnet, a national patient-centered clinical research network

Rachael L. Fleurence et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2014)

Editorial Material Medicine, General & Internal

Learning from Big Health Care Data

Sebastian Schneeweiss

NEW ENGLAND JOURNAL OF MEDICINE (2014)

Review Biotechnology & Applied Microbiology

Ready to Put Metadata on the Post-2015 Development Agenda? Linking Data Publications to Responsible Innovation and Science Diplomacy

Vural Ozdemir et al.

OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY (2014)

Letter Biotechnology & Applied Microbiology

Metadata Checklist for the Integrated Personal OMICS Study: Proteomics and Metabolomics Experiments

Michael Snyder et al.

OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY (2014)

Editorial Material Biotechnology & Applied Microbiology

Toward More Transparent and Reproducible Omics Studies Through a Common Metadata Checklist and Data Publications

Eugene Kolker et al.

OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY (2014)

Review Mathematical & Computational Biology

Cosinor-based rhythmometry

Germaine Cornelissen

THEORETICAL BIOLOGY AND MEDICAL MODELLING (2014)

Article Biochemistry & Molecular Biology

The Transformative Nature of Transparency in Research Funding

Daniel Mietchen

PLOS BIOLOGY (2014)

Editorial Material Biochemistry & Molecular Biology

Best Practices for Scientific Computing

Greg Wilson et al.

PLOS BIOLOGY (2014)

Editorial Material Biochemical Research Methods

Ten Simple Rules for the Care and Feeding of Scientific Data

Alyssa Goodman et al.

PLOS COMPUTATIONAL BIOLOGY (2014)

Article Biochemical Research Methods

MODMatcher: Multi-Omics Data Matcher for Integrative Genomic Analysis

Seungyeul Yoo et al.

PLOS COMPUTATIONAL BIOLOGY (2014)

Article Biology

CGtag: complete genomics toolkit and annotation in a cloud-based Galaxy

Saskia Hiltemann et al.

GIGASCIENCE (2014)

Review Multidisciplinary Sciences

Challenges of Big Data analysis

Jianqing Fan et al.

NATIONAL SCIENCE REVIEW (2014)

Article Biochemical Research Methods

cudaMap: a GPU accelerated program for gene expression connectivity mapping

Darragh G. McArt et al.

BMC BIOINFORMATICS (2013)

Editorial Material Medicine, General & Internal

The Inevitable Application of Big Data to Health Care

Travis B. Murdoch et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2013)

Review Computer Science, Interdisciplinary Applications

'Big data', Hadoop and cloud computing in genomics

Aisling O'Driscoll et al.

JOURNAL OF BIOMEDICAL INFORMATICS (2013)

Article Health Care Sciences & Services

Bringing Big Data to Personalized Healthcare: A Patient-Centered Framework

Nitesh V. Chawla et al.

JOURNAL OF GENERAL INTERNAL MEDICINE (2013)

Editorial Material Multidisciplinary Sciences

THE BIG CHALLENGES OF BIG DATA

Vivien Marx

NATURE (2013)

Editorial Material Genetics & Heredity

The Cancer Genome Atlas Pan-Cancer analysis project

John N. Weinstein et al.

NATURE GENETICS (2013)

Editorial Material Genetics & Heredity

The Genotype-Tissue Expression (GTEx) project

John Lonsdale et al.

NATURE GENETICS (2013)

Review Genetics & Heredity

Computational solutions for omics data

Bonnie Berger et al.

NATURE REVIEWS GENETICS (2013)

Article Biochemistry & Molecular Biology

ATHLATES: accurate typing of human leukocyte antigen through exome sequencing

Chang Liu et al.

NUCLEIC ACIDS RESEARCH (2013)

Article Biochemistry & Molecular Biology

A modular framework for gene set analysis integrating multilevel omics data

Steffen Sass et al.

NUCLEIC ACIDS RESEARCH (2013)

Article Biochemistry & Molecular Biology

The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud

Katherine Wolstencroft et al.

NUCLEIC ACIDS RESEARCH (2013)

Article Multidisciplinary Sciences

Exploring TCGA Pan-Cancer Data at the UCSC Cancer Genomics Browser

Melissa S. Cline et al.

SCIENTIFIC REPORTS (2013)

Article Computer Science, Interdisciplinary Applications

METADATA CHECKLIST FOR THE INTEGRATED PERSONAL OMICS STUDY: Proteomics and Metabolomics Experiments

Michael Snyder et al.

BIG DATA (2013)

Article Mathematical & Computational Biology

Joint generalized models for multidimensional outcomes: A case study of neuroscience data from multimodalities

Xiao-Feng Wang

BIOMETRICAL JOURNAL (2012)

Article Biochemical Research Methods

Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community

Konstantinos Krampis et al.

BMC BIOINFORMATICS (2012)

Article Biochemistry & Molecular Biology

Personal Omics Profiling Reveals Dynamic Molecular and Medical Phenotypes

Rui Chen et al.

Article Statistics & Probability

Variance Estimation Using Refitted Cross-Validation in Ultrahigh Dimensional Regression

Jianqing Fan et al.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY (2012)

Editorial Material Biochemistry & Molecular Biology

The changing privacy landscape in the era of big data

Eric E. Schadt

MOLECULAR SYSTEMS BIOLOGY (2012)

Article Multidisciplinary Sciences

An integrated map of genetic variation from 1,092 human genomes

David M. Altshuler et al.

NATURE (2012)

Article Genetics & Heredity

Bayesian method to predict individual SNP genotypes from gene expression data

Eric E. Schadt et al.

NATURE GENETICS (2012)

Article Biochemical Research Methods

The 1000 Genomes Project: data management and community access

Laura Clarke et al.

NATURE METHODS (2012)

Review Genetics & Heredity

Mining electronic health records: towards better research applications and clinical care

Peter B. Jensen et al.

NATURE REVIEWS GENETICS (2012)

Editorial Material Medicine, General & Internal

Preparing for Precision Medicine

Reza Mirnezami et al.

NEW ENGLAND JOURNAL OF MEDICINE (2012)

Article Multidisciplinary Sciences

Comparing Statistical Methods for Constructing Large Scale Gene Networks

Jeffrey D. Allen et al.

PLOS ONE (2012)

Article Health Policy & Services

UK Biobank: Current status and what it means for epidemiology

Naomi Allen et al.

HEALTH POLICY AND TECHNOLOGY (2012)

Article Statistics & Probability

Variance estimation using refitted cross-validation in ultrahigh dimensional regression

Jianqing Fan et al.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY (2012)

Editorial Material Chemistry, Multidisciplinary

Genome-wide Correlation between mRNA and Protein in a Single Cell

Edward S. Yeung

ANGEWANDTE CHEMIE-INTERNATIONAL EDITION (2011)

Article Biochemical Research Methods

GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies

Ling Sing Yung et al.

BIOINFORMATICS (2011)

Article Biotechnology & Applied Microbiology

Full-length transcriptome assembly from RNA-Seq data without a reference genome

Manfred G. Grabherr et al.

NATURE BIOTECHNOLOGY (2011)

Letter Genetics & Heredity

Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology

Eric E. Schadt et al.

NATURE REVIEWS GENETICS (2011)

Editorial Material Multidisciplinary Sciences

Challenges and Opportunities in Mining Neuroscience Data

Huda Akil et al.

SCIENCE (2011)

Article Computer Science, Information Systems

The Meaningful Use of Big Data: Four Perspectives - Four Challenges

Christian Bizer et al.

SIGMOD RECORD (2011)

Article Genetics & Heredity

Phased Whole-Genome Genetic Risk in a Family Quartet Using a Major Allele Reference Sequence

Frederick E. Dewey et al.

PLOS GENETICS (2011)

Article Statistics & Probability

NEARLY UNBIASED VARIABLE SELECTION UNDER MINIMAX CONCAVE PENALTY

Cun-Hui Zhang

ANNALS OF STATISTICS (2010)

Article Biochemical Research Methods

An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics

Ronald C. Taylor

BMC BIOINFORMATICS (2010)

Article Computer Science, Hardware & Architecture

A View of Cloud Computing

Michael Armbrust et al.

COMMUNICATIONS OF THE ACM (2010)

Article Biochemistry & Molecular Biology

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

Aaron McKenna et al.

GENOME RESEARCH (2010)

Article Biochemistry & Molecular Biology

A window into third-generation sequencing

Eric E. Schadt et al.

HUMAN MOLECULAR GENETICS (2010)

Article Computer Science, Information Systems

Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2)

Shawn N. Murphy et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2010)

Editorial Material Medicine, General & Internal

Challenges in the clinical application of whole-genome sequencing

Kelly E. Ormond et al.

LANCET (2010)

Article Medicine, General & Internal

Clinical assessment incorporating a personal genome

Euan A. Ashley et al.

LANCET (2010)

Editorial Material Medicine, General & Internal

The Path to Personalized Medicine

Margaret A. Hamburg et al.

NEW ENGLAND JOURNAL OF MEDICINE (2010)

Editorial Material Pharmacology & Pharmacy

Individual genomes and personalized medicine: life diversity and complexity

Christos Katsios et al.

PERSONALIZED MEDICINE (2010)

Editorial Material Genetics & Heredity

The $1,000 genome, the $100,000 analysis?

Elaine R. Mardis

GENOME MEDICINE (2010)

Article Genetics & Heredity

The 1000 Genomes Project: new opportunities for research and social challenges

Marc Via et al.

GENOME MEDICINE (2010)

Article Biochemistry & Molecular Biology

A platform to standardize, store, and visualize proteomics experimental data

Guangyong Zheng et al.

ACTA BIOCHIMICA ET BIOPHYSICA SINICA (2009)

Article Statistics & Probability

Using Generalized Correlation to Effect Variable Selection in Very High Dimensional Problems

Peter Hall et al.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2009)

Article Environmental Sciences

Minimum Information About a Microarray Experiment (MIAME) - Successes, Failures, Challenges

Alvis Brazma

THESCIENTIFICWORLDJOURNAL (2009)

Article Statistics & Probability

HIGH-DIMENSIONAL CLASSIFICATION USING FEATURES ANNEALED INDEPENDENCE RULES

Jianqing Fan et al.

ANNALS OF STATISTICS (2008)

Article Statistics & Probability

The sparsity and bias of the lasso selection in high-dimensional linear regression

Cun-Hui Zhang et al.

ANNALS OF STATISTICS (2008)

Article Biochemical Research Methods

Integrating biological data - the Distributed Annotation System

Andrew M. Jenkinson et al.

BMC BIOINFORMATICS (2008)

Article Biochemical Research Methods

CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment

Svetlin A. Manavski et al.

BMC BIOINFORMATICS (2008)

Article Statistics & Probability

Theoretical measures of relative performance of classifiers for high dimensional data with small sample sizes

Peter Hall et al.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY (2008)

Article Biotechnology & Applied Microbiology

1000 Genomes project

Nayanah Siva

NATURE BIOTECHNOLOGY (2008)

Article Biochemical Research Methods

High-throughput sequence alignment using Graphics Processing Units

Michael C. Schatz et al.

BMC BIOINFORMATICS (2007)

Article Statistics & Probability

The Dantzig selector:: Statistical estimation when p is much larger than n

Emmanuel Candes et al.

ANNALS OF STATISTICS (2007)

Article Genetics & Heredity

PLINK: A tool set for whole-genome association and population-based linkage analyses

Shaun Purcell et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2007)

Article Biochemical Research Methods

A multivariate analysis approach to the integration of proteomic and gene expression data

Ailis Fagan et al.

PROTEOMICS (2007)

Article Biochemical Research Methods

Identifying bacterial genes and endosymbiont DNA with Glimmer

Arthur L. Delcher et al.

BIOINFORMATICS (2007)

Article Biochemical Research Methods

Inferring transcriptional networks by mining 'Omics' data

Tim Van den Bulcke et al.

CURRENT BIOINFORMATICS (2006)

Letter Genetics & Heredity

GenePattern 2.0

M Reich et al.

NATURE GENETICS (2006)

Article Biochemistry & Molecular Biology

The International HapMap Project Web site

GA Thorisson et al.

GENOME RESEARCH (2005)

Article Biochemistry & Molecular Biology

Galaxy: A platform for interactive large-scale genome analysis

B Giardine et al.

GENOME RESEARCH (2005)

Article Biochemical Research Methods

FatiGO:: a web tool for finding significant associations of Gene Ontology terms with groups of genes

F Al-Shahrour et al.

BIOINFORMATICS (2004)

Article Biochemistry & Molecular Biology

The Gene Ontology (GO) database and informatics resource

MA Harris et al.

NUCLEIC ACIDS RESEARCH (2004)

Article Biotechnology & Applied Microbiology

Bioconductor: open software development for computational biology and bioinformatics

RC Gentleman et al.

GENOME BIOLOGY (2004)

Article Multidisciplinary Sciences

The International HapMap Project

RA Gibbs et al.

NATURE (2003)

Article Multidisciplinary Sciences

Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms

O Alter et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2003)

Article Biochemistry & Molecular Biology

Cytoscape: A software environment for integrated models of biomolecular interaction networks

P Shannon et al.

GENOME RESEARCH (2003)

Article Biochemistry & Molecular Biology

The bioperl toolkit:: Perl modules for the life sciences

JE Stajich et al.

GENOME RESEARCH (2002)

Article Statistics & Probability

Variable selection via nonconcave penalized likelihood and its oracle properties

JQ Fan et al.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2001)