4.6 Review

Machine Learning-Assisted Approaches in Modernized Plant Breeding Programs

Related references

Note: Only part of the references are listed.
Review Agronomy

A Critical Review of Climate Change Impact at a Global Scale on Cereal Crop Production

Ahsan Farooq et al.

Summary: Climate change poses a threat to food security, particularly for crops like wheat, maize, and rice. The impact of climate change on these crops varies by region, with colder areas potentially experiencing increased yields while equatorial countries may see decreased production. Water scarcity, amplified by climate change, is likely to reduce rice yields globally. Multiple climate models and bias correction techniques should be used for more accurate predictions. Adaptation measures, such as adjusting planting calendars and improving crop varieties, are recommended to mitigate the adverse effects of climate change.

AGRONOMY-BASEL (2023)

Article Agriculture, Multidisciplinary

UAV-based multi-sensor data fusion and machine learning algorithm for yield prediction in wheat

Shuaipeng Fei et al.

Summary: The use of machine learning and unmanned aerial vehicle-based multi-sensor data fusion can improve the accuracy of wheat yield prediction. Ensemble learning further enhances the prediction accuracy.

PRECISION AGRICULTURE (2023)

Review Plant Sciences

Prediction of thermal degradation of biopolymers in biomass under pyrolysis atmosphere by means of machine learning

Antoine L. Harfouche et al.

Summary: Artificial intelligence (AI) plays a crucial role in global agricultural research, particularly in the field of plant science. AI can effectively analyze large datasets in digital phenomics and discover intricate patterns. This article presents a perspective and primer on the application of AI in phenome research, introducing a novel human-centric explainable AI (X-AI) system architecture and clarifying the difference between post hoc and interpretable by design models. It also provides guidance for utilizing interpretable by design models in phenomic analysis and directs readers to relevant tools and resources for accessible data analytics. An accompanying interactive online tutorial is available.

TRENDS IN PLANT SCIENCE (2023)

Article Computer Science, Theory & Methods

Recent Advances in Bayesian Optimization

Xilu Wang et al.

Summary: This article provides a comprehensive survey of recent advances in Bayesian optimization based on Gaussian processes. It categorizes the existing work into nine main groups and discusses the open questions and promising future research directions in the field.

ACM COMPUTING SURVEYS (2023)

Article Computer Science, Artificial Intelligence

Evolutionary Machine Learning With Minions: A Case Study in Feature Selection

Nick Zhang et al.

Summary: This article introduces a novel algorithm-centric solution using evolutionary multitasking to speed up decision-making in the machine learning pipeline. By creating small data proxies and combining them with the main task, the efficiency of evolutionary search can be improved, accelerating the decision-making process. Experiments show that multitasking can significantly speed up the baseline evolutionary algorithms.

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION (2022)

Review Biotechnology & Applied Microbiology

Machine learning: its challenges and opportunities in plant system biology

Mohsen Hesami et al.

Summary: As sequencing technologies continue to advance, generating massive amounts of multidimensional data in plants, the integration of different omics datasets becomes crucial in order to gain comprehensive insights into plant biological systems. Machine learning offers promising approaches to integrate large datasets and recognize patterns, but optimization is needed to process multi-omics data.

APPLIED MICROBIOLOGY AND BIOTECHNOLOGY (2022)

Article Biochemistry & Molecular Biology

Machine-Learning-Based Genome-Wide Association Studies for Uncovering QTL Underlying Soybean Yield and Its Components

Mohsen Yoosefzadeh-Najafabadi et al.

Summary: A genome-wide association study (GWAS) is a recommended method for discovering marker-trait associations (MTAs) in plant species. This study evaluated the potential use of two machine learning algorithms (SVR and RF) in GWAS and compared them with two conventional methods (MLM and FarmCPU) for identifying MTAs for soybean-yield components. The results demonstrated the potential benefit of using SVR in GWAS for identifying MTAs with potential causal effects on target traits.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Article Forestry

Machine Learning-Assisted In Vitro Rooting Optimization in Passiflora caerulea

Marziyeh Jafari et al.

Summary: This study successfully predicted the in vitro rooting responses of Passiflora caerulea using a hybrid of generalized regression neural network (GRNN) and genetic algorithm (GA). The results demonstrated that the GRNN-GA model is reliable and accurate.

FORESTS (2022)

Article Multidisciplinary Sciences

Optimizing genomic selection in soybean: An important improvement in agricultural genomics

Mohsen Yoosefzadeh-Najafabadi et al.

Summary: This study explores the potential use of haplotype-based genomic selection (GS) to improve yield prediction accuracy in soybean and identifies a promising haplotype block on chromosome 19 with significant effects on yield.

HELIYON (2022)

Article Management

A conceptual data model promoting data-driven circular manufacturing

Federica Acerbi et al.

Summary: The circular economy paradigm promotes manufacturing companies' sustainability through circular manufacturing strategies. However, the management and sharing of data and information remain significant barriers hindering decision-making processes in circular manufacturing. This study proposes a reference model to standardise and structure the necessary data in circular manufacturing to support manufacturers' decision-making processes.

OPERATIONS MANAGEMENT RESEARCH (2022)

Review Biochemistry & Molecular Biology

Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction

Yunbi Xu et al.

Summary: The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and molecular marker genotypes. However, plant performance is determined by the combined effects of genotype, envirotype, and genotype by environment interaction. Integration of multidimensional information profiles, including spatiotemporal omics, provides predictive breeding with both tremendous opportunities and challenges.

MOLECULAR PLANT (2022)

Article Oncology

Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network

Bashier ElKarami et al.

Summary: This paper introduces a multi-omics data integration method based on UMAP and CNN. UMAP is used to embed gene expression, DNA methylation, and copy number alteration into two-dimensional RGB images, and GSN is constructed to integrate other omics data for better prediction. The proposed method achieves high accuracy for predicting Gleason score levels in prostate cancer and tumor stage in breast cancer.

CANCER INFORMATICS (2022)

Review Biochemistry & Molecular Biology

CRISPR-Mediated Engineering across the Central Dogma in Plant Biology for Basic Research and Crop Improvement

Dibyajyoti Pramanik et al.

Summary: The central dogma of molecular biology involves transferring genetic information from DNA to RNA to proteins, which is crucial for gene regulation in plants. Precisely manipulating these processes can accelerate crop improvement to meet the growing global population's demands.

MOLECULAR PLANT (2021)

Review Biochemistry & Molecular Biology

Using Interactome Big Data to Crack Genetic Mysteries and Enhance Future Crop Breeding

Leiming Wu et al.

Summary: This article discusses the significance and methods of dissecting genetic mysteries using interactome big data, explores the potential of combining machine learning, and proposes future breeding strategies for improving crop yields and quality.

MOLECULAR PLANT (2021)

Article Plant Sciences

Deep Learning for Predicting Complex Traits in Spring Wheat Breeding Program

Karansher S. Sandhu et al.

Summary: By comparing different DL algorithms with the traditional GS model, the study found that DL models provided higher prediction accuracy for each trait in both cross-validation and independent validation, especially for grain yield and grain protein content. DL models improved prediction accuracy by optimizing hyperparameters and avoiding overfitting, and should be incorporated into a plant breeder's toolkit.

FRONTIERS IN PLANT SCIENCE (2021)

Article Plant Sciences

Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean

Mohsen Yoosefzadeh-Najafabadi et al.

Summary: Recent advances in high-throughput field phenotyping have provided plant breeders with efficient tools for evaluating a large number of genotypes for important agronomic traits. The study evaluated the robustness of machine learning algorithms in predicting soybean seed yield using hyperspectral reflectance, and found that the random forest algorithm had the highest performance among others.

FRONTIERS IN PLANT SCIENCE (2021)

Article Plant Sciences

Comparative Analysis of Machine Learning and Evolutionary Optimization Algorithms for Precision Micropropagation of Cannabis sativa: Prediction and Validation of in vitro Shoot Growth and Development Based on the Optimization of Light and Carbohydrate Sources

Marco Pepe et al.

Summary: Micropropagation techniques provide the basis for many biotechnological applications, with the need for optimization in in vitro practices for Cannabis sativa L. Research has shown the potential of using machine learning and optimization algorithms to accurately predict optimal conditions for specific developmental responses, showcasing the power of these approaches in complex plant tissue culture interactions.

FRONTIERS IN PLANT SCIENCE (2021)

Article Plant Sciences

Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods

Mohsen Yoosefzadeh-Najafabadi et al.

Summary: The study proposed a method called HypWAS for phenome-phenome association analysis through hierarchical data integration strategy to estimate the predictive power of hyperspectral reflectance bands in predicting soybean seed yield. The results indicated the advantages of using hierarchical data integration strategy and advanced mathematical methods for a better understanding of the biology and genetic backgrounds of hyperspectral reflectance bands affecting soybean yield formation.

FRONTIERS IN PLANT SCIENCE (2021)

Article Agriculture, Multidisciplinary

Sequential forward selection and support vector regression in comparison to LASSO regression for spring wheat yield prediction based on UAV imagery

Sahameh Shafiee et al.

Summary: Traditional plant breeding for grain yield selection is time-consuming and costly, leading to a demand for innovative methods like machine learning to reduce costs and accelerate genetic gains. Remote sensing-based platforms such as UAVs show promise in predicting traits like grain yield. SVR combined with SFS and LASSO regressor were tested in predicting wheat grain yield, with NDVI showing the highest prediction ability and model performance improved by adding MTCI and EVI at earlier stages of grain filling. Both regression methods showed good capability for grain yield prediction, but LASSO regressor was more affordable and time-effective.

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2021)

Article Agriculture, Multidisciplinary

Evaluation of stacking and blending ensemble learning methods for estimating daily reference evapotranspiration

Tianao Wu et al.

Summary: The study evaluated the use of stacking and blending ensemble models for daily ETo estimation, finding that they outperformed basic and empirical models regardless of input combinations, and showed better portability across different climate zones.

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2021)

Review Computer Science, Information Systems

CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope

Dulari Bhatt et al.

Summary: Computer vision is increasingly popular in image processing, with deep CNN benefiting various fields such as video processing, object recognition, and natural language processing. Researchers have explored innovative concepts and architectural advancements to enhance CNN performance and capacity, focusing on leveraging channel and spatial information for information processing.

ELECTRONICS (2021)

Article Plant Sciences

Machine Learning-Mediated Development and Optimization of Disinfection Protocol and Scarification Method for Improved In Vitro Germination of Cannabis Seeds

Marco Pepe et al.

Summary: In vitro seed germination of cannabis faces challenges in uniformity and germination time due to the disinfection procedure. Artificial intelligence models, like the generalized regression neural network (GRNN), were used to optimize different disinfectants and immersion times for contamination reduction. The optimized protocol resulted in 0% contamination and 100% seed germination when combined with seed scarification.

PLANTS-BASEL (2021)

Article Biochemistry & Molecular Biology

Synergizing Off-Target Predictions for In Silico Insights of CENH3 Knockout in Cannabis through CRISPR/Cas

Mohsen Hesami et al.

Summary: The study evaluates the predictive ability of machine learning algorithms and ensemble-bagging strategy to predict sgRNA off-target activity in cannabis, with the random forest (RF) algorithm showing the best performance. Using RF algorithm as a meta-classifier for E-B method enhances prediction accuracy and F-measure. The E-B algorithm demonstrates the highest AUC-PRC and AUC-ROC values, indicating its success as a common ensemble strategy.

MOLECULES (2021)

Article Multidisciplinary Sciences

Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits

Mohsen Yoosefzadeh-Najafabadi et al.

Summary: Improving genetic yield potential in major food grade crops such as soybean is crucial for addressing global food demand. By studying important yield component traits and applying machine learning algorithms, the study found that the Radial Basis Function algorithm was the most accurate and used it in conjunction with an ensemble method to improve prediction accuracy. By incorporating a genetic algorithm, the study also modeled optimal values of yield components for maximizing yield potential in soybean genotypes.

PLOS ONE (2021)

Article Environmental Sciences

Using Hybrid Artificial Intelligence and Evolutionary Optimization Algorithms for Estimating Soybean Yield and Fresh Biomass Using Hyperspectral Vegetation Indices

Mohsen Yoosefzadeh-Najafabadi et al.

Summary: Recent advanced high-throughput field phenotyping combined with sophisticated big data analysis methods have provided plant breeders with unprecedented tools for better prediction of important agronomic traits, such as yield and fresh biomass, at early growth stages. This study demonstrated the potential use of 35 selected hyperspectral vegetation indices in predicting soybean seed yield and FBIO, achieving coefficients of determination of 0.76 and 0.77 for yield and 0.91 and 0.89 for FBIO using deep neural network and ensemble-bagging algorithms, respectively. Additionally, it was found that HVI associated with red, 670 nm, and near-infrared 800 nm regions were the most informative in predicting yield and FBIO.

REMOTE SENSING (2021)

Review Plant Sciences

Robotic Technologies for High-Throughput Plant Phenotyping: Contemporary Reviews and Future Perspectives

Abbas Atefi et al.

Summary: Phenotyping plants using robots has revolutionized the traditional manual measurement methods, providing a more efficient way to monitor changes in plant traits. However, the operation of these robots still faces challenges due to the dynamic nature of plants and agricultural environments.

FRONTIERS IN PLANT SCIENCE (2021)

Proceedings Paper Computer Science, Artificial Intelligence

An ensemble learning approach for predicting phenotypes from genotypes

Tingxi Yu et al.

Summary: Genomic Selection (GS) is a new breeding strategy that estimates breeding values and selects them through high-density markers, with machine learning algorithms improving genomic predictions. An ensemble learning-based Genomic Prediction model (ELGP) outperformed other base learners, showing great potential to enhance prediction abilities in other animals and plants. Selecting appropriate k-fold cross-validation methods is recommended to further improve the model's prediction ability.

20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS) (2021)

Article Biotechnology & Applied Microbiology

Analysis of macro nutrient related growth responses using multivariate adaptive regression splines

Meleksen Akin et al.

PLANT CELL TISSUE AND ORGAN CULTURE (2020)

Article Biochemical Research Methods

Computer vision and machine learning enabled soybean root phenotyping pipeline

Kevin G. Falk et al.

PLANT METHODS (2020)

Article Multidisciplinary Sciences

ZEAMAP, a Comprehensive Database Adapted to the Maize Multi-Omics Era

Songtao Gui et al.

ISCIENCE (2020)

Review Biotechnology & Applied Microbiology

Application of artificial intelligence models and optimization algorithms in plant cell and tissue culture

Mohsen Hesami et al.

APPLIED MICROBIOLOGY AND BIOTECHNOLOGY (2020)

Review Agronomy

Machine Learning for Plant Breeding and Biotechnology

Mohsen Niazian et al.

AGRICULTURE-BASEL (2020)

Article Genetics & Heredity

Applications of Support Vector Machine in Genomic Prediction in Pig and Maize Populations

Wei Zhao et al.

FRONTIERS IN GENETICS (2020)

Review Biotechnology & Applied Microbiology

Accelerating Climate Resilient Plant Breeding by Applying Next-Generation Artificial Intelligence

Antoine L. Harfouche et al.

TRENDS IN BIOTECHNOLOGY (2019)

Article Multidisciplinary Sciences

Machine learning algorithm validation with a limited sample size

Andrius Vabalas et al.

PLOS ONE (2019)

Review Computer Science, Artificial Intelligence

A Review on Dimensionality Reduction Techniques

Xuan Huang et al.

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (2019)

Review Genetics & Heredity

Navigating complexity to breed disease-resistant crops

Rebecca Nelson et al.

NATURE REVIEWS GENETICS (2018)

Article Biotechnology & Applied Microbiology

Predicting minor nutrient requirements of hazelnut shoot cultures using regression trees

Meleksen Akin et al.

PLANT CELL TISSUE AND ORGAN CULTURE (2018)

Article Computer Science, Artificial Intelligence

Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python package)

Maximilian Christ et al.

NEUROCOMPUTING (2018)

Review Mathematical & Computational Biology

Deep Learning for Computer Vision: A Brief Review

Athanasios Voulodimos et al.

COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE (2018)

Article Agriculture, Multidisciplinary

Using Support Vector Machines classification to differentiate spectral signatures of potato plants infected with Potato Virus Y

L. M. Griffel et al.

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2018)

Article Robotics

Artificial Intelligence for Long-Term Robot Autonomy: A Survey

Lars Kunze et al.

IEEE ROBOTICS AND AUTOMATION LETTERS (2018)

Editorial Material Biochemical Research Methods

P values and the search for significance

Naomi Altman et al.

NATURE METHODS (2017)

Article Biotechnology & Applied Microbiology

Modeling some mineral nutrient requirements for micropropagated wild apricot shoot cultures

Irina Y. Kovalchuk et al.

PLANT CELL TISSUE AND ORGAN CULTURE (2017)

Review Plant Sciences

A Concise Review on Multi-Omics Data Integration for Terroir Analysis in Vitis vinifera

Pastor Jullian Fabres et al.

FRONTIERS IN PLANT SCIENCE (2017)

Proceedings Paper Automation & Control Systems

Thorvald II - a Modular and Re-configurable Agricultural Robot

Lars Grimstad et al.

IFAC PAPERSONLINE (2017)

Article Biochemical Research Methods

Transcriptomic and metabolomic data integration

Rachel Cavill et al.

BRIEFINGS IN BIOINFORMATICS (2016)

Article Plant Sciences

PPIM: A Protein-Protein Interaction Database for Maize

Guanghui Zhu et al.

PLANT PHYSIOLOGY (2016)

Review Geography, Physical

Random forest in remote sensing: A review of applications and future directions

Mariana Belgiu et al.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2016)

Article Plant Sciences

WheatExp: an RNA-seq expression database for polyploid wheat

Stephen Pearce et al.

BMC PLANT BIOLOGY (2015)

Article Biochemistry & Molecular Biology

Superiority of artificial neural networks for a genetic classification procedure

I. C. Sant'Anna et al.

GENETICS AND MOLECULAR RESEARCH (2015)

Review Genetics & Heredity

Machine learning applications in genetics and genomics

Maxwell W. Libbrecht et al.

NATURE REVIEWS GENETICS (2015)

Article Agronomy

Selection in sugarcane families with artificial neural networks

Bruno Portela Brasileiro et al.

CROP BREEDING AND APPLIED BIOTECHNOLOGY (2015)

Article Statistics & Probability

R-Squared Measures for Two-Level Hierarchical Linear Models UsingSAS

Anthony Recchia

Journal of Statistical Software (2015)

Article Agriculture, Multidisciplinary

Neural networks for predicting breeding values and genetic gains

Gabi Nunes Silva et al.

SCIENTIA AGRICOLA (2014)

Review Plant Sciences

Field high-throughput phenotyping: the new crop breeding frontier

Jose Luis Araus et al.

TRENDS IN PLANT SCIENCE (2014)

Article Computer Science, Interdisciplinary Applications

A model-based approach for data integration to improve maintenance management by mixed reality

Danubia Bueno Espindola et al.

COMPUTERS IN INDUSTRY (2013)

Review Neurosciences

Neural Basis of Reinforcement Learning and Decision Making

Daeyeol Lee et al.

ANNUAL REVIEW OF NEUROSCIENCE, VOL 35 (2012)

Article Biochemical Research Methods

PRIN: a predicted rice interactome network

Haibin Gu et al.

BMC BIOINFORMATICS (2011)

Article Statistics & Probability

Variable Importance Assessment in Regression: Linear Regression versus Random Forest

Ulrike Groemping

AMERICAN STATISTICIAN (2009)

Article Genetics & Heredity

Machine Learning in Genome-Wide Association Studies

Silke Szymczak et al.

GENETIC EPIDEMIOLOGY (2009)

Article Biochemistry & Molecular Biology

KEGG for linking genomes to life and the environment

Minoru Kanehisa et al.

NUCLEIC ACIDS RESEARCH (2008)

Article Biochemical Research Methods

PlnTFDB:: an integrative plant transcription factor database

Diego Mauricio Riano-Pachon et al.

BMC BIOINFORMATICS (2007)

Article Statistics & Probability

Unsupervised learning with random forest predictors

T Shi et al.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2006)

Article Computer Science, Artificial Intelligence

Using AUC and accuracy in evaluating learning algorithms

J Huang et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2005)

Article Biochemistry & Molecular Biology

JASPAR: an open-access database for eukaryotic transcription factor binding profiles

A Sandelin et al.

NUCLEIC ACIDS RESEARCH (2004)

Article Computer Science, Artificial Intelligence

Efficient leave-one-out cross-validation of kernel Fisher discriminant classifiers

GC Cawley et al.

PATTERN RECOGNITION (2003)

Article Plant Sciences

AraCyc: A biochemical pathway database for Arabidopsis

LA Mueller et al.

PLANT PHYSIOLOGY (2003)

Review Chemistry, Analytical

Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research

S Agatonovic-Kustrin et al.

JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS (2000)

Article Mathematics, Interdisciplinary Applications

Cross-validation methods

MW Browne

JOURNAL OF MATHEMATICAL PSYCHOLOGY (2000)