315 14 12MB
English Pages [382] Year 2019
APPLICATIONS OF GENETIC AND GENOMIC RESEARCH IN CEREALS
This page intentionally left blank
Woodhead Publishing Series in Food Science, Technology and Nutrition
APPLICATIONS OF GENETIC AND GENOMIC RESEARCH IN CEREALS Edited by
Thomas Miedaner State Plant Breeding Institute, University of Hohenheim, Stuttgart, Germany
Viktor Korzun Global Lead Scientific Affairs, KWS SAAT SE, Einbeck, Germany
An imprint of Elsevier
Woodhead Publishing is an imprint of Elsevier The Officers’ Mess Business Centre, Royston Road, Duxford, CB22 4QH, United Kingdom 50 Hampshire Street, 5th Floor, Cambridge, MA 02139, United States The Boulevard, Langford Lane, Kidlington, OX5 1GB, United Kingdom Copyright © 2019 Elsevier Ltd. All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions. This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein). Notices Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary. Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility. To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein. Library of Congress Cataloging-in-Publication Data A catalog record for this book is available from the Library of Congress British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library ISBN: 978-0-08-102163-7 For information on all Woodhead publications visit our website at https://www.elsevier.com/books-and-journals
Publisher: Andre Gerhard Wolff Acquisition Editor: Nancy Maragioglio Editorial Project Manager: Jaclyn A. Truesdell Production Project Manager: Joy Christel Neumarin Honest Thangiah Cover Designer: Mark Rogers Typeset by SPi Global, India
CONTENTS Contributors xi In Commemoration xv Editorial: Genetics Became to be Genomics xvii
Section 1 Techniques
1
1. High-Throughput Genotyping for Cereal Research and Breeding
3
Martin W. Ganal, Jörg Plieske, Anja Hohmeyer, Andreas Polley, Marion S. Röder
1.1 Introduction 3 1.2 SNP Markers 4 1.3 Genotyping-by-Sequencing (GBS) 6 1.4 Array-Based Genotyping 7 1.5 High-Throughput Individual SNP Marker Genotyping 13 1.6 Summary and Outlook 14 References 15
2. Progress in Sequencing of Triticeae Genomes and Future Uses
M. Timothy Rabanus-Wallace, Nils Stein
19
2.1 Introduction 19 2.2 Triticeae Genome Assemblies 24 2.3 Current and Future Directions 40 References 43
3. Next-Generation Sequencing Enabled Genetics in Hexaploid Wheat
Laura-Jayne Gardiner, Ryan Joynson, Anthony Hall
49
3.1 Forward Genetics 50 3.2 Population Genetics 56 3.3 Reverse Genetics 58 3.4 Hidden Variation 59 References 60
v
vi
Contents
4. Rapid Gene Cloning in Wheat
M. Asyraf Md. Hatta, Burkhard Steuernagel, Brande B.H. Wulff
65
4.1 Map-Based Cloning of Genes in Triticeae 65 4.2 Gene Cloning Using Whole-Genome Sequencing in Plants 66 4.3 Mutational Genomics: Cloning Genes by Sequence Comparison of Multiple Mutant Alleles 70 4.4 Genome Complexity in Triticeae 73 4.5 Whole Genome Assembly of Wheat 73 4.6 Gene Cloning Using Genome Complexity Reduction 77 4.7 The Poor Agronomy of Wild Relatives of Wheat and Barley 82 References 85
5. High-Efficiency Transformation Techniques
Yuji Ishida, Yukoh Hiei, Toshihiko Komari
97
5.1 Introduction 97 5.2 Transformation Methods in Cereals 98 5.3 Wheat 99 5.4 Barley 104 5.5 Other Small Grain Cereals 107 5.6 Future Tasks in Transformation of Small Grain Cereals 109 5.7 Conclusion 113 Acknowledgments 113 References 115
6. Site-Directed Genome Modification in Triticeae Cereals Mediated by Cas Endonucleases
Jochen Kumlehn
121
6.1 Introduction 121 6.2 Functionality of RNA-Guided Cas Endonucleases and Principles of Their Utilization 122 6.3 Use of Cas Endonucleases to Validate Gene Function and to Improve Plant Performance 124 6.4 Further Progress in Technology Development 125 6.5 Off-Target Mutations 127 6.6 Perspective 127 References 131
Contents
Section 2 Traits 7. Marker-Based Harnessing of Genetic Diversity to Improve Resistance of Barley to Fungal and Viral Diseases
Dragan Perovic, Doris Kopahnke, Antje Habekuss, Frank Ordon, Albrecht Serfling
vii
135 137
7.1 Introduction 137 7.2 Marker Development for Major Resistance Genes in Barley 143 7.3 Recent Developments in Marker Technologies and Their Application 144 7.4 Gene Isolation and Detection of Natural Variation 147 7.5 Conclusions 150 References 151
8. Gene-Based Approaches to Durable Disease Resistance in Triticeae Cereals
Patrick Schweizer
165
8.1 Background 165 8.2 Gene-Based Approaches 166 8.3 Conclusions 176 References 177
9. Global Journeys of Adaptive Wheat Genes
Miguel Sanchez-Garcia, Alison R. Bentley
183
9.1 Introduction 183 9.2 Plant Height 185 9.3 Flowering Time 189 9.4 Future Outlook 194 Acknowledgments 195 References 195 Further Reading 200
10. Durum Wheat as a Bridge Between Wild Emmer Wheat Genetic Resources and Bread Wheat
Valentina Klymiuk, Andrii Fatiukha, Lin Huang, Zhen-zhen Wei, Tamar Kis-Papo, Yehoshua Saranga, Tamar Krugman, Tzion Fahima
201
10.1 Introduction 201 10.2 Obstacles for Gene Transfer Between Tetraploid and Hexaploid Wheat 204
viii
Contents
10.3 Changes in Recombination Rates After Crosses With Wheat Wild Relatives 205 10.4 Wild Emmer Wheat Gene Pool 206 10.5 Transferring of High Grain Protein Content QTLs From Wild Emmer 208 10.6 Improvement of Resistance to Stripe Rust Using Wild Emmer Genes 214 10.7 Utilization of Wild Emmer Powdery Mildew Resistance Gene Pool 216 10.8 Introgression of FHB Quantitative Resistance QTLs Originating From Wild Emmer Wheat 218 10.9 Improvement of Drought Resistance Using QTLs From Wild Emmer 219 10.10 Alternative Ways for Transferring of Useful Genes From Durum Into Bread Wheat 220 10.11 Advantages of Using Durum for Bridging Crosses 221 10.12 Modern Genomic Tools That Can Accelerate the Concept of Bridge-Crosses 222 Acknowledgments 224 References 225
Section 3 Breeding 11. Modern Field Phenotyping Opens New Avenues for Selection
Tobias Würschum
231 233
11.1 Phenotyping Platforms for Plant Breeding 234 11.2 Sensors for Precision Phenotyping 238 11.3 The Concept of Sensor Fusion 240 11.4 Data Storage and Analysis 242 11.5 Traits Assessed by Sensors 243 11.6 A Possible Future of Precision Phenotyping in the Field 246 References 247
12. Application of Genetic and Genomic Tools in Wheat for Developing Countries
Susanne Dreisigacker, Deepmala Sehgal, Ravi P. Singh, Carolina Sansaloni, Hans-Joachim Braun
251
12.1 Introduction 251 12.2 Key Breeding Priorities for Wheat Improvement in Developing Countries 252 12.3 Application of Genetic and Genomic Tools in the CIMMYT Wheat Breeding Pipeline 255 12.4 Gene Discovery and Genomic Prediction 256 12.5 Deployment of Genomics-Assisted Breeding Strategies 259 12.6 Exploring Genetic Resources 262 12.7 Tracking Crop Varieties 265
Contents
ix
12.8 Wheat Case Study 266 12.9 Evidence From Other Crops 267 References 268
13. Genomic Selection in Wheat
Daniel W. Sweeney, Jin Sun, Ella Taagen, Mark E. Sorrells
273
13.1 Introduction 273 13.2 High-Throughput Phenotyping 277 13.3 Genotype by Environment Interaction 279 13.4 Genomic Selection for Wheat Disease Resistance 287 13.5 Genomic Selection in Wheat for Nutritional Traits 291 13.6 Genomic Selection for Wheat Quality Traits 292 13.7 Future Prospects 295 13.8 Conclusion 296 References 297 Further Reading 301
14. “SpeedGS” to Accelerate Genetic Gain in Spring Wheat
Kai P. Voss-Fels, Eva Herzog, Susanne Dreisigacker, Sivakumar Sukumaran, Amy Watson, Matthias Frisch, Ben Hayes, Lee T. Hickey
303
14.1 Introduction 303 14.2 Accelerating Genetic Gain Using Modern Breeding Tools 305 14.3 Integrating Rapid Generation Advancement and Genomic Selection 309 14.4 Speeding Up Genetic Gain Through SpeedGS—A Simulation 310 14.5 Exploiting Genetic Resources Using Speed Breeding 314 14.6 Future Opportunities and Challenges 317 14.7 Simulation Materials and Methods 319 References 323
15. Genomics-Based Hybrid Rye Breeding
Thomas Miedaner, Viktor Korzun, Eva Bauer
329
15.1 Introduction 329 15.2 Hybrid-Enabling Technologies 331 15.3 Genomic Resources in Rye 333 15.4 Marker-Based Introgression Libraries for Using PGRs 336 15.5 QTL Mapping of Quantitative Traits 337 15.6 Genome-Wide Prediction and Genomic Selection (GS) 340 15.7 Conclusion 343 References 344 Index 349
This page intentionally left blank
CONTRIBUTORS Eva Bauer Technical University of Munich, TUM School of Life Sciences Weihenstephan, Plant Breeding, Freising, Germany Alison R. Bentley The John Bingham Laboratory, NIAB, Cambridge, United Kingdom Hans-Joachim Braun Centro Internacional de Mejoramiento de Maiz y Trigo (CIMMYT), El Batan, Texcoco, Mexico Susanne Dreisigacker Centro Internacional de Mejoramiento de Maiz y Trigo (CIMMYT), El Batan; International Maize and Wheat Improvement Center, Texcoco, Mexico Tzion Fahima Institute of Evolution; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel Andrii Fatiukha Institute of Evolution; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel Matthias Frisch Institute of Agronomy and Plant Breeding II, Justus Liebig University, Giessen, Germany Martin W. Ganal TraitGenetics GmbH, Gatersleben, Germany Laura-Jayne Gardiner Earlham Institute, Norwich, United Kingdom Antje Habekuss Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute (JKI), Quedlinburg, Germany Anthony Hall Earlham Institute; School of Biological Sciences, University of East Anglia, Norwich, United Kingdom M. Asyraf Md. Hatta John Innes Centre, Norwich Research Park, Norwich, United Kingdom; Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Malaysia Ben Hayes Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia
xi
xii
Contributors
Eva Herzog Institute of Agronomy and Plant Breeding II, Justus Liebig University, Giessen, Germany Lee T. Hickey Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia Yukoh Hiei Plant Innovation Center, Japan Tobacco Inc., Iwata, Japan Anja Hohmeyer TraitGenetics GmbH, Gatersleben, Germany Lin Huang Institute of Evolution; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel; Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China Yuji Ishida Plant Innovation Center, Japan Tobacco Inc., Iwata, Japan Ryan Joynson Earlham Institute, Norwich, United Kingdom Tamar Kis-Papo Institute of Evolution, University of Haifa, Haifa, Israel Valentina Klymiuk Institute of Evolution; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel Toshihiko Komari Plant Innovation Center, Japan Tobacco Inc., Iwata, Japan Doris Kopahnke Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute (JKI), Quedlinburg, Germany Viktor Korzun KWS SAAT SE, Einbeck, Germany Tamar Krugman Institute of Evolution, University of Haifa, Haifa, Israel Jochen Kumlehn Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany Thomas Miedaner University of Hohenheim, State Plant Breeding Institute, Stuttgart, Germany Frank Ordon Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute (JKI), Quedlinburg, Germany Dragan Perovic Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute (JKI), Quedlinburg, Germany
Contributors
xiii
Jörg Plieske TraitGenetics GmbH, Gatersleben, Germany Andreas Polley TraitGenetics GmbH, Gatersleben, Germany M. Timothy Rabanus-Wallace Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany Marion S. Röder Leibniz-Institute of Plant Genetics and Crop Plant Research, Gatersleben, Germany Miguel Sanchez-Garcia International Center for the Agriculture Research in the Dry Areas (ICARDA), Rabat, Morocco Carolina Sansaloni Centro Internacional de Mejoramiento de Maiz y Trigo (CIMMYT), El Batan, Texcoco, Mexico Yehoshua Saranga The Robert H. Smith Faculty of Agriculture Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel Patrick Schweizer Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Stadt Seeland, Germany Deepmala Sehgal Centro Internacional de Mejoramiento de Maiz y Trigo (CIMMYT), El Batan, Texcoco, Mexico Albrecht Serfling Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute (JKI), Quedlinburg, Germany Ravi P. Singh Centro Internacional de Mejoramiento de Maiz y Trigo (CIMMYT), El Batan, Texcoco, Mexico Mark E. Sorrells Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States Nils Stein Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany Burkhard Steuernagel John Innes Centre, Norwich Research Park, Norwich, United Kingdom Sivakumar Sukumaran International Maize and Wheat Improvement Center, Texcoco, Mexico Jin Sun Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States Daniel W. Sweeney Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States
xiv
Contributors
Ella Taagen Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States Kai P. Voss-Fels Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia Amy Watson Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia Zhen-zhen Wei Institute of Evolution; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel Brande B.H. Wulff John Innes Centre, Norwich Research Park, Norwich, United Kingdom Tobias Würschum State Plant Breeding Institute, University of Hohenheim, Stuttgart, Germany
IN COMMEMORATION In commemoration of our colleague Patrick Schweizer, who suddenly passed away during the production process of this book and shortly after delivering his chapter “Genebased approaches to durable disease resistance in Triticeae cereals.” Patrick Schweizer (1959–2018) was the head of the research group “Pathogen Stress Genomics” and the coordinator of the research area “Genome Analysis in Breeding Research” at the IPK Gatersleben, Germany. He was internationally recognized by his excellent publications on host-pathogen interaction, including nonhost resistance, preferably for powdery mildew of barley. He developed with others an advanced automated system (HyphArea) for the quantification of the hyphal growth of Blumeria graminis and Rhynchosporium secalis in planta and did path-breaking work on host- induced gene silencing. Besides his excellent scientific reputation, Patrick was a friendly, prudent, and very helpful colleague and it was always a pleasure to work with him. Thomas Miedaner Viktor Korzun
xv
This page intentionally left blank
EDITORIAL: GENETICS BECAME TO BE GENOMICS The starting point of modern genetics was the discovery of the “laws of inheritance” by Gregor Mendel in 1865 and 1866, and their rediscovery in 1900. Afterwards, researchers studied “Mendelian inheritance” for many traits of practical interest and were successful as long as the inheritance was monogenic. Even today, we depend on the research and breeding of such traits, especially for disease resistances, and also for physiological features, like plant height (Rht), flowering time (Ppd), and others. The science of genetics was complemented for polygenically inherited traits by the advent of quantitative genetics starting with Fisher (1930). Another milestone in wheat genetics was the generation of a large number of aneuploids by Sears (1966) and >400 segmental deletion lines developed later by Endo and Gill (1996). In the mid-1980s, the first molecular marker system, restriction fragment length polymorphism (RFLP), was applied to wheat in the United Kingdom and the synteny of complementary DNA-RFLP probes among the Triticeae was the starting point of comparative genomics (Gale and Devos, 1998).To further promote mapping of molecular markers, the International Triticeae Mapping Initiative (ITMI) was founded in 1990, which sparked a new field of research in wheat genetics and especially enabled international cooperation by creating common mapping populations, exchanging mapping tools, markers, and mapping information. Röder et al. (1998) published one of the first molecular maps in bread wheat with a worldwide use, originally comprising 279 SSR loci from the ITMI RILs (W7984 × Opata85). In the same year, the first genetic map of durum wheat was published (Blanco et al., 1998). Molecular markers were also used for mapping quantitative trait loci (QTL). However, it soon became apparent that for traits like grain yield or baking quality a large number of QTL is responsible, which are highly dependent on the environment, and most of them have such a low contribution to the trait that they cannot be detected statistically. Moreover, each mapping population provided mostly different QTL for the same complex traits, illustrating that the existing genetic variation was very much underestimated at that time and proving the expectation of R.A. Fisher that quantitative traits follow an infinitesimal model (Fisher, 1918).
xvii
xviii
Editorial: Genetics Became to be Genomics
Functional genomics in wheat lagged far behind maize, rice, and barley for a long time, because of the large genome size and its high repetitiveness. Techniques like RNA interference, TILLING, and mapping of expressed QTL (eQTL) have been used to unravel the functions of individual genes (Gupta et al. 2008). The first map-based cloning in wheat was performed with the leaf rust resistance genes Lr1, Lr10, Lr21, and the vernalization (VRN) locus by several groups, in 2003 (Gupta et al., 2008). The newly emerging cloning techniques that combine next-generation sequencing (NGS) technologies with recent advances in genomics will make gene cloning in wheat more straightforward (see Chapter 4). One obstacle in the gene-cloning process is to verify the cloned candidate gene by gene transfer to another genotype without the trait of interest. This is still a problem in all Triticeae caused by the limitation of genotypes that can be transformed and the still inefficient protocols for successful transformation (see Chapter 5). Public EST (expressed sequence tags) libraries (e.g., Lazo et al., 2004) were the basis for a great number of studies on developing single-nucleotide polymorphism (SNP) markers, construction of denser maps, gene expression, genome organization, and again comparative genomics. Finally, they led to the construction of the first high-density wheat SNP iSelect assay with 90,000 markers (Wang et al. 2014a). Mapping of about half of these SNP markers provided a highly valuable resource for the dissection of complex traits. Since then, high-throughput genotyping has become a routine (see Chapter 1). QTL mapping is still the method of choice for a better understanding of the genetics of complex traits. On the other hand, genome-wide association studies (GWAS) allow the exploitation of beneficial alleles from large population and genetic resources. NGS will provide new techniques for functional genomics and further improvement of wheat (see Chapter 3). This applies not only to forward genetics, but also to the use of mutant populations, the analysis of epigenetic variation, or the understanding of transposons. The first initiative toward whole genome sequencing (WGS) in wheat was launched at the meeting of the International Genome Research on Wheat (IGROW) held at Winnipeg, Canada, during June 1–4, 2003. This forum developed into the International Wheat Genome Sequencing Consortium (IWGSC) where sequencing of each wheat chromosome was assigned to a country and a consortium of research groups. Chinese Spring was selected for the first WGS since an array of genetic and molecular resources was established in this favorite object of cytogenetics. NGS technologies
Editorial: Genetics Became to be Genomics
xix
very much accelerated this international project (see Chapter 2). The first milestone was the publication of the first draft sequence of the whole bread wheat genome (IWGSC, 2014) and annotation of the first wheat chromosome 3B (Choulet et al., 2014), both published in a special issue in Science on July 18, 2014. The chromosome-based draft provides new insight into the structure, organization, and evolution of the large, complex genome of wheat. In barley, a full physical, genetic, and functional sequence assembly was reached in 2012 (IBGSC, 2012). Since the release of the IWGSC WGA v0.4 in July 2016, the genome assembly has been further refined to produce an IWGSC RefSeq v1.0 assembly through the integration of IWGSC chromosome-based sequences and resources (http://www.wheatgenome. org/Projects/IWGSC-Bread-Wheat-Projects/Reference-genome) that was made public on January 14, 2017 and should be published in 2018 (see Chapter 3). Other assemblies are now available as well (see Chapter 2). The 10+ Wheat Genomes Project aims at a new WGS assembly and annotation of 10 or more elite cultivars from seven countries to define the pan genome of wheat and detect individual variation within the genomes. For rye, a WGS-based draft genome was published recently (Bauer et al., 2017) and further improvement is expected in 2018. In 2012, the CRISPR/Cas nuclease was first introduced into eukaryotes, one year later genome editing was published in model plants (Arabidopsis, Nicotiana benthamiana). Despite the existence of alternative tools, the CRISPR/Cas system rapidly became the most efficient and widely used tool for genome engineering (see Chapter 6). The aims are to induce mutations into open reading frames of interest to modify or remove gene function. The first example in hexaploid bread wheat was the simultaneous knockout of the three homoeoalleles of the Mlo gene that confers durable powdery mildew resistance (Wang et al., 2014b). These techniques, as fashionably as they might seem, are only valuable in a practical sense, when they are used for improving highly demanded traits in wheat and barley breeding. They might especially be valid for harnessing valuable features from genetic resources, as exemplified in the field of resistance breeding (see Chapter 7). Marker-assisted selection (MAS) is then indispensable for the introgression of simply inherited traits, like monogenic powdery mildew and rust resistances. Although MAS accelerates introgression and selection, its use cannot overcome the high flexibility of pathogen populations that even more rapidly adapt to new resistance genes and make them ineffective. Therefore, the generation of durable resistance remains a task for the future (see Chapter 8). Here, a perfect synthesis between new
xx
Editorial: Genetics Became to be Genomics
genomic techniques and breeding enabling new concepts is illustrated, for example, by targeted allele introgression based on cloned resistance genes, switching off susceptibility factors by gene editing, or engineering nonhost resistance.The worldwide distribution and success of wheat would not have been possible without the exploitation of adaptive genes, like genes regulating plant height, photoperiod, and flowering time (see Chapter 9). Although these genes were known for a long time, we are just starting to understand their function and their interactions with other loci. New genetic variation is urgently necessary to maintain or even increase wheat genetic progress. One source could be the broad spectrum of tetraploid wheat species with durum wheat as a bridge (see Chapter 10). Interesting traits could be the high protein content, stripe rust and powdery mildew resistances, Fusarium head blight resistance, or drought tolerance. Today, we are on the border of using genomics not only for cutting-edge research, but also for practical breeding. Despite the enormous progress in genomics in the last decade, exact field phenotyping on a highthroughput level still presents a bottleneck in Triticeae that could be overcome in the future by modern phenotyping platforms integrating different kinds of sensors (see Chapter 11). It is our responsibility to make the new genomic techniques also available to medium-sized breeding companies in the industrial world and to breeding programs in developing countries (see Chapter 12). Here, the key traits are partially different including abiotic stress tolerance, high grain zinc and iron contents, special quality parameters, or resistances to local diseases. For complex traits, MAS proved to be too costly and ineffective. Selecting for several markers simultaneously fixes large parts of the respective chromosomes and increases the risk of co-selecting genome regions/QTL that negatively affect the performance of the progeny (= linkage drag, Miedaner and Korzun, 2012). Thus, genomic selection (GS) seems to be more promising for predicting quantitatively inherited traits in non-tested breeding populations on the basis of genomic models derived from extensive training populations (see Chapter 13). The new approaches use environmental covariates in prediction models or try to exploit epistasis. SpeedGS could further accelerate the breeding progress and enhance gain from selection (see Chapter 14). Speed breeding was established as a new technology for rapid generation advancement, especially in the spring wheat. Shortening the cycle length contributes largely to selection gain in all simulation scenarios. The availability of a medium-dense SNP iSelect 10k assay for rye opens genome-based breeding also for this smaller, but important European
Editorial: Genetics Became to be Genomics
xxi
crop (see Chapter 15). These genomic resources allow the targeted use of introgression libraries, QTL and association mapping, and genomic selection for this crop. With the new toolbox including not only the well-known (cyto-)genetic methods of the last two centuries, but also a large number of genomic approaches from the new century, we can face the future demands of Triticeae breeding in a growing world more confidently. Thomas Miedaner Viktor Korzun
REFERENCES Bauer, E., Schmutzer, T., Barilar, I., Mascher, M., Gundlach, H., Martis, M.M., Twardziok, S.O., Hackauf, B., Gordillo, A.,Wilde, P., Schmidt, M., Korzun,V., Mayer, K.F.X., Schmid, K., Schön, C.-C., Scholz, U., 2017. Towards a whole-genome sequence for rye (Secale cereale L.). Plant J. 89, 853–869. Blanco, A., Bellomo, M.P., Cenci, A., et al., 1998. A genetic linkage map of durum wheat. Theor. Appl. Genet. 97 (5–6), 721–728. Choulet, et al., 2014. Structural and functional partitioning of bread wheat chromosome 3B. Science 345 (6194), 1249721. https://doi.org/10.1126/science.1249721. Endo,T.R., Gill, B.S., 1996.The deletion stocks of common wheat. J. Hered. 87 (4), 295–307. Fisher, R.A., 1918. The correlation between relatives on the supposition of Mendelian inheritance. Trans. R. Soc. Edinb. Earth Environ. Sci. 52, 399–433. Fisher, R.A., 1930. The Genetical Theory of Natural Selection. Clarendon Press, Oxford, ISBN: 0-19-850440-3. Gale, M.D., Devos, K.M., 1998. Plant comparative genetics after 10 years. Science 282, 656–659. Gupta, P.K., Mir, R.R., Mohan, A., Kumar, J., 2008. Wheat genomics: present status and future prospects. Int. J. Plant Genomics 896451. https://doi.org/10.1155/2008/896451. IBGSC—International Barley Genome Sequencing Consortium, Mayer, K.F., Waugh, R., Brown, J.W., Schulman, A., Langridge, P., Platzer, M., Fincher, G.B., Muehlbauer, G.J., Sato, K., et al., 2012. A physical, genetic and functional sequence assembly of the barley genome. Nature 491, 711–716. IWGSC—International Wheat Genome Sequencing Consortium, 2014. A chromosomebased draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. Science 345 (6194), 1251788. https://doi.org/10.1126/science.1251788. Lazo, G.R., Chao, S., Hummel, D.D., Edwards, H., Crossman, C.C., Lui, N., et al., 2004. Development of an expressed sequence tag (EST) resource for wheat (Triticum aestivum L.): EST generation, unigene analysis, probe selection and bioinformatics for a 16,000-locus bin-delineated map. Genetics 168 (2), 585–593. https://doi.org/10.1534/ genetics.104.034777. Mendel, G., 1866.Versuche über Pflanzen-Hybriden. In:Verhandlungen des Naturforschenden Vereins zu Brünn. vol. 4. pp. 3–47. http://vlp.mpiwg-berlin.mpg.de/library/data/ lit26745. (in German). Miedaner, T., Korzun, V., 2012. Marker-assisted selection for disease resistance in wheat and barley breeding. Phytopathology 102, 560–566. https://doi.org/10.1094/ PHYTO-05-11-0157. Röder, M.S., Korzun,V.,Wendehake, K., et al., 1998. A microsatellite map of wheat. Genetics 149 (4), 2007–2023.
xxii
Editorial: Genetics Became to be Genomics
Sears, E.R., 1966. Nullisomic-tetrasomic combinations in hexaploid wheat. In: Riley, R., Lewis, K.R. (Eds.), Chromosome Manipulation and Plant Genetics. Oliver and Boyd, Edinburgh, pp. 29–45. Wang, S.,Wong, D., Forrest, K., Allen, A., Chao, S., Huang, B.E., et al., 2014a. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array. Plant Biotechnol. J. 12 (6), 787–796. https://doi.org/10.1111/ pbi.12183. Wang, Y., Cheng, X., Shan, Q., Zhang, Y., Liu, J., Gao, C., Qiu, J.L., 2014b. Simultaneous editing of three homoeoalleles in hexaploid bread wheat confers heritable resistance to powdery mildew. Nat. Biotechnol. 32, 947–951. https://doi.org/10.1038/nbt.2969.
FURTHER READING Dubcovsky, J., Luo, M.C., Zhong, G.Y., Brandsteitter, R., Desai, A., Kilian, A., Kleinhofs, A., Dvorak, J., 1996. Genetic map of diploid wheat, Triticum monococcum L. and its comparison with maps of Hordeum vulgare L. Genetics 143, 983–999.
SECTION 1
Techniques
This page intentionally left blank
CHAPTER 1
High-Throughput Genotyping for Cereal Research and Breeding Martin W. Ganal*, Jörg Plieske*, Anja Hohmeyer*, Andreas Polley*, Marion S. Röder† ⁎ TraitGenetics GmbH, Gatersleben, Germany Leibniz-Institute of Plant Genetics and Crop Plant Research, Gatersleben, Germany
†
Contents 1.1 1.2 1.3 1.4
Introduction SNP Markers Genotyping-by-Sequencing (GBS) Array-Based Genotyping 1.4.1 Barley 1.4.2 Wheat 1.4.3 Rye/Triticale 1.4.4 Oat 1.5 High-Throughput Individual SNP Marker Genotyping 1.6 Summary and Outlook References
3 4 6 7 9 9 12 13 13 14 15
1.1 INTRODUCTION Single-nucleotide polymorphisms (SNPs) are the smallest unit of DNA polymorphism and provide an almost infinite source of markers in a given species. High-throughput and large-scale genotyping of SNPs is now a routine tool in plant breeding in all major crop species including cereals (Rasheed et al., 2017). SNP genotyping has meanwhile almost completely replaced other genotyping technologies (e.g., microsatellites or AFLPs) due to their potential for high-throughput, high-speed data generation, repeatability, and cost effectiveness. Individual markers that are very tightly or perfectly linked to specific traits are now intensively used as a surrogate for phenotypic testing. Markers for specific disease resistant-traits, restorer genes, specific quality traits, and major phenology traits (e.g., controlling photoperiod, vernalization, plant height, etc.) are some examples which should be mentioned here. Multiple marker sets either analyzed as individual markers or simultaneously analyzed in large Applications of Genetic and Genomic Research in Cereals https://doi.org/10.1016/B978-0-08-102163-7.00001-6
Copyright © 2019 Elsevier Ltd. All rights reserved.
3
4
Applications of Genetic and Genomic Research in Cereals
groups are frequently used for the reduction of the donor background in marker-assisted selection (MAS)/backcrossing, the analysis of genetic relationships or the genetic mapping of new traits (single genes or quantitative trait loci, QTL). In recent years, the concept of genomic selection (GS) that has initially been developed for animal breeding (Meuwissen et al., 2001) has also been established for cereal breeding (Heffner et al., 2010). In the GS process, a breeding value for lines and individual progeny is being determined by the analysis of many markers and associating a specific effect of each marker for one or more traits (based on high-quality phenotyping data from a training population).The most promising progeny candidates are then retained. The advantage of this method is that the genetic gain per generation is supposed to be much higher than solely using phenotypic evaluation (Hamblin et al., 2011; Jonas and de Koning, 2013; Crossa et al., 2017). Genotyping with individual or multiplexed SNP markers has been revolutionized by some technological developments. Individual marker analysis can now be realized at very small costs using technologies such as TaqMan and KASP. In combination with automation and miniaturization, end-point marker analyses can now be performed routinely and with high quality in volumes of less than 1 μL and the required automated pipelines are commercially available. In combination with automated large-scale DNA extraction of many thousands of samples per day for a large breeding program is now within reach to analyze up to a million or more samples with sets of individual markers within a breeding cycle (Thomson, 2014). Multiplexed marker analysis (i.e., the parallel analysis of many thousands) has also made tremendous progress. Typically, such analyses are performed using microarray platforms from either Illumina (Infinium) or Affymetrix/ Thermo Fisher (Axiom), which permit the parallel analysis of up to more than one million SNPs. While large genotyping arrays have initially been quite expensive, the costs have come down to approximately 20–30 €/$ US per sample through technical optimization and the development of arrays with specific marker sets for breeding that typically contain 10,000–50,000 high-quality markers. Such numbers are necessary in particular within the framework of GS schemes, where typically 10,000 or more samples are being analyzed within a breeding cycle.
1.2 SNP MARKERS In the past, major efforts were required to identify many thousands of SNP markers for large-scale genotyping especially in cereal crops
High-Throughput Genotyping for Cereal Research and Breeding
5
(Ganal et al., 2009). In contrast to many other vegetables and field crops, cereals have a very large genome with approximately 5 Gb for barley, 16 Gb for hexaploid wheat, and 8 Gb for rye, which is at least twice the size of most other major crops. For a long time, this has prevented the availability of fully sequenced reference genomes. Markers have been identified mainly based on comparative sequencing of PCR-amplified segments, full transcriptome analysis, and sequencing of parts of the genome using complexity reduction methods. In general, these approaches have resulted in a limited set of SNPs (1000–10,000s) and in which the SNPs have been identified in only a limited set of lines. Often, this produces a certain level of ascertainment bias meaning that the revealed polymorphisms are skewed toward the analyzed samples or sample type and are not a random representation for the full range of genetic variability of the species. More recently, sequence capture has become a technology of choice to analyze many lines of wheat and barley (Winfield et al., 2012; Mascher et al., 2013b; Jordan et al., 2015). With this technology, it is possible to enrich for genomic regions that are transcribed and also specifically for coding sequences (called exons or over its entirety the exome). Since exome capture analyzes only a small proportion of the genome (a few percent) in the small grain cereals, this approach provides a relatively cost-efficient analysis method with a number of additional benefits. On the one side, exome sequencing provides a comprehensive overview concerning genetic variation in genes. On the other side, the exome sequences are in most cases single copy and, thus, are highly desirable for SNP assay development and functional assays. At present, the full genome sequencing of reference genomes of the cereal grains at high quality is almost completed.The barley reference genome has been published (Mascher et al., 2017) and for wheat the generation of a reference sequence is in its final stages (https://wheat-urgi.versailles.inra. fr). For rye, a reasonable draft sequence is now available (Bauer et al., 2017). Based on these reference sequences, efforts are underway to generate additional high-quality genome sequences of a number of additional lines and varieties (Uauy, 2017) in order to identify sequence polymorphisms over the entire genome of these species. Furthermore, with the development of improved DNA sequencing technologies and reduced costs per generated Gb, it will soon be possible to perform genome resequencing and bioinformatics analyses of cereal genomes in a routine manner for interesting varieties or lines at manageable costs (100,000 genetic profiles describing the diversity of the two biggest wheat genebanks in the world (68,000 from CIMMYT and 32,000 from ICARDA).To multiply the impacts of these results, a genetic resource utilization platform for breeders and researchers is being created, made up of publicly available data and software tools. Part of the data generated has been used for the classification of Mexican landraces, with developed core sets currently being regrown across the country for phenotypic evaluation and subsequent hybridization of the best accessions with modern germplasm (Vikram et al., 2016). In addition, prebreeding efforts in SeeD have developed diversity panels and a large number of advanced lines derived through a three-way top cross strategy (exotic × elite1 × elite2), which are now being extensively characterized at the genotypic and the phenotypic level. The new physical maps and re-sequencing of wheat cultivars will make it possible to translate these large-scale SNP datasets into haplotype maps that can greatly facilitate genome-wide association studies of complex traits and functional investigations of evolutionary changes when genebank
264
Applications of Genetic and Genomic Research in Cereals
accessions are compared to modern germplasm pools. These advances will also accelerate studies on crop designs via genomics-assisted breeding (Huang and Han, 2014). The wheat reference sequence with annotation of genes will eventually allow exploring linkages between SeeD GBS sequencing data and functional variations of major morphology and adaptation genes, to predict the initial performance of any genotyped genebank accession and to simplify the classification and exploration of the CIMMYT and ICARDA genebanks. Similar to SeeD, but at a lower scale, Kabbaj et al. (2017) recently investigated the genetic diversity within a global panel of durum wheat landraces and modern germplasm. With the SNP data derived from the 35K Axiom array, the authors were able to disentangle the history of durum wheat origins based on migration patterns observed within the landraces as well as breeding exchange and cross-hybridization among the modern germplasm. Population stratification in the panel also provided a better understanding of how many of the available alleles have been captured within a specific germplasm, and could have immediate practical impact on breeding. Complementing efforts on genetic characterization of accessions conserved ex situ in genebanks and the documentation, description, and phenotypic evaluation of genetic resources conserved in situ are of fundamental importance (Alsaleh et al., 2016). From 2009 to 2014, a nationwide effort was made to document, collect, conserve, and characterize wheat landraces grown in Turkey, which is considered part of the center of wheat origin. The long-term cultivation and exchange of wheat landraces by local farmers resulted in the continuous enhancement and adaptation of those landraces. Spike samples were collected from >1500 farmers from 59 provinces, planted as single-spike progenies, and classified into species, subspecies, and botanical varieties or morphotypes (Morgounov et al., 2016). Diversity indices based on SNP markers and the number of morphotypes identified regions in Turkey with the highest conserved genetic diversity, and possible conservation efforts are being discussed. Developed core sets are being characterized in detail. Researchers are exploiting the potential of this germplasm by incorporating it into national wheat breeding programs and utilizing rapid selection and novel breeding methodologies to integrate elite traits without losing the desired landrace characteristics. The resulting locally adapted improved wheat varieties can benefit smallholder farmers in rural areas of the developing world, generally affected by harsh farming conditions.
Application of Genetic and Genomic Tools in Wheat for Developing Countries
265
12.7 TRACKING CROP VARIETIES Crop germplasm improvement is a major activity of CGIAR centers and thousands of new varieties are developed to provide higher yields, better nutritional content, improved adaptation to fluctuating climates, and increased resistance to diseases and pests (Reynolds and Borlaug, 2006). Studying to what extent these crop varieties are adopted by farmers is crucial to evaluate the performance and to understand the impact of agricultural development programs. Rigorous impact assessment is also important for informed and evidence-based policy making, for instance, to develop appropriate support policy measures for improved targeting, access, and use of modern varieties (Shiferaw et al., 2014). However, measuring the dissemination of improved crop varieties is challenging.Various methodologies, such as seed sales inquiries, expert opinion estimates by breeders and extension services, as well as surveys at the household and plot levels are used, but the reliability of these approaches has never been verified and each approach has its own inherent limitations. For example, seed sales inquiries require specific surveys, which may not fit into the existing agricultural statistical systems. The estimates of expert panels are only as good as the experts’ knowledge and elicitation protocols, and household surveys are only as reliable as the farmer’s knowledge of the genotype she/he is sowing (Walker et al., 2014). In a major effort to quantify the adoption of improved varieties in Sub-Saharan Africa, DIIVA project 1 has shed light on the convergence of expert opinion estimates with household survey estimates (Walker et al., 2014). Conclusions point toward the fact that expert opinion estimates are likely to overemphasize the uptake of specific varieties, while household surveys are likely to underestimate their importance. The study concluded that “probably neither surveys nor expert panels can do a good job in delivering accurate estimates of cultivar-specific adoption” (Walker, 2015). The study calls for the use of state-of-the-art technology to develop an improved monitoring system that could help track the diffusion of individual modern varieties more accurately and efficiently. Next-generation sequencing technologies have become increasingly affordable in crops, and costs per sample are projected to continue to decrease in the coming decade (Buckler et al., 2016; Unamba et al., 2015). As a survey instrument, DNA fingerprinting on the basis of next-generation sequencing provides the opportunity to conduct a survey validation exercise and assess the accuracy of existing methods of crop varietal identification (Maredia et al., 2016; Rabbi et al., 2012).
266
Applications of Genetic and Genomic Research in Cereals
12.8 WHEAT CASE STUDY A number of studies have assessed the adoption and impact of improved wheat germplasm in developing countries (reviewed by Fisher and Erenstein, 2014). The studies used observational data and published varietal guides or cross-section analyses, and also included special analyses using remotely sensed data. Lantican et al. (2016) documented the global use of improved wheat germplasm and the economic benefit of international collaboration in wheat improvement research by the CGIAR from 1994 to 2014.The study revealed an increase in the adoption of CGIAR-related varieties, covering about 106 million hectares (64%) in the studied countries. The benefits attributable specifically to wheat improvement research by the CGIAR ranged from 2.2 to 3.1 billion US dollars. Shiferaw et al. (2014) used the nationally representative data for Ethiopia to study the adoption and impact of improved wheat varieties, and found an adoption rate of 70%. An adoption analysis has shown that wheat prices, prices of competing crops, sources of information on new varieties, input costs, agroecology, and geographical location influenced the adoption of improved varieties. A survey of 1200 wheat farmers in five states in the Indo-Gangetic Plains of India indicated that the rate of wheat adoption and varietal turnover has slowed down (Ghimire et al., 2012). This study found that various socioeconomic factors influenced the choice of whether to grow new wheat varieties in India; however, the most important determinant was access to seed from different sources. Fisher and Erenstein (2014) noted that the quality data and empirical methods used varied considerably across the diverse impact studies. The authors concluded that to better enable the comparison across studies, it was necessary to apply a uniform set of core survey questions, collect longitudinal data, use existing high-quality datasets and conduct randomized control trials to increase the quality of impact assessment. The use of DNA fingerprinting methods was not suggested. Only one study deploying DNA fingerprinting to track the diffusion of wheat varieties has been published in wheat. Reported by Yirga et al. (2016), DNA fingerprinting was explored among smallholder farmers in certain areas of Oromia, Ethiopia. Three complementary data collection methods were used. The estimates of varietal adoption based on farmer reports and DNA fingerprinting diverged considerably: 63% of the farmers were using improved wheat varieties according to survey respondents, while 96% of the farmers were using them according to DNA fingerprinting. This suggests that the household survey underestimated the economic importance of improved varieties. Besides its usefulness in assessing varietal
Application of Genetic and Genomic Tools in Wheat for Developing Countries
267
adaptation, DNA fingerprinting was considered useful to evaluate more accurately the seed demand, resolve seed quality disputes, and to help the variety release mechanism.
12.9 EVIDENCE FROM OTHER CROPS The concept of essential derivation using DNA fingerprinting has mainly been used to protect breeders’ rights. However, the objective in developing countries is different and aims to help collect accurate variety-specific identification data that can be used to study adoption rates. DNA fingerprinting to address this objective in other crops is also limited to a few recent attempts, mostly pilot studies. Rabbi et al. (2012) used genotypingby-sequencing as an alternative method to track released cassava varieties in farmers’ fields. In total, 88% of the 917 cassava accessions were matched to specific released varieties or landraces in the reference library. Numerous admixtures within accessions were found; this was explained by the fact that cassava farmers grow more than one variety in their fields, allowing cross-breeding. There were many synonymous or homonymous clone names, which would make it difficult to track released varieties by relying on names only. Kosmowski et al. (2016) tested the effectiveness of three household-based survey methods of identifying sweet potato varietal adoptation against DNA fingerprinting. All methods were found to be less accurate than the DNA fingerprinting benchmark. Similar to the study in cassava, variety names given by farmers provided inconsistent varietal identities. Probably the most comprehensive comparison of the approaches used to collect variety-specific adoptation data was published by Maredia et al. (2016) for cassava and beans.The authors compared six different approaches including farmer and expert elicitation. Each method provided different estimates of adoption rates, and no method could be specifically recommended. All methods underestimated the adoption of improved varieties and misclassified improved and local varieties. The authors pointed out that DNA fingerprinting was the only credible method, but the method was only as good as the quality of the reference library. In conclusion, despite limited evidence, DNA fingerprinting seems a viable tool to estimate modern variety adoption for a range of crops, including wheat.The technique has shown additional benefits, for it has been used to resolve seed quality disputes, assess seed demands more accurately and investigate the functioning of the varietal development, release, and dissemination system.The informality of seed systems appears to be the main constraint for more accurate survey-based identification. DNA fingerprinting
268
Applications of Genetic and Genomic Research in Cereals
across large-scale household surveys may pose a substantial logistical challenge, but more and more countries are acquiring the technical capacity to extract DNA from field samples and carry out genotyping. In addition, as the cost of DNA fingerprinting declines further, the cost of conducting a survey will diminish. More evidence is needed to assess whether DNA fingerprinting can be used as a complementary part of crop varietal adoption.
REFERENCES Alsaleh,A., Baloch, F.S., Nachit, M., Özkan, H., 2016. Phenotypic and genotypic intra-diversity among Anatolian durum wheat “Kunduru” landraces. Biochem. Syst. Ecol. 65, 9–16. Battenfield, S.D., Guzmán, C., Gaynor, R.C., Singh, R.P., Peña, R.J., Dreisigacker, S., Fritz, A.K., Poland, J.A., 2016. Genomic selection for processing and end-use quality traits in the CIMMYT spring bread wheat breeding program. Plant Genome 9 (2), 1–12. Braun, H.-J., Atlin, G., Payne, T., 2010. Multi-location testing as a tool to identify plant response to global climate change. In: Reynolds, M.P. (Ed.), Climate Change and Crop Production. vol. 115. CABI Climate Change Series, London, p. 138. Buckler, E.S., Ilut, D.C., Wang, X., Kretzschmar, T., Gore, M.A., Mitchell, S.E., 2016. rAmpSeq: using repetitive sequences for robust genotyping. bioRxiv, 096628. Burgueño, J., de los Campos, G.,Weigel, K., Crossa, J., 2012. Genomic prediction of breeding values when modeling genotype × environment interaction using pedigree and dense molecular markers. Crop Sci. 52, 707–719. Cakmak, I., Ozkan, H., Braun, H.J.,Welch, R.M., Romheld,V., 2000. Zinc and iron concentrations in seeds of wild, primitive, and modern wheats. Food Nutr. Bull. 21, 401–403. Crespo-Herrera, L.A., Crossa, J., Huerta-Espino, J., Autrique, E., Mondal, S.,Velu, G.,Vargas, M., Braun, H.-J., Singh, R.P., 2017. Genetic yield gains in CIMMYT’s international elite spring wheat yield trials by modeling the genotype x environment interaction. Crop Sci. 57, 789–801. Crossa, J., Jarquin, D., Franco, J., Perez-Rodriguez, P., Burgueño, J., Saint-Pierre, C.,Vikram, P., Sansaloni, C., Petroli, C., Akdemir, D., Sneller, C., Reynolds, M., Tattaris, M., Payne, T., Guzman, C., Peña, R.J., Wenzel, P., Sing, S., 2016. Genomic prediction of gene bank wheat landraces. Genes Genomes Genet. 6, 1819–1834. Crossa, J., Pérez, P., Hickey, J., Burgueño, J., Ornella, L., Cerón-Rojas, J., Zhang, X., Dreisigacker, S., Babu, R., Li1,Y., Bonnett, D., Mathews, K., 2014. Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity 112, 48–60. Crossa, J., Pérez, P., de los Campos, G., Mahuku, G., Dreisigacker, S., Magorokosho, C., 2011. Genomic selection and prediction in plant breeding. J. Crop Improv. 25, 239–261. Crossa, J., de los Campos, G., Pérez, P., Gianola, D., Burgueño, J., Araus, J.L., Makumbi, D., Singh, R.P., Dreisigacker, S.,Yan, J., Arief,V., Banziger, M., Braun, H.J., 2010. Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186, 713–724. Crossa, J., Burgueño, J., Dreisigacker, S.,Vargas, M., Herrera-Foessel, S.A., Lillemo, M., Singh, R.P., Trethowan, R., Warburton, M., Franco, J., Reynolds, M., Crouch, J.H., Ortiz, R., 2007. Association analysis of historical bread wheat germplasm using additive genetic covariance of relatives and population structure. Genetics 177, 1889–1913. Dreisigacker, S., Sukumaran, S., Guzmán, C., He, X., Bonnett, D., Crossa, J., 2016. Molecular marker-based selection tools in spring bread wheat improvement: CIMMYT experience and prospects. In: Rajpal, V.R., Rao, S.R., Raina, S.N. (Eds.), Molecular Breeding for Sustainable Crop Improvement. Springer International Publishing, New York, pp. 421–474.
Application of Genetic and Genomic Tools in Wheat for Developing Countries
269
Dreisigacker, S., Wang, X., Cisneros, B.A.M., Jing, R., Singh, P.K., 2015. Adult-plant resistance to Septoria tritici blotch in hexaploid spring wheat. Theor. Appl. Genet. 128, 2317–2329. Edae, E.A., Byrne, P.F., Haley, S.D., Lopes, M.S., Reynolds, M.P., 2014. Genome-wide association mapping of yield and yield components of spring wheat under contrasting moisture regimes. Theor. Appl. Genet. 127, 791–807. Fisher, M., Erenstein, O., 2014.The extent and nature of adoption and impact of CRP-wheat related research outputs, 2004–2014. In: CRP-Wheat Report, July 2014. CIMMYT, Mexico, D.F. Ghimire, S., Mehar, M., Mittal, S., 2012. Influence of sources of seed on varietal adoption behavior of wheat farmers in indo-Gangetic Plains of India. Agric. Econ. Res. Rev. 25, 399–408. Goddard, M.E., Hayes, B.J., 2007. Genomic selection. J. Anim. Breed. Genet. 124, 323–330. Guzmán, C., Xiao,Y., Crossa, J., González-Santoyo, H., Huerta, J., Singh, R., Dreisigacker, S., 2016. Sources of the highly expressed wheat bread making (wbm) gene in CIMMYT spring wheat germplasm and its effect on processing and bread-making quality. Euphytica 209 (3), 689–692. Hao, Y., Velu, G., Peña, R.J., Singh, S., Singh, R.P., 2014. Genetic loci associated with high grain zinc concentration and pleiotropic effect on kernel weight in wheat (Triticum aestivum L.). Mol. Breed. 34, 1893–1902. He, X., Lillemo, M., Shi, J., Wu, J., Bjørnstad, A., Belova, T., Dreisigacker, S., Etienne, D., Singh, P., 2016. QTL characterization of fusarium head blight resistance in CIMMYT bread wheat line Soru#1. PLoS ONE 11 (6), e0158052. Huang, X., Han, B., 2014. Natural variations and genome-wide association studies in crop plants. Annu. Rev. Plant Biol. 65 (1), 531–551. Juliana, J., Singh, R.P., Singh, P.K., Crossa, J., Huerta-Espino, J., Lan, C., Bhavani, S., Rutkoski, J.E., Poland, J.A., Bergstrom, G.C., Sorrells, M.E., 2017a. Genomic and pedigree-based prediction for leaf, stem, and stripe rust resistance in wheat.Theor. Appl. Genet. 130, 1415–1430. Juliana, J., Singh, R.P., Singh, P.K., Crossa, J., Rutkoski, J.E., Poland, J.A., Bergstrom, G.C., Sorrells, M.E., 2017b. Comparison of models and whole-genome profiling approaches for genomic-enabled prediction of Septoria tritici blotch, Stagonospora nodorum blotch, and tan spot resistance in wheat. Plant Genome 10 (2), 1–16. Juliana, P., Rutkoski, J.E., Poland, J.A., Singh, R.P., Murugasamy, S., Natesan, S., Barbier, H., Sorrells, M.E., 2015. Genome-wide association mapping for leaf tip necrosis and pseudo-black chaff in relation to durable rust resistance in wheat. Plant Genome 8 (2). https://doi.org/10.3835/plantgenome2015.01.0002. Kabbaj, H., Sall, A.T., Al-Abdallat, A., Geleta, M., Amri, A., Filali-Maltouf, A., Belkadi, B., Ortiz, R., Bassi, F.M., 2017. Genetic diversity within a global panel of durum wheat (Triticum durum) landraces and modern germplasm reveals the history of allele exchange. Front. Plant Sci. 8, 1–13. Kosmowski, F., Aragaw, A., Kilian, A., Ambel, A., Ilukor, J., Yigezu, B., Stevenson, J., 2016. Varietal Identification in Household Surveys Results from an Experiment Using DNA Fingerprinting of Sweet Potato Leaves in Southern Ethiopia. . Policy Res. Work. Pap. 7812, Worldbank Group, Sept. 2016. Krattinger, S.G., Lagudah, E.S., Spielmeyer,W., Singh, R.P., Huerta-Espino, J., McFadden, H., Bossolini, E., Selter, L.L., Keller, B., 2009. A putative ABC transporter confers durable resistance to multiple fungal pathogens in wheat. Science 323, 1360–1363. Lado, B., Battenfield, S., Guzmán, C., Quincke, M., Singh, R.P., Dreisigacker, S., Peña, R.J., Fritz, A., Silva, P., Poland, J., Gutiérrez, L., 2017. Strategies to select crosses using genomic prediction in two wheat breeding programs. Plant Genome 10 (2), 1–10. Lan, C., Basnet, B.R., Dreisigacker, S., Sehgal, D., Reyes Jaimez, A.E., Luna Garrido, B., Muñoz Zavala, S., Núñez Ríos, C., et al., 2016. Overview of bi-parental QTL mapping and cloning genes in the context of wheat rust. In: CIMMYT Wheat Molecular Genetics: Laboratory protocols and applications to wheat breeding. CIMMYT, Mexico, D.F., pp. 39–46.
270
Applications of Genetic and Genomic Research in Cereals
Lantican, M.A., Braun, H.-J., Payne, T.S., Singh, R.P., Sonder, K., Baum, M., van Ginkel, M., Erenstein, O., 2016. Impacts of International Wheat Improvement Research, 1994–2014. CIMMYT, Mexico, D.F, ISBN: 978-607-8263-55-4, p. 59. Lopes, M.S., Dreisigacker, S., Peña, R.J., Sukumaran, S., Reynolds, M.P., 2015. Genetic characterization of the wheat association mapping initiative (WAMI) panel for dissection of complex traits in spring wheat. Theor. Appl. Genet. 128, 453–464. Lopez-Cruz, M., Crossa, J., Bonnett, D., Dreisigacker, S., Poland, J., Jannink, J.L., Singh, R.P., Autrique, E., de los Campos, G., 2015. Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model. Genes Genomes Genet. 5, 569–582. Maredia, M.K., Reyes, B.A., Manu-Aduening, J., Dankyi, A., Hamazakaza, P., Muimui, K., Rabbi, I., Kulakow, P., Parkes, E., Abdoulaye, T., Katungi, E., Raatz, B., 2016. In: Testing Alternative Methods of Varietal Identification Using DNA Fingerprinting: Results of Pilot Studies in Ghana and Zambia. MSU Int. Dev. Work. Pap. 149, Dep. Agric. Food, Resour. Econ. Dep. Econ. Michigan State Univ. East Lansing, Michigan 48824–1039, U.S.A. McCouch, S., Baute, G.J., Bradeen, J., Bramel, P., Bretting, P.K., Buckler, E., Burke, J.M., Charest, D., Cloutier, S., Cole, G., Dempewolf, H., Dingkuhn, M., Feuillet, C., Gepts, P., Grattapaglia, D., Guarino, L., Jackson, S., Knapp, S., Langridge, P., Lawton-Rauh, A., Lijua, Q., Lusty, C., Michael, T., Myles, S., Naito, K., Nelson, R.L., Pontarollo, R., Richards, C.M., Rieseberg, L., Ross-Ibarra, J., Rounsley, S., Hamilton, R.S., Schurr, U., Stein, N., Tomooka, N., van der Knaap, E., van Tassel, D., Toll, J.,Valls, J.,Varshney, R.K., Ward, J., Waugh, R., Wenzl, P., Zamir, D., 2013. Agriculture: feeding the future. Nature 499, 23–24. Miedaner, T., Korzun, V., 2012. Marker-assisted selection for disease resistance in wheat and barley breeding. Phytopathology 102, 560–566. Mondal, S., Singh, R.P., Mason, E.R., Huerta-Espino, J., Autrique, E., Joshi, A.K., 2016. Grain yield, adaptation and progress in breeding for early-maturing and heat-tolerant wheat lines in South Asia. Field Crops Res. 192, 78–85. Mondal, S., Singh, R.P., Huerta-Espino, J., Kehel, Z., Autrique, E., 2015. Characterization of heat and drought stress tolerance in high-yielding spring wheat. Crop Sci. 55, 1–11. Moore, J.W., Herrera-Foessel, S., Lan, C., Schnippenkoetter, W., Ayliffe, M., Huerta-Espino, J., Lillemo, M., Viccars, L., Milne, R., Periyannan, S., Kong, X., Spielmeyer, W., Talbot, M., Bariana, H., Patrick, J.W., Dodds, P., Singh, R., Lagudah, E., 2015. A recently evolved hexose transporter variant confers resistance to multiple pathogens in wheat. Nat. Genet. 47, 1494–1498. Morgounov, A., Keser, M., Kan, M., Küçükçongar, M., Özdemir, F., Gummadov, N., Muminjanov, H., Zuev, E., Qualset, C.O., 2016. Wheat landraces currently grown in Turkey: distribution, diversity, and use. Crop Sci. 56, 3112–3124. Ortiz-Monasterio, J.I., Palacios-Rojas, N., Meng, E., Pixley, K., Trethowan, R., Peña, R.J., 2007. Enhancing the mineral and vitamin content of wheat and maize through plant breeding. J. Cereal Sci. 46, 293–307. Pérez-Rodríguez, P., Crossa, J., Rutkoski, J., Poland, J., Singh, R., Legarra, A., Autrique, E., de los Campos, G., Burgueño, J., Dreisigacker, S., 2017. Single-step genomic and pedigree genotype × environment interaction models for predicting wheat lines in international environments. Plant Genome 10. https://doi.org/10.3835/plantgenome2016.09.0089. Pérez-Rodríguez, P., Gianola, D., Gonzaléz-Camacho, J.M., Crossa, J., Manés,Y., Dreisigacker, S., 2012. A comparison between linear and non-parametric regression models for genome enabled prediction in wheat. Genes Genet. Genom. 2, 1595–1605. Poland, J., Endelman, J., Dawson, J., Rutkoski, J., Wu, S., Manes, Y., Dreisigacker, S., Crossa, J., Sánchez-Villeda, H., Sorrells, M.E., Jannink, J.L., 2012. Genomic selection in wheat breeding using genotyping-by-sequencing. Plant Genome 5, 103–113.
Application of Genetic and Genomic Tools in Wheat for Developing Countries
271
Rabbi, I.Y., Kulakow, P.A., Manu-Aduening, J.A., Dankyi, A.A., Asibuo, J.Y., Parkes, E.Y., Abdoulaye, T., Girma, G., Gedil, M.A., Ramu, P., Reyes, B., Maredia, M.K., 2012. Tracking crop varieties using genotyping-by-sequencing markers: a case study using cassava (Manihot esculenta Crantz). BMC Genet. 16, 115. Rajaram, S., van Ginkel, M., Fischer, R.A., 1994. In: Li, Z.S., Xin, Z.Y. (Eds.), CIMMYT’s Wheat Breeding Mega-Environments (ME). Proceedings of the 8th International Wheat Genetics Symposium, Beijing, China, 20–25 July 1993. China Agriculture Scientech Press, Beijing, pp. 1101–1106. Reynolds, M.P., Braun, H.-J., Cavalieri, A.J., Chapotin, S., Davies, W.J., Ellul, P., Feuillet, C., Govaerts, B., Kropff, M.J., Lucas, H., Nelson, J., Powell, W., Quilligan, E., Rosegrant, M.W., Singh, R.P., Sonder, K., Tang, H., Visscher, S., Wang, R., 2017. Improving global integration of crop research. Science 357 (6349), 359–360. Reynolds, M.P., Borlaug, N.E., 2006. Impacts of breeding on international collaborative wheat improvement. J. Agric. Sci. 144, 3. Risk, J.M., Selter, L.L., Krattinger, S.G.,Viccars, L.A., Richardson,T.M., Buesing, G., Herren, G., Lagudah, E.S., Keller, B., 2012. Functional variability of the Lr34 durable resistance gene in transgenic wheat. Plant Biotechnol. J. 10, 477–487. Rutkoski, J., Poland, J., Mondal, S., Autrique, E., González-Pérez, L., Crossa, J., Reynolds, M., Singh, R.P., 2016. Canopy temperature and vegetation indices from high-throughput phenotyping improve accuracy of pedigree and genomic selection for grain yield in wheat. Genes Genet. Genom. 6, 2799–2808. Rutkoski, J.E., Poland, J.A., Singh, R.P., Huerta-Espino, J., Bhavani, S., Barbier, H., et al., 2014. Genomic selection for quantitative adult plant stem rust resistance in wheat. Plant Genome 7 (3). https://doi.org/10.3835/plantgenome2014.02.0006. Saint Pierre, C., Burgueño, J., Crossa, J., Fuentes Dávila, G., Figueroa López, P., Solís Moya, E., Ireta Moreno, J., Hernández Muela, V.M., Zamora Villa, V.M., Vikram, P., Mathews, K., Sansaloni, C., Sehgal, D., Jarquin, D., Wenzl, P., Singh, S., 2016. Genomic prediction models for grain yield of spring bread wheat in diverse agro-ecological zones. Sci. Rep. 6, 27312. Sánchez-Martín, J., Steuernagel, B., Ghosh, S., Herren, G., Hurni, S., Adamski, N., 2016. Rapid gene isolation in barley and wheat by mutant chromosome sequencing. Genome Biol. 17, 221. Shiferaw, B., Kassie, M., Jaleta, M.,Yirga, C., 2014. Adoption of improved wheat varieties and impacts on household food security in Ethiopia. Food Policy 44, 272–284. Singh, P.K., Crossa, J., Duveiller, E., Singh, R.P., 2016b. Association mapping for resistance to tan spot induced by Pyrenophora tritici-repentis race 1 in CIMMYT’s historical bread wheat set. Euphytica 207 (3), 515–525. Singh, R.P., Singh, P.K., Rutkoski, J., Hodson, D.P., He, X., Jørgensen, L.N., Hovmøller, M.S., Huerta-Espino, J., 2016a. Disease impact on wheat yield potential and prospects of genetic control. Ann. Rev. Phytopathol. 54, 303–322. Sehgal, D., Autrique, E., Singh, R.P., Ellis, M., Singh, S., Dreisigacker, S., 2017. Identification of genomic regions for grain yield and yield stability and their epistatic interactions. Sci. Rep. 7, 41578. Sehgal, D., Dreisigacker, S., Belen, S., Küçüközdemir, Ü., Mert, Z., Özer, E., Morgounov, A., 2016. Mining centuries old in-situ conserved Turkish wheat landraces for grain yield and stripe rust resistance genes. Front. Genet. 7, 201. Sehgal, D.,Vikram, P., Sansaloni, C.P., Ortiz, C., Saint Pierre, C.S., Payne, T., 2015. Exploring and mobilizing the Gene Bank biodiversity for wheat improvement. PLoS ONE 10 (7), e0132112. Sukumaran, S., Lopes, M.S., Dreisigacker, S., Dixon, L.E., Zikhali, M., Griffiths, S., et al., 2016. Identification of earliness per se flowering time locus in spring wheat through a genome-wide association study. Crop Sci. 56, 2962–2972.
272
Applications of Genetic and Genomic Research in Cereals
Sukumaran, S., Dreisigacker, S., Lopes, M., Chavez, P., Reynolds, M.P., 2015. Genome-wide association study for grain yield and related traits in an elite spring wheat population grown in temperate irrigated environments. Theor. Appl. Genet. 128, 353–363. Tester, M., Langridge, P., 2010. Breeding technologies to increase crop production in a changing world. Science 80 (327), 818–822. Uauy, C., 2017. Wheat genomics comes of age. Curr. Opin. Plant Biol. 36, 142–148. Unamba, C.I., Nag, A., Sharma, R.K., 2015. Next generation sequencing technologies: The doorway to the unexplored genomics of non-model plants. Front. Plant Sci. 6, 1074. Valluru, R., Reynolds, M.P., Davies,W.J., Sukumaran, S., 2017. Phenotypic and genome-wide association analysis of spike ethylene in diverse wheat genotypes under heat stress. New Phytol. 214, 271–283. Varshney, R.K., Singh, V.K., Hickey, J.M., Xun, X., Marshall, D.F., Wang, J., Edwards, D., Ribaut, J.M., 2015. Analytical and decision support tools for genomics-assisted breeding. Trends Plant Sci. 21, 354–363. Vikram, P., Franco, J., Burgueño-Ferreira, J., Li, H., Sehgal, D., Saint Pierre, C., Ortiz, C., Sneller, C., Tattaris, M., Guzman, C., Sansaloni, C.P., Ellis, M., Fuentes-Davila, G., Reynolds, M., Sonder, K., Singh, P., Payne, T., Wenzl, P., Sharma, A., Bains, N.S., Singh, G.P., Crossa, J., Singh, S., 2016. Unlocking the genetic diversity of creole wheats. Sci. Rep. 6, 23092. Wageningen, F.S.C., 2016. Multi-Level Mapping and Exploration of Wheat Production and Consumption and their Potential Contribution to Alleviation of Poverty, Malnutrition and Gender Inequality. In: Final Report WHEAT Competitive Grant. Wageningen University Food Security Center Study, The Netherlands. Walker,T.S., 2015.Validating Adoption Estimates Generated by Expert Opinion and Assessing the Reliability of Adoption Estimates with Different Methods. In: Walker, T.S., Alwang, J. (Eds.), Crop Improvement, Adoption and Impact of Improved Varieties in Food Crops in Sub-Saharan Africa. CABI, Oxfordshire, UK, pp. 406–419. Walker, T., Alene, A., Ndjeunga, J., Labarta, R., Yigezu, Y., Diagne, A., Andrade, R., Andriatsitohaina, R.M., De Groote, H., Mausch, K., Yirga, C., Simtowe, F., Katungi, E., Jogo, W., Jaleta, M., Pandey, S., 2014. Measuring the effectiveness of crop improvement research in Sub-Saharan Africa from the perspectives of varietal output, adoption, and change: 20 crops, 30 countries, and 1150 cultivars in farmers’ fields. CGIAR Independent Science and Partnership Council, Rome. Yirga, C., Alemu, D., Oruko, L., Negisho, K., Traxler, G., 2016. Tracking the Diffusion of Crop Varieties Using DNA Fingerprinting Crop Varieties. Research Report 112. EIAR, Addis Ababa, Ethiopia. Yu, L.X., Lorenz, A., Rutkoski, J., Singh, R.P., Bhavani, S., Huerta-Espino, J., Sorrells, M.E., 2011. Association mapping and gene-gene interaction for stem rust resistance in CIMMYT spring wheat germplasm. Theor. Appl. Genet. 123, 1257–1268. Zhu, Z., Bonnett, D., Ellis, M., He, X., Heslot, N., Dreisigacker, S., Chunbao, G., Singh, P., 2016. Characterization of fusarium head blight resistance in a CIMMYT syntheticderived bread wheat line. Euphytica 106 (2), 367–375.
CHAPTER 13
Genomic Selection in Wheat Daniel W. Sweeney, Jin Sun, Ella Taagen, Mark E. Sorrells Plant Breeding and Genetics, Cornell University, Ithaca, NY, United States
Contents 13.1 Introduction 13.2 High-Throughput Phenotyping 13.2.1 High-Throughput Phenotyping Platforms 13.2.2 Application of High-Throughput Phenotyping in GS 13.3 Genotype by Environment Interaction 13.3.1 Use of Environmental Covariates in Prediction Models 13.3.2 Accounting for and Exploiting Epistasis 13.4 Genomic Selection for Wheat Disease Resistance 13.4.1 Wheat Rusts 13.4.2 Fusarium Head Blight 13.4.3 Other Wheat Diseases 13.5 Genomic Selection in Wheat for Nutritional Traits 13.6 Genomic Selection for Wheat Quality Traits 13.6.1 Milling and Flour Quality 13.6.2 Preharvest Sprouting 13.7 Future Prospects 13.8 Conclusion References
273 277 277 278 279 283 286 287 288 289 290 291 292 292 294 295 296 297
13.1 INTRODUCTION Improved utilization of quantitative traits is imperative to the continuous progress of plant breeding. While marker-assisted selection (MAS) and other marker technologies have aided in manipulating simple traits controlled by one or several genes, they are inefficient methods of selection for quantitative traits, which are polygenic and controlled by many small-effect loci (Bernardo, 2008; Heffner et al., 2009). Genomic selection (GS) is a strategy that utilizes phenotypes and high-density marker scores to predict the genomic breeding values of lines in a population (Heffner et al., 2009). By incorporating all marker information in the prediction model, GS captures more variation due to small-effect quantitative trait loci (QTL) and is suited for improving traits with both high and low heritability. Additionally, GS theory indicates that genetic gain per unit time can be improved as a result of Applications of Genetic and Genomic Research in Cereals https://doi.org/10.1016/B978-0-08-102163-7.00013-2
Copyright © 2019 Elsevier Ltd. All rights reserved.
273
274
Applications of Genetic and Genomic Research in Cereals
reduced phenotyping per generation and selection at any stage. Considering the declining costs of genotyping and sequencing services, and stagnant to rising costs of phenotyping, GS is a tool with the potential to substantially improve breeding for high value quantitative traits that are difficult and/or expensive to phenotype. Application of GS in wheat has the potential to significantly improve the breeding effort for quantitative traits including biotic and abiotic stress tolerance, nutritional and end-use quality, and yield. The GS theory was originally developed by animal breeders because of the high cost of phenotyping and the inability to replicate individual genotypes, and has shown promise in many crops including wheat (Meuwissen et al., 2001; Heffner et al., 2011a,b; Lorenzana and Bernardo, 2009). As the limits of QTL mapping for breeding became apparent, notably that biparental mapping populations are not well adapted to many breeding applications and the statistical methods used to identify loci and implement MAS are insufficient for improving quantitative traits, plant breeders began experimenting with GS (Bernardo and Yu, 2007; Heffner et al., 2009).The GS theory proposes using a training population that has been whole-genome genotyped and phenotyped in the target population of environments to predict the performance of related genotyped individuals that do not have phenotypic records (Meuwissen et al., 2001). Genotypic and phenotypic data from the training population are used to train a prediction model that uses genotypic information to calculate genomic estimated breeding values (GEBVs) for the breeding or selection population. The first empirical studies of GS in small grains were published between 2009 and 2011 (de los Campos et al., 2009; Crossa et al., 2010; Heffner et al., 2011a). Studies by Crossa et al. (2010) and de los Campos et al. (2009) used data from The International Maize and Wheat Improvement Center (CIMMYT) wheat breeding program and concluded that genomic markers in a model provided greater accuracy of the predicted breeding value than a model limited to pedigree relationships. Heffner et al. (2011a,b) compared the prediction accuracies of GS and MAS and phenotypic selection for nine grain quality traits in soft white wheat, and concluded that GS could increase the rate of genetic gain per unit time and cost in a wheat breeding program. The notable advantages of GS to the breeder are the potential for a shortened breeding cycle, and/or increased selection accuracy depending on the trait, with increased rate of genetic gains. Additionally, GS models are useful for predicting unobserved genotypes and/or unobserved environments (Burgueño et al., 2012; Heslot et al., 2014). For a more in-depth review of the methodology see Heffner et al. (2009) and Lorenz et al. (2011).
Genomic Selection in Wheat
275
Different wheat breeding programs will have different breeding methods and unique goals, germplasm, and constraints that influence strategy, models, and application of GS. Important considerations for a wheat breeder initiating GS include timing, training population composition and size, phenotyping and genotyping design, budget, and statistical modeling. GS can be applied to early or late stages of wheat breeding, with the respective goals of rapid cycling or increased selection accuracies. Choosing which individuals to include in the training population can be challenging and directly affects the prediction accuracy. Population structure and identification of potential subpopulations are important, as previous studies have clearly indicated that more closely related training and breeding populations lead to higher GS accuracies (Asoro et al., 2011; Wang et al., 2014). The size of the population required is dependent on multiple factors such as trait heritability and level of relatedness, but generally larger populations offer more accurate predictions (Heffner et al., 2011a,b). Multiple studies have indicated that GS accuracy increases linearly with the training population size, and observed plateaus in accuracy may be trait specific. For example, a linear increase in prediction accuracy was observed by Heffner et al. (2011a) when the training population size was increased from 24 to 96 within biparental populations, as well as for predictions within a population of advanced breeding lines when the training population was increased from 96 to 288 (Heffner et al., 2011b). Population size is also affected by the relationship between the training population and selection candidates, with smaller, closely related populations requiring fewer individuals (Rutkoski et al., 2015b). Density and genome distribution are the primary considerations for genetic markers, and genotyping by sequencing (GBS) has been widely used to generate large numbers of nonredundant evenly distributed markers (Heslot et al., 2013b). As a self-pollinated crop, wheat has relatively high levels of linkage disequilibrium between the markers and QTL, and GS may be relatively accurate using fewer markers than in outcrossing species (Heffner et al., 2011b; Rutkoski et al., 2013). It is important to note that accurate phenotypes are required for training GS models and there is a need for potentially novel phenotyping strategies that can maximize replication of the allele rather than the individual (Heslot et al., 2014). The training population should be frequently updated to maintain accuracy of the model’s phenotype and allele marker correlations. For example, if GS is applied early in a breeding scheme, the model should be retrained with the progeny of the selection candidates after two to three rounds of selection (Heffner et al., 2009; Rutkoski et al., 2015b).
276
Applications of Genetic and Genomic Research in Cereals
A range of GS models have been intensively studied for accurate calculation of GEBVs of individuals in the selection population. For an in-depth comparison of GS models, see Heslot et al. (2012) and Pérez-Rodríguez et al. (2012). To briefly acquaint the reader with models commonly referenced in this chapter, ridge-regression best linear unbiased prediction (RRBLUP) and genomic BLUP (G-BLUP) are the two models most frequently used for the GS. These are mixed models that take relatedness into account via additive relationship or genomic matrices, respectively, and can easily fit models with many more markers than observations. The RR-BLUP and G-BLUP follow the assumptions of the infinitesimal model that all loci along the genome have a small effect on a trait.These models equally shrink all marker effects toward zero and are equivalent (Endelman, 2011). Bayesian models assume prior marker effects distributions and differentially weight marker effects. The most commonly used is Bayes-Cπ, which allows some markers to have zero variance with a probability π (Juliana et al., 2017b). These are potentially appropriate models for traits that may be oligogenic. Reproducing Kernel Hilbert Spaces (RKHS) models are semiparametric and are used in situations to capture nonadditive effects. Random forest (RF) is a machine learning approach. Most published GS studies compare multiple statistical models for accuracy, many of which reported little or no difference in prediction accuracy between models (Heslot et al., 2012). The BLUP models are often preferred because they require considerably less computation time, are relatively easy to use, and typically perform as well or better than Bayesian models (Meuwissen et al., 2001; Heffner et al., 2011b; Juliana et al., 2017b). However, a comparison study of wheat, barley, Arabidopsis, and maize data sets with 11 different models by Heslot et al. (2012) suggested that GS in plant breeding could be based on the Bayesian Lasso or weighted Bayesian shrinkage regression models and RF. Estimation of nonadditive effects may be desirable in some circumstances, especially when predicting fixed genotypes from a clonal or inbred training population. Several wheat studies have compared additive and nonadditive models (Mirdita et al., 2015; He et al., 2016). This chapter reviews recent empirical GS studies in wheat and applications to breeding strategies in wheat. The majority of the studies cited are genomic prediction studies in the sense that they are not actually selecting, crossing, and calculating gain from selection using GEBVs from GS models. However, such studies suggest that in practice GS could lead to faster cycling and enable increased selection intensities. At this point in time, GS
Genomic Selection in Wheat
277
studies in wheat have covered the major trait classes of interest in bread wheat: yield, disease resistance, and quality. High throughput phenotyping is beginning to be integrated into GS modeling, as are crop modeling strategies. GS also provides a new tool for exploiting genetic variation in diversity panels and for dealing with the perennial challenge of genotype by environment interactions. Genomics-enabled breeding strategies are critical for wheat breeders to meet the challenges of climate change, emerging diseases, water scarcity, and population growth in the 21st century. GS is a promising tool to help wheat breeders in their mission to improve the livelihoods of wheat growers and consumers worldwide.
13.2 HIGH-THROUGHPUT PHENOTYPING 13.2.1 High-Throughput Phenotyping Platforms With recent rapid progress in genotyping technology, phenotyping has become a critical factor that could impede further advances of plant breeding because of the time and labor required, as well as the accuracy of the data. For these reasons, considerable effort has been put into the development of high-throughput phenotyping (HTP) platforms in crops, in an attempt to generate large-scale, high-density phenotypes with high accuracy and low cost. Field-based HTP platforms including ground-based platforms, aerial platforms, unmanned aerial systems or vehicles (UAS or UAV), spectral satellite imaging, and others (White et al., 2012; Araus and Cairns, 2014; Shakoor et al., 2017) have been established based on remote or proximal sensing and imaging technology. Remote sensing and imaging techniques usually fall into three categories: visible/near-infrared (VIS/NIR) spectroradiometry, infrared thermometry and thermal imaging, and red, green, and blue (RGB) light color digital photography (Araus and Cairns, 2014). Each of HTP platforms has its own advantages, and different sensor and image technologies could be specifically deployed based on the traits of interest and experimental design in the field (Shakoor et al., 2017). Moreover, they are currently applicable in many crops including wheat (Haghighattalab et al., 2016; Liebisch et al., 2015; Tanger et al., 2017; Watanabe et al., 2017). For example, an UAV mounted with RGB cameras was successfully used to estimate wheat plant height and growth rate (Holman et al., 2016), and an UAS equipped with blue-green-NIR was utilized to capture vegetation indices for large wheat breeding nurseries (Haghighattalab et al., 2016), in which the vegetation indices are capable of predicting complex traits such
278
Applications of Genetic and Genomic Research in Cereals
as grain yield (Rutkoski et al., 2016; Sun et al., 2017) and disease resistance (Bauriegel et al., 2011; Devadas et al., 2015). In addition to the field-based HTP platforms, HTP has also been developed for laboratory level evaluations of plant samples and NIRS analysis to analyze grain characteristics (Araus and Cairns, 2014).
13.2.2 Application of High-Throughput Phenotyping in GS HTP platforms provide an opportunity to apply HTP traits in GS. In addition to genotyping, GS models require reliable phenotypes. HTP platforms allow relatively accurate and rapid phenotypic data collection on a large scale, which is useful for GS model training. For example, Watanabe et al. (2017) applied UAV remote sensing in genomic prediction modeling for height measurement of sorghum, and evaluated the possibility of replacing the traditional manual measurement with the HTP data for GS. Although the genomic prediction accuracy was lower using UAV data (in range of 0.448–0.634) compared to traditional manual measurement (in range of 0.629–0.675), the results indicated that UAV remote sensing is feasible. Because of the cost and labor efficiency, HTP platforms are able to monitor the treatments during multiple plant growth stages, which enables the comparison of plant heights at the same stage (Watanabe et al., 2017). The application of HTP platforms in most crops is still at an early stage. However, in the long term, the prediction accuracy of UAV remote sensing has considerable potential compared to traditional measurements when the measurement errors from HTP platforms are reduced through improvement in the experimental designs and HTP technologies (Watanabe et al., 2017). A HTP platform for wheat plant height has been developed as well (Holman et al., 2016) using UAV-based remote sensing, and similar approaches could be applied for wheat genomic prediction. Traits from HTP platforms could also be applied in multi-trait GS to predict complex traits. A multi-trait GS model (Jia and Jannink, 2012) takes advantage of correlated traits to improve the genomic prediction accuracy for the trait of interest. When different HTP traits are correlated with the trait of interest, such traits can be utilized to improve genetic gain (Sun et al., 2017). Grain yield is a complex quantitative trait that is influenced by environment. Canopy temperature (CT) and normalized difference vegetation index (NDVI) are genetically correlated with grain yield (Rutkoski et al., 2016) and can be easily measured from HTP platforms (Haghighattalab et al., 2016). Rutkoski et al. (2016) and Sun et al. (2017) utilized CT and NDVI from HTP platforms to predict wheat grain yield using multi-trait
Genomic Selection in Wheat
279
GS models and achieved an improvement in the prediction accuracy for grain yield by 70% on average in different environments. Compared to traditional measurements, especially for an environment-dependent trait such as CT, HTP platforms are superior to hand measurements because of the reduced data collection time and measurement errors caused by the external environment. In addition, the time-series data for CT and NDVI from HTP platforms offers the opportunity to select wheat cultivars with high predicted grain yield at early plant growth stages enabling selection to occur before harvest. HTP data can be used in combination with genotypes to improve genomic prediction accuracy, which would be highly beneficial in situations where grain yield could not be phenotyped due to severe weather or because of a lack of seeds (Rutkoski et al., 2016). The applications of HTP platforms in GS demonstrate their capability to expedite crop selection and increase genetic gains. It is expected that more HTP traits from HTP platforms will be accessible in the near future. However, the development of HTP platforms is lagging far behind the genotyping technologies. In order to effectively promote the application of HTP, more efforts are needed to develop low cost and high-performance HTP platforms (Shakoor et al., 2017) and to improve the big data integration, processing, and management from the HTP platforms (Araus and Cairns, 2014; Shakoor et al., 2017).
13.3 GENOTYPE BY ENVIRONMENT INTERACTION When using GS to predict grain yield or plant biomass, genotype by environment (G×E) interaction is an important factor affecting the prediction accuracy, more so than for animals. Whole-genome genotyping has created the opportunity for new approaches to characterize and understand G×E interaction. Studies have evaluated the impact of G×E interaction on prediction accuracy and the use of environmental information in prediction models. Genomic prediction studies can use historical multi-environment phenotypic data together with available genotypic data to base selection on a target set of environments which may encompass more than the typical few years of data used by classical advanced testing programs (Heffner et al., 2009). It is important to note that atypical environments may impact GS more than conventional phenotypic selection because phenotypic data are only used to select or discard breeding lines, not to train a model. Consequently, genetic gain from phenotypic selection is only affected for a short time, whereas genetic gain from GS can impact marker effect e stimates
280
Applications of Genetic and Genomic Research in Cereals
and performance of relatives that will alter selection criteria in future selection cycles. The GS may also be able to predict the performance of individuals in untested environments by using the phenotypes of its relatives that have been tested in those environments. Crossa et al. (2010) and Burgueño et al. (2011) were among the first to report that marker effects were different across environmental conditions. Burgueño et al. (2011) compared linear mixed models and factor analytic models for their predictive ability. When G×E interaction was important, modeling G×E interaction with the factor analytic model increased prediction accuracy, but when G×E interaction was not significant, prediction accuracies were similar for all the models. Burgueño et al. (2012), using the same 599 CIMMYT wheat lines as Crossa et al. (2010), were the first to apply the modeling of G×E covariance structures in multi-environment trials. When they included G×E interaction in the model, the prediction ability for unobserved individuals increased by about 20% compared to a single-environment prediction model. Models that included both markers and pedigrees were more predictive than those that included one or the other. Additionally, prediction accuracies were higher for predicting the performance of genotypes in untested environments than for predicting untested genotypes. This study showed that modeling the genetic covariance between environments could increase the accuracy when predicting performance within specific environments for lines that were only observed in some of the environments. The model was not predictive for lines with no phenotypic information and environmental covariates (ECs) were not incorporated in the model. Other studies that included G×E interaction in the prediction model also observed an increase in accuracy (Heslot et al., 2013a; Jarquín et al., 2014; Lado et al., 2016; Lopez-Cruz et al., 2015; Cuevas et al., 2016a,b; Pérez-Rodríguez et al., 2017). However, Dawson et al. (2013) analyzed 17 years of data from the International Center for Maize and Wheat Improvement’s (CIMMYT) Semi-Arid Wheat Yield Trials (SAWYT) and found that there was no difference in accuracy between the models that took into account G×E interactions and models that did not. They also grouped locations based on genomic predictions for all genotypes in all environments but this did not improve accuracies within groups. They concluded that there was no consistent pattern of G×E interaction among the mega-environments, and the unbalanced dataset could not be partitioned into clusters that had predictive power. Lado et al. (2016) characterized and modeled G×E interaction to design sets of environments having low G×E interaction to improve the prediction
Genomic Selection in Wheat
281
accuracy of wheat genotype performance in untested environments. They employed mixed models to construct a covariance matrix across the environments using a highly unbalanced dataset to predict performance within or across the different sets of environments.They found that the best method for predicting new genotypes was to borrow information from relatives evaluated in other environments and then model the correlation matrix across the environments. For predicting the performance of genotypes in new environments the best approach was either to predict within defined mega-environments or across locations for single years. Lopez-Cruz et al. (2015) modeled G×E interaction using a marker × environment (M×E) interaction GS model. They compared the M×E interaction model with both within-environment and across-environment analyses. When predicting the performance of genotypes in environments in which they have not been tested the M×E interaction model produced 10% higher prediction accuracy than the within-environment analysis and 37% higher accuracy than the across-environment analysis. They concluded that the M×E interaction model could help identify markers linked to genes that contribute to stability across environments as well as those that interact with environments. However, the M×E interaction model is somewhat limited in the ability to interpret patterns of G×E interaction that are neutral or negative so the authors concluded that the model is best suited for the analysis of the positively correlated environments. Cuevas et al. (2016a,b) used a Bayesian genomic kernel model to account for the correlations between environments. The model that included G×E interaction consistently had higher prediction accuracy than single-environment models. In an earlier study, Cuevas et al. (2016a,b) used nonlinear Gaussian kernels to model M×E interaction in the wheat dataset reported in Crossa et al. (2010). The RKHS model was extended to take into account the G×E interaction. For the wheat data, the prediction model including G×E interaction was up to 60% more accurate. Continuing their study of M×E interaction, Crossa et al. (2016) evaluated the use of priors that produce shrinkage (Bayesian ridge regression) and variable selection (BayesB). They evaluated the genomic prediction accuracy of models for M×E interaction, within and across environments in a durum wheat population. They found that the M×E interaction model minimized the model residual variance and improved data-fitting gain for more simply inherited traits compared to more complex traits such as grain yield. Interestingly, the M×E interaction model found markers linked to major genes for heading date and their effects were stable across e nvironments.
282
Applications of Genetic and Genomic Research in Cereals
For grain yield, several large marker effects were identified in all the chromosome groups. The above studies that model G×E interaction are useful for understanding the relationships among environments and for increasing GS prediction accuracy. However, they are based on observed covariances among environments and, consequently, they can only describe past performance instead of being predictive of future performance. Pérez-Rodríguez et al. (2017) used single-step genomic and pedigree models to determine the prediction accuracy of 58,798 CIMMYT wheat lines grown under different management conditions including irrigated, drought, late heat, severe drought, and early heat over seven seasons in Ciudad Obregon, Mexico. Only 29,484 individuals had genotypes consisting of 9045 markers.The trained models were used to predict grain yield of some lines at sites in South Asia. Phenotypes, pedigrees, and genotypes were used to evaluate the lines using a single-step model that combined pedigree and marker information into a unified H matrix. For the models without G×E, pedigree alone gave the highest accuracies (0.16–0.26). When G×E was included in the model, pedigree alone produced the highest accuracies in four of the six South Asia sites (0.25–0.29), markers were best for one site (0.39) and pedigree plus markers was most accurate at one site (0.25).There was a clear advantage to including G×E interaction in the model but overall pedigree alone produced accuracies as good as genotypes or genotypes plus pedigrees. Heslot et al. (2013a) took a novel approach to analyze unbalanced data sets that are typical in plant breeding testing programs. Although all genotypes are not observed in all the environments, all marker effects are observed in all environments allowing the use of Euclidean distances to determine the relationships among environments. When marker effects were used to cluster environments, clear patterns were observed and outlier environments were readily identified (Fig. 13.1). Breeder field notes corroborated the abnormal environments but grouping environments based on similarity of marker effects did not improve the prediction accuracy. In addition, the marker effects can be used to compute the prediction accuracy between pairs of environments to generate a reciprocal prediction accuracy matrix between the environments. When grouping the environments based on the reciprocal prediction accuracies between environments, the prediction accuracy for yield across environments increased significantly. In a second experiment to manage G×E interaction, Heslot et al. (2013a) optimized the composition of the training population by first computing the average predictive accuracy of each environment for predicting the line
Genomic Selection in Wheat
283
Fig. 13.1 Heat map showing the similarity of environments based on Euclidian distances computed using marker effects. Environment comparisons with red shading are more dissimilar and those environments with blue shading are more similar (Figure 3 from Heslot et al., 2013a).
performance in the other environments in the same dataset. Then beginning with the least predictive environments they were removed one at a time and then the model was retrained on the remaining environments.The environments that were removed were placed in an unpredictive set and the prediction accuracy of that set was calculated (Fig. 13.2). If the prediction accuracy of the predictive set improved, the process was repeated until it no longer increased and the remaining environments were considered to be the optimal set. In their study, 18 out of the 58 environments were removed and accuracy increased from 0.54 to 0.61.
13.3.1 Use of Environmental Covariates in Prediction Models Conceptually, it would seem desirable to include environment descriptors in the prediction models to increase the accuracy and to characterize G×E interaction. However, when analyzing large numbers of markers and ECs, methods are required to limit the computational burden. One solution
284
0.4 0.2
Accuracy
0.6
0.8
Applications of Genetic and Genomic Research in Cereals
Legend:
0.0
Predictive set Unpredictive set Prediction 0
10 20 30 40 50 Number of environments excluded from the predictive set
60
Fig. 13.2 Optimization of the training population. The blue dots are cross-validated accuracies for the selected training population (predictive set) and red triangles are prediction accuracies for the environments removed from the training population (unpredictive set). Green squares are the prediction accuracies for a validation set observed in 2011 (Figure 5 from Heslot et al., 2013a).
is to select a few of the most important ECs and a reduced number of markers. However, this can result in the loss of important information. Jarquín et al. (2017) modeled interactions between markers and ECs by describing genetic and environmental gradients as linear functions of the markers and ECs which they referred to as a reaction norm. Their dataset consisted of grain yield for 139 wheat lines grown in 340 environments. The lines were genotyped with 2395 single nucleotide polymorphisms (SNPs) and they compared a model with only main effects with one that included 68 ECs. Prediction accuracy was low when trying to predict new lines based on main effects only (0.175) but increased 35% (0.236) when ECs were included in the model. For lines that had been tested in some environments, including ECs in the model increased the accuracy by 17% from 0.439 to 0.514.
Genomic Selection in Wheat
285
In a different approach, Heslot et al. (2014) used a crop model to integrate ECs into the GS framework to predict G×E, to increase the prediction accuracy, and to better understand the genetic architecture of G×E. Their dataset consisted of 2437 elite inbred lines genotyped with 1287 SNP markers and grown 44 environments over 6 years in France.They used a crop model known as SiriusQuality (Martre et al., 2006) to compute the phenology and synchronize early, medium, and late maturing genotypes with the weather data. Environmental stress covariates (climatic variables at a specific developmental stage) by developmental stage were derived by using prior knowledge about sensitivity of specific growth stages to abiotic stresses from the literature. Environmental stress covariates were used as independent variables in statistical genetic models for effect estimation and prediction. By extending the factorial regression model to the GS context each marker was fit as a main effect and as a sensitivity to each of the stress covariates. The interactions between markers and stress covariates as were captured using a machine learning algorithm and genotype performance was predicted as a main effect plus a G×E deviation. The large number of markers and ECs presented computation issues so they evaluated the variance of marker effects across environments and eliminated the markers with little or no variation. The photoperiod sensitivity gene Ppd-D1 had the highest variance but did not capture a significant part of the G×E variance. Models were evaluated using cross-validation and the best model consisted of 250 markers in combination with the nonlinear soft rule fit algorithm. The most important EC was the sum of the average daily temperature between the meiosis and the flowering. Also important was drought in the early spring measured by “total number of dry days to 350 degree days” and the “sum of precipitation and evapotranspiration potential.” The factor analytic model can predict a G×E response for any genotype in any environment, even if an environment had no phenotypic data for that genotype. This allows the calculation of Euclidean distances for all environments by using the predicted level of genetic correlation between the environments. The G×E interactions can be clustered to show the structure of the TPE. By including the G×E interaction component, prediction accuracy for genotype performance in unobserved environments increased by 11.1% on average and the variability in prediction accuracy decreased by 10.8%. In contrast to the approaches that used covariances among environments to improve prediction accuracy the use of selected ECs made it possible to predict performance in unobserved environments rather than taking a retrospective view of G×E interaction. This approach also contributes
286
Applications of Genetic and Genomic Research in Cereals
information about the plant response to the environment so that the breeder can leverage agronomy and physiology knowledge, reduce dimensionality and nonlinearity, use existing breeding data, and interpret results to identify specific environmental stresses.
13.3.2 Accounting for and Exploiting Epistasis Genomic prediction models can be applied either to the segregating populations or to the advanced inbred lines. In the case of segregating populations, we are concerned with the estimation of breeding value and appropriate linear models (Fig. 13.3). However, when predicting genotypic value of pure lines, models that can account for gene interactions may provide higher prediction accuracy. In many of the studies cited above, multiple models were compared, and linear models often perform as well as nonlinear models. However, some of the reports, especially those using CIMMYT wheat datasets have found that nonlinear models such as RKHS tend to increase prediction accuracy for complex traits such as grain
Integrating genomic selection in a breeding program Recurrent selection stage GS models emphasizing additive genetic variation Spring plant and harvest
Inbreeding and evaluation stage F2 GH/field advance SSD/bulk, MAS F3 field advance bulk or row-pheno, MAS F4 field advance bulk or plot-pheno, MAS
S0 self Winter Crossing
GH crossing and selfing cycle Fall plant and harvest
Summer Crossing S0 self
Training population F4:5, master N., phenotype, select uniform spikes, genotype, GS+PS GS models capturing non additive genetic variation
Advanced regional testing GEBV + phenotype
Planting GS-selected individuals, intermating, self-pollination and S1 seed harvest occur twice a year MAS and phenotyping can be applied to F2–F4 generations F4s can be phenotyped and F5 spikes selected for uniformity and for GBS genotyping Selected lines enter the master training nursery Each year selected lines are entered in the regional trials and/or recycled in the crossing program
Fig. 13.3 Integration of GS in a pureline breeding program. In the rapid cycling phase, GS is used to enhance gain per unit time. In the inbreeding phase, MAS and PS can be imposed until the F4 or F5 generation and then whole-genome genotyping is used to select individuals that enter the training population or are recycled in the crossing program. Each phase is conducted simultaneously and the GS models are updated annually.
Genomic Selection in Wheat
287
yield (e.g., Cuevas et al., 2016a; Crossa et al., 2010; Pérez-Rodríguez et al., 2012). Theoretically, epistatic effects can be estimated for a large number of markers but practically speaking, the large number of epistatic interactions makes it computationally difficult. Models such as RKHS regression and radial basis function neural networks (RBFNN) (González-Camacho et al., 2012) can indirectly map marker effects into a high n-dimensional space that can capture nonadditive genetic effects. Pérez-Rodríguez et al. (2012) compared the prediction accuracies of four linear GS models with three nonlinear models including RKHS regression, Bayesian regularized neural networks (BRNN), and RBFNN for days to heading and grain yield of 306 CIMMYT wheat lines. For days to heading, either RKHS or RBFNN had the highest accuracy for all 12 environments. Prediction accuracy for grain yield was highest for RKHS and RBFNN in six of the seven environments. Crossa et al. (2014) combined pedigree-derived additive and epistatic additive × additive relationship information in a single model-to-model G×E interaction. Including the pedigree additive × additive relationships in the models increased the prediction accuracy in three of the four environments indicating that modeling additive-by-additive epistasis with G×E interaction is important for increasing the accuracy of predictions in wheat breeding populations.
13.4 GENOMIC SELECTION FOR WHEAT DISEASE RESISTANCE Disease resistance in crops can be broadly classified into qualitative and quantitative, or nondurable and durable, resistance. Qualitative resistance is conditioned by single large effect genes, often termed R genes, that provide strong gene-for-gene resistance against specific races of a given pathogen species. Rapid pathogen evolution can overcome such resistance, particularly if the resistance genes are widely deployed in cultivated varieties. Quantitative resistance is conferred by many small-effect genes and provides more durable resistance because the pathogens cannot overcome multiple modes of resistance quickly. However, traditional qualitative disease resistance breeding strategies like pyramiding and MAS are not efficient or effective in breeding for quantitatively inherited disease resistance. GS for disease resistance could overcome these shortcomings and lead to more efficient breeding for disease resistance. Breeding for disease resistance and the applications of GS for disease resistance in crops are comprehensively reviewed in Poland and Rutkoski (2016).
288
Applications of Genetic and Genomic Research in Cereals
13.4.1 Wheat Rusts Wheat rusts are typically caused by one of three main pathogens: Puccinia graminis (stem rust), Puccinia striiformis (stripe/yellow rust), or Puccinia triticina (leaf/brown rust).These obligate biotrophs are global in distribution and historically have been the most serious wheat pathogens (Poland and Rutkoski, 2016). Wheat rust resistance can be qualitative or quantitative. Seedling resistance is qualitative and quantitative resistance expressed at the adult plant stage is termed adult plant resistance (APR) (Poland and Rutkoski, 2016). Durable rust resistance is needed for wheat cultivars worldwide as quickly evolving rust pathogens often overcome R gene resistance. Ornella et al. (2012) were the first to investigate GS for wheat stem and yellow rust. They found moderate to high prediction accuracies by cross-validating within biparental populations and moderate prediction accuracies between populations when they were related. Predicting an environment from a correlated environment was found to be useful as well. Daetwyler et al. (2014) used a diverse group of wheat landrace accessions to train GS models for all the three rusts. Adding diagnostic PCR markers for the R genes Lr34, Sr57, and Yr18 improved prediction accuracy but a linked marker for Sr2 did not improve stem rust prediction accuracy because Sr2 is a recent introgression not present in older landraces. Rutkoski et al. (2014) followed a similar strategy in GS for stem rust APR in spring wheat. They ran genome-wide association studies (GWAS) and fit significant markers as fixed effects in GS models. Markers linked to Sr2 were significant in their germplasm and improved prediction accuracies, but seedling disease scores did not fit as fixed effects.To date, only Rutkoski et al. (2015a) have published the actual GS gain from selection experiment for disease resistance in wheat. They compared two cycles of GS to one cycle of phenotypic selection (PS) and examined inbreeding, genetic variance, and correlated response to selection. GS performed as well as PS per unit time with equal selection intensity but decreased genetic variance and increased inbreeding. Muleta et al. (2017) found that the prediction accuracies for stripe rust did not change appreciably in a diverse set of accessions from the USDA National Small Grains Collection when a subset of markers was used for prediction models instead of the whole marker set. They also found that combining genetically distant clusters in cross-validation led to higher prediction accuracy than cross-validation within clusters. Juliana et al. (2017a) evaluated least squares (LS), G-BLUP, G-BLUP with significant fixed marker effects, and three RKHS models for seedling leaf and stripe rust resistance and leaf, stripe, and stem rust APR using GBS SNPs.
Genomic Selection in Wheat
289
Overall, G-BLUP and RKHS models performed the best but LS models performed as well or better when R genes were present in the training population germplasm.
13.4.2 Fusarium Head Blight Fusarium head blight (FHB) is a devastating fungal disease caused by Fusarium graminearum in humid small grain production areas worldwide. The FHB disease symptoms include accumulation of the potent mycotoxin deoxynivalenol (DON) in the wheat grain. The DON is a particularly difficult phenotype for selection because DON levels are frequently poorly correlated with visible disease symptoms. The FHB resistance is largely quantitative, with a single large effect QTL, Fhb-1, popularly introgressed into breeding material in North America (Jin et al., 2013). Rutkoski et al. (2012) published the first study of GS for FHB resistance in wheat.They found that GS models, especially RF and RKHS, gave higher prediction accuracies for incidence and severity-related traits than multiple linear regression (MLR) models for MAS. Addition of QTL as fixed effects did not improve model performance. However, for DON, MLR performed as well as GS models and using QTL markers alone performed better than genome-wide markers. Mirdita et al. (2015) used a large set of 2325 breeding lines and varieties in 11 environments over 2 years as a training population for FHB severity. The authors tested two epistatic models, RKHS and extended G-BLUP, in addition to RR-BLUP and Bayes-Cπ. They found the best prediction accuracies with epistatic models, with a mean prediction accuracy of 0.6 with RKHS. Arruda et al. (2016) also compared MAS to GS for FHB and specifically looked at the additions of the large effect Fhb-1 locus, independent QTL, and QTL in the training population as fixed effects. GS models consistently outperformed the MAS models and the addition of significant QTL in the training population as fixed effects substantially increased the prediction accuracy. The authors cautioned that this is likely due to what they term “inside trading,” that is, estimating QTL effects in the same population that is used for the training population and including those effects as fixed inflates the prediction accuracy. Jiang et al. (2017) compared independent validation of MAS and GS models with cross-validation using a large set of European wheat breeding lines phenotyped for FHB resistance. By sampling across genotypes and environments, cross-validated GS models were not inflated compared to independently validated models; however, MAS models were inflated using cross-validation.The authors also looked at prediction accuracies of individual genotypes using the reliability
290
Applications of Genetic and Genomic Research in Cereals
criterion, an analysis commonly used in animal breeding but infrequently utilized in plant breeding. Prediction accuracies of individual genotypes using the reliability criterion were four to sixfold higher for individuals with high reliability than with low reliability.
13.4.3 Other Wheat Diseases Juliana et al. (2017b) used two CIMMYT bread wheat screening nurseries to build GS models for three major necrotrophic foliar diseases of wheat: Septoria tritici blotch (STB), Stagonospora nodorum blotch (SNB), and tan spot (TS). The LS, G-BLUP, four Bayesian, and three RKHS models were tested for pedigree and GS using GBS SNP and DArT markers. Prediction accuracies were moderate to high for all traits and for all models. The LS models consistently performed the worst and RKHS models incorporating markers and pedigree data tended to perform the best. Mirdita et al. (2015) used the same strategy for STB prediction as for FHB. They found the best prediction accuracies with epistatic models, with a mean prediction accuracy of 0.5 for STB with RKHS, which is comparable to the results of Juliana et al. (2017b). GS is a promising tool for wheat breeders to improve quantitative disease resistance in wheat. Prediction accuracies for rust resistance at the seedling and adult stage generally were high to moderate and tended to improve when R genes were included as fixed effects. Prediction accuracies for FHB traits in all studies tended to be moderate to high, with GS models consistently outperforming MAS models, a notable exception being DON in Rutkoski et al. (2012). Several studies showed that including significant markers for R genes or large effect QTL can increase prediction accuracy for disease resistance traits (Arruda et al., 2016; Daetwyler et al., 2014; Rutkoski et al., 2014). Association mapping is a useful way to screen large effect markers in diverse populations and is a worthwhile analysis to run when building GS models for disease resistance. The impact of adding such markers as fixed effects to GS models is highly dependent on population structure and the frequency of the causative locus in the training population. However, as Arruda et al. (2016) caution, inclusion of QTL from the training population as fixed effects may unrealistically inflate prediction accuracies. Inclusion of diagnostic markers as fixed effects for large effect QTL like Fhb-1 or R genes like Sr2 known to be present in the training and validation populations may be beneficial depending on the breeding goals and germplasm. Models capturing epistatic effects appear to be useful for some diseases.
Genomic Selection in Wheat
291
13.5 GENOMIC SELECTION IN WHEAT FOR NUTRITIONAL TRAITS Wheat is one of the most widely produced and consumed cereal grains in the world, serving as a primary source of calories for millions of people worldwide. Access to sufficient calories is not adequate for total nutrition. Micronutrient deficiencies are widespread in developed and developing nations alike and are a strong contributor to malnutrition. Breeding for increased vitamin and mineral content, or biofortification, of staple crops such as wheat is a potential avenue for alleviating micronutrient deficiencies and malnutrition. Iron (Fe) and zinc (Zn) deficiencies, respectively, affect over 800 million and 1 billion people worldwide and are particularly harmful for women and children (HarvestPlus, 2017). Recent biofortification efforts such as HarvestPlus, a Consortium of International Agriculture Research Centers (CGIAR) program, have resulted in the release of high Zn wheat varieties in South Asia (HarvestPlus, 2017). Genetic control of concentrations of many grain micronutrients appear to be quantitative and are therefore potential targets for GS (Trethowan et al., 2005; Velu et al., 2014). Micronutrient traits also require specialized phenotyping equipment, restricting breeders with minimal resources and budgets. Genomic prediction models for wheat grain Fe and Zn concentrations were examined by Velu et al. (2016) using the CIMMYT HarvestPlus Association Mapping panel, a population containing landrace progenies and various synthetic-derived progenies. The authors found low to moderate prediction accuracies for grain Fe and Zn concentrations measured across two growing seasons in multiple locations. The best prediction accuracies were obtained using G-BLUP models with both genetic and pedigree relationship matrices and inclusion of both genotype by environment (G×E) and pedigree by environment kernels. Several studies have shown that incorporating G×E interactions from multi-environment trials improves prediction accuracy across environments compared to not accounting for such interactions (Burgueño et al., 2012; Lopez-Cruz et al., 2015).Wheat Fe and Zn concentrations are heavily influenced by native soil Fe and Zn contents and tend to show large G×E interactions (Ortiz-Monasterio et al., 2007). Manickavelu et al. (2017) analyzed a set of Afghan wheat landraces grown in Japan and Afghanistan for grain potassium, phosphorus, magnesium, iron, zinc, and manganese contents. The authors ran GWAS for each trait and discovered only a single significant association for Zn, further suggesting the quantitative inheritance of wheat grain nutrient traits. Multiple models for each grain nutrient trait were fit in each country and accuracies were found
292
Applications of Genetic and Genomic Research in Cereals
to be moderate to high, with macronutrient (P, K, Mg) accuracies higher than micronutrient (Fe, Zn, Mn) accuracies. Prediction accuracies in Japan were higher than that in Afghanistan, which was attributed to large environmental variance and soil conditions in Afghanistan. The G×E interactions were not explicitly modeled and multi-trait models were not evaluated despite significant genetic correlations between macronutrients.
13.6 GENOMIC SELECTION FOR WHEAT QUALITY TRAITS The value of wheat lies not only in yield per se, but also in its quality.Wheat quality is determined by a suite of qualitative and quantitative traits. Milling quality and market class of wheat are largely determined by hardness, an oligogenic trait, and protein content, a quantitative trait (Edwards et al., 2010; Battenfield et al., 2016). Hard wheats have a stronger starch-protein attachment than soft wheats. The additional energy needed to break these attachments in milling creates higher proportions of damaged starch granules, which aid in water absorption and produce better baking performance for leavened products (Battenfield et al., 2016). Soft wheats have less damaged starch and are preferred for cookies, cakes, and pastries. Hard wheats tend to have high protein and soft wheats tend to have low protein. Kernel weight, flour yield, flour protein, flour color, starch damage, and protein quality are additional wheat milling quality traits of importance to breeders and end users.The primary storage proteins in wheat, glutenins and gliadins, give wheat its unique viscoelastic properties in baking (Battenfield et al., 2016). Dough rheology measurements are taken by mixing flour with water to determine dough elasticity, strength, tolerance, and optimal mixing time conferred by glutenin and gliadin contents (Battenfield et al., 2016). Farinographs, mixographs, and alveographs measure different dough rheology components but are time, flour, and cost intensive. Baking quality traits such as loaf volume and loaf texture are valuable as well and also require significant resource inputs that may not be available or feasible in early generation breeding trials.
13.6.1 Milling and Flour Quality Heffner et al. (2011a,b) were the first to develop prediction models for milling and flour quality traits. In both studies phenotypic prediction accuracy performed better for all end-use quality traits than GS models. Heffner et al. (2011a) compared RR-BLUP and Bayes-Cπ models and found that RRBLUP performed better in a classic QTL mapping biparental p opulation
Genomic Selection in Wheat
293
while Bayes-Cπ performed better in an elite by elite population. In general, prediction accuracies for quality traits from multifamily predictions (Heffner et al., 2011b) tended to be higher than the accuracies from the biparental predictions (Heffner et al., 2011a). This is seemingly in contradiction with the general paradigm that prediction accuracies are maximized within biparental populations (Crossa et al., 2014; Lehermeier et al., 2014). However, the training population size in Heffner et al. (2011a) was only 96 individuals in each family and ~450 markers were used whereas in the multifamily study Heffner et al. (2011b) used a training population size of 288 and 1158 markers. Hoffstetter et al. (2016) measured flour yield and flour softness in an F4-derived breeding population of soft winter wheat. The authors tested subsetting the training population and marker sets to improve prediction accuracy. Removing lines with high G×E interaction did not change prediction accuracies within environment for quality traits but subsetting markers to only include markers with significant associations to the trait substantially improved prediction accuracy for flour yield and flour softness compared to using the complete marker set. Other studies testing reduced marker set prediction accuracy typically have found minimal difference between whole and subset marker sets. Several GS studies that have been recently published make use of large diverse training populations and comprehensive HTP of wheat quality traits. Battenfield et al. (2016) utilized a large CIMMYT spring wheat multiyear breeding population data set with full processing and end-use quality phenotypes to test multiple GS models for the accuracy of forward prediction for the next year. Forward prediction accuracies from 2011 to 2015 increased as more years were added to the models but cross-validation with all years outperformed all forward prediction models in all traits. The authors postulated that cross-validation prediction accuracies are inflated due to not accounting for G×E interactions and leveraging information from relatives across years that is not available in forward prediction. Increase in response to selection from GS was greater than 100% for flour yield, grain protein, flour protein, and several dough rheology traits (Battenfield et al., 2016). The authors saw GS for end-use quality as a complement, not a replacement, for phenotypic selection in the CIMMYT breeding program, but expected that gain from selection could ultimately be much higher considering that 10,000 lines can be genotyped at the same cost as phenotyping 1000 lines for end-use quality. Hayes et al. (2017) integrated near-infrared (NIR) and nuclear magnetic resonance (NMR) HTP into multi-trait GS models using a large training population of public and private breeding
294
Applications of Genetic and Genomic Research in Cereals
lines and synthetic derivatives. In all, 44 quality traits were phenotyped in the training set over 2 years, including a set of noodle quality traits; the validation set was composed of training set subsets and Australian National Variety Trials (NVT) over 3 years. Including NIR and NMR phenotypes as correlated traits increased the prediction accuracy for grain, milling, and baking traits but noticeable improvements were not observed for dough rheology traits. Prediction accuracies were largely above 0.5, likely sufficient for effective early generation end-use quality selection (Hayes et al., 2017). Such accuracies in the NVT validation sets, which contain elite breeding lines, showed the robustness of the diverse training set to sufficiently predict elite breeding material.
13.6.2 Preharvest Sprouting Preharvest sprouting (PHS) occurs when wheat is exposed to excessive moisture in the field before harvest and germination processes are initiated. The PHS damage is only visible to the naked eye in the most serious circumstances when rootlets and coleoptiles form on the spike. The PHS damage can be measured directly through germination tests of loose grain and mist tests of individual spikes, or indirectly by falling number, an indirect measurement of alpha-amylase activity through dough rheological properties. GS for PHS resistance in wheat was first tested by Heffner et al. (2011a,b). In the first study, a single biparental population of 96 individuals segregating for PHS resistance was used to train prediction models. The PS accuracy was found to be significantly higher than accuracy for RR-BLUP or Bayes-Cπ, but both GS models performed much better than MAS. The authors cite the high heritability of PHS, small training population size, low-density markers, polygenic nature of PHS, and low G×E as reasons for the superior performance of the PS in the study. The second study used a multifamily training population of soft winter wheat breeding lines and evaluated several genomic prediction models as well as association analysis models to test the prediction accuracies and net merit from selection indices. Prediction accuracies for PHS using GS models outperformed association analysis models but not PS accuracy. However, in both studies, the authors conclude that the reduced cycle time and cost of GS for PHS and other quality traits would be superior to PS. Moore et al. (2017) used a panel of 1118 hard red and white winter wheat breeding lines as a training population with GBS SNP markers and PHS resistance phenotypes from germination tests. White wheats tend to be more susceptible to PHS than red wheats but no significant improvement in prediction accuracy was ob-
Genomic Selection in Wheat
295
served by adding kernel color as a fixed effect. However, they did find that adding five significant markers from GWAS improved prediction accuracy, supporting earlier findings from Bernardo (2014), Daetwyler et al. (2014), and Arruda et al. (2016).
13.7 FUTURE PROSPECTS Long-term GS experiments are still lacking in wheat and empirical results of response to GS in wheat are limited to the study by Rutkoski et al. (2015a). Realized gains from selection using GS are expected to be greater than those from phenotypic selection if selections based on GEBVs can substantially shorten the breeding cycle time. Heffner et al. (2010) simulated a public winter wheat breeding program and estimated long-term gain from selection for GS and MAS. Low to moderate GS prediction accuracies were sufficient to outperform MAS. Even in high heritability scenarios, GS was predicted to result in 1.5-fold greater genetic gain than MAS. Full implementation of GS into a breeding program would require significant reallocation of resources and a reordering of breeding objectives, but costs are expected to be equal to or less than MAS as phenotyping becomes less prevalent and occurs mostly to update prediction models. Resources would be shifted to genotyping and crossing costs instead of quality assays. Instead of phenotyping quality traits on a small number of lines in early or advanced yield trials, genomic prediction models could be used to discard a large percentage of early generation progeny, even F2 lines, with subsequent selection at harvest based on genomic and NIR/NMR predictions (Hayes et al., 2017). The immediate benefits of reduced phenotyping costs are more apparent for quality traits that involve specialized machinery, expensive reagents, and large quantities of flour than for disease resistance that can be scored at the seedling stage. However, Poland and Rutkoski (2016) point out that well trained GS models enable selection for yield at any stage; this allows for yield-based selections in very large early generation disease screens. They also highlighted the ability to simultaneously select for qualitative and quantitative disease resistance, which is typically hindered by R genes masking smaller effect QTL. Genomic prediction models could allow early generation breeding material instead of advanced material to be evaluated for micronutrient content and/or end-use quality. Such models could improve micronutrient breeding efforts in developing nations where consistent access to micronutrient phenotyping equipment is rare and biofortified varieties are most needed. GS for biofortification is additionally attractive because
296
Applications of Genetic and Genomic Research in Cereals
c ulturally important local crops could be efficiently improved without introducing new species or varieties that may not be readily accepted. GS models and breeding schemes accounting for nutritional quality G×E interaction in wheat are attractive because of the large effect of soil on grain micronutrient content. Early selection of quality traits could enable breeders to target specific consumer niches like craft baking, export markets, and health products with more precision which in turn could increase the value of wheat products for growers and processors. Genomic prediction models could be used to predict quality performance in yield trials across diverse environments without phenotyping (Burgueño et al., 2012; Lopez-Cruz et al., 2015). Breeders may be able to breed for several disparate markets at the same time with relative ease by creating differentially weighted selection indices for their prediction models. In a world where improved basic cereal grain nutrition and tailor-made cereal grain products are paradoxically needed, GS for nutritional and end-use quality traits in wheat offers exciting possibilities for breeders, farmers, processors, and consumers.
13.8 CONCLUSION Genomic prediction models have been trained for a large number of economically important traits in winter and spring wheat, including the ultimately most important trait, yield. Genomic prediction has also been shown to be an effective approach to dissect complicated epistatic and genotype by environment interactions in wheat. In the face of increasingly unpredictable weather and annual trends, breeding climate resilient wheat varieties will depend on proper exploitation of these two phenomena. GS provides the opportunity to select for any quantitative trait of interest at any stage of the breeding cycle, giving more power and freedom to today’s breeders. The dawn of the genomics-enabled breeding age is still upon the plant breeding world. Access to, and funding for, consistent genotyping is a requirement of applied GS, yet remains a challenge for public breeding programs globally. The promise of GS has yet to be fully realized in the developing world. Long-term effects of GS on variance and inbreeding in wheat have been theorized but presently have little empirical evidence. Applications of HTP to wheat genomics are actively being pursued but remain in their infancy. There is high potential for many unexplored applications of GS in wheat breeding. GS is not a breeding panacea, but is an invaluable new tool for wheat breeders in every season, every market, and every set of breeding goals.
Genomic Selection in Wheat
297
REFERENCES Araus, J.L., Cairns, J.E., 2014. Field high-throughput phenotyping: the new crop breeding frontier. Trends Plant Sci. 19, 52–61. Arruda, M.P., Lipka, A.E., Brown, P.J., Krill, A.M., Thurber, C., Brown-Guedira, G., Dong, Y., Foresman, B.J., Kolb, F.L., 2016. Comparing genomic selection and marker-assisted selection for Fusarium head blight resistance in wheat (Triticum aestivum L.). Mol. Breed. 36, 1–11. Asoro, F.G., Newell, M.A., Beavis, W.D., Scott, M.P., Jannink, J.-L., 2011. Accuracy and training population design for genomic selection on quantitative traits in elite North American oats. Plant Genome J. 4, 132–144. Battenfield, S.D., Guzmán, C., Gaynor, R.C., Singh, R.P., Peña, R.J., Dreisigacker, S., Fritz, A.K., Poland, J.A., 2016. Genomic selection for processing and end-use quality traits in the CIMMYT spring bread wheat breeding program. Plant Genome 9. Bauriegel, E., Giebel, A., Geyer, M., Schmidt, U., Herppich, W.B., 2011. Early detection of Fusarium infection in wheat using hyper-spectral imaging. Comput. Electron. Agric. 75, 304–312. Bernardo, R., 2008. Molecular markers and selection for complex traits in plants: learning from the last 20 years. Crop Sci. 48, 1649–1664. Bernardo, R., 2014. Genomewide selection when major genes are known. Crop Sci. 54, 68–75. Bernardo, R.,Yu, J., 2007. Prospects for genomewide selection for quantitative traits in maize. Crop Sci. 47, 1082–1090. Burgueño, J., Crossa, J., Cotes, J.M.,Vicente, F.S., Das, B., 2011. Prediction assessment of linear mixed models for multienvironment trials. Crop Sci. 51, 944–954. Burgueño, J., de los Campos, G.,Weigel, K., Crossa, J., 2012. Genomic prediction of breeding values when modeling genotype × environment interaction using pedigree and dense molecular markers. Crop Sci. 52, 707–719. Crossa, J., de los Campos, G., Pérez, P., Gianola, D., Burgueño, J., Araus, J.L., Makumbi, D., Singh, R.P., Dreisigacker, S.,Yan, J., Arief,V., Banziger, M., Braun, H.J., 2010. Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186, 713–724. Crossa, J., Pérez, P., Hickey, J., Burgueño, J., Ornella, L., Cerón-Rojas, J., Zhang, X., Dreisigacker, S., Babu, R., Li,Y., Bonnett, D., Mathews, K., 2014. Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity 112, 48–60. Crossa, J., de los Campos, G., Maccaferri, M., Tuberosa, R., Burgueño, J., Pérez-Rodríguez, P., 2016. Extending the marker × environment interaction model for genomic-enabled prediction and genome-wide association analysis in durum wheat. Crop Sci. 56, 2193–2209. Cuevas, J., Crossa, J., Soberanis, V., Pérez-Elizalde, S., Pérez-Rodríguez, P., Campos, G.d.l., Montesinos-López, O.A., Burgueño, J., 2016a. Genomic prediction of genotype × environment interaction kernel regression models. Plant Genome 9 (3), 1–20. Cuevas, J., Crossa, J., Montesinos-López, O.A., Burgueño, J., Pérez-Rodríguez, P., Campos, G.d.l, 2016b. Bayesian genomic prediction with genotype × environment interaction kernel models. G3: Genes Genomes Genet 7, 41–53. Daetwyler, H.D., Bansal, U.K., Bariana, H.S., Hayden, M.J., Hayes, B.J., 2014. Genomic prediction for rust resistance in diverse wheat landraces. Theor. Appl. Genet. 127, 1795–1803. Dawson, J.C., Endelman, J.B., Heslot, N., Crossa, J., Poland, J., Dreisigacker, S., Manes, Y., Sorrells, M.E., Jannink, J.L., 2013. The use of unbalanced historical data for genomic selection in an international wheat breeding program. Field Crops Res. 154, 12–22.
298
Applications of Genetic and Genomic Research in Cereals
de los Campos, G., Naya, H., Gianola, D., Crossa, J., Legarra, A., Manfredi, E., Weigel, K., Cotes, J.M., 2009. Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics 182, 375–385. Devadas, R., Lamb, D.W., Backhouse, D., Simpfendorfer, S., 2015. Sequential application of hyperspectral indices for delineation of stripe rust infection and nitrogen deficiency in wheat. Precis. Agric. 16, 477–491. Edwards, M.A., Osborne, B.G., Henry, R.J., 2010. Puroindoline genotype, starch granule size distribution and milling quality of wheat. J. Cereal Sci. 52, 314–320. Endelman, J.B., 2011. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome J. 4, 250. González-Camacho, J.M., de Los Campos, G., Pérez, P., Gianola, D., Cairns, J.E., Mahuku, G., Babu, R., Crossa, J., 2012. Genome-enabled prediction of genetic values using radial basis function neural networks. Theor. Appl. Genet. 125, 759–771. Haghighattalab, A., González Pérez, L., Mondal, S., Singh, D., Schinstock, D., Rutkoski, J., Ortiz-Monasterio, I., Singh, R.P., Goodin, D., Poland, J., 2016. Application of unmanned aerial systems for high throughput phenotyping of large wheat breeding nurseries. Plant Methods 12, 35. HarvestPlus, 2017. Crops. http://www.harvestplus.org/what-we-do/crops. Hayes, B.J., Panozzo, J., Walker, C.K., Choy, A.L., Kant, S., Wong, D., Tibbits, J., Daetwyler, H.D., Rochfort, S., Hayden, M.J., Spangenberg, G.C., 2017. Accelerating wheat breeding for end-use quality with multi-trait genomic predictions incorporating near infrared and nuclear magnetic resonance-derived phenotypes. Theor. Appl. Genet. 130, 1–15. He, S., Schulthess, A.W., Mirdita, V., Zhao, Y., Korzun, V., Bothe, R., Ebmeyer, E., Reif, J.C., Jiang, Y., 2016. Genomic selection in a commercial winter wheat population. Theor. Appl. Genet. 129, 641–651. Heffner, E.L., Sorrells, M.E., Jannink, J.-L.L., 2009. Genomic selection for crop improvement. Crop Sci. 49, 1–12. Heffner, E.L., Lorenz, A.J., Jannink, J.L., Sorrells, M.E., 2010. Plant breeding with genomic selection: gain per unit time and cost. Crop Sci. 50, 1681–1690. Heffner, E.L., Jannink, J.-L., Iwata, H., Souza, E., Sorrells, M.E., 2011a. Genomic selection accuracy for grain quality traits in biparental wheat populations. Crop Sci. 51, 2597–2606. Heffner, E.L., Jannink, J.-L., Sorrells, M.E., 2011b. Genomic selection accuracy using multifamily prediction models in a wheat breeding program. Plant Genome 4, 65–75. Heslot, N.,Yang, H.-P., Sorrells, M.E., Jannink, J.-L., 2012. Genomic selection in plant breeding: a comparison of models. Crop Sci. 52, 146–160. Heslot, N., Jannink, J.L., Sorrells, M.E., 2013a. Using genomic prediction to characterize environments and optimize prediction accuracy in applied breeding data. Crop Sci. 53 (3), 921–933. Heslot, N., Rutkoski, J., Poland, J., Jannink, J.L., Sorrells, M.E., 2013b. Impact of marker ascertainment bias on genomic selection accuracy and estimates of genetic diversity. PLoS ONE 8, e74612. Heslot, N., Akdemir, D., Sorrells, M.E., Jannink, J.L., 2014. Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions. Theor. Appl. Genet. 127, 463–480. Hoffstetter, A., Cabrera, A., Huang, M., Sneller, C., 2016. Optimizing training population data and validation of genomic selection for economic traits in soft winter wheat. G3 6, 2919–2928. Holman, F.H., Riche, A.B., Michalski, A., Castle, M., Wooster, M.J., Hawkesford, M.J., 2016. High throughput field phenotyping of wheat plant height and growth rate in field plot trials using UAV based remote sensing. Remote Sens. 8, 1031.
Genomic Selection in Wheat
299
Jarquín, D., Crossa, J., Lacaze, X., Du Cheyron, P., Daucourt, J., Lorgeou, J., Perez, P., Calus, M., de los Campos, G., Burgueño, J., 2014. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor. Appl. Genet. 127, 595–607. Jarquín, D., Lemes da Silva, C., Gaynor, R.C., Poland, J., Fritz, A., Howard, R., Battenfield, S., Crossa, J., 2017. Increasing genomic-enabled prediction accuracy by modeling genotype × environment interactions in Kansas wheat. Plant Genome 10. Jia, Y., Jannink, J.L., 2012. Multiple trait genomic selection methods increase genetic value prediction accuracy. Genetics 192, 1513–1522. Jiang,Y., Schulthess, A.W., Rodemann, B., Ling, J., Plieske, J., Kollers, S., Ebmeyer, E., Korzun, V., Argillier, O., Stiewe, G., Ganal, M.W., Röder, M.S., Reif, J.C., 2017. Validating the prediction accuracies of marker-assisted and genomic selection of Fusarium head blight resistance in wheat using an independent sample. Theor. Appl. Genet. 130, 471–482. Jin, F., Zhang, D., Bockus, W., Baenziger, P.S., Carver, B., Bai, G., 2013. Fusarium head blight resistance in U.S. winter wheat cultivars and elite breeding lines. Crop Sci. 53, 2006–2013. Juliana, P., Singh, R.P., Singh, P.K., Crossa, J., Huerta-Espino, J., Lan, C., Bhavani, S., Rutkoski, J.E., Poland, J.A., Bergstrom, G.C., Sorrells, M.E., 2017a. Genomic and pedigree-based prediction for leaf, stem, and stripe rust resistance in wheat. Theor. Appl. Genet. 130, 1415–1430. Juliana, P., Singh, R.P., Singh, P.K., Crossa, J., Rutkoski, J.E., Poland, J.A., Bergstrom, G.C., Sorrells, M.E., 2017b. Comparison of models and whole-genome profiling approaches for genomic-enabled prediction of Septoria tritici blotch, Stagonospora nodorum blotch, and tan spot resistance in wheat. Plant Genome 10. Lado, B., Barrios, P.G., Quincke, M., Silva, P., Gutiérrez, L., 2016. Modeling genotype × environment interaction for genomic selection with unbalanced data from a wheat breeding program. Crop Sci. 56, 2165–2179. Lehermeier, C., Krämer, N., Bauer, E., Bauland, C., Camisan, C., Campo, L., Flament, P., Melchinger, A.E., Menz, M., Meyer, N., Moreau, L., Moreno-González, J., Ouzunova, M., Pausch, H., Ranc, N., Schipprack, W., Schönleben, M., Walter, H., Charcosset, A., Schön, C.-C., 2014. Usefulness of multiparental populations of maize (Zea mays L.) for genome-based prediction. Genetics 198, 3–16. Liebisch, F., Kirchgessner, N., Schneider, D., Walter, A., Hund, A., 2015. Remote, aerial phenotyping of maize traits with a mobile multi-sensor approach. Plant Methods 11, 9. Lopez-Cruz, M., Crossa, J., Bonnett, D., Dreisigacker, S., Poland, J., Jannink, J.-L., Singh, R.P., Autrique, E., de los Campos, G., 2015. Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model. G3 5, 569–582. Lorenz, A.J., Chao, S., Asoro, F.G., Heffner, E.L., Hayashi, T., Iwata, H., Smith, K.P., Sorrells, M.E., Jannink, J.L., 2011. Genomic selection in plant breeding: knowledge and prospects. Adv. Agron. 110, 77–123. Lorenzana, R.E., Bernardo, R., 2009. Accuracy of genotypic value predictions for marker- based selection in biparental plant populations. Theor. Appl. Genet. 120, 151–161. Manickavelu, A., Hattori, T., Yamaoka, S., Yoshimura, K., Kondou, Y., Onogi, A., Matsui, M., Iwata, H., Ban, T., 2017. Genetic nature of elemental contents in wheat grains and its genomic prediction: toward the effective use of wheat landraces from Afghanistan. PLoS ONE 12, e0169416. Martre, P., Jamieson, P.D., Semenov, M.A., Zyskowski, R.F., Porter, J.R., Triboi, E., 2006. Modelling protein content and composition in relation to crop nitrogen dynamics for wheat. Eur. J. Agron. 25, 138–154. Meuwissen, T.H.E., Hayes, B.J., Goddard, M.E., 2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829.
300
Applications of Genetic and Genomic Research in Cereals
Mirdita, V., He, S., Zhao, Y., Korzun, V., Bothe, R., Ebmeyer, E., Reif, J.C., Jiang, Y., 2015. Potential and limits of whole genome prediction of resistance to Fusarium head blight and Septoria tritici blotch in a vast Central European elite winter wheat population. Theor. Appl. Genet. 128, 2471–2481. Moore, J.K., Manmathan, H.K., Anderson,V.A., Poland, J.A., Morris, C.F., Haley, S.D., 2017. Improving genomic prediction for pre-harvest sprouting tolerance in wheat by weighting large-effect quantitative trait loci. Crop Sci. 57, 1315–1324. Muleta, K.T., Bulli, P., Zhang, Z., Chen, X., Pumphrey, M., 2017. Unlocking diversity in germplasm collections via genomic selection: a case study based on quantitative adult plant resistance to stripe rust in spring wheat. Plant Genome 10. Ornella, L., Singh, S., Perez, P., Burgueño, J., Singh, R., Tapia, E., Bhavani, S., Dreisigacker, S., Braun, H.-J., Mathews, K., Crossa, J., 2012. Genomic prediction of genetic values for resistance to wheat rusts. Plant Genome J. 5, 136–148. Ortiz-Monasterio, J.I., Palacios-Rojas, N., Meng, E., Pixley, K., Trethowan, R., Pena, R.J., 2007. Enhancing the mineral and vitamin content of wheat and maize through plant breeding. J. Cereal Sci. 46, 293–307. Pérez-Rodríguez, P., Gianola, D., González-Camacho, J.M., Crossa, J., Manès,Y., Dreisigacker, S., 2012. Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat. G3 2, 1595–1605. Pérez-Rodríguez, P., Crossa, J., Rutkoski, J., Poland, J., Singh, R., Legarra, A., Autrique, E., de los Campos, G., Burgueño, J., Dreisigacker, S., 2017. Single-step genomic and pedigree genotype × environment interaction models for predicting wheat lines ininternational environments. Plant Genome 10. Poland, J., Rutkoski, J., 2016. Advances and challenges in genomic selection for disease resistance. Annu. Rev. Phytopathol. 54, 79–98. Rutkoski, J., Benson, J., Jia, Y., Brown-Guedira, G., Jannink, J.-L., Sorrells, M., 2012. Evaluation of genomic prediction methods for Fusarium head blight resistance in wheat. Plant Genome J. 5, 51. Rutkoski, J.E., Poland, J., Jannink, J.-L., Sorrells, M.E., 2013. Imputation of unordered markers and the impact on genomic selection accuracy. G3 427–439. Rutkoski, J.E., Poland, J.A., Singh, R.P., Huerta-Espino, J., Bhavani, S., Barbier, H., Rouse, M.N., Jannink, J.-L., Sorrells, M.E., 2014. Genomic selection for quantitative adult plant stem rust resistance in wheat. Plant Genome 7. Rutkoski, J., Singh, R.P., Huerta-Espino, J., Bhavani, S., Poland, J., Jannink, J.L., Sorrells, M.E., 2015a. Genetic gain from phenotypic and genomic selection for quantitative resistance to stem rust of wheat. Plant Genome 8. Rutkoski, J., Singh, R.P., Huerta-Espino, J., Bhavani, S., Poland, J., Jannink, J.L., Sorrells, M.E., 2015b. Efficient use of historical data for genomic selection: a case study of stem rust resistance in wheat. Plant Genome 8, 1–10. Rutkoski, J., Poland, J., Mondal, S., Autrique, E., Pérez, L.G., Crossa, J., Reynolds, M., Singh, R., 2016. Canopy temperature and vegetation indices from high-throughput phenotyping improve accuracy of pedigree and genomic selection for grain yield in wheat. G3 6, 2799–2808. Shakoor, N., Lee, S., Mockler, T.C., 2017. High throughput phenotyping to accelerate crop breeding and monitoring of diseases in the field. Curr. Opin. Plant Biol. 38, 184–192. Sun, J., Rutkoski, J.E., Poland, J.A., Crossa, J., Jannink, J.-L., Sorrells, M.E., 2017. Multitrait, random regression, or simple repeatability model in high-throughput phenotyping data improve genomic prediction for wheat grain yield. Plant Genome 10. Tanger, P., Klassen, S., Mojica, J.P., Lovell, J.T., Moyers, B.T., Baraoidan, M., Naredo, M.E.B., McNally, K.L., Poland, J., Bush, D.R., Leung, H., Leach, J.E., McKay, J.K., 2017. Fieldbased high throughput phenotyping rapidly identifies genomic regions controlling yield components in rice. Sci. Rep. 7, 42839.
Genomic Selection in Wheat
301
Trethowan, R.M., Reynolds, M., Sayre, K., Ortiz-Monasterio, I., 2005. Adapting wheat cultivars to resource conserving farming practices and human nutritional needs. Ann. Appl. Biol. 146, 405–413. Velu, G., Ortiz-Monasterio, I., Cakmak, I., Hao,Y., Singh, R.P., 2014. Biofortification strategies to increase grain zinc and iron concentrations in wheat. J. Cereal Sci. 59, 365–372. Velu, G., Crossa, J., Singh, R.P., Hao, Y., Dreisigacker, S., Perez-Rodriguez, P., Joshi, A.K., Chatrath, R., Gupta, V., Balasubramaniam, A., Tiwari, C., Mishra, V.K., Sohu, V.S., Mavi, G.S., 2016. Genomic prediction for grain zinc and iron concentrations in spring wheat. Theor. Appl. Genet. 129, 1595–1605. Wang, Y., Mette, M., Miedaner, T., Gottwald, M., Wilde, P., Reif, J.C., Zhao, Y., 2014. The accuracy of prediction of genomic selection in elite hybrid rye populations surpasses the accuracy of marker-assisted selection and is equally augmented by multiple field evaluation locations and test years. BMC Genomics 15, 556. Watanabe, K., Guo, W., Arai, K., Takanashi, H., Kajiya-Kanegae, H., Kobayashi, M.,Yano, K., Tokunaga, T., Fujiwara, T., Tsutsumi, N., Iwata, H., 2017. High-throughput phenotyping of sorghum plant height using an unmanned aerial vehicle and its application to genomic prediction modeling. Front. Plant Sci. 8. White, J.W., Andrade-Sanchez, P., Gore, M.A., Bronson, K.F., Coffelt, T.A., Conley, M.M., Feldmann, K.A., French, A.N., Heun, J.T., Hunsaker, D.J., Jenks, M.A., Kimball, B.A., Roth, R.L., Strand, R.J., Thorp, K.R., Wall, G.W., Wang, G., 2012. Field-based phenomics for plant genetics research. Field Crop Res. 133, 101–112.
FURTHER READING Bernardo, R., 2014a. Genomewide selection of parental inbreds: classes of loci and virtual biparental populations. Crop Sci. 54, 2586–2595. Crespo-Herrera, L.A., Crossa, J., Huerta-Espino, J., Autrique, E., Mondal, S.,Velu, G.,Vargas, M., Braun, H.J., Singh, R.P., 2017. Genetic yield gains in CIMMYT’s international elite spring wheat yield trials by modeling the genotype × environment interaction. Crop Sci. 57, 789–801. He, S., Reif, J.C., Korzun,V., Bothe, R., Ebmeyer, E., Jiang,Y., 2017. Genome-wide mapping and prediction suggests presence of local epistasis in a vast elite winter wheat populations adapted to Central Europe. Theor. Appl. Genet. 130, 635–647. Heslot, N., Jannink, J.-L., Sorrells, M.E., 2015. Perspectives for genomic selection applications and research in plants. Crop Sci. 55, 1–12. Lado, B., Battenfield, S., Guzmán, C., Quincke, M., Singh, R.P., Dreisigacker, S., Peña, R.J., Fritz, A., Silva, P., Poland, J., Gutiérrez, L., 2017. Strategies for selecting crosses using genomic prediction in two wheat breeding programs. Plant Genome 10. Liu, G., Zhao,Y., Gowda, M., Longin, C.F.H., Reif, J.C., Mette, M.F., 2016. Predicting hybrid performances for quality traits through genomic-assisted approaches in central European wheat. PLoS ONE 11, e0158635. Marulanda, J.J., Mi, X., Melchinger, A.E., Xu, J.-L., Würschum, T., Longin, C.F.H., 2016. Optimum breeding strategies using genomic selection for hybrid breeding in wheat, maize, rye, barley, rice and triticale. Theor. Appl. Genet. 129, 1901–1913. Michel, S., Ametz, C., Gungor, H., Akgöl, B., Epure, D., Grausgruber, H., Löschenberger, F., Buerstmayr, H., 2017. Genomic assisted selection for enhancing line breeding: merging genomic and phenotypic selection in winter wheat breeding programs with preliminary yield trials. Theor. Appl. Genet. 130, 363–376. Paltridge, N.G., Milham, P.J., Ortiz-Monasterio, J.I.,Velu, G.,Yasmin, Z., Palmer, L.J., Guild, G.E., Stangoulis, J.C.R., 2012. Energy-dispersive X-ray fluorescence spectrometry as a tool for zinc, iron and selenium analysis in whole grain wheat. Plant Soil 361, 261–269.
302
Applications of Genetic and Genomic Research in Cereals
Rincent, R., Kuhn, E., Monod, H., Oury, F.-X., Rousset, M., Allard, V., Le Gouis, J., 2017. Optimization of multi-environment trials for genomic selection based on crop models. Theor. Appl. Genet. 130, 1735–1752. Sukumaran, S., Crossa, J., Jarquín, D., Reynolds, M., 2017. Pedigree-based prediction models with genotype × environment interaction in multienvironment trials of CIMMYT wheat. Crop Sci. 57, 1865–1880. Yang,W., Guo, Z., Huang, C., Duan, L., Chen, G., Jiang, N., Fang,W., Feng, H., Xie,W., Lian, X.,Wang, G., Luo, Q., Zhang, Q., Liu, Q., Xiong, L., 2014. Combining high-throughput phenotyping and genome-wide association studies to reveal natural genetic variation in rice. Nat. Commun. 5, 5087. Yuan, L., Zhang, H., Zhang, Y., Xing, C., Bao, Z., 2017. Feasibility assessment of multi- spectral satellite sensors in monitoring and discriminating wheat diseases and insects. Optik 131, 598–608. Zhao, Y., Mette, M.F., Gowda, M., Longin, C.F.H., Reif, J.C., 2014. Bridging the gap between marker-assisted and genomic selection of heading time and plant height in hybrid wheat. Heredity 112, 638–645.
CHAPTER 14
“SpeedGS” to Accelerate Genetic Gain in Spring Wheat Kai P. Voss-Fels⁎, Eva Herzog†, Susanne Dreisigacker‡, Sivakumar Sukumaran‡, Amy Watson⁎, Matthias Frisch†, Ben Hayes⁎, Lee T. Hickey⁎ ⁎
Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia † Institute of Agronomy and Plant Breeding II, Justus Liebig University, Giessen, Germany ‡ International Maize and Wheat Improvement Center, Texcoco, Mexico
Contents 14.1 Introduction 14.2 Accelerating Genetic Gain Using Modern Breeding Tools 14.3 Integrating Rapid Generation Advancement and Genomic Selection 14.4 Speeding Up Genetic Gain Through SpeedGS—A Simulation 14.5 Exploiting Genetic Resources Using Speed Breeding 14.6 Future Opportunities and Challenges 14.7 Simulation Materials and Methods 14.7.1 Software and Replications 14.7.2 Model for Meiosis 14.7.3 Genetic Model 14.7.4 Genetic Architecture of Traits 14.7.5 Generation of Marker Effects for the Breeding and Diversity Set 14.7.6 Assignment of Genetic Effects to Marker Loci 14.7.7 Phenotypes 14.7.8 Crossing and Selection Schemes 14.7.9 Number and Duration of Breeding Cycles 14.7.10 Simulation of Genomic Selection References
303 305 309 310 314 317 319 319 319 319 319 320 320 321 322 322 323 323
14.1 INTRODUCTION The human population is predicted to increase to more than nine billion people by the middle of this century. This challenge, coupled with the constant loss of land suitable for agricultural production, necessitates substantial yield increases for all major commodity crops. Predictions suggest that up to 40% more production will be necessary (Spiertz and Ewert, 2009) Applications of Genetic and Genomic Research in Cereals https://doi.org/10.1016/B978-0-08-102163-7.00014-4
Copyright © 2019 Elsevier Ltd. All rights reserved.
303
304
Applications of Genetic and Genomic Research in Cereals
in order to meet the demand for 60%–70% more food, as predicted by the declaration of the World Summit on Food Security in 2009 (Grainger, 2010). The need to increase the efficiency of cropping systems for globally important cereals, such as wheat, is an ongoing challenge for the agricultural and environmental research community. The Green Revolution resulted in dramatic gains in wheat productivity over the past 60 years, with yields increasing on average by a factor of 2.6 (Alston et al., 2009). However, wheat yield increases have faced stagnation during the past two decades in many parts of the world, including Europe, the United States of America, Asia, and Australia (Brisson et al., 2010; Ray et al., 2012). This stagnation can be partly attributed to a drastic reduction of genetic diversity within elite wheat-breeding gene pools, due to strong selective breeding and an intensive germplasm exchange between different commercial breeding programs (Qian et al., 2017; Reif et al., 2005; Voss-Fels et al., 2015). On the other hand, global wheat production is threatened by an increasing number of extreme weather events due to the onset of climate change. It has recently been estimated that global wheat yields will be reduced by 6% for each 1°C of further temperature increase, accompanied by increasingly variable yield stability (Asseng et al., 2014). Water availability, in particular, is a major limiting factor for wheat in rain-fed agricultural systems worldwide. Depending on their intensity and duration, drought events can impact crop productivity, ranging from yield reduction to a complete failure of the crop. A recent study, comparing the influence of different extreme weather disasters on global crop production from 1964 to 2007, concluded that droughts and extreme heat have had the most serious negative impact on agricultural production, causing average national crop failures of 9%–10% (Lesk et al., 2016). Interestingly, the same study found that recent extreme droughts had a significantly larger effect on cereal production than earlier drought events, with the recent droughts having a more severe effect in developed countries compared to developing countries. According to projections, droughts are very likely to become more frequent in the near future (Chenu et al., 2013; Seneviratne et al., 2012) and extreme heat events are predicted to be increasingly common and severe (Battisti and Naylor, 2009). Given these extremely dynamic demographic and environmental global changes, future wheat production faces novel and unprecedented threats that address all related fields of public and private research. The most promising approach is the reinstitution and/ or creation of wheat cultivars with broad diversity to meet specific demands of local adaptation to new climatic challenges.The improvement of modern
“SpeedGS” to Accelerate Genetic Gain in Spring Wheat
305
varieties to enhance climate resilience is therefore a key aspect to secure future food supply and global wheat production must further be increased by more than 50% by 2050 (Langridge, 2013).The urgency and importance to enhance wheat performance has recently also been recognized by G20 policy makers, who approved the initiation of the International Research Initiative for Wheat Improvement (IRIWI), a huge global collaborative research project aimed at optimizing and strengthening of public and private wheat research endeavors (G20, 2011). In this chapter, we present novel concepts for integrating genomic selection and speed breeding (“SpeedGS”) as a powerful strategy to speed up yield increase in spring wheat. We use simulations to explore the potential for accelerating genetic gain by adopting a combined approach based on real data sets and breeding schemes from the International Maize and Wheat Improvement Center (CIMMYT).We discuss the challenges associated with implementing rapid cycling breeding programs, the opportunity to integrate other breeding technologies, and highlight priorities for further research seeking to optimize crop improvement protocols.
14.2 ACCELERATING GENETIC GAIN USING MODERN BREEDING TOOLS In the era of modern plant breeding, numerous breeding tools and technologies have been developed, including biotechnological approaches, such as doubled haploid technology (Schön and Simianer, 2015).The recent advances in highly efficient agrigenomic approaches, such as ultrahigh-throughput next-generation sequencing technologies that provide thousands to millions of data points at constantly decreasing costs, along with further developments of statistical solutions to exploit large amounts of genomic data have revolutionized wheat breeding (Bassi et al., 2016; Qian et al., 2017; Voss-Fels and Snowdon, 2016). Now, marker-assisted breeding approaches are ubiquitous in wheat genetic studies and modern breeding programs. Genomic selection (GS), one form of marker-assisted selection that considers genome-wide marker data to predict the performance of a genotype based on genomic estimated breeding values, has gained vast popularity because it enables selection among candidate genotypes directly after being generated, rather than after years of intensive and expensive phenotyping. This assists breeders to reduce the duration of the breeding cycle and ultimately increase the rate of genetic gain per unit of time (Heffner et al., 2010; Lorenzana and Bernardo, 2009; Wong and Bernardo, 2008). The term GS
306
Applications of Genetic and Genomic Research in Cereals
was first coined in 2001 (Meuwissen et al., 2001) and has been successfully applied in breeding programs for major crops, such as rice (Spindel et al., 2015) and maize (Riedelsheimer et al., 2012). In wheat, there are an increasing number of studies that report the successful application of GS to improve the efficiency of trait selection. For example, the recent study by Liu et al. (2016) showed that GS outperformed MAS for baking quality-related traits and that GS could be a powerful tool to accelerate progress in baking quality improvement in modern European bread wheat breeding programs. Battenfield et al. (2016) further confirmed these findings by implementing GS in a large wheat breeding program of CIMMYT in Mexico, including almost 8000 wheat lines and over 20,000 SNPs per line, generated by genotyping-by-sequencing (GBS). The large-scale study also found GS to be a powerful tool to facilitate selection for end-use wheat quality in early generations. Hayes et al. (2017) extended the prediction of end-use quality of wheat using GS by also incorporating measures of near-infrared and nuclear magnetic resonance in their predictions. The effectiveness of GS in wheat was further demonstrated for improving stem rust (Rutkoski et al., 2014) and Fusarium head blight resistance (Arruda et al., 2016; Jiang et al., 2017; Rutkoski et al., 2014) and yield (Michel et al., 2017; Norman et al., 2017; Poland et al., 2012a; Song et al., 2017). Holding a great potential for wheat improvement, the true gain from GS in commercial wheat breeding remains elusive at present and studies to date indicate that the optimal GS breeding design depends on the breeder's individual needs, thus necessitating further validation (Bassi et al., 2016). Hybrid breeding presents another promising opportunity to boost wheat productivity and drastically enhance yield in the short term, as the approach resulted in five-fold yield increase in maize yields over the past century (Marulanda et al., 2016). However, there are different factors that currently hamper the wide implementation of hybrid breeding approaches in wheat, such as large-scale hybrid seed production due to the strong self-pollinating nature of wheat (