Development of a SNP identification pipeline for highly heterozygous crops

Tom Ruttink, Lieven Sterck, Elke Vermeulen, Antje Rohde, Isabel Roldan-Ruiz

    Research output: Chapter in Book/Report/Conference proceedingC1: Articles in proceedingspeer-review


    Next Generation Sequencing technologies significantly advance the development of molecular markers for molecular breeding. Dedicated NGS data-analysis procedures must be developed for de novo reference assembly and SNP discovery in crop species without a reference genome sequence. In outcrossing fodder crops, the high degree of polymorphism hampers de novo assembly, contig clustering, read mapping, and SNP discovery. Using selected candidate genes as case studies, we illustrate the reconstruction of a reference transcript sequence from RNA-seq data from multiple genotypes, we validate de novo transcript assembly by Sanger sequencing, and analyse how read mapping and SNP discovery parameters determine sensitivity and specificity during SNP discovery. Thus, we propose a general strategy to construct a non-redundant reference transcriptome for crops without a sequenced genome, using predicted proteins from a closely related model species as a guidance for clustering and annotation. This reference transcriptome is required for candidate gene discovery and exome-wide identification of polymorphisms.

    Original languageEnglish
    Title of host publicationBreeding strategies for sustainable forage and turf grass improvement
    EditorsD Milbourne, S Barth
    Number of pages9
    Place of PublicationDordrecht
    Publication date2012
    Publication statusPublished - 2012
    EventEucarpia 29th Fodder Crops and Amenity Grasses Section Meeting - Dublin, Ireland
    Duration: 4-Sept-20118-Sept-2011


    Dive into the research topics of 'Development of a SNP identification pipeline for highly heterozygous crops'. Together they form a unique fingerprint.

    Cite this