• Login
    View Item 
    •   Athenaeum Home
    • BioMed Central Open Access Articles
    • Open Access Articles by UGA Faculty
    • View Item
    •   Athenaeum Home
    • BioMed Central Open Access Articles
    • Open Access Articles by UGA Faculty
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies

    Thumbnail
    View/Open
    13059_2014_Article_3360.pdf (2.264Mb)
    Date
    2014-03-04
    Author
    Neale, David B
    Wegrzyn, Jill L
    Stevens, Kristian A
    Zimin, Aleksey V
    Puiu, Daniela
    Crepeau, Marc W
    Cardeno, Charis
    Koriabine, Maxim
    Holtz-Morris, Ann E
    Liechty, John D
    Martínez-García, Pedro J
    Vasquez-Gross, Hans A
    Lin, Brian Y
    Zieve, Jacob J
    Dougherty, William M
    Fuentes-Soriano, Sara
    Wu, Le-Shin
    Gilbert, Don
    Marçais, Guillaume
    Roberts, Michael
    Holt, Carson
    Yandell, Mark
    Davis, John M
    Smith, Katherine E
    Dean, Jeffrey F
    Lorenz, W W
    Whetten, Ross W
    Sederoff, Ronald
    Wheeler, Nicholas
    McGuire, Patrick E
    Main, Doreen
    Loopstra, Carol A
    Mockaitis, Keithanne
    deJong, Pieter J
    Yorke, James A
    Salzberg, Steven L
    Langley, Charles H
    Metadata
    Show full item record
    Abstract
    Abstract Background The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. Results We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. Conclusions In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied.
    URI
    http://dx.doi.org/10.1186/gb-2014-15-3-r59
    http://hdl.handle.net/10724/32627
    Collections
    • Open Access Articles by UGA Faculty

    About Athenaeum | Contact Us | Send Feedback
     

     

    Browse

    All of AthenaeumCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    About Athenaeum | Contact Us | Send Feedback