Title

Capturing chloroplast variation for molecular ecology studies: a simple next generation sequencing approach applied to a rainforest tree

Document Type

Article

Publication details

McPherson, H, van der Merwe, M, Delaney, SK, Edwards, MA, Henry, RJ, McIntosh, E, Rymer, PD, Milner, ML, Siow, J & Rossetto, M 2013, 'Capturing chloroplast variation for molecular ecology studies: a simple next generation sequencing approach applied to a rainforest tree', BMC Ecology, vol. 13, no. 8.

Published version available from:

http://doi.org/10.1186/1472-6785-13-8

Peer Reviewed

Peer-Reviewed

Abstract

Background

With high quantity and quality data production and low cost, next generation sequencing has the potential to provide new opportunities for plant phylogeographic studies on single and multiple species. Here we present an approach for in silicio chloroplast DNA assembly and single nucleotide polymorphism detection from short-read shotgun sequencing. The approach is simple and effective and can be implemented using standard bioinformatic tools.

Results

The chloroplast genome of Toona ciliata (Meliaceae), 159,514 base pairs long, was assembled from shotgun sequencing on the Illumina platform using de novo assembly of contigs. To evaluate its practicality, value and quality, we compared the short read assembly with an assembly completed using 454 data obtained after chloroplast DNA isolation. Sanger sequence verifications indicated that the Illumina dataset outperformed the longer read 454 data. Pooling of several individuals during preparation of the shotgun library enabled detection of informative chloroplast SNP markers. Following validation, we used the identified SNPs for a preliminary phylogeographic study of T. ciliata in Australia and to confirm low diversity across the distribution.

Conclusions

Our approach provides a simple method for construction of whole chloroplast genomes from shotgun sequencing of whole genomic DNA using short-read data and no available closely related reference genome (e.g. from the same species or genus). The high coverage of Illumina sequence data also renders this method appropriate for multiplexing and SNP discovery and therefore a useful approach for landscape level studies of evolutionary ecology.