Published on 25 March 2013 |

Version 1

Data from: Next-generation sequencing to inventory taxonomic diversity in eukaryotic communities: a test for freshwater diatoms

View Dataset
Kermarrec, Lenaïg;Franc, Alain;Rimet, Frédéric;Chaumeil, Philippe;Humbert, Jean-François;Bouchez, Agnès;Humbert, J. F.

Description

The recent emergence of barcoding approaches coupled to those of Next Generation Sequencing (NGS) have raised new perspectives for studying environmental communities. In this framework, we tested the possibility to derive accurate inventories of diatom communities from pyrosequencing outputs with an available DNA reference library. We used three molecular markers targeting the nuclear, chloroplast and mitochondrial genomes (SSU rDNA, rbcL, and cox1), and three samples of a mock community composed of 30 known diatom strains belonging to 21 species. In the goal to detect methodological biases, one sample was constituted directly from pooled cultures, whereas the others consisted of pooled PCR products. The NGS reads obtained by pyrosequencing (Roche 454) were compared first to a DNA reference library including the sequences of all the species used to constitute the mock community, and secondly to a complete DNA reference library with a larger taxonomic coverage. A stringent taxonomic assignation, gave inventories that were compared to the real one. We detected biases due to DNA extraction and to PCR amplification that resulted in false-negatives detection. Conversely, pyrosequencing errors appeared to generate false-positives, especially in case of closely allied species. The taxonomic coverage of DNA reference libraries appears to be the most crucial factor, together with marker polymorphism which is essential to identify taxa at the species level. RbcL offers a high resolving power which, together with its large DNA reference library. Though needing further optimization, pyrosequencing is suitable for identifying diatom assemblages and may find applications in the field of freshwater biomonitoring.

Citations (1)

Mentions (0)

Metrics

Dataset Index

2.2

FAIR Score

77%

Citations

1

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Dryad

Assigned Domain

Subfield

Molecular Biology

Field

Biochemistry, Genetics and Molecular Biology

Domain

Life Sciences

Confidence Score

54%

Source

Scholar Data Model

Keywords

MetagenomicsBacillariophyta

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00