Automated Author ProfileZhang, Wenwei
Zhang, Wenwei
Current S-Index
Sum of Dataset Indices for all datasets
Average Dataset Index per Dataset
Average Dataset Index per dataset
Total Datasets
Total datasets for this author
Average FAIR Score
Average FAIR Score per dataset
Total Citations
Total citations to the author's datasets
Total Mentions
Total mentions of the author's datasets
S-Index Interpretation
The S-Index (Sharing Index) is a comprehensive metric that represents the cumulative impact of all your datasets. It is calculated as the sum of Dataset Index scores across all your claimed datasets.
What it means:
- A higher S-index indicates greater overall impact of your datasets relative to typical datasets in their fields of research
- The S-Index grows as you add more datasets or as existing datasets gain more citations and mentions
- It provides a single number to track your research data impact over time
Current S-Index: 17.5 (sum of 16 datasets Dataset Index scores)
More information here.
S-Index Over Time
Cumulative Citations Over Time
Cumulative Mentions Over Time
Datasets
No description available
Authors
- ZHANG, Wenwei ;
- WU, Lipeng ;
- SONG, Xianjin ;
- XU, Cheng ;
- LIU, Guoqiang ;
- LI, Shiqiang
No description available
Authors
- ZHANG, Wenwei ;
- WU, Lipeng ;
- SONG, Xianjin ;
- XU, Cheng ;
- LIU, Guoqiang ;
- LI, Shiqiang
Current microbial sequencing relies on short-read platforms like Illumina and DNBSEQ, favored for their low cost and high accuracy. However, these methods often produce fragmented draft genomes, hindering comprehensive bacterial function analysis. CycloneSEQ, a novel long-read sequencing platform developed by BGI-Research, its sequencing performance and assembly improvements has been evaluated.
Using CycloneSEQ long-read sequencing, the type strain produced long reads with an average length of 11.6 kbp and an average quality score of 14.4. After hybrid assembly with short reads data, the assembled genome exhibited an error rate of only 0.04 mismatches and 0.08 indels per 100 kbp compared to the reference genome. This method was validated across 9 diverse species, successfully assembling complete circular genomes. Hybrid assembly significantly enhances genome completeness by using long reads to fill gaps and accurately assemble multi-copy rRNA genes, which unable be achieved by short reads solely. Through data subsampling, we found that over 500 Mbp of short-read data combined with 100 Mbp of long-read data can result in a high-quality circular assembly. Additionally, using CycloneSEQ long reads effectively improves the assembly of circular complete genomes from mixed microbial communities.
CycloneSEQs read length is sufficient for circular bacterial genomes, but its base quality needs improvement. Integrating DNBSEQ short reads improved accuracy, resulting in complete and accurate assemblies. This efficient approach can be widely applied in microbial sequencing.
Authors
- Liang, Hewei ;
- Zou, Yuanqiang ;
- Wang, Mengmeng ;
- Hu, Tongyuan ;
- Wang, Haoyu ;
- He, Wenxin ;
- Ju, Yanmei ;
- Guo, Rujin ;
- Chen, Junyi ;
- Guo, Fei ;
- Zeng, Tao ;
- Dong, Yuliang ;
- Zhang, Yuning ;
- Wang, Bo ;
- Liu, Chuanyu ;
- Jin, Xin ;
- Zhang, Wenwei ;
- Xu, Xun ;
- Xiao, Liang
No description available
Authors
- Zhang, Wenwei
An entry from the Cambridge Structural Database, the world’s repository for small molecule crystal structures. The entry contains experimental data from a crystal diffraction study. The deposited dataset for this entry is freely available from the CCDC and typically includes 3D coordinates, cell parameters, space group, experimental conditions and quality measures.
Authors
- Chen, Cong ;
- Zhang, Mingxing ;
- Zhang, Wenwei ;
- Bai, Junfeng
An entry from the Cambridge Structural Database, the world’s repository for small molecule crystal structures. The entry contains experimental data from a crystal diffraction study. The deposited dataset for this entry is freely available from the CCDC and typically includes 3D coordinates, cell parameters, space group, experimental conditions and quality measures.
Authors
- Jiang, Jingjing ;
- Lu, Zhiyong ;
- Zhang, Mingxing ;
- Duan, Jingui ;
- Zhang, Wenwei ;
- Pan, Yi ;
- Bai, Junfeng
An entry from the Cambridge Structural Database, the world’s repository for small molecule crystal structures. The entry contains experimental data from a crystal diffraction study. The deposited dataset for this entry is freely available from the CCDC and typically includes 3D coordinates, cell parameters, space group, experimental conditions and quality measures.
Authors
- Jiang, Jingjing ;
- Lu, Zhiyong ;
- Zhang, Mingxing ;
- Duan, Jingui ;
- Zhang, Wenwei ;
- Pan, Yi ;
- Bai, Junfeng
More extensive use of metagenomic shotgun sequencing in microbiome research relies on the development of high-throughput, cost-effective sequencing. Here we present a comprehensive evaluation of the performance of the new high-throughput sequencing platform BGISEQ-500 for metagenomic shotgun sequencing and compare its performance with that of two Illumina platforms.
Using fecal samples from 20 healthy individuals we evaluated the intra-platform reproducibility for metagenomic sequencing on the BGISEQ-500 platform in a setup comprising 8 library replicates and 8 sequencing replicates. Cross-platform consistency, was evaluated by comparing 20 pairwise replicates on the BGISEQ-500 platform versus the Illumina HiSeq 2000 platform and the Illumina HiSeq 4000 platform. In addition, we compared the performance of the two Illumina platforms against each other.
By a newly developed overall accuracy quality control method, an average of 82.45 million high quality reads (96.06% of raw reads) per sample with 90.56% of bases scoring Q30 and above was obtained using the BGISEQ-500 platform. Quantitative analyses revealed extremely high reproducibility between BGISEQ-500 intra-platform replicates. Cross-platform replicates differed slightly more than intra-platform replicates, yet a high consistency was observed. Only a low percentage (2.02% -3.25%) of genes exhibited significant differences in relative abundance comparing the BGISEQ-500 and HiSeq platforms, with a bias towards genes with higher GC content being enriched on the HiSeq platforms.
Our study provides the first set of performance metrics for human gut metagenomic sequencing data using BGISEQ-500. The high accuracy and technical reproducibility confirm the applicability of the new platform for metagenomic studies, though caution is still warranted when combining metagenomic data from different platforms.
Authors
- Fang, Chao ;
- Zhong, Huanzi ;
- Lin, Yuxiang ;
- Chen, Bin ;
- Han, Mo ;
- Ren, Huahui ;
- Lu, Haorong ;
- Luber, Jacob, Mayne ;
- Xia, Min ;
- Li, Wangsheng ;
- Stein, Shayna ;
- Xu, Xun ;
- Zhang, Wenwei ;
- Drmanac, Radoje ;
- Wang, Jian ;
- Yang, Huanming ;
- Hammarström, Lennart ;
- Kostic, Aleksandar, David ;
- Kristiansen, Karsten ;
- Li, Junhua
Ancient DNA research has been revolutionised following development of Next Generation Sequencing platforms. Although a number of such platforms have been applied to ancient DNA samples, the Illumina series are the dominant choice today, mainly because of high production capacities and short read production. Recently a potentially attractive alternative platform for palaeogenomic data generation has been developed, the BGISEQ-500, whose sequence output are comparable with the Illumina series. In this study, we modified the standard BGISEQ-500 library preparation specifically for use on degraded DNA, then directly compared the sequencing performance and data quality of the BGISEQ-500 to the Illumina HiSeq2500 platform, on DNA extracted from eight historic and ancient dog and wolf samples.
The data generated was largely comparable between sequencing platforms, with no statistically significant difference observed for parameters including level (p=0.371) and average sequence length (p=0718) of endogenous nuclear DNA, sequence GC content (p=0.311), double stranded DNA damage rate (p=0.309), and sequence clonality (p=0.093). Small significant differences were found in single strand DNA damage rate (S, slight lower for the BGISEQ-500, p=0.011) and the background rate of difference from the reference genome (, slightly higher for BGISEQ-500, p=0.012). This may result from the differences in amplification cycles used to PCR amplify the libraries. A significant difference was also observed in the mitochondrial DNA percentages recovered (p=0.018), although we believe this is likely a stochastic effect relating to the extremely low levels of mitochondria that were sequenced from three of the samples with overall very low levels of endogenous DNA.
Although we acknowledge our analyses were limited to animal material, our observations suggest that the BGISEQ-500 holds the potential to represent valid and potentially valuable alternative platform for palaeogenomic data generation, that is worthy of future exploration by those interested in the sequencing and analysis of degraded DNA.
Authors
- Mak, Sarah, Siu Tze ;
- Gopalakrishnan, Shyam ;
- Caroe, Christian ;
- Geng, Chunyu ;
- Liu, Shanlin ;
- Sinding, Mikkel-Holger, S ;
- K Kuderna, Lukas, F ;
- Zhang, Wenwei ;
- Fu, Shujin ;
- Vieira, Filipe, G ;
- Germonpré, Mietje ;
- Bocherens, Hervé ;
- Fedorov, Sergey ;
- Petersen, Bent ;
- Sicheritz-Ponten, Thomas ;
- Marques-Bonet, Tomas ;
- Zhang, Guojie ;
- Jiang, Hui ;
- Gilbert, M.Thomas, P
BGISEQ-500 sequencer is a new desktop sequencer developed by BGI. Using DNA nanoballs (DNB) and combinational probe-anchor synthesis (cPAS) developed from Complete Genomics(TM) sequencing technology, it generates short reads at a large scale, which can help fulfill the growing demands for sequencing. Here, we present the first human whole genome sequencing dataset from the BGISEQ-500. The dataset was generated by sequencing the widely used Genome in a Bottle Consortium cell line, HG001 (NA12878) in one sequencing run. And the sequencing data were ~1,000 million paired sequences with the length of 50 bp (PE50). We also include examples of the raw images from the sequencer for reference. Finally, we carried out variation calling based on the dataset and compared it that identified from similar amount of publicly available HiSeq2500 data and the previously identified high confident variations.
Authors
- Huang, Jie ;
- Liang, Xinming ;
- Xuan, Yuankai ;
- Geng, Chunyu ;
- Li, Yuxiang ;
- Lu, Haorong ;
- Qu, Shoufang ;
- Mei, Xianglin ;
- Chen, Hongbo ;
- Yu, Ting ;
- Sun, Nan ;
- Rao, Junhua ;
- Wang, Jiahao ;
- Zhang, Wenwei ;
- Chen, Ying ;
- Liao, Sha ;
- Jiang, Hui ;
- Liu, Xin ;
- Yang, Zhaopeng ;
- Mu, Feng ;
- Gao, Shangxian