Automated Author Profile

Zhang, Yanfeng

Current S-Index

16.7

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

1.5

Average Dataset Index per dataset

Total Datasets

11

Total datasets for this author

Average FAIR Score

64.7%

Average FAIR Score per dataset

Total Citations

10

Total citations to the author's datasets

Total Mentions

0

Total mentions of the author's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Data and R scripts for:Diet-induced inflammation and gout prevalence: Sex-specific findings from a cross-sectional analysis in the representative US population (Version: v0.1)

Dataset: datav4.csvSource: The data was derived from the National Health and Nutrition Examination Survey (NHANES).Processing: The raw NHANES data underwent a cleaning and preparation process using the shuffle.R script included in this repository, resulting in the datav4.csv file.R ScriptsThis repository includes three R scripts which should generally be run in the following order:shuffle.R:Function: This script is responsible for data cleaning, preprocessing, and a_gout_study_data_codeMing of the initial NHANES data to generate the final analytical dataset (datav4.csv).calculate.R:Function: This script performs the main statistical calculations and generates most of the tables presented in the manuscript.Dependency: This script uses the datav4.csv file generated by shuffle.R.plot.R:Function: This script is used to generate the figures and plots included in the study.Dependency: This script likely uses data objects created by calculate.R or the datav4.csv file directly.System RequirementsR Version: The scripts were developed and tested using R version 4.4.1.Required R PackagesThe following R packages need to be installed to run the scripts:dplyrtidyrggplot2VIMYou can install these packages in R using the following command:install.packages(c("dplyr", "tidyr", "ggplot2", "VIM"))

Authors

  • Li, Sheng-Guang ;
  • Zhang, Lina ;
  • Gao, Jun ;
  • Zou, Yadan ;
  • Li, Ji ;
  • Zhang, Jing ;
  • Yu, Ruohan ;
  • Long, Ting ;
  • Zhang, Yanfeng
0 Citations0 Mentions73% FAIR1.8 Dataset Index
10.5281/zenodo.154906752025

Data and R scripts for:Diet-induced inflammation and gout prevalence: Sex-specific findings from a cross-sectional analysis in the representative US population (Version: v0.1)

Dataset: datav4.csvSource: The data was derived from the National Health and Nutrition Examination Survey (NHANES).Processing: The raw NHANES data underwent a cleaning and preparation process using the shuffle.R script included in this repository, resulting in the datav4.csv file.R ScriptsThis repository includes three R scripts which should generally be run in the following order:shuffle.R:Function: This script is responsible for data cleaning, preprocessing, and a_gout_study_data_codeMing of the initial NHANES data to generate the final analytical dataset (datav4.csv).calculate.R:Function: This script performs the main statistical calculations and generates most of the tables presented in the manuscript.Dependency: This script uses the datav4.csv file generated by shuffle.R.plot.R:Function: This script is used to generate the figures and plots included in the study.Dependency: This script likely uses data objects created by calculate.R or the datav4.csv file directly.System RequirementsR Version: The scripts were developed and tested using R version 4.4.1.Required R PackagesThe following R packages need to be installed to run the scripts:dplyrtidyrggplot2VIMYou can install these packages in R using the following command:install.packages(c("dplyr", "tidyr", "ggplot2", "VIM"))

Authors

  • Li, Sheng-Guang ;
  • Zhang, Lina ;
  • Gao, Jun ;
  • Zou, Yadan ;
  • Li, Ji ;
  • Zhang, Jing ;
  • Yu, Ruohan ;
  • Long, Ting ;
  • Zhang, Yanfeng
0 Citations0 Mentions69% FAIR1.7 Dataset Index
10.5281/zenodo.154906852025

SS18-SSX2 ChIP-seq data

Flag-SSX ChIP-seq and Input read coverage data in bdg format, Peak data. The ChIP-seq data are processed as follows: The raw ChIP-seq data (FASTQ format) were first adaptor-trimmed, then mapped to the human reference genome (hg19) using Bowtie2 program (version 2.1.0) with the default setting. After removing duplicated reads, we used the MACS2 (version 2.1) software to identify peaks using the matched DNA input data as the control. Based on the coordinate of called peaks, the overlap rate between two TF binding peaks was analyzed using the Bedtools program. The peaks were ranked by the number of mapped reads within the peak interval and the top 10% of peaks were selected for motif discovery. The summits of the top 10% peaks were extended by 100 bp on either side. Motifs between 5 and 30 bp in length were identified on both strands. We employed the MEME 4.9.1 toolkit to search DNA motifs and enrichment significance for candidate TFs.

Authors

  • Zhang, Yanfeng
0 Citations0 Mentions65% FAIR0.7 Dataset Index
10.17632/7mhyv8f6vp2024

SS18-SSX2 ChIP-seq data

Flag-SSX ChIP-seq and Input read coverage data in bdg format, Peak data. The ChIP-seq data are processed as follows: The raw ChIP-seq data (FASTQ format) were first adaptor-trimmed, then mapped to the human reference genome (hg19) using Bowtie2 program (version 2.1.0) with the default setting. After removing duplicated reads, we used the MACS2 (version 2.1) software to identify peaks using the matched DNA input data as the control. Based on the coordinate of called peaks, the overlap rate between two TF binding peaks was analyzed using the Bedtools program. The peaks were ranked by the number of mapped reads within the peak interval and the top 10% of peaks were selected for motif discovery. The summits of the top 10% peaks were extended by 100 bp on either side. Motifs between 5 and 30 bp in length were identified on both strands. We employed the MEME 4.9.1 toolkit to search DNA motifs and enrichment significance for candidate TFs.

Authors

  • Zhang, Yanfeng
0 Citations0 Mentions65% FAIR1.6 Dataset Index
10.17632/7mhyv8f6vp.22024

SS18-SSX2 ChIP-seq data

Flag-SSX ChIP-seq and Input read coverage data in bdg format, Peak data

Authors

  • Zhang, Yanfeng
0 Citations0 Mentions65% FAIR0.7 Dataset Index
10.17632/7mhyv8f6vp.12024

Explanation of all the variables in the database (Version: 1)

No description available

Authors

  • Huang, Yiwen ;
  • Wang, Lijuan ;
  • Huo, Junsheng ;
  • Wu, Qiong ;
  • Wang, Wei ;
  • Chang, Suying ;
  • Zhang, Yanfeng
0 Citations0 Mentions77% FAIR1.9 Dataset Index
10.5061/dryad.57v2100/22019

Data from: High anaemia prevalence and its causes in children aged 6-23 months in rural Qinghai, China: findings from a cross-sectional study (Version: 1)

No description available

Authors

  • Huang, Yiwen ;
  • Wang, Lijuan ;
  • Huo, Junsheng ;
  • Wu, Qiong ;
  • Wang, Wei ;
  • Chang, Suying ;
  • Zhang, Yanfeng
0 Citations0 Mentions77% FAIR1.9 Dataset Index
10.5061/dryad.57v2100/12019

Data of YYB surveys

No description available

Authors

  • Zhang, Yanfeng ;
  • Wu, Qiong ;
  • Wang, Wei ;
  • Van Velthoven, Michelle Helena ;
  • Chang, Suying ;
  • Han, Huijun ;
  • Xing, Ming ;
  • Chen, Li ;
  • Scherpbier, Robert W.
0 Citations0 Mentions77% FAIR1.7 Dataset Index
10.5061/dryad.52mt5/12016

The maternal, newborn and child health (MNCH) household survey data

No description available

Authors

  • Wu, Qiong ;
  • Scherpbier, Robert W. ;
  • Van Velthoven, Michelle Helena ;
  • Chen, Li ;
  • Wang, Wei ;
  • Li, Ye ;
  • Zhang, Yanfeng ;
  • Car, Josip
0 Citations0 Mentions81% FAIR0.3 Dataset Index
10.5061/dryad.bh4kt/12014

Genomic data from the Chinese Rhesus macaque (<em>Macaca mulatta lasiota</em>).

The Chinese rhesus macaque (Macaca mulatta lasiota) is a subspecies of rhesus macaques that mainly resides in western and central China. Due to their anatomical and physiological similarity with human beings, macaques are a common laboratory model. Also, as several macaques species have been sequenced, such as the Indian rhesus macaque and the crab-eating macaque, examination of the Chinese rhesus macaque (CR) genome offers interesting insights into the entire Macaca genus.The DNA sample for data sequencing and analyses was obtained from a five-year old female CR from southwestern China. The genome was sequenced on the IlluminaGAIIx platform, from which 142-Gb of high-quality sequence, representing 47-fold genome coverage for CR. The total size of the assembled CR genome was about 2.84 Gb, providing 47-fold on average. Scaffolds were assigned to the chromosomes according to the synteny displayed with the Indian rhesus macaque and human genome sequences. About 97% of the CR scaffolds could be placed onto chromosomes.

Authors

  • Yan, Guangmei ;
  • Zhang, Guojie ;
  • Fang, Xiaodong ;
  • Zhang, Yanfeng ;
  • Li, Cai ;
  • Ling, Fei ;
  • Cooper, David, N ;
  • Li, Qiye ;
  • Li, Yan ;
  • van Gool, Alain, J ;
  • Du, Hongli ;
  • Chen, Jiesi ;
  • Chen, Ronghua ;
  • Zhang, Pei ;
  • Huang, Zhiyong ;
  • Thompson, John, R ;
  • Meng, Yuhuan ;
  • Bai, Yinqi ;
  • Wang, Jufang ;
  • Zhuo, Min ;
  • Wang, Tao ;
  • Huang, Ying ;
  • Wei, Liqiong ;
  • Li, Jianwen ;
  • Wang, Zhiwen ;
  • Hu, Haofu ;
  • Le, Liang ;
  • Stenson, Peter, D ;
  • Li, Bo ;
  • Liu, Xiaoming ;
  • Ball, Edward, V ;
  • An, Na ;
  • Huang, Quanfei ;
  • Zhang, Yong ;
  • Fan, Wei ;
  • Zhang, Xiuqing ;
  • Li, Yingrui ;
  • Wang, Wen ;
  • Katze, Michael, G ;
  • Su, Bing ;
  • Nielsen, Rasmus ;
  • Yang, Huanming ;
  • Wang, Jun ;
  • Wang, Xiaoning ;
  • Wang, Jian
5 Citations0 Mentions31% FAIR2.5 Dataset Index
10.5524/1000022011