Published on 02 September 2020

The influence of human genetic variation on Epstein-Barr virus sequence diversity

View Dataset
Loetscher Alexis

Description

This project is the first attempt to apply a "genome-to-genome" approach to investigate the impact of the host genetic pressure on the genome of a member of the Herpesviridae family. Namely, 285 pairs of human and EBV genomes were sequenced and multiple GWASes between human and EBV variations were performed. This repository contains the results of downstream analysis on the pathogen-side. The variant calling data was produced from read alignment using BWA mem using GATK HC, SNVer, VarScan2, BCFtools, freebayes and the intersection of the sets of variation from the first three. The "covstats" files contains statistics about the read alignment. The tarball SHCS_EBV_variant_call.tar.gz contains all compressed VCF files. The tarball SHCS_EBV_variant_matrices_stats.tar.gz contains all matrices used as traits in the GWASes, as well as a variety of statistics. The pipeline used to generate this data is publically available here: https://gitlab.com/ezlab/vir_var_calling/

Citations (0)

Mentions (0)

Metrics

Dataset Index

0.3

FAIR Score

13%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Oncology

Field

Medicine

Domain

Health Sciences

Confidence Score

100%

Source

Open Alex

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00