Published on 05 April 2021 |

Version V1

Additional FIle 6 - Base level analysis of Empirical Base Pair Recall, Pileup Mappability, and GC content across the H37Rv genome

View Dataset
Marin, Maximillian Gabriel

Description

This table (additional File 6) contains the following metrics calculated for all base pair positions of the H37Rv reference genome (NC_000962.3) of Mycobacterium tuberculosis: 1) EBR_36CI: The calculated Empirical Base Pair Recall (EBR) score across 36 clinical Mtb isolates for each genomics position 2) Pmap_K50_E4: Pileup Mappability calculated with parameters set to (k = 50 bp, e = up to 4 mismatches) 3) Pmap_K100_E4: Pileup Mappability calculated with parameters set to (k = 100 bp, e = up to 4 mismatches) 4) Pmap_K150_E4: Pileup Mappability calculated with parameters set to (k = 150 bp, e = up to 4 mismatches) 5) GC%_100bpWindowSize: GC % calculated with a 100 bp window size around the position of interest 6) PLC_Tag: Indicates whether the genomic position of H37Rv is defined as a "putative low confidence" position. (Non-PLC or PLC) NOTE: 1: The scores for each genomic position of H37Rv are provided in order in this table. This means there are 4,411,532 rows corresponding to the 4,411,532 positions of H37Rv. NOTE 2: Pileup Mappability scores were not calculated for the first k base positions. (It was not possible to sample k total k-mers from those positions due to their proximity to the end of genome sequence.)

Citations (0)

Mentions (0)

Metrics

Dataset Index

0.9

FAIR Score

81%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Zenodo

Assigned Domain

Subfield

Molecular Biology

Field

Biochemistry, Genetics and Molecular Biology

Domain

Life Sciences

Confidence Score

91%

Source

Open Alex

Keywords

Mycobacterium tuberculosisGenomics

Normalization Factors

FT

30.77

CTw

1.00

MTw

1.00