Automated Author Profile

Backofen, Rolf

Current S-Index

10.0

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

0.5

Average Dataset Index per dataset

Total Datasets

19

Total datasets for this author

Average FAIR Score

74.5%

Average FAIR Score per dataset

Total Citations

12

Total citations to the author's datasets

Total Mentions

0

Total mentions of the author's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Supporting data for "StoatyDive: Evaluation and Classification of Peak Profiles for Sequencing Data"

The prediction of binding sites (peak-calling) is a common task in the data analysis of methods such as crosslinking immunoprecipitation in combination with high-throughput sequencing (CLIP-Seq). The predicted binding sites are often further analyzed to predict sequence motifs or structure patterns. When looking at a typical result of such high-throughput experiments, the obtained peak profiles differ largely on a genomic level. Thus, a tool is missing that evaluates and classifies the predicted peaks based on their shapes. We hereby present StoatyDive, a tool that can be used to filter for specific peak profile shapes of sequencing data such as CLIP.
With StoatyDive we are able to classify peak profile shapes from CLIP-seq data of the histone stem-loop-binding protein (SLBP). We compare the results to existing tools and show that StoatyDive finds more distinct peak shape clusters for CLIP data. Furthermore, we present StoatyDives capabilities as a quality control tool and as a filter to pick different shapes based on biological or technical questions for other CLIP data from different RNA binding proteins with different biological functions and numbers of RNA recognition motifs. We finally show that proteins involved in splicing, such as RBM22 and U2AF1, have potentially sharper-shaped peaks than other RNA binding proteins.
StoatyDive finally fills the demand for a peak shape clustering tool for CLIP-Seq data that fine-tunes downstream analysis steps such as structure or sequence motif predictions and that acts as a quality control.

Authors

  • Heyl, Florian ;
  • Backofen, Rolf
1 Citation0 Mentions31% FAIR1.1 Dataset Index
10.5524/1008962021

Additional file 3 of Improving CLIP-seq data analysis by incorporating transcript information

Additional file 3 Table S5: Motif search results for 9 RBPs and 28 binding motifs collected from various sources (.xlsx)

Authors

  • Uhl, Michael ;
  • Tran, Van Dinh ;
  • Backofen, Rolf
1 Citation0 Mentions85% FAIR0.6 Dataset Index
10.6084/m9.figshare.13467341.v12020

Comprehensive search for accessory proteins encoded with archaeal and bacterial type III CRISPR-<i>cas</i> gene cassettes reveals 39 new <i>cas</i> gene families

A study was undertaken to identify conserved proteins that are encoded adjacent to cas gene cassettes of Type III CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats – CRISPR associated) interference modules. Type III modules have been shown to target and degrade dsDNA, ssDNA and ssRNA and are frequently intertwined with cofunctional accessory genes, including genes encoding CRISPR-associated Rossman Fold (CARF) domains. Using a comparative genomics approach, and defining a Type III association score accounting for coevolution and specificity of flanking genes, we identified and classified 39 new Type III associated gene families. Most archaeal and bacterial Type III modules were seen to be flanked by several accessory genes, around half of which did not encode CARF domains and remain of unknown function. Northern blotting and interference assays in Synechocystis confirmed that one particular non-CARF accessory protein family was involved in crRNA maturation. Non-CARF accessory genes were generally diverse, encoding nuclease, helicase, protease, ATPase, transporter and transmembrane domains with some encoding no known domains. We infer that additional families of non-CARF accessory proteins remain to be found. The method employed is scalable for potential application to metagenomic data once automated pipelines for annotation of CRISPR-Cas systems have been developed. All accessory genes found in this study are presented online in a readily accessible and searchable format for researchers to audit their model organism of choice: http://accessory.crispr.dk.

Authors

  • Shah, Shiraz A. ;
  • Alkhnbashi, Omer S. ;
  • Behler, Juliane ;
  • Wenyuan Han ;
  • Qunxin She ;
  • Hess, Wolfgang R. ;
  • Garrett, Roger A. ;
  • Backofen, Rolf
1 Citation0 Mentions85% FAIR0.5 Dataset Index
10.6084/m9.figshare.65871262020

Comprehensive search for accessory proteins encoded with archaeal and bacterial type III CRISPR-<i>cas</i> gene cassettes reveals 39 new <i>cas</i> gene families

A study was undertaken to identify conserved proteins that are encoded adjacent to cas gene cassettes of Type III CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats – CRISPR associated) interference modules. Type III modules have been shown to target and degrade dsDNA, ssDNA and ssRNA and are frequently intertwined with cofunctional accessory genes, including genes encoding CRISPR-associated Rossman Fold (CARF) domains. Using a comparative genomics approach, and defining a Type III association score accounting for coevolution and specificity of flanking genes, we identified and classified 39 new Type III associated gene families. Most archaeal and bacterial Type III modules were seen to be flanked by several accessory genes, around half of which did not encode CARF domains and remain of unknown function. Northern blotting and interference assays in Synechocystis confirmed that one particular non-CARF accessory protein family was involved in crRNA maturation. Non-CARF accessory genes were generally diverse, encoding nuclease, helicase, protease, ATPase, transporter and transmembrane domains with some encoding no known domains. We infer that additional families of non-CARF accessory proteins remain to be found. The method employed is scalable for potential application to metagenomic data once automated pipelines for annotation of CRISPR-Cas systems have been developed. All accessory genes found in this study are presented online in a readily accessible and searchable format for researchers to audit their model organism of choice: http://accessory.crispr.dk.

Authors

  • Shah, Shiraz A. ;
  • Alkhnbashi, Omer S. ;
  • Behler, Juliane ;
  • Wenyuan Han ;
  • Qunxin She ;
  • Hess, Wolfgang R. ;
  • Garrett, Roger A. ;
  • Backofen, Rolf
0 Citations0 Mentions15% FAIR0.2 Dataset Index
10.6084/m9.figshare.6587126.v32020

CRISPR-Cas systems in multicellular cyanobacteria

Novel CRISPR-Cas systems possess substantial potential for genome editing and manipulation of gene expression. The types and numbers of CRISPR-Cas systems vary substantially between different organisms. Some filamentous cyanobacteria harbor > 40 different putative CRISPR repeat-spacer cassettes, while the number of cas gene instances is much lower. Here we addressed the types and diversity of CRISPR-Cas systems and of CRISPR-like repeat-spacer arrays in 171 publicly available genomes of multicellular cyanobacteria. The number of 1328 repeat-spacer arrays exceeded the total of 391 encoded Cas1 proteins suggesting a tendency for fragmentation or the involvement of alternative adaptation factors. The model cyanobacterium Anabaena sp. PCC 7120 contains only three cas1 genes but hosts three Class 1, possibly one Class 2 and five orphan repeat-spacer arrays, all of which exhibit crRNA-typical expression patterns suggesting active transcription, maturation and incorporation into CRISPR complexes. The CRISPR-Cas system within the element interrupting the Anabaena sp. PCC 7120 fdxN gene, as well as analogous arrangements in other strains, occupy the genetic elements that become excised during the differentiation-related programmed site-specific recombination. This fact indicates the propensity of these elements for the integration of CRISPR-cas systems and points to a previously not recognized connection. The gene all3613 resembling a possible Class 2 effector protein is linked to a short repeat-spacer array and a single tRNA gene, similar to its homologs in other cyanobacteria. The diversity and presence of numerous CRISPR-Cas systems in DNA elements that are programmed for homologous recombination make filamentous cyanobacteria a prolific resource for their study. Abbreviations: Cas: CRISPR associated sequences; CRISPR: Clustered Regularly Interspaced Short Palindromic Repeats; C2c: Class 2 candidate; SDR: small dispersed repeat; TSS: transcriptional start site; UTR: untranslated region.

Authors

  • Shengwei Hou ;
  • Brenes-Álvarez, Manuel ;
  • Reimann, Viktoria ;
  • Alkhnbashi, Omer S. ;
  • Backofen, Rolf ;
  • Muro-Pastor, Alicia M. ;
  • Hess, Wolfgang R.
0 Citations0 Mentions15% FAIR0.2 Dataset Index
10.6084/m9.figshare.6807191.v32020

CRISPR-Cas systems in multicellular cyanobacteria

Novel CRISPR-Cas systems possess substantial potential for genome editing and manipulation of gene expression. The types and numbers of CRISPR-Cas systems vary substantially between different organisms. Some filamentous cyanobacteria harbor > 40 different putative CRISPR repeat-spacer cassettes, while the number of cas gene instances is much lower. Here we addressed the types and diversity of CRISPR-Cas systems and of CRISPR-like repeat-spacer arrays in 171 publicly available genomes of multicellular cyanobacteria. The number of 1328 repeat-spacer arrays exceeded the total of 391 encoded Cas1 proteins suggesting a tendency for fragmentation or the involvement of alternative adaptation factors. The model cyanobacterium Anabaena sp. PCC 7120 contains only three cas1 genes but hosts three Class 1, possibly one Class 2 and five orphan repeat-spacer arrays, all of which exhibit crRNA-typical expression patterns suggesting active transcription, maturation and incorporation into CRISPR complexes. The CRISPR-Cas system within the element interrupting the Anabaena sp. PCC 7120 fdxN gene, as well as analogous arrangements in other strains, occupy the genetic elements that become excised during the differentiation-related programmed site-specific recombination. This fact indicates the propensity of these elements for the integration of CRISPR-cas systems and points to a previously not recognized connection. The gene all3613 resembling a possible Class 2 effector protein is linked to a short repeat-spacer array and a single tRNA gene, similar to its homologs in other cyanobacteria. The diversity and presence of numerous CRISPR-Cas systems in DNA elements that are programmed for homologous recombination make filamentous cyanobacteria a prolific resource for their study. Abbreviations: Cas: CRISPR associated sequences; CRISPR: Clustered Regularly Interspaced Short Palindromic Repeats; C2c: Class 2 candidate; SDR: small dispersed repeat; TSS: transcriptional start site; UTR: untranslated region.

Authors

  • Shengwei Hou ;
  • Brenes-Álvarez, Manuel ;
  • Reimann, Viktoria ;
  • Alkhnbashi, Omer S. ;
  • Backofen, Rolf ;
  • Muro-Pastor, Alicia M. ;
  • Hess, Wolfgang R.
1 Citation0 Mentions85% FAIR0.7 Dataset Index
10.6084/m9.figshare.68071912020

Additional file 1 of Improving CLIP-seq data analysis by incorporating transcript information

Additional file 1 Table S1: Exon overlap statistics of ENCODE eCLIP datasets (.xlsx)

Authors

  • Uhl, Michael ;
  • Tran, Van Dinh ;
  • Backofen, Rolf
1 Citation0 Mentions85% FAIR0.5 Dataset Index
10.6084/m9.figshare.13467333.v12020

Comprehensive search for accessory proteins encoded with archaeal and bacterial Type III CRISPR-Cas gene cassettes reveals 39 new <i>cas</i> gene families

A study was undertaken to identify conserved proteins that are encoded adjacent to cas gene cassettes of Type III CRISPR-Cas (Clusters of Regularly IntersSpaced Palindromic Repeats - CRISPR associated) interference modules. Type III modules have been shown to target and degrade dsDNA, ssDNA and ssRNA and are frequently intertwined with cofunctional accessory genes, including genes encoding CRISPR-associated Rossman Fold (CARF) domains. Using a comparative genomics approach, and defining a Type III association score accounting for coevolution and specificity of flanking genes, we identified and classified 39 new Type III associated gene families. Most archaeal and bacterial Type III modules were seen to be flanked by several accessory genes, around half of which did not encode CARF domains and remain of unknown function. Northern blotting and interference assays in Synechocystis confirmed that one particular non-CARF accessory protein family was involved in crRNA maturation. Non-CARF accessory genes were generally diverse, encoding nuclease, helicase, protease, ATPase, transporter and transmembrane domains with some encoding no known domains. We infer that additional families of non-CARF accessory proteins remain to be found. The method employed is scalable for potential application on metagenomic data once automated pipelines for annotation of CRISPR-Cas systems have been developed. All accessory genes found in this study are presented online in a readily accessible and searchable format for researchers to audit their model organism of choice: http://accessory.crispr.dk.

Authors

  • Shah, Shiraz A. ;
  • Alkhnbashi, Omer S. ;
  • Behler, Juliane ;
  • Wenyuan Han ;
  • Qunxin She ;
  • Hess, Wolfgang R. ;
  • Garrett, Roger A. ;
  • Backofen, Rolf
0 Citations0 Mentions85% FAIR0.1 Dataset Index
10.6084/m9.figshare.6587126.v12018

Comprehensive search for accessory proteins encoded with archaeal and bacterial type III CRISPR-<i>cas</i> gene cassettes reveals 39 new <i>cas</i> gene families

A study was undertaken to identify conserved proteins that are encoded adjacent to cas gene cassettes of Type III CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats – CRISPR associated) interference modules. Type III modules have been shown to target and degrade dsDNA, ssDNA and ssRNA and are frequently intertwined with cofunctional accessory genes, including genes encoding CRISPR-associated Rossman Fold (CARF) domains. Using a comparative genomics approach, and defining a Type III association score accounting for coevolution and specificity of flanking genes, we identified and classified 39 new Type III associated gene families. Most archaeal and bacterial Type III modules were seen to be flanked by several accessory genes, around half of which did not encode CARF domains and remain of unknown function. Northern blotting and interference assays in Synechocystis confirmed that one particular non-CARF accessory protein family was involved in crRNA maturation. Non-CARF accessory genes were generally diverse, encoding nuclease, helicase, protease, ATPase, transporter and transmembrane domains with some encoding no known domains. We infer that additional families of non-CARF accessory proteins remain to be found. The method employed is scalable for potential application to metagenomic data once automated pipelines for annotation of CRISPR-Cas systems have been developed. All accessory genes found in this study are presented online in a readily accessible and searchable format for researchers to audit their model organism of choice: http://accessory.crispr.dk.

Authors

  • Shah, Shiraz A. ;
  • Alkhnbashi, Omer S. ;
  • Behler, Juliane ;
  • Wenyuan Han ;
  • Qunxin She ;
  • Hess, Wolfgang R. ;
  • Garrett, Roger A. ;
  • Backofen, Rolf
1 Citation0 Mentions85% FAIR0.5 Dataset Index
10.6084/m9.figshare.6587126.v22018

CRISPR-Cas Systems in Multicellular Cyanobacteria

Novel CRISPR-Cas systems possess substantial potential for genome editing and manipulation of gene expression. The types and numbers of CRISPR-Cas systems vary substantially between different organisms. Some filamentous cyanobacteria harbor >40 different putative CRISPR repeat-spacer cassettes, while the number of cas gene instances is much lower. Here we addressed the types and diversity of CRISPR-Cas systems and of CRISPR-like repeat-spacer arrays in 171 publicly available genomes of multicellular cyanobacteria. The number of 1328 repeat-spacer arrays exceeded the total of 391 encoded Cas1 proteins suggesting a tendency for fragmentation or the involvement of alternative adaptation factors. The model cyanobacterium Anabaena sp. PCC 7120 contains only three cas1 genes but hosts at least three Class 1, one Class 2 and five orphan repeat-spacer arrays, all of which exhibit crRNA-typical expression patterns suggesting active transcription, maturation and incorporation into CRISPR complexes. The CRISPR-Cas system within the element interrupting the Anabaena sp. PCC 7120 fdxN gene, as well as analogous arrangements in other strains, occupy the genetic elements that become excised during the differentiation-related programmed site-specific recombination. This fact indicates the propensity of these elements for the integration of CRISPR-cas systems and points to a previously not recognized connection. The possible Class 2 effector protein gene all3613 is linked to a short repeat-spacer array and a single tRNA gene, similar to its homologs in other cyanobacteria. The diversity, high number and presence of CRISPR-Cas systems in DNA elements that are programmed for homologous recombination make filamentous cyanobacteria a prolific resource for their study.

Authors

  • Hess, Wolfgang R. ;
  • Shengwei Hou ;
  • Brenes-Álvarez, Manuel ;
  • Reimann, Viktoria ;
  • Alkhnbashi, Omer S. ;
  • Backofen, Rolf ;
  • Muro-Pastor, Alicia M.
0 Citations0 Mentions85% FAIR0.3 Dataset Index
10.6084/m9.figshare.6807191.v12018