Automated Author ProfileScott, Michelle
Scott, Michelle
Current S-Index
Sum of Dataset Indices for all datasets
Average Dataset Index per Dataset
Average Dataset Index per dataset
Total Datasets
Total datasets for this author
Average FAIR Score
Average FAIR Score per dataset
Total Citations
Total citations to the author's datasets
Total Mentions
Total mentions of the author's datasets
S-Index Interpretation
The S-Index (Sharing Index) is a comprehensive metric that represents the cumulative impact of all your datasets. It is calculated as the sum of Dataset Index scores across all your claimed datasets.
What it means:
- A higher S-index indicates greater overall impact of your datasets relative to typical datasets in their fields of research
- The S-Index grows as you add more datasets or as existing datasets gain more citations and mentions
- It provides a single number to track your research data impact over time
Current S-Index: 14.0 (sum of 12 datasets Dataset Index scores)
More information here.
S-Index Over Time
Cumulative Citations Over Time
Cumulative Mentions Over Time
Datasets
[This repository contains the source data for the workflow presented in the manuscript "Reducing the structure bias of RNA-Seq reveals a large number of non-annotated non-coding RNA". The workflow can be found here: http://gitlabscottgroup.med.usherbrooke.ca/gaspard/snakemake_blockbuster ] The study of RNA expression is the fastest growing area of genomic research. However, despite the dramatic increase in the number of sequenced transcriptomes, we still do not have accurate estimates of the number and expression levels of non-coding RNA genes. Non-coding transcripts are often overlooked due to incomplete genome annotation. In this study, we use annotation-independent detection of RNA reads generated using a reverse transcriptase with low structure bias to identify non-coding RNA. Transcripts between 20 and 500 nucleotides were filtered and crosschecked with non-coding RNA annotations revealing 115 non-annotated non-coding RNAs expressed in different cell lines and tissues. Inspecting the sequence and structural features of these transcripts indicated that 60% of these transcripts correspond to new tRNA and snoRNA genes. The identified genes exhibited features of their respective families in terms of structure, expression, conservation and response to depletion of interacting proteins. Together, our data reveal a new group of RNA that are difficult to detect using standard gene prediction and RNA sequencing techniques, suggesting that reliance on actual gene annotation and sequencing techniques distort the perceived architecture of the human transcriptome.
Authors
- Reulet, Gaspard ;
- Scott, Michelle
[This repository contains the source data for the workflow presented in the manuscript "Reducing the structure bias of RNA-Seq reveals a large number of non-annotated non-coding RNA". The workflow can be found here: http://gitlabscottgroup.med.usherbrooke.ca/gaspard/snakemake_blockbuster ] The study of RNA expression is the fastest growing area of genomic research. However, despite the dramatic increase in the number of sequenced transcriptomes, we still do not have accurate estimates of the number and expression levels of non-coding RNA genes. Non-coding transcripts are often overlooked due to incomplete genome annotation. In this study, we use annotation-independent detection of RNA reads generated using a reverse transcriptase with low structure bias to identify non-coding RNA. Transcripts between 20 and 500 nucleotides were filtered and crosschecked with non-coding RNA annotations revealing 115 non-annotated non-coding RNAs expressed in different cell lines and tissues. Inspecting the sequence and structural features of these transcripts indicated that 60% of these transcripts correspond to new tRNA and snoRNA genes. The identified genes exhibited features of their respective families in terms of structure, expression, conservation and response to depletion of interacting proteins. Together, our data reveal a new group of RNA that are difficult to detect using standard gene prediction and RNA sequencing techniques, suggesting that reliance on actual gene annotation and sequencing techniques distort the perceived architecture of the human transcriptome.
Authors
- Reulet, Gaspard ;
- Scott, Michelle
Additional file 4: Pairwise alignments for the 3-CDS benchmark. Zip file containing the sequence file and the pairwise alignment files at the fasta format for the manually-built3-CDS benchmark considered in the “Results” section, for each of the five methods and each parameter configuration.
Authors
- Jammali, Safa ;
- Esaie Kuitche ;
- Rachati, Ayoub ;
- Bélanger, François ;
- Scott, Michelle ;
- Ouangraoua, Aïda
Additional file 5: Pairwise alignments for the 21-CDS dataset. Zip file containing the sequence file and the pairwise alignment files at the fasta format for the 21-CDS benchmarkconsidered in the “Results” section, for each of the five methods and each parameter configuration.
Authors
- Jammali, Safa ;
- Esaie Kuitche ;
- Rachati, Ayoub ;
- Bélanger, François ;
- Scott, Michelle ;
- Ouangraoua, Aïda
Additional file 5: Pairwise alignments for the 21-CDS dataset. Zip file containing the sequence file and the pairwise alignment files at the fasta format for the 21-CDS benchmarkconsidered in the “Results” section, for each of the five methods and each parameter configuration.
Authors
- Jammali, Safa ;
- Esaie Kuitche ;
- Rachati, Ayoub ;
- Bélanger, François ;
- Scott, Michelle ;
- Ouangraoua, Aïda
Additional file 2: CDS of the ten gene families. Zip file containing the CDS files at the fasta format for each of the ten gene families considered in the “Results” section.
Authors
- Jammali, Safa ;
- Esaie Kuitche ;
- Rachati, Ayoub ;
- Bélanger, François ;
- Scott, Michelle ;
- Ouangraoua, Aïda
Additional file 4: Pairwise alignments for the 3-CDS benchmark. Zip file containing the sequence file and the pairwise alignment files at the fasta format for the manually-built3-CDS benchmark considered in the “Results” section, for each of the five methods and each parameter configuration.
Authors
- Jammali, Safa ;
- Esaie Kuitche ;
- Rachati, Ayoub ;
- Bélanger, François ;
- Scott, Michelle ;
- Ouangraoua, Aïda
Additional file 2: CDS of the ten gene families. Zip file containing the CDS files at the fasta format for each of the ten gene families considered in the “Results” section.
Authors
- Jammali, Safa ;
- Esaie Kuitche ;
- Rachati, Ayoub ;
- Bélanger, François ;
- Scott, Michelle ;
- Ouangraoua, Aïda
Classification of all NLS motifs considered in this study, including their sequence, class, regulation modes, number of encoding transcripts and number of encoding exons. (XLSX 23 kb)
Authors
- Mikael-Jonathan Luce ;
- Akpawu, Anna ;
- Tucunduva, Daniel ;
- Mason, Spencer ;
- Scott, Michelle
Classification of all NES motifs considered in this study, including their sequence, regulation modes, number of encoding transcripts and number of encoding exons. (XLSX 15 kb)
Authors
- Mikael-Jonathan Luce ;
- Akpawu, Anna ;
- Tucunduva, Daniel ;
- Mason, Spencer ;
- Scott, Michelle