Automated Organization Profile

Universidade Estadual Paulista - UNESP

Current S-Index

14.2

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

1.4

Average Dataset Index per dataset

Total Datasets

10

Total datasets in this organization

Average FAIR Score

70.8%

Average FAIR Score per dataset

Total Citations

3

Total citations to the organization's datasets

Total Mentions

0

Total mentions of the organization's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

The Coupler - Arquivos de teste (test files)

Arquivos para teste (Test files) Separadores: Tabulado, virgula, ponto e virgula (tabular, comma, dot and comma)

Authors

  • Castanha, Rafael Gutierres
0 Citations0 Mentions73% FAIR1.6 Dataset Index
10.5281/zenodo.71306132022

The Coupler - Arquivos de teste (test files)

Arquivos para teste (Test files) Separadores: Tabulado, virgula, ponto e virgula (tabular, comma, dot and comma)

Authors

  • Castanha, Rafael Gutierres
0 Citations0 Mentions73% FAIR1.6 Dataset Index
10.5281/zenodo.71306142022

Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Pretraining Datasets raw audios from CORAA

This repository contains all the pretraining datasets used in the paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers by Marcelo Gauy and Marcelo Finger. These datasets are part of a collection of datasets from the TaRSila project (see https://sites.google.com/view/tarsila-c4ai). The audios published here were in part also published with annotations and transcriptions as the CORAA dataset (see https://github.com/nilc-nlp/CORAA). Here we publish the original raw audios from the following datasets (without transcriptions) - ALIP, C-Oral, SP2010, NURC-Recife, NURC-São Paulo and Programa Certas Palavras. In total, the datasets contain about 800 hours of Brazilian Portuguese Speech. The audios have been converted to mp3 to facilitate the upload. ALIP, C-Oral and SP2010 are integrally contained in one file each. Programa Certas Palavras and NURC-Recife are split in 3 parts each, while NURC-SP is split in 7 parts of roughly equal size. More information on the datasets can be found in the paper Acoustic models of Brazilian Portuguese Speech based on Neural Transformers as well as on the original references which created these datasets.

Authors

  • Matheus Gauy, Marcelo ;
  • Finger, Marcelo ;
  • Aluisio, Sandra Maria ;
  • Svartman, Flaviane Romani Fernandes ;
  • Candido Junior, Arnaldo ;
  • Casanova, Edresson ;
  • Leite, Marli Quadros ;
  • Soares, Anderson ;
  • Oliveira, Frederico Santos de ;
  • Oliveira, Lucas ;
  • Fernandes Jr, Ricardo ;
  • Silva, Daniel da ;
  • Fayet, Fernando Gorgulho ;
  • Carlotto, Bruno Baldissera ;
  • Gris, Lucas R ;
  • Santos, Vinícius Gonçalves dos
0 Citations0 Mentions73% FAIR1.8 Dataset Index
10.5281/zenodo.67949242022

Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Pretraining Datasets raw audios from CORAA

This repository contains all the pretraining datasets used in the paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers by Marcelo Gauy and Marcelo Finger. These datasets are part of a collection of datasets from the TaRSila project (see https://sites.google.com/view/tarsila-c4ai). The audios published here were in part also published with annotations and transcriptions as the CORAA dataset (see https://github.com/nilc-nlp/CORAA). Here we publish the original raw audios from the following datasets (without transcriptions) - ALIP, C-Oral, SP2010, NURC-Recife, NURC-São Paulo and Programa Certas Palavras. In total, the datasets contain about 800 hours of Brazilian Portuguese Speech. The audios have been converted to mp3 to facilitate the upload. ALIP, C-Oral and SP2010 are integrally contained in one file each. Programa Certas Palavras and NURC-Recife are split in 3 parts each, while NURC-SP is split in 7 parts of roughly equal size. More information on the datasets can be found in the paper Acoustic models of Brazilian Portuguese Speech based on Neural Transformers as well as on the original references which created these datasets.

Authors

  • Matheus Gauy, Marcelo ;
  • Finger, Marcelo ;
  • Aluisio, Sandra Maria ;
  • Svartman, Flaviane Romani Fernandes ;
  • Candido Junior, Arnaldo ;
  • Casanova, Edresson ;
  • Leite, Marli Quadros ;
  • Soares, Anderson ;
  • Oliveira, Frederico Santos de ;
  • Oliveira, Lucas ;
  • Fernandes Jr, Ricardo ;
  • Silva, Daniel da ;
  • Fayet, Fernando Gorgulho ;
  • Carlotto, Bruno Baldissera ;
  • Gris, Lucas R ;
  • Santos, Vinícius Gonçalves dos
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.67949232022

Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Refinement dataset SPIRA

This dataset was collected over the internet and in hospital wards with the goal of detecting respiratory insufficiency (typically caused by COVID-19). This data collection is part of the SPIRA Project, whose goal is developing a system for recognizing respiratory insufficiency through speech analysis. The datasets presented here were used in the paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers by Marcelo Gauy and Marcelo Finger. The spira_trimmed_data file contains the original ~1 hour dataset collected over the internet (control) and in hospital wards (patients) by the SPIRA Project. This is as described in the paper: Deep learning against COVID-19: Respiratory insufficiency detection in Brazilian Portuguese Speech. We include it here for completeness. The spira_control_full_mp3 file contains the complete ~18 hours control data collected over the internet by the SPIRA project. While not useful for respiratory insufficiency detection, the dataset may be used for identifying age and gender as we mention in our paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers.

Authors

  • Matheus Gauy, Marcelo ;
  • Finger, Marcelo ;
  • Aluisio, Sandra Maria ;
  • Spazzapan, Evelyn Alves ;
  • Berti, Larissa Cristina ;
  • Camargo Neto, Augusto César de ;
  • Candido Junior, Arnaldo ;
  • Casanova, Edresson ;
  • Svartman, Flaviane Romani Fernandes ;
  • Ferreira, Renato Cordeiro ;
  • Fernandes Jr, Ricardo ;
  • Goldman, Alfredo ;
  • Gris, Lucas R ;
  • Leyton, Pedro ;
  • Levin, Anna Sara Shafferman ;
  • Martins, Marcus Vinicíus Moreira ;
  • Queiroz, Marcelo Gomes de ;
  • Quirino, J Henrique ;
  • Medeiros, Beatriz Raposo de ;
  • Sabino, Ester Cerdeira ;
  • Silva, Daniel da
1 Citation0 Mentions77% FAIR2.2 Dataset Index
10.5281/zenodo.66724502022

Acoustic models of Brazilian Portuguese Speech based on Neural Transformers - Refinement dataset SPIRA

This dataset was collected over the internet and in hospital wards with the goal of detecting respiratory insufficiency (typically caused by COVID-19). This data collection is part of the SPIRA Project, whose goal is developing a system for recognizing respiratory insufficiency through speech analysis. The datasets presented here were used in the paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers by Marcelo Gauy and Marcelo Finger. The spira_trimmed_data file contains the original ~1 hour dataset collected over the internet (control) and in hospital wards (patients) by the SPIRA Project. This is as described in the paper: Deep learning against COVID-19: Respiratory insufficiency detection in Brazilian Portuguese Speech. We include it here for completeness. The spira_control_full_mp3 file contains the complete ~18 hours control data collected over the internet by the SPIRA project. While not useful for respiratory insufficiency detection, the dataset may be used for identifying age and gender as we mention in our paper: Acoustic models of Brazilian Portuguese Speech based on Neural Transformers.

Authors

  • Matheus Gauy, Marcelo ;
  • Finger, Marcelo ;
  • Aluisio, Sandra Maria ;
  • Spazzapan, Evelyn Alves ;
  • Berti, Larissa Cristina ;
  • Camargo Neto, Augusto César de ;
  • Candido Junior, Arnaldo ;
  • Casanova, Edresson ;
  • Svartman, Flaviane Romani Fernandes ;
  • Ferreira, Renato Cordeiro ;
  • Fernandes Jr, Ricardo ;
  • Goldman, Alfredo ;
  • Gris, Lucas R ;
  • Leyton, Pedro ;
  • Levin, Anna Sara Shafferman ;
  • Martins, Marcus Vinicíus Moreira ;
  • Queiroz, Marcelo Gomes de ;
  • Quirino, J Henrique ;
  • Medeiros, Beatriz Raposo de ;
  • Sabino, Ester Cerdeira ;
  • Silva, Daniel da
2 Citations0 Mentions77% FAIR2.6 Dataset Index
10.5281/zenodo.66724512022

aplicando_analise_critica_discurso_perspectiva_securitizacao.pdf (Version: 1.1)

No description available

Authors

  • Contrera, Flávio ;
  • Mariano, Karina Lilia Pasquariello ;
  • Menezes, Roberto Goulart
0 Citations0 Mentions15% FAIR0.3 Dataset Index
10.48331/scielodata.vqptcy/1qwqle2021

Notas metodológicas: Aplicando a Análise Crítica do Discurso à perspectiva da securitização (Version: 1.1)

A nota detalha os procedimentos metodológicos do artigo “Retórica da ameaça e securitização. A política migratória dos Estados Unidos na administração Trump”, principalmente o método da Análise Crítica do Discurso.

Authors

  • Contrera, Flávio ;
  • Mariano, Karina Lilia Pasquariello ;
  • Menezes, Roberto Goulart
0 Citations0 Mentions88% FAIR1.9 Dataset Index
10.48331/scielodata.vqptcy2021

SELECTION METHODS TO OPTIMIZE THE GAIN AND GENETIC DIVERSITY IN Pinus caribaea var. caribaea

The proposal of this work was to estimate the genetic variability in orchards of Pinus caribaea var. caribaea based on growth traits and to analyze the best selection method. This study was conducted in two areas of P. caribaea var. caribaea situated in Savannah biome. The first orchard was a randomized complete block design with 76 progenies and 4 controls (area 1), the second orchard, the lattice design was 10x10 with 99 progenies and one control (area 2), 28 and 27 years old, respectively. The software SELEGEN was used to estimate genetic parameters trough REML/BLUP method. Significant variation was observed between and with progeny all traits in area 2 and only between plants within plots for height in area 1. The highest estimates of genetic variation and heritability were obtained for area 1. Without the optimization of selection, the highest gain (4.8%) in the selection between and within with a selection intensity of 52%, for area 1. In area 2, the highest gain (2.86%) in individual selection. We conclude that there is low genetic variability in seedlings orchards of P. caribaea var. caribaea. However, area 1 presents higher genetic control than area 2, and should be better explored. For the next generations, it is recommended the infusion of new genetic material to proceed with a forest improvement program, since it was observed low variability and low gains in the selection of P. caribaea var. caribaea.

Authors

  • Daniele, Zulian,
0 Citations0 Mentions73% FAIR1.6 Dataset Index
10.5281/zenodo.36533832020

SELECTION METHODS TO OPTIMIZE THE GAIN AND GENETIC DIVERSITY IN Pinus caribaea var. caribaea

The proposal of this work was to estimate the genetic variability in orchards of Pinus caribaea var. caribaea based on growth traits and to analyze the best selection method. This study was conducted in two areas of P. caribaea var. caribaea situated in Savannah biome. The first orchard was a randomized complete block design with 76 progenies and 4 controls (area 1), the second orchard, the lattice design was 10x10 with 99 progenies and one control (area 2), 28 and 27 years old, respectively. The software SELEGEN was used to estimate genetic parameters trough REML/BLUP method. Significant variation was observed between and with progeny all traits in area 2 and only between plants within plots for height in area 1. The highest estimates of genetic variation and heritability were obtained for area 1. Without the optimization of selection, the highest gain (4.8%) in the selection between and within with a selection intensity of 52%, for area 1. In area 2, the highest gain (2.86%) in individual selection. We conclude that there is low genetic variability in seedlings orchards of P. caribaea var. caribaea. However, area 1 presents higher genetic control than area 2, and should be better explored. For the next generations, it is recommended the infusion of new genetic material to proceed with a forest improvement program, since it was observed low variability and low gains in the selection of P. caribaea var. caribaea.

Authors

  • Daniele, Zulian,
0 Citations0 Mentions79% FAIR0.3 Dataset Index
10.5281/zenodo.36533822020