Published on 01 January 2022
Bacillus Carbohydrate Metabolism Protvec model
View DatasetDescription
Protvec model trained using 8,743 sequences from the Genome Taxonomy Database (GTDB). Sequences were filtered to remove sequences containing 'X', sequences shorter than 30 amino acids and sequences longer than 1024 amino acids. Training used a vector size of 100 and a context size of 25 to produce a dictionary object containing a 100-dimensional vector for each 3-mer present in the training data.
Model is stored as a .pkl file which can be imported using the Python pickle module.
Citations (1)
Cited on 01 January 2026
Weight: 1.00
Mentions (3)
- https://github.com/susiegriggo/ProtvecBacterialProteinsSoftware Heritage
Mentioned on 20 November 2024
Weight: 1.36
- https://github.com/susiegriggo/ProtvecHierachySoftware Heritage
Mentioned on 09 March 2023
Weight: 1.23
- https://github.com/susiegriggo/ProtvecHierachy-Software Heritage
Mentioned on 08 March 2023
Weight: 1.23
Metrics Over Time
Publication Details
Subfield
Molecular Biology
Field
Biochemistry, Genetics and Molecular Biology
Domain
Life Sciences
Confidence Score
46%
Source
Scholar Data Model