Published on 01 January 2024

Data and Code for B3P2Augur

View Dataset
Gu, Zhifeng;Hao, Yuduo;Wang, Tianyu;Cai, Peiling;Zhang, Yang;Deng, Kejun;Lin, Hao;Lv, Hao

Description

The blood-brain barrier serves as a critical interface between the bloodstream and brain tissue. It plays a pivotal role in safeguarding brain from harmful substances, thus protecting the integrity of the nervous system and preserving overall brain homeostasis. However, the prediction models for B3PPs have been hampered by issue of limited positive data. In response to this challenge, this study aimed to use data augmentation to process the data, in order to develop better prediction models.In this study, we analyzed the amino acid composition and sequence features of blood-brain barrier penetrating peptides, and finally presented B3P2Augur, a novel prediction model using borderline-SMOTE-based data augmentation and machine learning. Further analysis demonstrated that the model performs best on the independent set (AUROC=0.931) with a 25% data augmentation ratio. Additionally, B3P2Augur has been developed into a tool that can be executed on a computer, with the source code freely available.B3P2Augur improves the prediction performance compared with existing models and demonstrates the effectiveness of data augmentation algorithms in predicting blood-brain barrier penetrating peptides, which may be valuable for developing new peptides.

Citations (0)

Mentions (0)

Metrics

Dataset Index

0.1

FAIR Score

13%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

Assigned Domain

Subfield

Computational Theory and Mathematics

Field

Computer Science

Domain

Physical Sciences

Confidence Score

38%

Source

Scholar Data Model

Keywords

Sequence analysisBioinformatic methods development

Normalization Factors

FT

30.77

CTw

1.00

MTw

1.00