Research Article

BERT-PPII: The Polyproline Type II Helix Structure Prediction Model Based on BERT and Multichannel CNN

Table 1

The dataset under strict definition (Strict_data).

DatasetNumber of sequenceNumber of PPIINumber of non-PPIITotal

Training set65613662214944871531109
Test set16409068382819391887
Independent test set9204855201537206392