Research Article
BERT-PPII: The Polyproline Type II Helix Structure Prediction Model Based on BERT and Multichannel CNN
Table 1
The dataset under strict definition (Strict_data).
| Dataset | Number of sequence | Number of PPII | Number of non-PPII | Total |
| Training set | 6561 | 36622 | 1494487 | 1531109 | Test set | 1640 | 9068 | 382819 | 391887 | Independent test set | 920 | 4855 | 201537 | 206392 |
|
|