Skip to main content

Table 1 Details of training dataset, validation dataset and independent testing dataset

From: Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture

Data set

Description

Number of sequences

Number of positive data

Number of negative data

Note

Training

12,100

7733

250,054

Random partitioning in each training iteration

Validation

1547

50,010

Testing

1345

6293

46,080

Reservation