Benchmark Data
Online Supporting Information A . The Benchmark Dataset The benchmark dataset Sbenchconsists of 80 proteins. Listed below are their PDB codes and amino acid sequences. The number in the brackets after each PDB code is the value of ln(Kf) , where Kf is the experimental apparent folding rate constant of the corresponding protein. See the text of the paper for further explanation. To download the data in the Online Supporting Information A, click Supp-A.
 
Online Supporting Information B. Nine different features derived from the 80 protein sequences in the Online Supporting Information A. For the definitions of these features, see Eq.5 of the paper as well as the relevant texts. The information of the PDB codes given here is just for indicating which proteins were used in this study. Note: only their sequence information but none of their 3D information was used to train the predictor Pred-PFR, as mentioned in the paper. To download the data in the Online Supporting Information B, click Supp-B.