Benchmark Data
Online Supporing Infomation A:   (1) List of 9,832 enzyme protein sequences classified into 6 main functional families: (EC.1) Oxidoreductases; (EC.2) Transferases; (EC.3) Hydrolases; (EC.4) Lyases; (EC.5) Isomerase; and (EC.6) Ligases. (2) List of 9,850 non-enzyme protein sequences. None of proteins included has more than 40% sequence identity to any other in a same basic subset. See the text of the paper for further explanation. To download the Online Supporting Information A, click Supp-A.
 
Online Supporing Infomation B:   List of 10,442 enzyme sequences for 18 sub-family classes of EC.1 (Oxidoreductases), 8 sub-family classes of EC.2 (Transferases), 5 sub-family classes of EC.3 (Hydrolases), 6 sub-family classes of EC.4 (Lyases), 6 sub-family classes of EC.5 (Isomerases), and 6 sub-family classes of EC.6 (Ligases). None of proteins included has more than 40% sequence identity to any other in a same basic subset. To download the Online Supporting Information B, click Supp-B.