Supplementary material for the paper:

1.Datasets for word-vector SVM model and PC linear model:

NLS Data

non-NLS Data

2.Datasets for frequent pattern mining:

Nuclear Data

Non-nuclear Data

3.Datasets for machine learning model:

Training protein dataset

Yeast protein dataset

Hybrid protein dataset

4.All NLS created by frequent pattern mining:

Unique NLS ordered by score

Unique NLS ordered by enrich

NLSs including all sources