NLSExplorer Result Page
Position | Prediction type NLS indicates Nuclear Localization Signals, NIA means potential Nuclear Important Area in protein sequence. | Length | Recommendation score | Entropy score | Rcommended NLS segment |
130-141 | NLS | 12 | 0.99021 | 2.9183 | TGTKRKLDEYLD |
144-163 | NLS | 20 | 0.9553 | 3.44145 | QGVVGQFNKIKLRPKYKKST |
Segment Similarity Information
For each NLS, we present the top 5 predicted segments from the NLS candidate library based on their similarity. The NLS candidate library was constructed by running NLSExplorer on nuclear proteins from Swiss-Prot, it contains a large number of known and potential NLSs.
NLS | Similar NLS | Uni ID | Species | Probability | Position |
---|---|---|---|---|---|
TGTKRKLDEYLD | SKKRKSP | O13911 | Schizosaccharomyces pombe (strain 972 / ATCC 24843) | 0.9855445027351379 | 2-8 |
AMKRKL | Q0UZ59 | Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 / FGSC 10173) | 0.9783025979995728 | 1-6 | |
RGTKRRR | Q9U1Y5 | Caenorhabditis elegans | 0.9877992868423462 | 2-8 | |
KKRAR | Q4HZK7 | Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) | 0.9893062710762024 | 2-6 | |
ATKRKA | Q4WRC2 | Aspergillus fumigatus (strain ATCC MYA-4609 / CBS 101355 / FGSC A1100 / Af293) | 0.9835029244422913 | 1-6 | |
QGVVGQFNKIKLRPKYKKST | PKRVLKG | Q5AR44 | Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) | 0.9051908254623413 | 1-7 |
QLRKRKRV | Q02749 | Saccharomyces cerevisiae (strain ATCC 204508 / S288c) | 0.9466467499732971 | 3-10 | |
KKLKGKRSKI | P39529 | Saccharomyces cerevisiae (strain ATCC 204508 / S288c) | 0.9561310410499573 | 14-23 | |
KKRGVL | Q68F54 | Xenopus laevis | 0.9105245471000671 | 1-6 | |
ASGKKPGTKGTKRHRKIT | Q09922 | Schizosaccharomyces pombe (strain 972 / ATCC 24843) | 0.9869093298912048 | 2-19 |
Position | Type NLS indicates Nuclear Localization Signals, NIA means potential Nuclear Important Area in protein sequence. | Length | Recommendation score | Entropy score | Segment |
455-460 | NIA | 6 | 0.01461 | 1.25163 | KKVKKE |
531-538 | NIA | 8 | 0.01387 | 2.5 | VIGKVNRR |
226-235 | NIA | 10 | 0.01269 | 2.37095 | FVVKKSEVKP |
564-579 | NIA | 16 | 0.01094 | 3.25 | AITFAEQKLNCKYKKF |
418-423 | NIA | 6 | 0.01048 | 2.25163 | VNKKST |
406-411 | NIA | 6 | 0.00915 | 2.25163 | PNKYVV |
518-526 | NIA | 9 | 0.00879 | 2.9477 | VIQKYNRFK |
555-562 | NIA | 8 | 0.0073 | 2.40564 | LQGLVPLS |
540-552 | NIA | 13 | 0.00681 | 2.4997 | STTLHNNLLKLLA |
259-264 | NIA | 6 | 0.00667 | 2.25163 | DNRVFV |
13-22 | NIA | 10 | 0.00656 | 2.64644 | ASTPSRASFD |
389-394 | NIA | 6 | 0.0063 | 2.58496 | VSQILK |
508-513 | NIA | 6 | 0.00628 | 2.58496 | GEIKDV |
25-36 | NIA | 12 | 0.00581 | 3.25163 | YSEFCDKQPNDY |
211-216 | NIA | 6 | 0.00568 | 2.25163 | SNRFSD |
170-175 | NIA | 6 | 0.00561 | 2.58496 | LEQTIN |
301-310 | NIA | 10 | 0.00531 | 2.72193 | NCKKCHFVDV |
489-503 | NIA | 15 | 0.00515 | 2.92323 | KNEERLTIAKKNKEF |
Attention distribution along the sequence
The distribution of attention highlights the focus of our computational model on the sequence, with higher attention points indicating a greater likelihood of influencing the sequence's ability to localize to the nucleus.
Sequence visualization according to the attention map
NLSExplorer's inference focus is automatically presented intuitively in the prediction result. It provides a detailed profile of the nuclear localization relevance for each amino acid site, demonstrating both the inherent reliability of the learning model and the potential biological meaningful sites.

The structure statistical of recommended segment
Regarding the large-scale recommended segments, we compile the structural composition of each segment.
The recommended segment visualization on 3d structure
The green, blue, and yellow regions refer to the top three recommended NLS, while the red region represents other regions important for nuclear transport(top-5)