; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013954 (gene) of Snake gourd v1 genome

Gene IDTan0013954
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionnucleolar protein 12
Genome locationLG01:26157000..26160947
RNA-Seq ExpressionTan0013954
SyntenyTan0013954
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR034220 - RBM34, RNA recognition motif 1
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596731.1 Inositol-tetrakisphosphate 1-kinase 5, partial [Cucurbita argyrosperma subsp. sororia]2.4e-25381.43Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN P +  +E+  GSAPSSVF+TLFGNVG E+ GVSIFST+NPFRRKD++S P AA AEK  +GG+DDH IDDV+SRKK KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSSE+KKSK+KEKSRDREL+N  +E DD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASL+HNMAVFAGNHIRVDRACPPRKKLKVGS+PIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL S PLKKRN PSTE D S AKKRAVDSGLRTPDSSKRVTPKA VASYQGLRASK G
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK V HKR               KRPEK SERKGKRPAVANRKAVA AS+NG+ AAPKQ GVKRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIK
        R+K
Subjt:  RIK

KAG7028265.1 nop12, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-25581.35Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN P +  +E+  GSAPSSVF+TLFGNVG E+ GVSIFST+NPFRRKD++S P AA AEK  +GG+DDH IDDV+SRKK KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSSE+KKSK+KEKSRDREL+N  +E DD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEML+TKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASL+HNMAVFAGNHIRVDRACPPRKKLKVGS+PIYDPKRTVFVGNLPFDVKDEELYQLFCG+ENVGSSVEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL STPLKKRN PSTE D S AKKRAVDSGLRTPDSSKRVTPKA VASYQGLRASK G
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK VDHKR               KRPEK SERKGKRPAVANRKAVA AS+NG+ AAPKQ GVKRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIKRLR
        R+KR R
Subjt:  RIKRLR

XP_022922379.1 nucleolar protein 12 [Cucurbita moschata]2.8e-25782.51Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN P +  +E+  GSAPSSVF+TLFGNVG E+ GVSIFSTDNPFRRKD++  P AA AEKS +GG+DDH IDDVKSRKK KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSSE+KKSK+KEKSRDREL+N  +EKDD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGS+PIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL STPLKKRN PSTE D S AKKRAVDSGLRTPDSSKRVTPKA VASYQGLRASK G
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK V HKR               KRPEK SERKGKRPAVANRKAVA AS+NGV AAPKQ GVKRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIKRLR
        R+KRLR
Subjt:  RIKRLR

XP_023005336.1 nucleolar protein 12 [Cucurbita maxima]3.7e-25481.52Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN+P +  +E+  GSAPSSVF+TLFGNVG E+ GVSIFST+NPFRRKD++S P AA AEKS +GG+DDH IDDVKSRKK KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSS++KKSK+K+KSRDREL+N  +EKDD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGS+PIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL STPLKKRN PSTE D S AKKRAVDSGLRTPDSSKR   KA VASYQGLRASK  
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK VDHKR               KRPEK SERKGKRPAVANRKAVA AS+NGV AAPKQ GVKRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIKRLR
        R+KR R
Subjt:  RIKRLR

XP_023540437.1 nucleolar protein 12 [Cucurbita pepo subsp. pepo]2.4e-25381.19Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN P +  +E+  GSA SSVF+TLFGNVG E+ GVSIFST+NPFRRKD++S P AA AEKS +GG+D H IDDVKSR+K KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSSE+KKSK+KEKSRDREL+N  +EKDD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGS+PIY+PKRTVFVGNLPFDVKDEELYQLFCGIENVGS+VEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL STPLKKRN PSTE D S AKKRAVDSGLRTPDSSKR TPKA VASYQGLRASK G
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK VD+KR               KRPEK SERKGKRPAVANRKAVA AS+NGV AAPKQ G KRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIKRLR
        R+KR R
Subjt:  RIKRLR

TrEMBL top hitse value%identityAlignment
A0A1S3BK91 nucleolar protein 124.5e-22977.18Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGK K  +  NPPDEI T + P SVFDTLFG+ GVEN  VSIFS+DNPFRRKDS+SI    A                  SR KGKEKRVG DLDS  EG
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKK--KEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELERE
        VK+SSEIKKSKK  KEKSRDREL+N     DDGE GFES+  LK N+KKG   GSET E S  F            +KLGENVKLMKERKKRKRDELERE
Subjt:  VKSSSEIKKSKK--KEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELERE

Query:  YEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVP--LANSKKPRKGA
        YEAKKYG S+VA+D  EGS G VVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK L KEFSQFGEI+SVRIRSVP  +ANSKKPRKGA
Subjt:  YEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVP--LANSKKPRKGA

Query:  ILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRV
        I+SKK+NE ADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPP KKLKVG+ PIYDPKRTVFVGNLPFDVKDEELYQLFCGI+ +GSSVEAVRV
Subjt:  ILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRV

Query:  IRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRA
        IRDP VN+GKGFAYVFFKTREA NSVVN QKLELR RTLRLFHAK N  STP KKRN+P TEADH+PAKK AVDSGL TPDSSKRVTPKA  ASYQGLRA
Subjt:  IRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRA

Query:  SKTGSQKKIHAKGNSTKWPKSYSNSKEKPVDH-KRKRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKRLR
        SK+GSQKKIH KG+STKWPKS+SNSKEKP+DH KRK PEKTSERKGKRPAVANRKAVA A+KNG+ A PKQ G+KRKSDSR+P SSHRNKR+K+ R
Subjt:  SKTGSQKKIHAKGNSTKWPKSYSNSKEKPVDH-KRKRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKRLR

A0A5A7UHL8 Nucleolar protein 122.6e-22977.35Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGK K  +  NPPDEI T + P SVFDTLFG+ GVEN  VSIFS+DNPFRRKDS+SI    A                  SR KGKEKRVG DLDS  EG
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKK--KEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELERE
        VK+SSEIKKSKK  KEKSRDREL+N     DDGE GFES+  LK N+KKG   GSET E S  F            +KLGENVKLMKERKKRKRDELERE
Subjt:  VKSSSEIKKSKK--KEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELERE

Query:  YEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVP--LANSKKPRKGA
        YEAKKYG S+VA+D  EGS G VVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK L KEFSQFGEI+SVRIRSVP  +ANSKKPRKGA
Subjt:  YEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVP--LANSKKPRKGA

Query:  ILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRV
        I+SKK+NE ADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPP KKLKVG+ PIYDPKRTVFVGNLPFDVKDEELYQLFCGI+ +GSSVEAVRV
Subjt:  ILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRV

Query:  IRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRA
        IRDP VN+GKGFAYVFFKTREAANSVVN QKLELR RTLRLFHAK N  STP KKRN+P TEADH+PAKK AVDSGL TPDSSKRVTPKA  ASYQGLRA
Subjt:  IRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRA

Query:  SKTGSQKKIHAKGNSTKWPKSYSNSKEKPVDH-KRKRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKRLR
        SK+GSQKKIH KG+STKWPKS+SNSKEKP+DH KRK PEKTSERKGKRPAVANRKAVA A+KNG+ A PKQ G+KRKSDSR+P SSHRNKR+K+ R
Subjt:  SKTGSQKKIHAKGNSTKWPKSYSNSKEKPVDH-KRKRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKRLR

A0A6J1CWE3 nucleolar protein 125.6e-24077.44Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRK----DSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDS
        MGKKKT++P NP + + + SA SSVF+TLFGNVGV++ GVSIFS+DNPFRR+    DS+SIP AAAAE++   GS+DH +DDVKS K GKEKR G DL S
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRK----DSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDS

Query:  AEEGVKSSSEIKKSKKKEKSRD-RELNNGPVEKDDGES---GFESRSVLKKNKKKGDVSGSE---TLEMSSEFSGKLKIDEQ-------------RSDNK
        AEE  KSSSEIKKSK+KEK+RD R+L NG +E++ GE+   G +S  VLKKNK+KG  SGSE   T E S EFSG LK++EQ             R+D+K
Subjt:  AEEGVKSSSEIKKSKKKEKSRD-RELNNGPVEKDDGES---GFESRSVLKKNKKKGDVSGSE---TLEMSSEFSGKLKIDEQ-------------RSDNK

Query:  LGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEI
        LGE  KLMKE+KKRKRDELEREYEA+KYGASEVA+DGEEGS G V GKKRKALDDPSEMLV+KEGFDDESKLLRTVFVGNLPLKVKKK LGKEFS+FGEI
Subjt:  LGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEI

Query:  ESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEE
        ESVRIRSVPLANSKKPRKGAILSKKVNE A+SSHAY+VFK VESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEE
Subjt:  ESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEE

Query:  LYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTP
        LYQLFCGI+N+G+SVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDR+LRLFHA PNL STPLKKR++P TEAD +PAKK AV  G  TP
Subjt:  LYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTP

Query:  DSSKRVTPKAPVASYQGLRASKTGS-QKKIHAKGNSTKWPKSYSNSKEKPVDHKRKRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDS
        DSSKR TPK+  ASYQGLR+SKTGS QKK   K    KWPKS SNSKEKPVDHKRK+PEKT ERKGKRPAVANRKA A+ASKNG  AAPKQ G KRKS S
Subjt:  DSSKRVTPKAPVASYQGLRASKTGS-QKKIHAKGNSTKWPKSYSNSKEKPVDHKRKRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDS

Query:  RTPESSHRNKRIKRLR
        RTPESSHRNKR+KR R
Subjt:  RTPESSHRNKRIKRLR

A0A6J1E378 nucleolar protein 121.3e-25782.51Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN P +  +E+  GSAPSSVF+TLFGNVG E+ GVSIFSTDNPFRRKD++  P AA AEKS +GG+DDH IDDVKSRKK KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSSE+KKSK+KEKSRDREL+N  +EKDD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGS+PIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL STPLKKRN PSTE D S AKKRAVDSGLRTPDSSKRVTPKA VASYQGLRASK G
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK V HKR               KRPEK SERKGKRPAVANRKAVA AS+NGV AAPKQ GVKRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIKRLR
        R+KRLR
Subjt:  RIKRLR

A0A6J1KUP0 nucleolar protein 121.8e-25481.52Show/hide
Query:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG
        MGKKKTN+P +  +E+  GSAPSSVF+TLFGNVG E+ GVSIFST+NPFRRKD++S P AA AEKS +GG+DDH IDDVKSRKK KEKRVG DLDSAE G
Subjt:  MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEG

Query:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE
        VKSSS++KKSK+K+KSRDREL+N  +EKDD E GF+SR V+KKNK+ G        E+ SEFSG+LKIDEQRSD+KLG +VKL KE+KKRKRDELEREYE
Subjt:  VKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYE

Query:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
        AKKYGASEVA+D  EGS G VVGKKRKA+DD SEMLVTKEGFDDESKLLRTVFVGNLPLKVKKK LGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK
Subjt:  AKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSK

Query:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
        KVNEDADSSHAYVVFKT ESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGS+PIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP
Subjt:  KVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDP

Query:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG
        NVNIGKGFAYVFFKT+EAANSV+N QKL+LRDR LRLFHAKPNL STPLKKRN PSTE D S AKKRAVDSGLRTPDSSKR   KA VASYQGLRASK  
Subjt:  NVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTG

Query:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK
        SQKK  AKG+ TKW KS+SNSKEK VDHKR               KRPEK SERKGKRPAVANRKAVA AS+NGV AAPKQ GVKRKSDSRTPESSHRNK
Subjt:  SQKKIHAKGNSTKWPKSYSNSKEKPVDHKR---------------KRPEKTSERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNK

Query:  RIKRLR
        R+KR R
Subjt:  RIKRLR

SwissProt top hitse value%identityAlignment
O13741 Nucleolar protein 123.9e-2831.55Show/hide
Query:  ESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRK----RDELEREY----------EAKKYGASEVADDGEEGSWGIV
        +S+ + K   K+  V   + +E+      +   +   SD K  +N+K   ++KK+K     D++E +Y          E  K  A  + D+ ++      
Subjt:  ESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRK----RDELEREY----------EAKKYGASEVADDGEEGSWGIV

Query:  VGKKRKALDD-PSEMLVTKEGFDDESKLLRTVFVGNLPLKV-----KKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVF
        V ++R + +D  SE  V ++  ++  K  +TVFV NLP +V       K L K F QFG ++S+R RS+  + +  PRK A   KK + + D+ +AY+VF
Subjt:  VGKKRKALDD-PSEMLVTKEGFDDESKLLRTVFVGNLPLKV-----KKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVF

Query:  KTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLF--CGIENVGSSVEAVRVIRDPNVNIGKGFAYVFF
        +   SA+++LS N  +F   H+RVD    P  +         D KR VFVGNL F+ ++E L++ F  CG      S++ VR++RDP  N+GKGFAY+ F
Subjt:  KTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLF--CGIENVGSSVEAVRVIRDPNVNIGKGFAYVFF

Query:  K-TREAANSVVNKQKLELRDRTLRLFHAKPNL-ASTPLKKRNKPSTEADHSPAKK
        K T     +++  +K     RTLR+  AK     S    KR    T      A+K
Subjt:  K-TREAANSVVNKQKLELRDRTLRLFHAKPNL-ASTPLKKRNKPSTEADHSPAKK

P42696 RNA-binding protein 347.6e-2431.64Show/hide
Query:  RSVLKKNKKKGDVSGSETLE--MSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEM
        +  +KK K+  +   +  +E  +S E + K+K  ++ ++       KL          +LE E   K+    + +  G       V    RK LDD  + 
Subjt:  RSVLKKNKKKGDVSGSETLE--MSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEM

Query:  LVT---KEGFDDESKLL---RTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNM
        +V+   K   + E + L   RTVFVGNLP+   KK L   F ++G+IESVR RS+  A     +K A + +K++ D  + +AYVVFK   +A  +L  N 
Subjt:  LVT---KEGFDDESKLL---RTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNM

Query:  AVFA-GNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDE--ELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQ
        A  A G  IRVD A     +           KR+VFVGNLP+ V++   E + L CG      S+ AVR++RD    IGKGF YV F+  ++ +  +   
Subjt:  AVFA-GNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDE--ELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQ

Query:  KLELRDRTLRLFHA------KPNLASTPLKKRNKP
          EL  R LR+  +      K   ++  LK  +KP
Subjt:  KLELRDRTLRLFHA------KPNLASTPLKKRNKP

Q5AHI7 Nucleolar protein 129.6e-1928.4Show/hide
Query:  DHNIDDV-KSRKKGKEKRVGTDLDSAEEGVKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQ
        D NI+ + K+ + G   +   +L   +  V    ++   K++    +  LN    E D+ E  +   S      +  D   +E  E +S      K D++
Subjt:  DHNIDDV-KSRKKGKEKRVGTDLDSAEEGVKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQ

Query:  RSDNKLGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPL-----KVKKKTL
          + +     KL+ E+ + ++DE +   EAK   +S+VA++  +      V  K K L+                K  RTVFVGN+P      K+  K  
Subjt:  RSDNKLGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPL-----KVKKKTL

Query:  GKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVG
           F  +G+I+S+R RS+   +   PRK A   K +++  DS +AY+V+K   ++ A+   N  VF  +H+RVD    P  K         D KRT+FVG
Subjt:  GKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVG

Query:  NLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANS--VVNKQKLEL----RDRTLRLFHAKPNLASTPLKKRNKPSTEAD
        NL F+ K+E L++ F     +   VE+VR+IRD   N+GKGFA V FK   + N   ++N + LE     + R LR+  AK N A   L   N      D
Subjt:  NLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANS--VVNKQKLEL----RDRTLRLFHAKPNLASTPLKKRNKPSTEAD

Query:  HSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKI------HAKGNSTKWPKSYSNSKEKPVDHKRKRPEKTSERK
        +   K  A  S  +  D+ K    +A     +  R S  G  K+I        KG + K  K     K+      R+R  K  E +
Subjt:  HSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKI------HAKGNSTKWPKSYSNSKEKPVDHKRKRPEKTSERK

Q5M9F1 RNA-binding protein 343.6e-2635.19Show/hide
Query:  DDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFA-GNHIRVDR
        ++  K  RTVFVGNLP+   KK L   F ++G++ESVR RSV  A     +K A + +K + D  S +AYVVFK   +A  +L  N A  A G  IRVD 
Subjt:  DDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFA-GNHIRVDR

Query:  ACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLF--CGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFH
        A     +           KR+VFVGNLP+ V +  L + F  CG      S+ AVR++R+P   +G+GF YV F+  +A +  +     EL  R LR+  
Subjt:  ACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLF--CGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFH

Query:  AKPNLASTPLKKRN-KPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKIHAK
           ++    LK++N  PS + D S +K+R       T    K  +  A +     L   K G +KK   K
Subjt:  AKPNLASTPLKKRN-KPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKIHAK

Q8C5L7 RNA-binding protein 342.8e-2637.12Show/hide
Query:  DDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFA-GNHIRVDR
        ++  K  RTVFVGNLP+   KK L   F ++G++ESVR RSV  A     +K A + +K + D  S +AYVVFK   +A  +L  N A  A G  IRVD 
Subjt:  DDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFA-GNHIRVDR

Query:  ACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLF--CGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFH
        A     +           KR+VFVGNLP+ ++D  L + F  CG      S+ AVR++R+P   +G+GF YV F+  +A +  +     EL  R LR+  
Subjt:  ACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLF--CGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFH

Query:  AKPNLASTPLKKRN-KPSTEADHSPAKKR
           ++    LK++N  PS + D    K+R
Subjt:  AKPNLASTPLKKRN-KPSTEADHSPAKKR

Arabidopsis top hitse value%identityAlignment
AT2G18510.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.7e-0821.91Show/hide
Query:  TVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLS-HNMAVFAGNHIRVDRACPPRKKL
        TV+VG L  ++ ++ L + F Q G + +V +    + N                    ++ ++ +++ E A  ++   NM    G  IRV++A   +K L
Subjt:  TVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLS-HNMAVFAGNHIRVDRACPPRKKL

Query:  KVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNK---QKLELRDRTLRLFHAKPN---
         VG+         +F+GNL  DV ++ LY  F     + S+    +++RDP+    +GF ++ + + EA+++ +     Q L  R  T+   + K     
Subjt:  KVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNK---QKLELRDRTLRLFHAKPN---

Query:  ----------LASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTP
                   A+ P  ++++P T     P       +GL  P ++  + P
Subjt:  ----------LASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTP

AT3G13224.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.4e-0428.32Show/hide
Query:  VFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNK-QKLE
        V  G  + + R  P   K   G+       + +FVG +P  V ++EL   F    NV   VE  +VIRD   N  +GF +V F + E  + +++K   ++
Subjt:  VFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNK-QKLE

Query:  LRDRTLRLFHAKP
        + D  + +  A+P
Subjt:  LRDRTLRLFHAKP

AT4G24770.1 31-kDa RNA binding protein3.5e-0825.49Show/hide
Query:  DELEREYEAKKYGASEVAD-DGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKK
        +E E   E++     + ++ D  EG     V +  ++  D SE  V++     E      +FVGNL   V  + L   F Q G +E   +          
Subjt:  DELEREYEAKKYGASEVAD-DGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKK

Query:  PRKGAILSKKVNEDADSSHAYVVFKTVESAQASL-SHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSS
             I +++   D      +V   +V+ A+ ++   N     G  + V++A  PR      +  +Y+P   V+VGNLP+DV +  L QLF      G  
Subjt:  PRKGAILSKKVNEDADSSHAYVVFKTVESAQASL-SHNMAVFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSS

Query:  VEAVRVIRDPNVNIGKGFAYVFFKTREAAN---SVVNKQKLELRDRTLRLFHAKP
        VEA RV+ D      +GF +V     +  N   S ++ Q LE R   + +   +P
Subjt:  VEAVRVIRDPNVNIGKGFAYVFFKTREAAN---SVVNKQKLELRDRTLRLFHAKP

AT5G46840.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.8e-10646.08Show/hide
Query:  MGKKKT--NNPINPPDEIETGSAPSSVFDTLFGNVGVENSG-VSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSA
        MGKKK+    P +  DE +T      +F TLF    VE+SG  S+FS DNPFRRK    I  ++                 +   KKG ++    + +  
Subjt:  MGKKKT--NNPINPPDEIETGSAPSSVFDTLFGNVGVENSG-VSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSA

Query:  EEGVKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELER
        EE      + KKSKK++K  D              SG E                 ET+  + E SG                  L+ +RKKRKRDE+E 
Subjt:  EEGVKSSSEIKKSKKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELER

Query:  EYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAI
        EYE KKYG+ E+ +          VG+KRK  D+ ++ +V+KEGFDDESKLLRTVFVGNLPLKVKKK + KEFS+FGE+ESVRIRSVP+ +SK+ RKGAI
Subjt:  EYEAKKYGASEVADDGEEGSWGIVVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAI

Query:  LSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLK-VGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRV
        + K++NE A S HAYVVF+T +SA ASL+HNM++  GNH+RVDRACPPRKK K      +YDPKRTVF+GNLPFDVKDEE+YQLF G  N+ +S+EAVRV
Subjt:  LSKKVNEDADSSHAYVVFKTVESAQASLSHNMAVFAGNHIRVDRACPPRKKLK-VGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRV

Query:  IRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRA
        IRDP++NIGKG AYV FKTREAAN V+ K  L+LR+R LR+   KP+   TP K+++ PS EA +SPA+KR     + TP  + +        SYQG+RA
Subjt:  IRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRA

Query:  SKTGSQKKIHAKGNSTKWPKSYSNSKEKP-----VDHKRKRPEKTS-ERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKR
        SK+G  KK       T + KS + +K +P      D+K+      S ER  KRPAVA RKA A A  +  +   +  G KRK ++RTPES  + K+ KR
Subjt:  SKTGSQKKIHAKGNSTKWPKSYSNSKEKP-----VDHKRKRPEKTS-ERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKR

AT5G46840.2 RNA-binding (RRM/RBD/RNP motifs) family protein2.6e-9656Show/hide
Query:  VGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESA
        VG+KRK  D+ ++ +V+KEGFDDESKLLRTVFVGNLPLKVKKK + KEFS+FGE+ESVRIRSVP+ +SK+ RKGAI+ K++NE A S HAYVVF+T +SA
Subjt:  VGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESA

Query:  QASLSHNMAVFAGNHIRVDRACPPRKKLK-VGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAAN
         ASL+HNM++  GNH+RVDRACPPRKK K      +YDPKRTVF+GNLPFDVKDEE+YQLF G  N+ +S+EAVRVIRDP++NIGKG AYV FKTREAAN
Subjt:  QASLSHNMAVFAGNHIRVDRACPPRKKLK-VGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAAN

Query:  SVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKIHAKGNSTKWPKSYSN
         V+ K  L+LR+R LR+   KP+   TP K+++ PS EA +SPA+KR     + TP  + +        SYQG+RASK+G  KK       T + KS + 
Subjt:  SVVNKQKLELRDRTLRLFHAKPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKIHAKGNSTKWPKSYSN

Query:  SKEKP-----VDHKRKRPEKTS-ERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKR
        +K +P      D+K+      S ER  KRPAVA RKA A A  +  +   +  G KRK ++RTPES  + K+ KR
Subjt:  SKEKP-----VDHKRKRPEKTS-ERKGKRPAVANRKAVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAAGAAAACTAACAACCCCATAAATCCCCCCGACGAAATCGAAACTGGTTCTGCTCCGTCCAGCGTTTTCGATACTCTGTTCGGCAACGTCGGAGTAGAAAA
CTCCGGCGTTTCTATCTTCTCCACGGACAACCCCTTTCGTAGGAAAGATTCAAATTCCATTCCTGCGGCTGCAGCTGCTGAAAAATCTTCCAATGGTGGCTCTGATGATC
ATAATATCGATGATGTAAAGAGTAGGAAGAAGGGCAAGGAGAAAAGGGTTGGAACTGATTTGGATTCTGCTGAGGAAGGTGTCAAAAGTTCGTCGGAGATCAAGAAATCG
AAGAAGAAGGAGAAGTCTCGGGACCGGGAATTGAATAATGGTCCTGTCGAGAAGGATGATGGAGAAAGCGGTTTCGAATCGCGAAGCGTATTGAAGAAGAACAAAAAGAA
AGGTGATGTTTCGGGCTCTGAAACTTTAGAAATGAGTTCTGAATTTAGCGGAAAATTGAAGATAGACGAGCAAAGGAGTGACAATAAACTCGGTGAGAATGTGAAATTGA
TGAAAGAGAGGAAGAAGAGGAAGAGAGATGAACTTGAGAGAGAGTACGAAGCGAAAAAGTATGGTGCATCAGAGGTAGCTGATGATGGAGAAGAAGGTTCATGGGGCATT
GTTGTTGGGAAGAAGAGGAAAGCATTGGACGATCCTTCAGAGATGTTGGTAACGAAGGAAGGATTTGATGATGAAAGCAAGCTTCTGAGAACTGTGTTTGTTGGGAACTT
GCCATTGAAAGTGAAGAAAAAAACTTTAGGCAAGGAGTTTAGCCAATTTGGAGAGATAGAATCTGTTAGGATCCGGTCTGTGCCACTTGCAAATAGCAAAAAACCGAGGA
AAGGGGCAATTCTTTCAAAGAAAGTCAATGAAGATGCGGACAGTTCTCATGCATATGTTGTTTTCAAAACGGTGGAATCAGCACAGGCTTCTCTGTCCCATAACATGGCT
GTGTTTGCAGGAAATCATATACGTGTTGATAGGGCATGCCCTCCGCGTAAAAAGTTGAAAGTAGGAAGTGCTCCAATCTATGATCCCAAGAGAACTGTTTTTGTGGGTAA
CCTTCCATTTGATGTAAAGGATGAAGAATTGTATCAATTATTTTGCGGAATCGAGAACGTGGGATCCAGTGTTGAAGCTGTTCGGGTCATCAGAGACCCTAATGTGAACA
TAGGGAAGGGCTTTGCATATGTCTTCTTTAAAACAAGGGAAGCAGCAAATTCCGTAGTTAACAAACAAAAACTAGAGTTGCGTGATAGAACGCTGAGGCTCTTTCATGCC
AAACCAAATCTGGCATCGACCCCACTCAAGAAACGGAATAAACCATCTACCGAAGCCGATCACTCCCCAGCAAAGAAAAGGGCAGTGGATTCAGGTTTGAGGACACCAGA
CAGCAGCAAGAGGGTAACGCCGAAAGCTCCTGTTGCATCCTATCAGGGCCTGCGTGCGAGCAAAACTGGCTCCCAGAAGAAGATCCATGCTAAAGGCAATAGCACGAAAT
GGCCGAAATCATATTCAAACAGTAAGGAGAAGCCAGTAGATCACAAGAGGAAAAGACCAGAGAAGACAAGTGAAAGGAAGGGGAAAAGACCAGCAGTTGCCAACAGAAAG
GCTGTGGCAAAAGCGTCGAAAAATGGCGTTGCAGCAGCGCCAAAACAAGTCGGGGTGAAGCGCAAGTCCGATAGCCGAACTCCAGAGAGCTCTCACAGAAACAAGAGAAT
CAAAAGGTTAAGATAG
mRNA sequenceShow/hide mRNA sequence
CGTTCCCAATCCAACACGGTGTAAGCCCTAGGGCTCAAAATTTCATCCTCTCCAATCTCCATCGTTCGCCTCAATTACTTTCAACCTGTACATTCGCAGAATCCGCAGAA
AACCCAGGAGAATAGTAATGGGGAAGAAGAAAACTAACAACCCCATAAATCCCCCCGACGAAATCGAAACTGGTTCTGCTCCGTCCAGCGTTTTCGATACTCTGTTCGGC
AACGTCGGAGTAGAAAACTCCGGCGTTTCTATCTTCTCCACGGACAACCCCTTTCGTAGGAAAGATTCAAATTCCATTCCTGCGGCTGCAGCTGCTGAAAAATCTTCCAA
TGGTGGCTCTGATGATCATAATATCGATGATGTAAAGAGTAGGAAGAAGGGCAAGGAGAAAAGGGTTGGAACTGATTTGGATTCTGCTGAGGAAGGTGTCAAAAGTTCGT
CGGAGATCAAGAAATCGAAGAAGAAGGAGAAGTCTCGGGACCGGGAATTGAATAATGGTCCTGTCGAGAAGGATGATGGAGAAAGCGGTTTCGAATCGCGAAGCGTATTG
AAGAAGAACAAAAAGAAAGGTGATGTTTCGGGCTCTGAAACTTTAGAAATGAGTTCTGAATTTAGCGGAAAATTGAAGATAGACGAGCAAAGGAGTGACAATAAACTCGG
TGAGAATGTGAAATTGATGAAAGAGAGGAAGAAGAGGAAGAGAGATGAACTTGAGAGAGAGTACGAAGCGAAAAAGTATGGTGCATCAGAGGTAGCTGATGATGGAGAAG
AAGGTTCATGGGGCATTGTTGTTGGGAAGAAGAGGAAAGCATTGGACGATCCTTCAGAGATGTTGGTAACGAAGGAAGGATTTGATGATGAAAGCAAGCTTCTGAGAACT
GTGTTTGTTGGGAACTTGCCATTGAAAGTGAAGAAAAAAACTTTAGGCAAGGAGTTTAGCCAATTTGGAGAGATAGAATCTGTTAGGATCCGGTCTGTGCCACTTGCAAA
TAGCAAAAAACCGAGGAAAGGGGCAATTCTTTCAAAGAAAGTCAATGAAGATGCGGACAGTTCTCATGCATATGTTGTTTTCAAAACGGTGGAATCAGCACAGGCTTCTC
TGTCCCATAACATGGCTGTGTTTGCAGGAAATCATATACGTGTTGATAGGGCATGCCCTCCGCGTAAAAAGTTGAAAGTAGGAAGTGCTCCAATCTATGATCCCAAGAGA
ACTGTTTTTGTGGGTAACCTTCCATTTGATGTAAAGGATGAAGAATTGTATCAATTATTTTGCGGAATCGAGAACGTGGGATCCAGTGTTGAAGCTGTTCGGGTCATCAG
AGACCCTAATGTGAACATAGGGAAGGGCTTTGCATATGTCTTCTTTAAAACAAGGGAAGCAGCAAATTCCGTAGTTAACAAACAAAAACTAGAGTTGCGTGATAGAACGC
TGAGGCTCTTTCATGCCAAACCAAATCTGGCATCGACCCCACTCAAGAAACGGAATAAACCATCTACCGAAGCCGATCACTCCCCAGCAAAGAAAAGGGCAGTGGATTCA
GGTTTGAGGACACCAGACAGCAGCAAGAGGGTAACGCCGAAAGCTCCTGTTGCATCCTATCAGGGCCTGCGTGCGAGCAAAACTGGCTCCCAGAAGAAGATCCATGCTAA
AGGCAATAGCACGAAATGGCCGAAATCATATTCAAACAGTAAGGAGAAGCCAGTAGATCACAAGAGGAAAAGACCAGAGAAGACAAGTGAAAGGAAGGGGAAAAGACCAG
CAGTTGCCAACAGAAAGGCTGTGGCAAAAGCGTCGAAAAATGGCGTTGCAGCAGCGCCAAAACAAGTCGGGGTGAAGCGCAAGTCCGATAGCCGAACTCCAGAGAGCTCT
CACAGAAACAAGAGAATCAAAAGGTTAAGATAGGAAACACGGGAAGAAAGAAGTTGATGTTGTGGACTGAACTGAATTCTTTGAACTTTCTTTGTATTCTTTTATGTGGC
TTATCAACTCCCTTCTGGATTAGGGGAAAGTTCATTTAATCAACTTTTCTATGTAGTGCTTTTGCCTCTTATTTTCTATGTAGTGCTTTTGCCTCTTAT
Protein sequenceShow/hide protein sequence
MGKKKTNNPINPPDEIETGSAPSSVFDTLFGNVGVENSGVSIFSTDNPFRRKDSNSIPAAAAAEKSSNGGSDDHNIDDVKSRKKGKEKRVGTDLDSAEEGVKSSSEIKKS
KKKEKSRDRELNNGPVEKDDGESGFESRSVLKKNKKKGDVSGSETLEMSSEFSGKLKIDEQRSDNKLGENVKLMKERKKRKRDELEREYEAKKYGASEVADDGEEGSWGI
VVGKKRKALDDPSEMLVTKEGFDDESKLLRTVFVGNLPLKVKKKTLGKEFSQFGEIESVRIRSVPLANSKKPRKGAILSKKVNEDADSSHAYVVFKTVESAQASLSHNMA
VFAGNHIRVDRACPPRKKLKVGSAPIYDPKRTVFVGNLPFDVKDEELYQLFCGIENVGSSVEAVRVIRDPNVNIGKGFAYVFFKTREAANSVVNKQKLELRDRTLRLFHA
KPNLASTPLKKRNKPSTEADHSPAKKRAVDSGLRTPDSSKRVTPKAPVASYQGLRASKTGSQKKIHAKGNSTKWPKSYSNSKEKPVDHKRKRPEKTSERKGKRPAVANRK
AVAKASKNGVAAAPKQVGVKRKSDSRTPESSHRNKRIKRLR