; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017388 (gene) of Snake gourd v1 genome

Gene IDTan0017388
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionnucleolin 1-like isoform X1
Genome locationLG04:83128395..83138379
RNA-Seq ExpressionTan0017388
SyntenyTan0017388
Gene Ontology termsGO:0043488 - regulation of mRNA stability (biological process)
GO:1900364 - negative regulation of mRNA polyadenylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008143 - poly(A) binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR002483 - PWI domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR040366 - Nuclear polyadenylated RNA-binding protein Nab2/ZC3H14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606874.1 Polyadenylate-binding protein 2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.99Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVPR KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIANDETRVTPRSEVSRVKHSSPEQ PSHRKR+RAD+HQG EREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTSNSDTT+ PRRLQSVAKVPNPMATVIKAVSEAAED IRVKSSSVFDRLG QSRD+D  E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVTAVEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERMT
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQESELRGTRSAV VTENGEPVTIVNQQKKPAA+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATK SLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRG+PGGFRPRPPIKLGARSMQWKRDSQTT  DNGAS+ G SVPS+GARSLTYVRT+ KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

KAG7036580.1 Polyadenylate-binding protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.28Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVPR KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIANDETRVTPRSEVSRVKHSSPEQ P HRKRSRAD+HQG EREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTSNSDTT+ PRRLQSVAKVPNPMATVIKAVSEAAED IRVKSSSVFDRLGRQSRD+D  E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVTAVEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERMT
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQESELRGTRSAV VTENGEPVTIVNQQKKPAA+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRG+PGGFRPRPPIKLGARSMQWKRDSQTT  DNGAS+ G SVPS+GARSLTYVRT+ KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

XP_022949230.1 nucleolin 1-like isoform X1 [Cucurbita moschata]0.0e+0090.43Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVP  KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIANDETRVTPRSEVSRVKHSSPEQ PSHRKRSRAD+HQG EREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTS SDTT+ PRRLQSVAKVPNPMATVIKAVSEAAED IRVKSSSVFDRLGRQSRD+D  E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVTAVEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERMT
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQESELRGTRSAV VTENGEPVTIVNQQKKPAA+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTT  DNGAS+ G SVPS+GARSLTYVRTE KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

XP_022998409.1 uncharacterized protein LOC111493050 isoform X1 [Cucurbita maxima]0.0e+0089.99Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVPR KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIAN+ETRVTPRSEVSRVKHSSPEQ PSHRKRSRAD+HQG EREA FQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTSNSDTT+ PRRLQSVAKVPNPMATVIKAVSEAAED IRVKSSSVFDRLGRQSRD DL E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVT VEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERM 
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQ SELRGTRSAV VTENGEPVTIVNQQKKP A+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTT  DNGAS+ G SVPS+GARSLTYVRTE KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

XP_023525351.1 nucleolin 1-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0090.29Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVPR KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIANDETRVTPRSEVSRVKHSSPEQ PSHRKR RAD+HQG EREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTSNSDT + PRRLQS+AKVPNPMATVIKAVSEAAED IRVKSSSVFDRLGRQSRD+DL E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVTAVEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERMT
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQESELRGTRSAV VTENGEPVTIVNQQKKPAA+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQ-TTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQ TT  DNGAS+ G SVPS+GARSLTYVRTE KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQ-TTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

TrEMBL top hitse value%identityAlignment
A0A1S3CFE4 nucleolin 1 isoform X10.0e+0085.74Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDRVDDRTFKVDF+GEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRR+EEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        NEVPRPKSP AEPD  N SHNLESD ERGK+EK+S RRRNREW+GIAN+ETRV P+SEVSRVKHSSPEQ P+HRKRSR D+ QGTEREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVATTRPS+AAKEP SKRLRSVVS SNSDTTNRPRRLQSVAKVPNPMATVIKAV+EA+ED +RVKSSSVFDRLGRQSRDMDLTE+SG+  EY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYL-RNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVT+VE+ +YGDMNH++DRPYSATYL R+NY GKY+ +E +FE ETGLASDS SE+EDV I+GHRVFDDSWTAESGVRKG NLRT  FR V+N+DDER+ 
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYL-RNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANG-LAA
        QY  QKDQPSL ANSSRDIVNISVNVNTWKPPHYQD GQI EL G+KFLQESEL+GTRSAV VTENGEPVT+VNQ+K PA+NLQKEFQKP LSANG  A+
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANG-LAA

Query:  TRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPR
        TRPLE+ADARTIFVSNVHFAATKDSLSRHFNKFGE++KVIIVTDATTGQPKGSAYVEFMRKE+AE+ALSLDGTSFMSRILKVVRKNASQ EG S V WPR
Subjt:  TRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPR

Query:  AVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        AVRGSP+PT RFTRVPF RGVPGGFR RPP+KLGARSMQWKRD+QT  ADNGAS+SGNS+PS GARSLTYVRTE KPADK
Subjt:  AVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

A0A5A7U3W4 Nucleolin 1 isoform X10.0e+0085.74Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDRVDDRTFKVDF+GEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRR+EEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        NEVPRPKSP AEPD  N SHNLESD ERGK+EK+S RRRNREW+GIAN+ETRV P+SEVSRVKHSSPEQ P+HRKRSR D+ QGTEREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVATTRPS+AAKEP SKRLRSVVS SNSDTTNRPRRLQSVAKVPNPMATVIKAV+EA+ED +RVKSSSVFDRLGRQSRDMDLTE+SG+  EY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYL-RNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVT+VE+ +YGDMNH++DRPYSATYL R+NY GKY+ +E +FE ETGLASDS SE+EDV I+GHRVFDDSWTAESGVRKG NLRT  FR V+N+DDER+ 
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYL-RNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANG-LAA
        QY  QKDQPSL ANSSRDIVNISVNVNTWKPPHYQD GQI EL G+KFLQESEL+GTRSAV VTENGEPVT+VNQ+K PA+NLQKEFQKP LSANG  A+
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANG-LAA

Query:  TRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPR
        TRPLE+ADARTIFVSNVHFAATKDSLSRHFNKFGE++KVIIVTDATTGQPKGSAYVEFMRKE+AE+ALSLDGTSFMSRILKVVRKNASQ EG S V WPR
Subjt:  TRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPR

Query:  AVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        AVRGSP+PT RFTRVPF RGVPGGFR RPP+KLGARSMQWKRD+QT  ADNGAS+SGNS+PS GARSLTYVRTE KPADK
Subjt:  AVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

A0A6J1DFE3 protein gar2 isoform X10.0e+0087.22Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGS DRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGR K+EA+NELNVFLADDSHSFVSWLWDHLASSMDLYVE PTK S 
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        NEVP P SP AEPDR N SH+LE D ERGKSEK+S RRRNREWKGIANDETRV PRSEVSRVKHSSPEQ PSHRKRSRAD+HQGTEREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVATTRPS+ AKEP SKRLRSVVSTSN+DTTNRPRRLQSVAKVPNPMATVIKAV+EAAEDAIRVKSSSVFDRLGRQS DMDLTE SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYL-RNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADD-ERM
         VTAVED KYGD+ HTQD+PYS TYL R+NYSGKY  NE +FE +TGLASDSTSE++DV +QGHR FDDSWTAESGVRK GNLR+ PFRVVENAD+  R+
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYL-RNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADD-ERM

Query:  TQYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANG-LA
        TQY KQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE G +KFLQESEL+G+RSAV VTENG+ VTIVNQQK PAAN QKEFQKP  SANG  A
Subjt:  TQYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANG-LA

Query:  ATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWP
        ATRPLEDADARTIFVSNVHF ATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRK+AAE+ALSLDGTSFMSRILKV+RKNASQ EG SIV WP
Subjt:  ATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWP

Query:  RAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        RAVRGSPYP+PRF+R PF RG+PGGFRPRPPIKLGARSMQWKRDSQ T+ADNGAS+SGNS+ SSGARSLTYVRTE KPADK
Subjt:  RAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

A0A6J1GC70 nucleolin 1-like isoform X10.0e+0090.43Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVP  KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIANDETRVTPRSEVSRVKHSSPEQ PSHRKRSRAD+HQG EREAAFQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTS SDTT+ PRRLQSVAKVPNPMATVIKAVSEAAED IRVKSSSVFDRLGRQSRD+D  E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVTAVEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERMT
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQESELRGTRSAV VTENGEPVTIVNQQKKPAA+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTT  DNGAS+ G SVPS+GARSLTYVRTE KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

A0A6J1KE94 uncharacterized protein LOC111493050 isoform X10.0e+0089.99Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGSEDR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSM+LYVEPP K SA
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
        +EVPR KSP AEPD   GSHNLESDLERGKSEK+S RRRNREWKGIAN+ETRVTPRSEVSRVKHSSPEQ PSHRKRSRAD+HQG EREA FQVSIAAPRR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY
        LLQFAMRDAVAT +PS+ AKEPLSKRLRSVVSTSNSDTT+ PRRLQSVAKVPNPMATVIKAVSEAAED IRVKSSSVFDRLGRQSRD DL E+SGQ AEY
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEY

Query:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT
        GVT VEDHKYGDMNHTQDRPYSATYL  +NYSGKY P EA+F+AETGLASDSTSESEDVTI GH+VFDDSWTAESGVRKGGNLRTAPFRVVEN DDERM 
Subjt:  GVTAVEDHKYGDMNHTQDRPYSATYLR-NNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMT

Query:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT
        +Y KQK QPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAE GG+KFLQ SELRGTRSAV VTENGEPVTIVNQQKKP A+LQKEFQKP  SANGLAAT
Subjt:  QYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAAT

Query:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA
        RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRKEAAE+ALSLDGTSF+SRILKV RKNASQ EG SIV WPRA
Subjt:  RPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRA

Query:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK
        VRGSPYPTPRF+RVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTT  DNGAS+ G SVPS+GARSLTYVRTE KPADK
Subjt:  VRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESKPADK

SwissProt top hitse value%identityAlignment
Q6NVP7 Polyadenylate-binding protein 26.2e-1432.47Show/hide
Query:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG   EL G + +++ EL   ++ V  + E  E +  +  + +   N+      P  +   + +     +ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRA---VRGSPYPT-PRFTR--VPFPRG
         + +V I+ D  TG PKG AY+EF  KE+  ++L+LD + F  R +KVV K  ++  G+S     +PRA    R S Y +  RF     P PRG
Subjt:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRA---VRGSPYPT-PRFTR--VPFPRG

Q7ZXB8 Polyadenylate-binding protein 2-B4.8e-1432.47Show/hide
Query:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG   EL G + +++ EL   ++ V  + E  E +  +  + +   N+      P  +   + +     +ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRA---VRGSPYPT-PRFTR--VPFPRG
         + +V I+ D  TG PKG AY+EF  KE+  ++L+LD + F  R +KVV K  ++  G+S     +PRA    R S Y +  RF     P PRG
Subjt:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRA---VRGSPYPT-PRFTR--VPFPRG

Q86U42 Polyadenylate-binding protein 26.9e-1328.71Show/hide
Query:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG +    G   +++ EL   ++ V  + E  E +  +  + +   N+      P  +   + +     +ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIK
         + +V I+ D  +G PKG AY+EF  KE+  ++L+LD + F  R +KV+ K  ++  G+S     +PRA   +       +R  F  G     RPR  + 
Subjt:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIK

Query:  LG-ARSMQW
         G AR+  W
Subjt:  LG-ARSMQW

Q8CCS6 Polyadenylate-binding protein 29.0e-1328.71Show/hide
Query:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG +    G   +++ EL   ++ V  + E  E +  +  + +   N+      P  +   + +     +ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIK
         + +V I+ D  +G PKG AY+EF  KE+  ++L+LD + F  R +KV+ K  ++  G+S     +PR+   +       +R  F  G     RPR  I 
Subjt:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIK

Query:  LG-ARSMQW
         G AR+  W
Subjt:  LG-ARSMQW

Q9DDY9 Polyadenylate-binding protein 2-A4.8e-1432.47Show/hide
Query:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG   EL G + +++ EL   ++ V  + E  E +  +  + +   N+      P  +   + +     +ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGGKKFLQESELRGTRSAV-HVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRA---VRGSPYPT-PRFTR--VPFPRG
         + +V I+ D  TG PKG AY+EF  KE+  ++L+LD + F  R +KVV K  ++  G+S     +PRA    R S Y +  RF     P PRG
Subjt:  EIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIV--AWPRA---VRGSPYPT-PRFTR--VPFPRG

Arabidopsis top hitse value%identityAlignment
AT2G24350.1 RNA binding (RRM/RBD/RNP motifs) family protein3.4e-1524.88Show/hide
Query:  VDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA--NEVP
        VD  TF +    E  ++L+  I   +  F  DY+DD L EYV VL+ NG+ + +A  +L  FL + S  FV  LW+ L      Y       S    +V 
Subjt:  VDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA--NEVP

Query:  RPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRRLLQF
           + T     ++     + D +          +       I + E  V+P+ E  ++       +P  R R R  E+           S    R++L+ 
Subjt:  RPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRRLLQF

Query:  AMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSS----SVFDRLGRQSRDMDLTEASGQHAEY
         +  A       + AK   S   RS         T + R      ++ +  A   +AVS    DA   + +    SV+DRLGR S +  L   S   +++
Subjt:  AMRDAVATTRPSSAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSS----SVFDRLGRQSRDMDLTEASGQHAEY

Query:  GV----TAVEDHKYGDMNHTQDRPYSATYLRNNYSGKYAPNEALFEAETGLASDSTSESEDVT-IQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADD
        G+    T V               +S  + R   +  Y      F+       D   +SE +T  + H  ++ S     G+            V  N+ +
Subjt:  GV----TAVEDHKYGDMNHTQDRPYSATYLRNNYSGKYAPNEALFEAETGLASDSTSESEDVT-IQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADD

Query:  ERMTQYNKQKDQ----PSLVANSS--RDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKP
               KQ +Q    PSL+++ S  +DI +   NV                   K+ +QE ELR  +S     +       V ++K    + + ++Q+ 
Subjt:  ERMTQYNKQKDQ----PSLVANSS--RDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKP

Query:  TLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHF-NKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQS
                     + +++R I V+NV++AA K+++S  F +K G +  VI+VTD  T  PKG+A+V F  KE+   A++L GT F SR +K VR +   S
Subjt:  TLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHF-NKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQS

Query:  EGVSIVAWPRAVRGS
          VS  A P+ V GS
Subjt:  EGVSIVAWPRAVRGS

AT3G12640.1 RNA binding (RRM/RBD/RNP motifs) family protein1.0e-12044.14Show/hide
Query:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA
        MGS D VDDRTF  DFS EG+AKL+E +K K+KE+MGDYTDD LVEYVIVLLRNGRRKEEA NEL +FL DDS SFV+WLWDHLA S+D Y       S 
Subjt:  MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSA

Query:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR
         E    KS             L+S+ ++G+S+K +  RR R+W+         +  + VS +      +A S RKRSR D+ +  +REA   VS    RR
Subjt:  NEVPRPKSPTAEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTS--NSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSS-SVFDRLGRQS------RDMDLT
        LLQFA+RDA+A +RP++++ E   KRLRSVVSTS  NS   +  R+++SVA+V NPMATV+KAV+EAAEDA + KS  SVFDR+   +      ++M L 
Subjt:  LLQFAMRDAVATTRPSSAAKEPLSKRLRSVVSTS--NSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSS-SVFDRLGRQS------RDMDLT

Query:  EASGQHAE-----YGVTAVEDHKYGDMNHTQDRPYSATYLRNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAP
        E S +  E      G  AV+      + +TQ    +  Y  N  +     N   F ++ G    S + S   T  G+R+               N  +  
Subjt:  EASGQHAE-----YGVTAVEDHKYGDMNHTQDRPYSATYLRNNYSGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAP

Query:  FRVVENADDERMTQYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHV-TENGEPVTIVNQQKKPAANLQKE
         R+V+  D +R+   N Q   P +   +     + S N++T K    ++  +I ++G ++++ E  L  + +   + T+     TI N   KPAA++++E
Subjt:  FRVVENADDERMTQYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGGKKFLQESELRGTRSAVHV-TENGEPVTIVNQQKKPAANLQKE

Query:  FQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVR-KN
                  L+ TRPLEDA +RTIFV+NVHF ATKDSLSRHFNKFGE++K  IVTD  TGQP GSAY+EF RKEAAE+ALSLDGTSFMSRILK+V+  N
Subjt:  FQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVR-KN

Query:  ASQSEGVSIVAWPRAVRGSPYPTPRFTRVP-FPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESK
            E  S ++W R          RFTR P + RG  G  R R  ++ G RSMQWKRDS    AD G   + N+V  + ARSLTYVR ESK
Subjt:  ASQSEGVSIVAWPRAVRGSPYPTPRFTRVP-FPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPSSGARSLTYVRTESK

AT5G10350.2 RNA-binding (RRM/RBD/RNP motifs) family protein6.6e-1137.38Show/hide
Query:  ANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILK
        A ++KE       A+  A     E+ DAR+++V NV +A T + +  HF   G + +V I+ D   GQPKG AYVEF+  EA + AL L+ +    R LK
Subjt:  ANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILK

Query:  VVRKNAS
        V  K  +
Subjt:  VVRKNAS

AT5G51120.2 polyadenylate-binding protein 16.0e-1237.14Show/hide
Query:  LQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVV
        L+ E++K  +++   AA +  E+ D+R+I+V NV +A T + + +HF   G + +V I+TD   GQPKG AYVEF+  EA +++L L+ +    R +KV 
Subjt:  LQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVV

Query:  RKNAS
         K  +
Subjt:  RKNAS

AT5G65260.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.6e-1237.14Show/hide
Query:  EDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRAVRG
        E+ DAR++FV NV +A T + + +HF   G + +V I+TD   GQPKG AYVEF+  EA + AL L+ +    R LKV++K  +    V  +   R  R 
Subjt:  EDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKGSAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRAVRG

Query:  SPYPTPRFTRVPFPRGV---PGGFRPRPPIKLGARSMQWK
        +PY   RF R PF       P G+   P  +   R M ++
Subjt:  SPYPTPRFTRVPFPRGV---PGGFRPRPPIKLGARSMQWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGTGAGGATCGTGTCGATGATCGGACGTTCAAGGTTGATTTCAGTGGCGAGGGAATGGCCAAACTCAGAGAGAGGATAAAGCTGAAAATGAAGGAATTCATGGG
CGATTATACGGACGACACTCTTGTGGAATACGTGATTGTCTTGTTGAGGAACGGAAGGCGCAAAGAAGAAGCACAGAATGAACTAAACGTATTTTTGGCGGACGATAGCC
ATTCTTTTGTATCTTGGCTTTGGGACCATCTGGCTTCAAGTATGGATTTATATGTGGAACCGCCTACTAAACCTTCTGCCAATGAAGTGCCCAGACCGAAGTCTCCAACA
GCAGAACCAGATAGAATAAATGGTTCTCACAATCTTGAGTCTGATTTAGAAAGAGGGAAGTCTGAAAAAGTATCTATCAGGCGGCGTAATAGGGAATGGAAAGGAATTGC
TAATGATGAAACCCGTGTTACTCCAAGATCTGAAGTTAGTCGTGTCAAGCATTCTTCTCCTGAACAAGCTCCTAGTCATAGGAAAAGAAGCCGTGCTGATGAGCATCAAG
GCACTGAGAGGGAGGCAGCCTTTCAGGTAAGCATCGCTGCCCCTCGGCGGCTGCTCCAGTTTGCGATGCGAGATGCTGTGGCCACCACAAGGCCCTCTAGTGCAGCTAAG
GAACCCCTTTCGAAGCGTCTCCGTTCGGTAGTTTCTACATCCAATAGTGACACAACTAATCGTCCTAGGCGGCTTCAATCTGTTGCAAAAGTGCCAAATCCTATGGCAAC
TGTTATTAAGGCTGTTTCGGAGGCAGCTGAAGATGCGATAAGAGTTAAATCCTCCAGTGTTTTTGACCGACTTGGTCGTCAATCTCGTGATATGGATTTAACAGAGGCCA
GTGGCCAACATGCAGAATATGGAGTAACTGCCGTAGAGGACCATAAATATGGGGATATGAATCATACACAGGATCGACCTTACTCAGCCACTTATCTCAGAAACAATTAT
AGTGGGAAATATGCTCCCAATGAGGCCTTGTTTGAAGCTGAAACTGGATTAGCATCAGATTCTACATCTGAAAGTGAAGATGTTACTATTCAGGGTCATAGAGTATTTGA
CGATTCTTGGACTGCAGAATCAGGAGTAAGAAAGGGAGGCAATTTGCGGACTGCGCCATTCAGGGTAGTTGAGAATGCTGATGATGAACGCATGACACAATATAATAAGC
AGAAGGATCAACCTTCCTTAGTTGCAAATTCCTCACGCGACATTGTAAATATCTCTGTGAATGTTAATACTTGGAAACCACCTCACTATCAGGACCCAGGGCAGATTGCT
GAACTTGGTGGTAAAAAGTTTTTGCAGGAGAGTGAGTTACGAGGTACCAGATCTGCTGTGCATGTAACAGAGAATGGCGAGCCAGTCACTATTGTTAACCAACAGAAAAA
GCCTGCAGCAAACCTTCAAAAAGAGTTCCAGAAACCTACTTTATCTGCTAATGGCCTTGCTGCCACTCGTCCCTTAGAGGACGCTGATGCTCGAACCATTTTTGTTAGCA
ATGTTCATTTTGCTGCTACAAAGGATAGCCTGTCTAGGCATTTTAACAAGTTTGGAGAGATTGTAAAAGTCATCATTGTAACCGATGCAACCACCGGCCAACCCAAAGGG
TCGGCCTATGTGGAGTTCATGAGAAAAGAAGCTGCAGAGAGTGCATTGTCTCTTGATGGGACCTCGTTTATGTCTAGGATTCTGAAGGTCGTAAGGAAAAATGCGTCACA
ATCAGAAGGTGTTTCAATTGTCGCCTGGCCTCGTGCTGTGCGTGGCTCACCATATCCTACTCCCAGATTTACCAGAGTTCCTTTCCCGAGGGGCGTTCCCGGTGGATTTA
GGCCTCGTCCCCCCATCAAACTTGGTGCCAGGAGCATGCAGTGGAAGCGGGATAGTCAGACCACAAATGCTGACAATGGTGCTTCTGTCTCTGGTAATTCTGTACCCTCA
TCTGGTGCTCGTAGTCTCACCTATGTTCGAACAGAATCTAAGCCGGCGGACAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATTTTTCATTTTTAATTTTTTTCTTCTGTTCATCTCAAATCGCTCTCCCTCAGCCATGTGCTCTGATGAAATCCGCGCTCTGTTGATTTTCTGCGTTTAATTTCCAGCCT
CTTCCCTGTTAATCGGTTTCTCTGTCCAGTTCACAGGATTCGGTGGAACCGGCTAGGGTTCATGGAAGCTGCAATTCGAGAGCTCAATATTTGAGGGTTTGAATATATCA
ATTTAACTCCGGTGTTAAGTCTTAAGAGTGAGCGTTCGGCGTTCTGGATTGTGGAGCTTTTTGGAAAAAGAAAAGACGCGTCGGAGATGGGGAGTGAGGATCGTGTCGAT
GATCGGACGTTCAAGGTTGATTTCAGTGGCGAGGGAATGGCCAAACTCAGAGAGAGGATAAAGCTGAAAATGAAGGAATTCATGGGCGATTATACGGACGACACTCTTGT
GGAATACGTGATTGTCTTGTTGAGGAACGGAAGGCGCAAAGAAGAAGCACAGAATGAACTAAACGTATTTTTGGCGGACGATAGCCATTCTTTTGTATCTTGGCTTTGGG
ACCATCTGGCTTCAAGTATGGATTTATATGTGGAACCGCCTACTAAACCTTCTGCCAATGAAGTGCCCAGACCGAAGTCTCCAACAGCAGAACCAGATAGAATAAATGGT
TCTCACAATCTTGAGTCTGATTTAGAAAGAGGGAAGTCTGAAAAAGTATCTATCAGGCGGCGTAATAGGGAATGGAAAGGAATTGCTAATGATGAAACCCGTGTTACTCC
AAGATCTGAAGTTAGTCGTGTCAAGCATTCTTCTCCTGAACAAGCTCCTAGTCATAGGAAAAGAAGCCGTGCTGATGAGCATCAAGGCACTGAGAGGGAGGCAGCCTTTC
AGGTAAGCATCGCTGCCCCTCGGCGGCTGCTCCAGTTTGCGATGCGAGATGCTGTGGCCACCACAAGGCCCTCTAGTGCAGCTAAGGAACCCCTTTCGAAGCGTCTCCGT
TCGGTAGTTTCTACATCCAATAGTGACACAACTAATCGTCCTAGGCGGCTTCAATCTGTTGCAAAAGTGCCAAATCCTATGGCAACTGTTATTAAGGCTGTTTCGGAGGC
AGCTGAAGATGCGATAAGAGTTAAATCCTCCAGTGTTTTTGACCGACTTGGTCGTCAATCTCGTGATATGGATTTAACAGAGGCCAGTGGCCAACATGCAGAATATGGAG
TAACTGCCGTAGAGGACCATAAATATGGGGATATGAATCATACACAGGATCGACCTTACTCAGCCACTTATCTCAGAAACAATTATAGTGGGAAATATGCTCCCAATGAG
GCCTTGTTTGAAGCTGAAACTGGATTAGCATCAGATTCTACATCTGAAAGTGAAGATGTTACTATTCAGGGTCATAGAGTATTTGACGATTCTTGGACTGCAGAATCAGG
AGTAAGAAAGGGAGGCAATTTGCGGACTGCGCCATTCAGGGTAGTTGAGAATGCTGATGATGAACGCATGACACAATATAATAAGCAGAAGGATCAACCTTCCTTAGTTG
CAAATTCCTCACGCGACATTGTAAATATCTCTGTGAATGTTAATACTTGGAAACCACCTCACTATCAGGACCCAGGGCAGATTGCTGAACTTGGTGGTAAAAAGTTTTTG
CAGGAGAGTGAGTTACGAGGTACCAGATCTGCTGTGCATGTAACAGAGAATGGCGAGCCAGTCACTATTGTTAACCAACAGAAAAAGCCTGCAGCAAACCTTCAAAAAGA
GTTCCAGAAACCTACTTTATCTGCTAATGGCCTTGCTGCCACTCGTCCCTTAGAGGACGCTGATGCTCGAACCATTTTTGTTAGCAATGTTCATTTTGCTGCTACAAAGG
ATAGCCTGTCTAGGCATTTTAACAAGTTTGGAGAGATTGTAAAAGTCATCATTGTAACCGATGCAACCACCGGCCAACCCAAAGGGTCGGCCTATGTGGAGTTCATGAGA
AAAGAAGCTGCAGAGAGTGCATTGTCTCTTGATGGGACCTCGTTTATGTCTAGGATTCTGAAGGTCGTAAGGAAAAATGCGTCACAATCAGAAGGTGTTTCAATTGTCGC
CTGGCCTCGTGCTGTGCGTGGCTCACCATATCCTACTCCCAGATTTACCAGAGTTCCTTTCCCGAGGGGCGTTCCCGGTGGATTTAGGCCTCGTCCCCCCATCAAACTTG
GTGCCAGGAGCATGCAGTGGAAGCGGGATAGTCAGACCACAAATGCTGACAATGGTGCTTCTGTCTCTGGTAATTCTGTACCCTCATCTGGTGCTCGTAGTCTCACCTAT
GTTCGAACAGAATCTAAGCCGGCGGACAAGTAGGAACCACTTGATTGAGAGAGTTGTTCTCTCCTCAAGGATTTGCTCAAAATTATTATGCGAAAAGAGAATCACGCAGA
TCAAGTATGACCTCCACAAATTGTTTGTTTGATTTTTTTGGTGGTTCATGATTCTTCGTGCATCATGGTGGAGGAAACTGAGAAAGAAGACCACGGGAAGTTTTAGCAGT
TTCTAGGAGAAATATAATTTTTGTTTCCATCGGGGAATGGAAATTGGTGAGTTTATAGCTAGGAAACTCATAATGGCGGTTCGGATGATTTCATATTTTTGAGCTCCCCC
TCCCATTCAGAATTCTTATTCAGTTACTAGATTAAAATTTGTGAATAGAGTAAGAGTAGACGTTTTGTTTTTTGTTTAACGGATATTATGATATTTTGTAATTTTTATTA
CTGATGTGACAGAAGAGAATGAAAAAAAAGAAAAATAGAGTTACATGAAAAGAAAATGAATAAATTTTACCTAGTTTGTATATCAGCAGGTCTTTGTACCAGTTTATAAG
CATGTTTTTACCTAGTTTGA
Protein sequenceShow/hide protein sequence
MGSEDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKEEAQNELNVFLADDSHSFVSWLWDHLASSMDLYVEPPTKPSANEVPRPKSPT
AEPDRINGSHNLESDLERGKSEKVSIRRRNREWKGIANDETRVTPRSEVSRVKHSSPEQAPSHRKRSRADEHQGTEREAAFQVSIAAPRRLLQFAMRDAVATTRPSSAAK
EPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVSEAAEDAIRVKSSSVFDRLGRQSRDMDLTEASGQHAEYGVTAVEDHKYGDMNHTQDRPYSATYLRNNY
SGKYAPNEALFEAETGLASDSTSESEDVTIQGHRVFDDSWTAESGVRKGGNLRTAPFRVVENADDERMTQYNKQKDQPSLVANSSRDIVNISVNVNTWKPPHYQDPGQIA
ELGGKKFLQESELRGTRSAVHVTENGEPVTIVNQQKKPAANLQKEFQKPTLSANGLAATRPLEDADARTIFVSNVHFAATKDSLSRHFNKFGEIVKVIIVTDATTGQPKG
SAYVEFMRKEAAESALSLDGTSFMSRILKVVRKNASQSEGVSIVAWPRAVRGSPYPTPRFTRVPFPRGVPGGFRPRPPIKLGARSMQWKRDSQTTNADNGASVSGNSVPS
SGARSLTYVRTESKPADK