; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g26710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g26710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein SPT2 homolog
Genome locationchr1:19037839..19040940
RNA-Seq ExpressionMoc01g26710
SyntenyMoc01g26710
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010847 - regulation of chromatin assembly (biological process)
GO:0043486 - histone exchange (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR013256 - Chromatin SPT2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022133105.1 myb-like protein I [Momordica charantia]5.1e-257100Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQ
        VIQESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQ
Subjt:  VIQESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQ

Query:  PPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRP
        PPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRP
Subjt:  PPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRP

Query:  VSNSNNGNGPGPKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPY
        VSNSNNGNGPGPKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPY
Subjt:  VSNSNNGNGPGPKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPY

Query:  SDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ
        SDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ
Subjt:  SDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ

XP_023518428.1 protein SPT2 homolog isoform X1 [Cucurbita pepo subsp. pepo]2.3e-17774.55Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEH +EDY++YDEYE    DD + ++ EEEYEEVED KPTKEELEYL LRQRLKE+IRKQ     SKKDGGSH+ S  KKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK
        VIQESKSLLENQHLASRVS H HDN+KS G NS A KPRVPPKIINEKKTKVQ LKDTRDYSFLFSED +VPAP KESS+ SV  PSTEARSAH  +K+K
Subjt:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK

Query:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNG-----
        QP  N RQN+HV    KKSV +N GQMQS+NK  SSGN N S+MKAK  LGN +NGNGPGRP+ NS+NGNGPGRP+ NS+NGNGPGRP+ N  NG     
Subjt:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNG-----

Query:  --------NGPGRPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQV
                NGPGRPV NSNNG GPG     PKAPSAL+Q+K SLP +KS VP VH+PLP+K L DKRNE+R P+KAK+ PNR +SSSRPQMSKP PQRQ+
Subjt:  --------NGPGRPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQV

Query:  SSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLK
        SSRP LN QRPKKRPARPYSD+EDD +GG+AISLIRKMF YNP +FS DDDDSDMEANF+DIMMEEKRSA+IARKEDEEQLRLIQEEEERERRARIKRLK
Subjt:  SSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLK

Query:  RAK
        RAK
Subjt:  RAK

XP_023518430.1 protein SPT2 homolog isoform X2 [Cucurbita pepo subsp. pepo]8.0e-17875.92Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEH +EDY++YDEYE    DD + ++ EEEYEEVED KPTKEELEYL LRQRLKE+IRKQ     SKKDGGSH+ S  KKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK
        VIQESKSLLENQHLASRVS H HDN+KS G NS A KPRVPPKIINEKKTKVQ LKDTRDYSFLFSED +VPAP KESS+ SV  PSTEARSAH  +K+K
Subjt:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK

Query:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGR
        QP  N RQN+HV    KKSV +N GQMQS+NK  SSGN N S+MKAK  LGN +NGNGPGRP+ NS+NGNGPGRP+ N  NGNGPGRP+ N +  NGPGR
Subjt:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGR

Query:  PVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKK
        PV NSNNG GPG     PKAPSAL+Q+K SLP +KS VP VH+PLP+K L DKRNE+R P+KAK+ PNR +SSSRPQMSKP PQRQ+SSRP LN QRPKK
Subjt:  PVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKK

Query:  RPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAK
        RPARPYSD+EDD +GG+AISLIRKMF YNP +FS DDDDSDMEANF+DIMMEEKRSA+IARKEDEEQLRLIQEEEERERRARIKRLKRAK
Subjt:  RPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAK

XP_038881803.1 protein SPT2 homolog isoform X1 [Benincasa hispida]2.7e-18177.24Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRD-KKLPYDNFGSFFGPSQPVISQ
        MEH +EDY+ Y+EY+EYE DDQM EE+EEEYEEVED KPTKEE+EYLELRQRLKE+IR+QSKKDGS     SH+ S D KKLPYDNFGSFFGPSQPVISQ
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRD-KKLPYDNFGSFFGPSQPVISQ

Query:  RVIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN
        RVIQESKSLLENQHLASRVS H H NKKS G NS A KPRVPPK++NEK+TKVQKLKDTRDYSFLFSED +VPAPTKE SS SV  PSTEARSA   MK+
Subjt:  RVIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN

Query:  KQPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG
        KQP  N RQNIH GHKDKKSV IN GQMQS+NK  SSGN N S+MKAK QLGN  NGNGPGRP+ N +NG+GPGRP+ NS+NG+GPGRP+ NS N NGPG
Subjt:  KQPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG

Query:  RPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPK
        RP+ NSNNGNGPG     PKA SA++Q+K SLPGTK+S P VHKPLPSK L DKRN++R PAKAKV P+R +SSSRPQ+SK   QRQV SR A+N QRPK
Subjt:  RPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPK

Query:  KRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKG
        KRPAR YSD+EDD EG +AISLIRKMF YNP KFSRDDDDSDMEANF+DI+MEEKRSA+IARKEDEEQLRLIQEEEERERRARIKRLKRAKG
Subjt:  KRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKG

XP_038881805.1 protein SPT2 homolog isoform X2 [Benincasa hispida]1.8e-17776.83Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRD-KKLPYDNFGSFFGPSQPVISQ
        MEH +EDY+ Y+EY+EYE DDQM EE+EEEYEEVED KPTKEE+EYLELRQRLKE+IR+QSKKDGS     SH+ S D KKLPYDNFGSFFGPSQPVISQ
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRD-KKLPYDNFGSFFGPSQPVISQ

Query:  RVIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN
        RVIQESKSLLENQHLASRVS H H NKKS G NS A KPRVPPK    K+TKVQKLKDTRDYSFLFSED +VPAPTKE SS SV  PSTEARSA   MK+
Subjt:  RVIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN

Query:  KQPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG
        KQP  N RQNIH GHKDKKSV IN GQMQS+NK  SSGN N S+MKAK QLGN  NGNGPGRP+ N +NG+GPGRP+ NS+NG+GPGRP+ NS N NGPG
Subjt:  KQPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG

Query:  RPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPK
        RP+ NSNNGNGPG     PKA SA++Q+K SLPGTK+S P VHKPLPSK L DKRN++R PAKAKV P+R +SSSRPQ+SK   QRQV SR A+N QRPK
Subjt:  RPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPK

Query:  KRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKG
        KRPAR YSD+EDD EG +AISLIRKMF YNP KFSRDDDDSDMEANF+DI+MEEKRSA+IARKEDEEQLRLIQEEEERERRARIKRLKRAKG
Subjt:  KRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKG

TrEMBL top hitse value%identityAlignment
A0A0A0KLN3 Uncharacterized protein6.2e-17671.43Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRD-KKLPYDNFGSFFGPSQPVISQ
        MEH +EDY+ Y++Y+EYE DDQM EEEEEEYEEVE  KPTKEE EYLELRQRLKE+IR+QSK+DGS     SH+ S D KKLPYDNFGSFFGPSQPVISQ
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRD-KKLPYDNFGSFFGPSQPVISQ

Query:  RVIQESKSLLENQHLASRVS-HHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN
        RVIQESKSLLENQHLASRVS H H NKKS G NS A KPRV PK+++EK+TKVQKLKDTRDYSFLFSED +VPAP+KE SS SV  PSTEARSA   MK+
Subjt:  RVIQESKSLLENQHLASRVS-HHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN

Query:  KQPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQ--------------------------------------LGNRHNGNGPGR
        K PPSN RQNIHV HK+KKSV +N G MQS+NK ASSGN N S+MKAK Q                                      LGN +NGNGPGR
Subjt:  KQPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQ--------------------------------------LGNRHNGNGPGR

Query:  PVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPV-VHKPLPSKALPDKRNEIREP
        P+ NS+NGNGPGRP+ NS+NGNGPGRPL NS+NGNGPGRP+ NSNNGNGPG     PKA SA++Q++ SLPGT++SVP  VHKPLPSK L DKRN++R P
Subjt:  PVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPV-VHKPLPSKALPDKRNEIREP

Query:  AKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIA
        AKAKV P+R +SSSRPQMSK    RQVSSRP +N QRPKKRPAR YSD+EDD EG +AISLIRKMFRYNP KFSRDDDDSDMEANF+DIMMEE+RSARIA
Subjt:  AKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIA

Query:  RKEDEEQLRLIQEEEERERRARIKRLKRAKGQ
        RKEDEEQLRLIQEEEE+ERRAR+KRLKRAKGQ
Subjt:  RKEDEEQLRLIQEEEERERRARIKRLKRAKGQ

A0A6J1BU33 myb-like protein I2.4e-257100Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQ
        VIQESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQ
Subjt:  VIQESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQ

Query:  PPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRP
        PPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRP
Subjt:  PPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRP

Query:  VSNSNNGNGPGPKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPY
        VSNSNNGNGPGPKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPY
Subjt:  VSNSNNGNGPGPKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPY

Query:  SDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ
        SDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ
Subjt:  SDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ

A0A6J1HGF8 protein SPT2 homolog isoform X41.1e-17575.51Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEH +EDY++YDEYE    DD + ++ EEEYEEVED KPTKEELEYL LRQRLKE+IRKQ     SKKDGGSH+ S  KKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK
        VIQESKSLLENQHLASRVS H HDN+KS G NS A KPRVPPKIINEKKTKVQ LKDTRDYSFLFSED +VPAP KESS+ SV  PSTEA SAH L+K+K
Subjt:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK

Query:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGR
        QP  N RQNIHV    KKSV +N GQMQS+NK  SSGN N S+MK K  LGN +NGNGPGRP+ NS+NGNGPGRP+ N  NGNGPGRP+ N +  NGPGR
Subjt:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGR

Query:  PVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKK
        PV NSNNG GPG     PKAPSAL+Q+K  LP +KS VP VH+PLP+K L DKRNE+R P+KAK+ PNR +SSSRPQMSKP PQRQ+SSRP LN QRPKK
Subjt:  PVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKK

Query:  RPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAK
        RPARPYSD+EDD +GG+AISLIRKMF YNP +F+ DDDDSDMEANF+DIMMEEKRSA+IARKEDEEQLRLIQEEEERERRARIKRLKRAK
Subjt:  RPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAK

A0A6J1HIF5 protein SPT2 homolog isoform X13.1e-17572.09Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEH +EDY++YDEYE    DD + ++ EEEYEEVED KPTKEELEYL LRQRLKE+IRKQ     SKKDGGSH+ S  KKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK
        VIQESKSLLENQHLASRVS H HDN+KS G NS A KPRVPPKIINEKKTKVQ LKDTRDYSFLFSED +VPAP KESS+ SV  PSTEA SAH L+K+K
Subjt:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK

Query:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGR
        QP  N RQNIHV    KKSV +N GQMQS+NK  SSGN N S+MK K  LGN +NGNGPGRP+ NS+NGNGPGRP+ NS+NGNGPGRP+ NS+NGNGPGR
Subjt:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGR

Query:  PVSNSNNGNGPG-------------------------------PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSS
        P+ N +NGNGPG                               PKAPSAL+Q+K  LP +KS VP VH+PLP+K L DKRNE+R P+KAK+ PNR +SSS
Subjt:  PVSNSNNGNGPG-------------------------------PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSS

Query:  RPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEE
        RPQMSKP PQRQ+SSRP LN QRPKKRPARPYSD+EDD +GG+AISLIRKMF YNP +F+ DDDDSDMEANF+DIMMEEKRSA+IARKEDEEQLRLIQEE
Subjt:  RPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEE

Query:  EERERRARIKRLKRAK
        EERERRARIKRLKRAK
Subjt:  EERERRARIKRLKRAK

A0A6J1HIG2 AAC-rich mRNA clone AAC11 protein-like isoform X33.1e-17574.16Show/hide
Query:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR
        MEH +EDY++YDEYE    DD + ++ EEEYEEVED KPTKEELEYL LRQRLKE+IRKQ     SKKDGGSH+ S  KKLPYDNFGSFFGPSQPVISQR
Subjt:  MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQR

Query:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK
        VIQESKSLLENQHLASRVS H HDN+KS G NS A KPRVPPKIINEKKTKVQ LKDTRDYSFLFSED +VPAP KESS+ SV  PSTEA SAH L+K+K
Subjt:  VIQESKSLLENQHLASRVSHH-HDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK

Query:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNG-----
        QP  N RQNIHV    KKSV +N GQMQS+NK  SSGN N S+MK K  LGN +NGNGPGRP+ NS+NGNGPGRP+ NS+NGNGPGRP+ N  NG     
Subjt:  QPPSNSRQNIHVGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNG-----

Query:  --------NGPGRPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQV
                NGPGRPV NSNNG GPG     PKAPSAL+Q+K  LP +KS VP VH+PLP+K L DKRNE+R P+KAK+ PNR +SSSRPQMSKP PQRQ+
Subjt:  --------NGPGRPVSNSNNGNGPG-----PKAPSALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQV

Query:  SSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLK
        SSRP LN QRPKKRPARPYSD+EDD +GG+AISLIRKMF YNP +F+ DDDDSDMEANF+DIMMEEKRSA+IARKEDEEQLRLIQEEEERERRARIKRLK
Subjt:  SSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLK

Query:  RAK
        RAK
Subjt:  RAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22720.1 SPT2 chromatin protein3.3e-1232.85Show/hide
Query:  PAPTKESSSHSVSVPSTEARSAHALMKNKQPPSNSRQNIHVGHKDKKSVSINGGQMQSR------NKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSN
        PA +     +S    S     + A+  + +P S+  Q  +   ++ +  S  G QMQ R       +PASSG+          + G+  N   P RP  +
Subjt:  PAPTKESSSHSVSVPSTEARSAHALMKNKQPPSNSRQNIHVGHKDKKSVSINGGQMQSR------NKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSN

Query:  SHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVS-----NSNNGNGPGPKAPSA-------LVQRKTSLPGTKSSVPVVHKPLPSKAL-PDKRNEI
            NG       S N NG     S+S        PV      +S+NG GPG  A +A        ++RK S+   KSS+    +P  S+ +  D R  +
Subjt:  SHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVS-----NSNNGNGPGPKAPSA-------LVQRKTSLPGTKSSVPVVHKPLPSKAL-PDKRNEI

Query:  REPAKAKVTPNRLMSSSR--PQMSKPAPQRQVSSRPALNVQRP-------------KKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSR-DDDDSD
         E  K     +R M++ R  P+ S P  + Q+ S+PAL  +RP             KK+PAR     ED E    A  ++R++    P +FSR DDDD +
Subjt:  REPAKAKVTPNRLMSSSR--PQMSKPAPQRQVSSRPALNVQRP-------------KKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSR-DDDDSD

Query:  MEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKR
        MEA FEDI  EE+RSARIAR+EDE +L+L++EEE RER  + ++L R
Subjt:  MEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKR

AT2G22720.2 SPT2 chromatin protein5.2e-5037.16Show/hide
Query:  DYDE---YDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVIQ
        D DE   YD+Y  Y GD+  YE+EEE     ED +P KEELE+LE RQ+LKE IR   KK G+          R +KLPY++FGSFFGPS+PVIS RVIQ
Subjt:  DYDE---YDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVIQ

Query:  ESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVP----PKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK
        ESKSLLEN+    ++S+    KK   P + +    V     PK++NE + KV+ LKDTRDYSFLFS+D  +P P KES S S S P++EARSA    + K
Subjt:  ESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVP----PKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNK

Query:  QPPSNSRQNIHVGHK-DKKSVSIN--------------------------------------------GGQMQSR-----NKPASSGN-------LNSSV
        Q    + +  H  H+ +K+ VS N                                            G QMQSR      +PASSG+        NS  
Subjt:  QPPSNSRQNIHVGHK-DKKSVSIN--------------------------------------------GGQMQSR-----NKPASSGN-------LNSSV

Query:  MKAKTQLGNRHNGNGPGRPVSNSHN---GNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG-------------------------RPVSNSNNGNGPGPKA
          A +Q+  R   +G  RP S+       +G  RP  +S N   P RP  +    NG                           R   +S+NG GPG  A
Subjt:  MKAKTQLGNRHNGNGPGRPVSNSHN---GNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG-------------------------RPVSNSNNGNGPGPKA

Query:  PSA-------LVQRKTSLPGTKSSVPVVHKPLPSKAL-PDKRNEIREPAKAKVTPNRLMSSSR--PQMSKPAPQRQVSSRPALNVQRP------------
         +A        ++RK S+   KSS+    +P  S+ +  D R  + E  K     +R M++ R  P+ S P  + Q+ S+PAL  +RP            
Subjt:  PSA-------LVQRKTSLPGTKSSVPVVHKPLPSKAL-PDKRNEIREPAKAKVTPNRLMSSSR--PQMSKPAPQRQVSSRPALNVQRP------------

Query:  -KKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSR-DDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKR
         KK+PAR     ED E    A  ++R++    P +FSR DDDD +MEA FEDI  EE+RSARIAR+EDE +L+L++EEE RER  + ++L R
Subjt:  -KKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSR-DDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKR

AT2G22720.3 SPT2 chromatin protein5.2e-5037.1Show/hide
Query:  EDYDE---YDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVI
        +D DE   YD+Y  Y GD+  YE+EEE     ED +P KEELE+LE RQ+LKE IR   KK G+          R +KLPY++FGSFFGPS+PVIS RVI
Subjt:  EDYDE---YDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVI

Query:  QESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVP----PKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN
        QESKSLLEN+    ++S+    KK   P + +    V     PK++NE + KV+ LKDTRDYSFLFS+D  +P P KES S S S P++EARSA    + 
Subjt:  QESKSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVP----PKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKN

Query:  KQPPSNSRQNIHVGHK-DKKSVSIN--------------------------------------------GGQMQSR-----NKPASSGN-------LNSS
        KQ    + +  H  H+ +K+ VS N                                            G QMQSR      +PASSG+        NS 
Subjt:  KQPPSNSRQNIHVGHK-DKKSVSIN--------------------------------------------GGQMQSR-----NKPASSGN-------LNSS

Query:  VMKAKTQLGNRHNGNGPGRPVSNSHN---GNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG-------------------------RPVSNSNNGNGPGPK
           A +Q+  R   +G  RP S+       +G  RP  +S N   P RP  +    NG                           R   +S+NG GPG  
Subjt:  VMKAKTQLGNRHNGNGPGRPVSNSHN---GNGPGRPVSNSHNGNGPGRPLSNSHNGNGPG-------------------------RPVSNSNNGNGPGPK

Query:  APSA-------LVQRKTSLPGTKSSVPVVHKPLPSKAL-PDKRNEIREPAKAKVTPNRLMSSSR--PQMSKPAPQRQVSSRPALNVQRP-----------
        A +A        ++RK S+   KSS+    +P  S+ +  D R  + E  K     +R M++ R  P+ S P  + Q+ S+PAL  +RP           
Subjt:  APSA-------LVQRKTSLPGTKSSVPVVHKPLPSKAL-PDKRNEIREPAKAKVTPNRLMSSSR--PQMSKPAPQRQVSSRPALNVQRP-----------

Query:  --KKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSR-DDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKR
          KK+PAR     ED E    A  ++R++    P +FSR DDDD +MEA FEDI  EE+RSARIAR+EDE +L+L++EEE RER  + ++L R
Subjt:  --KKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKFSR-DDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKR

AT4G37860.1 SPT2 chromatin protein2.0e-2231.09Show/hide
Query:  EYLELRQRLKEQIRKQSKKDGS--KKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVIQESKSLLENQHLASRVSHHHDN---------------KKS
        E+LELRQR+KE IR + +   +       + ++     LPYD FGSFFGPSQ VI+ RV+QESK LLEN+  A+++ +   N               KKS
Subjt:  EYLELRQRLKEQIRKQSKKDGS--KKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVIQESKSLLENQHLASRVSHHHDN---------------KKS

Query:  HGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQPPSNSRQNIHVGHKDKKSVSINGGQMQ
            S         K  NE K K +KLKD RDYSFLFS+D  +P   KE        P T   S+       +P S++    H     ++  +IN  + +
Subjt:  HGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQPPSNSRQNIHVGHKDKKSVSINGGQMQ

Query:  SRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVSNSNNGNGPGPKAPSALVQRKTSLPG
        S NK                                        G+P S      G  RPLS++       +P+ + +            + QR  SL  
Subjt:  SRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVSNSNNGNGPGPKAPSALVQRKTSLPG

Query:  TKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKF
        T+S +P                    PAK                       Q+ S+P L  +R KK+P +   DD        A+ ++RKM +   ++F
Subjt:  TKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMFRYNPNKF

Query:  SRDDDDSD---MEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAK
        +  D+D D   MEANF+DIM EEKRS R+A+KED EQLRL+ EEEER RR + ++L   K
Subjt:  SRDDDDSD---MEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACATCCAATTGAAGACTATGATGAATATGATGAATACGAAGAATATGAAGGTGATGATCAAATGTATGAAGAAGAAGAGGAGGAATATGAAGAGGTAGAA
GACCGTAAGCCTACCAAGGAAGAATTAGAATACCTTGAACTAAGGCAACGATTGAAAGAGCAAATTAGAAAGCAGTCAAAAAAAGATGGCAGTAAAAAAGATGGC
GGTTCTCACATGAACTCCAGAGATAAGAAGCTTCCCTATGATAATTTTGGTTCATTCTTTGGTCCTTCCCAACCTGTCATTTCTCAAAGAGTAATTCAAGAAAGC
AAGTCATTATTAGAAAACCAGCACTTGGCATCTAGGGTTTCCCACCATCATGATAATAAAAAGAGTCACGGTCCAAATAGTGCAGCTCCCAAACCTAGGGTTCCA
CCCAAAATCATAAATGAGAAGAAGACGAAAGTTCAAAAGCTTAAGGACACCAGAGACTACTCCTTTCTATTTTCCGAGGATGGACATGTTCCTGCTCCTACAAAA
GAGTCCTCATCTCATAGTGTTTCGGTTCCAAGTACTGAGGCACGTTCAGCTCATGCTCTCATGAAAAATAAGCAGCCTCCGAGTAATTCTCGTCAGAATATTCAT
GTGGGTCATAAAGACAAAAAGTCTGTTTCCATCAATGGCGGCCAAATGCAGTCTAGAAACAAGCCAGCATCCTCTGGCAACCTCAATTCGTCAGTAATGAAGGCT
AAGACACAATTGGGTAATAGGCACAACGGGAATGGTCCTGGCCGGCCAGTGAGTAATAGCCACAACGGGAATGGTCCTGGCCGGCCAGTGAGTAATAGCCACAAC
GGGAACGGTCCTGGCCGGCCATTGAGTAATAGCCACAACGGAAACGGTCCTGGCCGGCCAGTGAGTAACAGCAACAATGGGAATGGTCCTGGCCCGAAAGCCCCA
TCTGCTCTCGTGCAGAGGAAGACTTCTCTACCAGGTACAAAGAGTTCCGTACCTGTTGTTCACAAGCCATTACCATCAAAAGCACTTCCAGATAAGAGGAATGAA
ATTCGAGAACCTGCTAAGGCTAAAGTAACACCGAATCGTCTCATGTCATCGTCAAGACCCCAGATGAGCAAGCCAGCACCACAAAGACAAGTTTCATCTCGTCCA
GCTTTGAATGTTCAACGGCCGAAGAAAAGGCCTGCAAGGCCTTACTCTGATGATGAGGATGATGAGGAGGGCGGAAAAGCTATTAGTCTTATCAGAAAAATGTTC
AGATACAATCCGAACAAGTTTTCCCGAGATGATGACGACAGTGATATGGAGGCCAATTTCGAAGATATAATGATGGAGGAAAAAAGAAGTGCAAGAATAGCAAGG
AAGGAGGATGAAGAGCAACTTCGTTTGATCCAAGAAGAAGAAGAACGGGAGAGGCGTGCGAGAATTAAAAGGCTTAAAAGAGCGAAAGGGCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACATCCAATTGAAGACTATGATGAATATGATGAATACGAAGAATATGAAGGTGATGATCAAATGTATGAAGAAGAAGAGGAGGAATATGAAGAGGTAGAA
GACCGTAAGCCTACCAAGGAAGAATTAGAATACCTTGAACTAAGGCAACGATTGAAAGAGCAAATTAGAAAGCAGTCAAAAAAAGATGGCAGTAAAAAAGATGGC
GGTTCTCACATGAACTCCAGAGATAAGAAGCTTCCCTATGATAATTTTGGTTCATTCTTTGGTCCTTCCCAACCTGTCATTTCTCAAAGAGTAATTCAAGAAAGC
AAGTCATTATTAGAAAACCAGCACTTGGCATCTAGGGTTTCCCACCATCATGATAATAAAAAGAGTCACGGTCCAAATAGTGCAGCTCCCAAACCTAGGGTTCCA
CCCAAAATCATAAATGAGAAGAAGACGAAAGTTCAAAAGCTTAAGGACACCAGAGACTACTCCTTTCTATTTTCCGAGGATGGACATGTTCCTGCTCCTACAAAA
GAGTCCTCATCTCATAGTGTTTCGGTTCCAAGTACTGAGGCACGTTCAGCTCATGCTCTCATGAAAAATAAGCAGCCTCCGAGTAATTCTCGTCAGAATATTCAT
GTGGGTCATAAAGACAAAAAGTCTGTTTCCATCAATGGCGGCCAAATGCAGTCTAGAAACAAGCCAGCATCCTCTGGCAACCTCAATTCGTCAGTAATGAAGGCT
AAGACACAATTGGGTAATAGGCACAACGGGAATGGTCCTGGCCGGCCAGTGAGTAATAGCCACAACGGGAATGGTCCTGGCCGGCCAGTGAGTAATAGCCACAAC
GGGAACGGTCCTGGCCGGCCATTGAGTAATAGCCACAACGGAAACGGTCCTGGCCGGCCAGTGAGTAACAGCAACAATGGGAATGGTCCTGGCCCGAAAGCCCCA
TCTGCTCTCGTGCAGAGGAAGACTTCTCTACCAGGTACAAAGAGTTCCGTACCTGTTGTTCACAAGCCATTACCATCAAAAGCACTTCCAGATAAGAGGAATGAA
ATTCGAGAACCTGCTAAGGCTAAAGTAACACCGAATCGTCTCATGTCATCGTCAAGACCCCAGATGAGCAAGCCAGCACCACAAAGACAAGTTTCATCTCGTCCA
GCTTTGAATGTTCAACGGCCGAAGAAAAGGCCTGCAAGGCCTTACTCTGATGATGAGGATGATGAGGAGGGCGGAAAAGCTATTAGTCTTATCAGAAAAATGTTC
AGATACAATCCGAACAAGTTTTCCCGAGATGATGACGACAGTGATATGGAGGCCAATTTCGAAGATATAATGATGGAGGAAAAAAGAAGTGCAAGAATAGCAAGG
AAGGAGGATGAAGAGCAACTTCGTTTGATCCAAGAAGAAGAAGAACGGGAGAGGCGTGCGAGAATTAAAAGGCTTAAAAGAGCGAAAGGGCAATAG
Protein sequenceShow/hide protein sequence
MEHPIEDYDEYDEYEEYEGDDQMYEEEEEEYEEVEDRKPTKEELEYLELRQRLKEQIRKQSKKDGSKKDGGSHMNSRDKKLPYDNFGSFFGPSQPVISQRVIQES
KSLLENQHLASRVSHHHDNKKSHGPNSAAPKPRVPPKIINEKKTKVQKLKDTRDYSFLFSEDGHVPAPTKESSSHSVSVPSTEARSAHALMKNKQPPSNSRQNIH
VGHKDKKSVSINGGQMQSRNKPASSGNLNSSVMKAKTQLGNRHNGNGPGRPVSNSHNGNGPGRPVSNSHNGNGPGRPLSNSHNGNGPGRPVSNSNNGNGPGPKAP
SALVQRKTSLPGTKSSVPVVHKPLPSKALPDKRNEIREPAKAKVTPNRLMSSSRPQMSKPAPQRQVSSRPALNVQRPKKRPARPYSDDEDDEEGGKAISLIRKMF
RYNPNKFSRDDDDSDMEANFEDIMMEEKRSARIARKEDEEQLRLIQEEEERERRARIKRLKRAKGQ