; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0274 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0274
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionchromo domain protein LHP1-like
Genome locationMC05:1997000..2000719
RNA-Seq ExpressionMC05g0274
SyntenyMC05g0274
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049056.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]7.23e-18759.47Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF
        MGRGKKK  GSS+ E +AL  P   FT STH NG  DSAPS  +NNG+          +S+ N+SVQ P  T       GED        SE+  LDEGF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        FE+E+IRRKRVRK Q +++    GWPET NTWEP +NLQSC EFI+E+EE     +SGKQR+RKRK+GD ++  ++EK  +I+A+DNVTDV I+ +DDRL
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+APLN K+  DLP PQ P+D  HEGE                    +D K DG RK+DEYD++L +  A +S NMV+SD+   AS DVSLV D+SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
        +VGS QGSH  GAKRRKSSRVKRFTKD+ALSE   QGL+QNA T  IEPTDP++QL  +N S SGHSRNV+ ITRII+P+GYSVSV NNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW
        RSDGKEVTVNNKFLK NNP L  +  A   SS        +   +   ++ + + MG   +GPSP+VPCIIVGFLG+IIF PT  SIWES+E LLELGIW
Subjt:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW

Query:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL
        VAVIL+FLLLLVH LSIFFPVL  SSTFAVQH+SSPGYDADGFGFG G   LFL LLFLVLY LL
Subjt:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL

TYK17507.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]3.02e-19660.88Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF
        MGRGKKK  GSS+ E +AL  P   FT STH NG  DSAPS  +NNG+          +S+ N+SVQ P  T       GED        SE+  LDEGF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        FE+E+IRRKRVRKGQLQYLVKW GWPET NTWEP +NLQSC EFI+E+EE     +SGKQR+RKRK+GD ++  ++EK  +I+A+DNVTDV I+ +DDRL
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+APLN K+  DLP PQ P+D  HEGE                    +D K DG RK+DEYD++L +  A +S NMV+SD+   AS DVSLV D+SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
        +VGS QGSH  GAKRRKSSRVKRFTKD+ALSE   QGL+QNA T  IEPTDP++QL  +N S SGHSRNV+ ITRII+P+GYSVSV NNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW
        RSDGKEVTVNNKFLK NNP L  +  A   SS        +   +   ++ + + MG   +GPSP+VPCIIVGFLG+IIF PT  SIWES+E LLELGIW
Subjt:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW

Query:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL
        VAVIL+FLLLLVH LSIFFPVL  SSTFAVQH+SSPGYDADGFGFG G   LFL LLFLVLY LL
Subjt:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL

XP_022147101.1 chromo domain protein LHP1-like [Momordica charantia]1.37e-273100Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKW
        MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKW
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKW

Query:  HGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHE
        HGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHE
Subjt:  HGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHE

Query:  GEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFT
        GEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFT
Subjt:  GEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFT

Query:  KDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
        KDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
Subjt:  KDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL

XP_038877763.1 probable chromo domain-containing protein LHP1 isoform X1 [Benincasa hispida]2.57e-17267.7Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQ-VGE-----------DTSEQPKLDEGF
        MGRGKKK VGSS+ E  AL +P   FT STH NG  DS PS  +NNGN          +S+QNSSVQ P  T   GE             S +  LDEGF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQ-VGE-----------DTSEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        FE+E+IRRKRVRKGQLQYLVKW GWPET NTWEP +NLQ+CSEFIDEFEES    +SGKQR+RKRK+GD +N+P++EK+ ++LA+DNVTDV I  VDDRL
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+APLN K   DLP PQ PVD THEGEFGSHLN TKT  T++V+NG +DGK DG RK+DEYDL+L ELKA ISANMV+SD+ A AS D++LV D+SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
        +VGS Q SH IGAKRRKSSRVKRFTKD+ALSE+SEQ L+QNA T SIEPTD +KQ   EN SLSGHSRNV  ITRIIKP+GYSVSV NNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLL
        RSDGKEVTVNNKFLK NNP L
Subjt:  RSDGKEVTVNNKFLKDNNPLL

XP_038877764.1 chromo domain protein LHP1-like isoform X2 [Benincasa hispida]2.01e-17367.94Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQ-VGE-----------DTSEQPKLDEGF
        MGRGKKK VGSS+ E  AL +P   FT STH NG  DS PS  +NNGN          +S+QNSSVQ P  T   GE             S +  LDEGF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQ-VGE-----------DTSEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSA
        FE+E+IRRKRVRKGQLQYLVKW GWPET NTWEP +NLQ+CSEFIDEFEE  +SGKQR+RKRK+GD +N+P++EK+ ++LA+DNVTDV I  VDDRLS+A
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSA

Query:  PLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVG
        PLN K   DLP PQ PVD THEGEFGSHLN TKT  T++V+NG +DGK DG RK+DEYDL+L ELKA ISANMV+SD+ A AS D++LV D+SKADC+VG
Subjt:  PLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVG

Query:  STQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSD
        S Q SH IGAKRRKSSRVKRFTKD+ALSE+SEQ L+QNA T SIEPTD +KQ   EN SLSGHSRNV  ITRIIKP+GYSVSV NNI DV+VTFL VRSD
Subjt:  STQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSD

Query:  GKEVTVNNKFLKDNNPLL
        GKEVTVNNKFLK NNP L
Subjt:  GKEVTVNNKFLKDNNPLL

TrEMBL top hitse value%identityAlignment
A0A5A7U472 Chromo domain protein LHP1-like3.50e-18759.47Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF
        MGRGKKK  GSS+ E +AL  P   FT STH NG  DSAPS  +NNG+          +S+ N+SVQ P  T       GED        SE+  LDEGF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        FE+E+IRRKRVRK Q +++    GWPET NTWEP +NLQSC EFI+E+EE     +SGKQR+RKRK+GD ++  ++EK  +I+A+DNVTDV I+ +DDRL
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+APLN K+  DLP PQ P+D  HEGE                    +D K DG RK+DEYD++L +  A +S NMV+SD+   AS DVSLV D+SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
        +VGS QGSH  GAKRRKSSRVKRFTKD+ALSE   QGL+QNA T  IEPTDP++QL  +N S SGHSRNV+ ITRII+P+GYSVSV NNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW
        RSDGKEVTVNNKFLK NNP L  +  A   SS        +   +   ++ + + MG   +GPSP+VPCIIVGFLG+IIF PT  SIWES+E LLELGIW
Subjt:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW

Query:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL
        VAVIL+FLLLLVH LSIFFPVL  SSTFAVQH+SSPGYDADGFGFG G   LFL LLFLVLY LL
Subjt:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL

A0A5D3D0R0 Chromo domain protein LHP1-like1.46e-19660.88Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF
        MGRGKKK  GSS+ E +AL  P   FT STH NG  DSAPS  +NNG+          +S+ N+SVQ P  T       GED        SE+  LDEGF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQV-----GEDT-------SEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        FE+E+IRRKRVRKGQLQYLVKW GWPET NTWEP +NLQSC EFI+E+EE     +SGKQR+RKRK+GD ++  ++EK  +I+A+DNVTDV I+ +DDRL
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESL---QSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+APLN K+  DLP PQ P+D  HEGE                    +D K DG RK+DEYD++L +  A +S NMV+SD+   AS DVSLV D+SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
        +VGS QGSH  GAKRRKSSRVKRFTKD+ALSE   QGL+QNA T  IEPTDP++QL  +N S SGHSRNV+ ITRII+P+GYSVSV NNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW
        RSDGKEVTVNNKFLK NNP L  +  A   SS        +   +   ++ + + MG   +GPSP+VPCIIVGFLG+IIF PT  SIWES+E LLELGIW
Subjt:  RSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRER-MG--EAGPSPLVPCIIVGFLGMIIFGPTFVSIWESVESLLELGIW

Query:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL
        VAVIL+FLLLLVH LSIFFPVL  SSTFAVQH+SSPGYDADGFGFG G   LFL LLFLVLY LL
Subjt:  VAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL

A0A6J1D1C0 chromo domain protein LHP1-like6.64e-274100Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKW
        MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKW
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKW

Query:  HGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHE
        HGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHE
Subjt:  HGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHE

Query:  GEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFT
        GEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFT
Subjt:  GEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFT

Query:  KDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
        KDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
Subjt:  KDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL

A0A6J1GGE8 chromo domain-containing protein LHP1-like2.28e-15863.66Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQVGEDT------------SEQPKLDEGF
        MGRGKKK VGSS++EA+AL  PV GFTDST  NG  DSAPSN +NNGN          +S+QNSSVQTP +   G +             SE  KLD+GF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQVGEDT------------SEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQ---SGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        F +E+IRRKRVRKGQLQYLVKWHGWPETANTWEP +NLQSC+EFI+EFE+ ++   SGKQR+RKRK+GD  N+ ++EK+  ++A DNVT+V +S VDD L
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQ---SGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+ PLN  I +DLPTPQV +D T               GT NV+NG M  K DG R++DEYDL+L ELKA ISANMV+SD+ AE+SKD+ LV D SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
         VGSTQGSH IGAKRRKSSRVKRFTK+A  SE+S+  L+QN +  ++EPTD N+QL  EN SLSGHSRNVA I RIIKP+GYSVSVSNNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLL
        RSDGKEVTVNNKFLK NNPLL
Subjt:  RSDGKEVTVNNKFLKDNNPLL

A0A6J1IGX1 chromo domain-containing protein LHP1-like1.05e-15663.66Show/hide
Query:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQVGEDT------------SEQPKLDEGF
        MGRGKKK VGSS++EA+AL  PV GFTDST  NG  DSAPSN +NNGN          +S+QNSSVQTP +   G +             SE  KLD+GF
Subjt:  MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGN----------ASVQNSSVQTPQVTQVGEDT------------SEQPKLDEGF

Query:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQ---SGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL
        F +E+IRRKRVRKGQLQYLVKWHGWPETANTWEP +NLQSC+EFI+EFE+ ++   SGKQR+RKRK+GD  N+ ++EK+  ++A DNVT+V +S VDD L
Subjt:  FEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQ---SGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRL

Query:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC
        S+ PLN  I +DLPTPQV +D T               GT NV+NG M  K DG RK+DEYDL+L EL A ISANMV+SD  AE+SKD+ LV D SKADC
Subjt:  SSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADC

Query:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV
         VGSTQGSH IGAKRRKSSRVKRFTK+AA +E+SEQ L+QN VT ++   D N+Q+  EN SLSGHSRNVA I RIIKP+GYSVSVSNNI DV+VTFL V
Subjt:  MVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVV

Query:  RSDGKEVTVNNKFLKDNNPLL
        RSDGKEVTVNNKFLK NNPLL
Subjt:  RSDGKEVTVNNKFLKDNNPLL

SwissProt top hitse value%identityAlignment
O00257 E3 SUMO-protein ligase CBX43.7e-0955.32Show/hide
Query:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENL
        E P + E  F +ESI +KR+RKG+++YLVKW GW    NTWEP EN+
Subjt:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENL

O55187 E3 SUMO-protein ligase CBX43.7e-0955.32Show/hide
Query:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENL
        E P + E  F +ESI +KR+RKG+++YLVKW GW    NTWEP EN+
Subjt:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENL

Q339W7 Probable chromo domain-containing protein LHP12.2e-3033.53Show/hide
Query:  PKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQ-RRRKRK------NGDSQNRPKKEKRREILAVDNVT
        PKL EG++EIE IRR+R+RKG+LQYLVKW GWPE+ANTWEP ENL +CS+ ID FE  LQS +  R+RKRK       G + +  K+ + R         
Subjt:  PKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQ-RRRKRK------NGDSQNRPKKEKRREILAVDNVT

Query:  DVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSD--GRRKKDEYDLQLSELKAEISANMVNSDENAEASK
           +       + AP   ++P      +   + + +   G   + +     +  +N   +G S    R    E  L +     +   ++VN   N+E   
Subjt:  DVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKNGTMDGKSD--GRRKKDEYDLQLSELKAEISANMVNSDENAEASK

Query:  DVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQG---LEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSV
          +LV         V  +QG    GAK+RKS  V+RF ++       E G   + ++  +   E  D  K     N            IT+IIKP+ ++ 
Subjt:  DVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQG---LEQNAVTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSV

Query:  SVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
        +V+N++Q V +TF  +RSDG+EV V++K LK NNPLL
Subjt:  SVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL

Q944N1 Chromo domain protein LHP11.9e-4236.15Show/hide
Query:  NGNGDSAPSNCDNNGNA----SVQNSSVQTPQVTQV------GEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEF
        NG  + A S C+    A           + P+V +V      G     +PKL EGF+EIE++RR+R  KG++ YL+KW GWPE+ANTWEP  NL SC++ 
Subjt:  NGNGDSAPSNCDNNGNA----SVQNSSVQTPQVTQV------GEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEF

Query:  IDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREI---LAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVK
        ID +EESL+SGK RRRKRK G +Q  P  +++R     +A  N   V + ++++   S PLN+    DL      VD       GS LN +K +  VN  
Subjt:  IDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREI---LAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVK

Query:  NGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKD--AALSEDSEQGLEQNA
             G     R+++E +L+LSELK   S N    D +        L N   K +      Q   C GAK+RKS  V+RF ++  +A+ +D++  L    
Subjt:  NGTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKD--AALSEDSEQGLEQNA

Query:  VTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
        +   ++    N  +  ++      S++   IT+++ P+ Y  S SN++ DV VTF+  R+DG  V V+NKFLK NNPLL
Subjt:  VTASIEPTDPNKQLALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL

Q946J8 Chromo domain-containing protein LHP11.5e-4238.84Show/hide
Query:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQ-RRRKRKNGDSQNRPKKEKRREILAVDNVTDVDI
        E+PKLDEGF+EIE+IRRKRVRKG++QYL+KW GWPETANTWEP ENLQS ++ ID FE SL+ GK  R+RKRK     ++ KK++R        +T    
Subjt:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQ-RRRKRKNGDSQNRPKKEKRREILAVDNVTDVDI

Query:  SMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHL--NDTKTNGTVNVKNGTMDGKSDGR--RKKDEYDLQLSELKAEI-SANMVNSDENAEASKD
           +   SS  LN     D+P    P+D +        +   +   +  V   +G++      R    + EYD  L+EL+  + ++N     +      +
Subjt:  SMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHL--NDTKTNGTVNVKNGTMDGKSDGR--RKKDEYDLQLSELKAEI-SANMVNSDENAEASKD

Query:  VSLV--NDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLA---------LENSSLSGHSR-NVAAITRI
           V  N L K        + S  IGAKRRKS  VKRF +D + S +     +QN +T  +   D   ++A         +EN +LS  ++     IT+I
Subjt:  VSLV--NDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLA---------LENSSLSGHSR-NVAAITRI

Query:  IKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
        +KP+ ++ SVS+N+Q+VLVTFL +RSDGKE  V+N+FLK +NP L
Subjt:  IKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)1.0e-4338.84Show/hide
Query:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQ-RRRKRKNGDSQNRPKKEKRREILAVDNVTDVDI
        E+PKLDEGF+EIE+IRRKRVRKG++QYL+KW GWPETANTWEP ENLQS ++ ID FE SL+ GK  R+RKRK     ++ KK++R        +T    
Subjt:  EQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTWEPSENLQSCSEFIDEFEESLQSGKQ-RRRKRKNGDSQNRPKKEKRREILAVDNVTDVDI

Query:  SMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHL--NDTKTNGTVNVKNGTMDGKSDGR--RKKDEYDLQLSELKAEI-SANMVNSDENAEASKD
           +   SS  LN     D+P    P+D +        +   +   +  V   +G++      R    + EYD  L+EL+  + ++N     +      +
Subjt:  SMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHL--NDTKTNGTVNVKNGTMDGKSDGR--RKKDEYDLQLSELKAEI-SANMVNSDENAEASKD

Query:  VSLV--NDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLA---------LENSSLSGHSR-NVAAITRI
           V  N L K        + S  IGAKRRKS  VKRF +D + S +     +QN +T  +   D   ++A         +EN +LS  ++     IT+I
Subjt:  VSLV--NDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQLA---------LENSSLSGHSR-NVAAITRI

Query:  IKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL
        +KP+ ++ SVS+N+Q+VLVTFL +RSDGKE  V+N+FLK +NP L
Subjt:  IKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCGAGGGAAGAAGAAGGTGGTAGGAAGCTCCGATTCTGAGGCATTGGCGCTTGCGCTTCCAGTTCTTGGCTTCACTGATTCTACTCATGGCAATGGCAATGGAGA
TTCAGCTCCCTCGAACTGTGACAATAATGGAAACGCTTCGGTTCAGAACAGTTCAGTGCAGACTCCACAAGTGACCCAAGTCGGAGAAGACACTTCTGAGCAACCAAAGC
TCGACGAAGGCTTCTTCGAAATTGAATCTATTCGGCGTAAGAGAGTTCGAAAGGGGCAGCTTCAGTACCTCGTCAAATGGCATGGCTGGCCAGAGACTGCCAATACATGG
GAACCCTCGGAGAATCTCCAGTCATGCTCTGAATTTATTGATGAATTTGAAGAAAGCTTGCAATCAGGAAAGCAGCGTAGGCGCAAGCGCAAGAATGGGGATAGCCAAAA
TCGACCTAAGAAGGAAAAACGGCGTGAAATCCTAGCTGTTGACAATGTCACAGATGTAGATATCAGTATGGTGGATGATCGCCTATCATCTGCTCCTCTAAACATAAAAA
TTCCTTTTGATCTTCCTACTCCTCAAGTACCTGTAGACTTTACTCATGAAGGAGAGTTTGGCAGCCATCTTAATGATACCAAAACTAATGGAACAGTTAATGTTAAAAAT
GGAACTATGGATGGGAAATCTGATGGAAGAAGAAAGAAAGATGAATACGATCTTCAACTTAGTGAGCTCAAGGCAGAAATTTCTGCCAATATGGTCAATTCTGATGAAAA
TGCTGAGGCTTCTAAAGATGTTAGCCTTGTTAATGATCTTTCCAAGGCTGATTGCATGGTGGGTTCCACTCAGGGAAGTCACTGCATTGGAGCCAAGAGAAGGAAGTCTA
GTAGGGTGAAAAGGTTCACTAAGGATGCAGCCTTATCTGAAGACTCTGAACAGGGATTAGAGCAAAATGCAGTGACTGCAAGTATTGAGCCTACTGACCCAAACAAACAA
TTAGCGCTCGAGAATTCTAGTTTGTCAGGTCACTCCAGAAATGTGGCTGCTATCACAAGGATTATCAAGCCTATTGGTTATTCAGTTTCAGTATCAAATAACATCCAGGA
TGTACTTGTAACCTTCTTGGTTGTGAGGTCTGATGGAAAGGAAGTGACAGTGAATAACAAATTTCTTAAGGATAACAATCCACTTCTGCATGGAAATGGTTCAGCTTTTG
AAAAGTCATCTCCTACCACCACCAATAAGCCTCCTCAATATCTCTATGCTACAACCCAGGTTCTGATTAGAGAGAGAATGGGAGAGGCAGGGCCATCTCCTCTTGTACCT
TGTATCATAGTTGGATTTCTGGGGATGATAATTTTTGGGCCAACTTTTGTATCCATTTGGGAGAGTGTAGAGTCCCTACTTGAACTGGGTATTTGGGTTGCAGTGATTCT
TGTTTTCCTTTTACTGCTTGTGCACTTGCTTTCTATTTTCTTTCCTGTGCTTCAGGTTTCATCCACTTTTGCAGTTCAGCATACCAGCAGCCCTGGGTATGATGCTGACG
GATTCGGTTTCGGTTTCGGGTTAGGAACTCTGTTTCTGGTTCTTCTCTTCCTTGTGCTCTATAATCTATTG
mRNA sequenceShow/hide mRNA sequence
AGAGAGGGAAAGGACTAAGGAGGGATAAAACGAAAATGGGGCGAGGGAAGAAGAAGGTGGTAGGAAGCTCCGATTCTGAGGCATTGGCGCTTGCGCTTCCAGTTCTTGGC
TTCACTGATTCTACTCATGGCAATGGCAATGGAGATTCAGCTCCCTCGAACTGTGACAATAATGGAAACGCTTCGGTTCAGAACAGTTCAGTGCAGACTCCACAAGTGAC
CCAAGTCGGAGAAGACACTTCTGAGCAACCAAAGCTCGACGAAGGCTTCTTCGAAATTGAATCTATTCGGCGTAAGAGAGTTCGAAAGGGGCAGCTTCAGTACCTCGTCA
AATGGCATGGCTGGCCAGAGACTGCCAATACATGGGAACCCTCGGAGAATCTCCAGTCATGCTCTGAATTTATTGATGAATTTGAAGAAAGCTTGCAATCAGGAAAGCAG
CGTAGGCGCAAGCGCAAGAATGGGGATAGCCAAAATCGACCTAAGAAGGAAAAACGGCGTGAAATCCTAGCTGTTGACAATGTCACAGATGTAGATATCAGTATGGTGGA
TGATCGCCTATCATCTGCTCCTCTAAACATAAAAATTCCTTTTGATCTTCCTACTCCTCAAGTACCTGTAGACTTTACTCATGAAGGAGAGTTTGGCAGCCATCTTAATG
ATACCAAAACTAATGGAACAGTTAATGTTAAAAATGGAACTATGGATGGGAAATCTGATGGAAGAAGAAAGAAAGATGAATACGATCTTCAACTTAGTGAGCTCAAGGCA
GAAATTTCTGCCAATATGGTCAATTCTGATGAAAATGCTGAGGCTTCTAAAGATGTTAGCCTTGTTAATGATCTTTCCAAGGCTGATTGCATGGTGGGTTCCACTCAGGG
AAGTCACTGCATTGGAGCCAAGAGAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATGCAGCCTTATCTGAAGACTCTGAACAGGGATTAGAGCAAAATGCAGTGA
CTGCAAGTATTGAGCCTACTGACCCAAACAAACAATTAGCGCTCGAGAATTCTAGTTTGTCAGGTCACTCCAGAAATGTGGCTGCTATCACAAGGATTATCAAGCCTATT
GGTTATTCAGTTTCAGTATCAAATAACATCCAGGATGTACTTGTAACCTTCTTGGTTGTGAGGTCTGATGGAAAGGAAGTGACAGTGAATAACAAATTTCTTAAGGATAA
CAATCCACTTCTGCATGGAAATGGTTCAGCTTTTGAAAAGTCATCTCCTACCACCACCAATAAGCCTCCTCAATATCTCTATGCTACAACCCAGGTTCTGATTAGAGAGA
GAATGGGAGAGGCAGGGCCATCTCCTCTTGTACCTTGTATCATAGTTGGATTTCTGGGGATGATAATTTTTGGGCCAACTTTTGTATCCATTTGGGAGAGTGTAGAGTCC
CTACTTGAACTGGGTATTTGGGTTGCAGTGATTCTTGTTTTCCTTTTACTGCTTGTGCACTTGCTTTCTATTTTCTTTCCTGTGCTTCAGGTTTCATCCACTTTTGCAGT
TCAGCATACCAGCAGCCCTGGGTATGATGCTGACGGATTCGGTTTCGGTTTCGGGTTAGGAACTCTGTTTCTGGTTCTTCTCTTCCTTGTGCTCTATAATCTATTG
Protein sequenceShow/hide protein sequence
MGRGKKKVVGSSDSEALALALPVLGFTDSTHGNGNGDSAPSNCDNNGNASVQNSSVQTPQVTQVGEDTSEQPKLDEGFFEIESIRRKRVRKGQLQYLVKWHGWPETANTW
EPSENLQSCSEFIDEFEESLQSGKQRRRKRKNGDSQNRPKKEKRREILAVDNVTDVDISMVDDRLSSAPLNIKIPFDLPTPQVPVDFTHEGEFGSHLNDTKTNGTVNVKN
GTMDGKSDGRRKKDEYDLQLSELKAEISANMVNSDENAEASKDVSLVNDLSKADCMVGSTQGSHCIGAKRRKSSRVKRFTKDAALSEDSEQGLEQNAVTASIEPTDPNKQ
LALENSSLSGHSRNVAAITRIIKPIGYSVSVSNNIQDVLVTFLVVRSDGKEVTVNNKFLKDNNPLLHGNGSAFEKSSPTTTNKPPQYLYATTQVLIRERMGEAGPSPLVP
CIIVGFLGMIIFGPTFVSIWESVESLLELGIWVAVILVFLLLLVHLLSIFFPVLQVSSTFAVQHTSSPGYDADGFGFGFGLGTLFLVLLFLVLYNLL