; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G009600 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G009600
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionchromo domain-containing protein LHP1-like
Genome locationCmo_Chr08:6226570..6229815
RNA-Seq ExpressionCmoCh08G009600
SyntenyCmoCh08G009600
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR008251 - Chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR017984 - Chromo domain subgroup
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593747.1 Chromo domain-containing protein LHP1, partial [Cucurbita argyrosperma subsp. sororia]2.6e-21589.5Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNS    NNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSLMTGKQRKRKRKHGVVH QTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGD                       KDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP
        NNGPAA LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIG+ENSGYRGESFIRNN TDDARNELRIIKIIKP
Subjt:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP

Query:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        LGYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

KAG7026079.1 Chromo domain-containing protein LHP1, partial [Cucurbita argyrosperma subsp. argyrosperma]7.3e-21094.92Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNS    NNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSLMTGKQRKRKRKHGVVH QTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP
        NNG AA LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIG+ENSGYRGESFIRNN TDDARNELRIIKIIKP
Subjt:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP

Query:  LGYSASVSNNIQD
        LGYSASVSNNIQD
Subjt:  LGYSASVSNNIQD

XP_022964471.1 chromo domain-containing protein LHP1-like [Cucurbita moschata]1.7e-23896.27Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPL
        NNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPL
Subjt:  NNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPL

Query:  GYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        GYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  GYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

XP_022999903.1 chromo domain-containing protein LHP1-like [Cucurbita maxima]3.7e-23093Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNS++NNNNN+AIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSL TGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINMSSLTHHY HSQ+IVCNHEGDKNGD+IALERGKKI+IDNVGKN TQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEART D
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP
        NNGPAA LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIG+ENSGY GE FIRNNKTD+ARNELRIIKIIKP
Subjt:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP

Query:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        LGYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

XP_023514742.1 chromo domain-containing protein LHP1-like [Cucurbita pepo subsp. pepo]7.5e-23193.87Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNS    NNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADF+AQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINM+SLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEART D
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP
        NNGPAA LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIG+ENSGY GE FIRNNKTDDARNELRIIKIIKP
Subjt:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP

Query:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        LGYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

TrEMBL top hitse value%identityAlignment
A0A1S3CBU8 chromo domain-containing protein LHP1-like4.4e-17674.58Show/hide
Query:  MKIKGS-GRKKSASGSTV--------AMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGEL-DDGGDGEEEDEHDGDEADFAA
        MK KG  GRKKSAS S          AMD GE VHGG+SDYAN N NNN  N +     EPS SHL++THQ+QNL E  DD G+G+EEDE DGDEA FA+
Subjt:  MKIKGS-GRKKSASGSTV--------AMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGEL-DDGGDGEEEDEHDGDEADFAA

Query:  QRTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKR--QQG
        QRTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPE ANTWEPLENLHTCSDFI+AFE            +SLMTGKQRKRKRKHGVVHTQTKKR  QQ 
Subjt:  QRTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKR--QQG

Query:  ANFSTYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQR---KKDEHEYDPKLSELKATVLTNVAI
         +FS YNVTDVEISV+DQRLPSAP+N+SSLT+ YAHSQ++V NHEG+KNGDV A+ERGK+ DIDN+G+NATQR   KKDEHEYDPKLSELKATVLTN+A+
Subjt:  ANFSTYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQR---KKDEHEYDPKLSELKATVLTNVAI

Query:  GDKLAVNFQEARTTDNNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKT
         DK  +N Q+ARTT+NNG AA LSK   VE   DNRC GA+RRKSGSV+RFR DSTLSELP +Q+AELTLAVVESG +VEPIG+ENSGY GES  RNNKT
Subjt:  GDKLAVNFQEARTTDNNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKT

Query:  DDARNELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        DDARNE  IIKIIKPLGYSASVSNN+QDVLVT     SDGTEV+VDNKFLKA NPLLLINFYEQHLRYTTRS
Subjt:  DDARNELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1H4P7 chromo domain-containing protein LHP1-like1.4e-17775.21Show/hide
Query:  MKIKGSGRKKSASGSTV--------AMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQR
        MKIKG  RKKSASGST         AMDGGE VHGGDSDY +MN+NNN  N       EPSTSHLS+THQEQNL E D   + +E+DE DG EA F AQR
Subjt:  MKIKGSGRKKSASGSTV--------AMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQR

Query:  TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFS
        TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCS+FI+AFE             SL++GKQRKRKRKHGVVH QTKKRQQ ANFS
Subjt:  TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFS

Query:  TYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQR---KKDEHEYDPKLSELKATVLTNVAIGDKL
        TYNVTDVEISV+DQRLPSAPIN+ SLT+ YA S+++V NHEG+KNGDV A+ERG+  DIDN G+NATQR   KK EHEYDPKLSELKATVLTN+AIGDKL
Subjt:  TYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQR---KKDEHEYDPKLSELKATVLTNVAIGDKL

Query:  AVNFQEARTTDNNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDAR
        A++F +ARTT+NN PAA LSK+G+VE   +NRC GAKRRKSGSVKRFRQDSTLSELPV QN ELTLAVVESG   EPIG+ENSGY GES   NN+TD+AR
Subjt:  AVNFQEARTTDNNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDAR

Query:  NELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        NE  IIKI+KPLG SASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHL YTTRS
Subjt:  NELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1HIZ9 chromo domain-containing protein LHP1-like8.1e-23996.27Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPL
        NNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPL
Subjt:  NNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPL

Query:  GYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        GYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  GYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1KL10 chromo domain-containing protein LHP1-like1.8e-23093Show/hide
Query:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
        MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNS++NNNNN+AIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY
Subjt:  MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFY

Query:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
        EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE            RSL TGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE
Subjt:  EIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVE

Query:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD
        ISVLDQRLPSAPINMSSLTHHY HSQ+IVCNHEGDKNGD+IALERGKKI+IDNVGKN TQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEART D
Subjt:  ISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTD

Query:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP
        NNGPAA LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIG+ENSGY GE FIRNNKTD+ARNELRIIKIIKP
Subjt:  NNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKP

Query:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        LGYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  LGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1KVS2 chromo domain-containing protein LHP1-like1.2e-17875.64Show/hide
Query:  MKIKGSGRKKSASGSTV--------AMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQR
        MKIKG  RKKSASGST         AMDGGE VHGGDSDY +MN+NNN  N       EPSTSHLS+THQEQNL E D   + +E+DE DG EA FAAQR
Subjt:  MKIKGSGRKKSASGSTV--------AMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQR

Query:  TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFS
        TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCS+FI+AFE            +SL++GKQRKRKRKHGVVH QTKKRQQ ANFS
Subjt:  TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFS

Query:  TYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQR---KKDEHEYDPKLSELKATVLTNVAIGDKL
        TYNVTDVEISV+DQRLPSAPINM SLT+ YA S+++V NHEG+KNGDV A+ERG+  DIDNVG+NATQR   KK EHEYDPKLSELKAT LTN+AIGDKL
Subjt:  TYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQR---KKDEHEYDPKLSELKATVLTNVAIGDKL

Query:  AVNFQEARTTDNNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDAR
        A++F +ARTT+NN  AA LSK+G+VE   +NRC GAKRRKSGSVKRFRQDSTLSELPV QN ELTLAVVES   VEPIG+ENSGY GE   RNN+TD+AR
Subjt:  AVNFQEARTTDNNGPAA-LSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDAR

Query:  NELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS
        NE  IIKI+KPLGYSASVSNNIQDVLVT     SDGTEVMVDNKFLKANNPLLLINFYEQHL YTTRS
Subjt:  NELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRYTTRS

SwissProt top hitse value%identityAlignment
O95931 Chromobox protein homolog 79.9e-0855.26Show/hide
Query:  YEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENL
        + +E+IR+KRVRKG+++YL+KW+GWP   +TWEP E++
Subjt:  YEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENL

P05205 Heterochromatin protein 14.4e-0841.79Show/hide
Query:  DGDEADFAAQRTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE
        D  E+         ++  Y +E I  +RVRKG+++Y +KW+G+PET NTWEP  NL  C D I  +E
Subjt:  DGDEADFAAQRTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFE

Q339W7 Probable chromo domain-containing protein LHP12.4e-3833.49Show/hide
Query:  QEQNLGELDDGGDGEEEDEHD-------GDEADFAAQR----------------------TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWE
        +EQ  GE ++  +GEEE+E +       G+E++ AA                          L +G+YEIE IRR+R+RKG+LQYL+KWRGWPE+ANTWE
Subjt:  QEQNLGELDDGGDGEEEDEHD-------GDEADFAAQR----------------------TNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWE

Query:  PLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNH
        PLENL  CSD IDAFE R          +S   G++RKRK         T     G+N S        +         AP     L    +  +A  C+ 
Subjt:  PLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNH

Query:  EGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSV
        +      V  L+    +  + + +N  Q        +   S +  T    + +  +L     E      NG +       V  +   +  GAK+RKSG+V
Subjt:  EGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSV

Query:  KRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNK
        +RF Q+      P     E    VV      E +G    G  G+      KT+   N + I KIIKP+ ++A+V+N++Q V +T     SDG EVMVD+K
Subjt:  KRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNK

Query:  FLKANNPLLLINFYEQHLRYTTRS
         LKANNPLLLI++YEQ LRY   S
Subjt:  FLKANNPLLLINFYEQHLRYTTRS

Q944N1 Chromo domain protein LHP11.7e-4735.87Show/hide
Query:  SKTHQEQNLGELDDGGDGEEEDE-------HDGDEADFAAQ-RTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFER
        SK  +E   GE +  G+GEE+DE         GD    A + +  L +GFYEIE +RR+R  KG++ YLIKWRGWPE+ANTWEP  NL +C+D IDA+E 
Subjt:  SKTHQEQNLGELDDGGDGEEEDE-------HDGDEADFAAQ-RTNLDDGFYEIEAIRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFER

Query:  RYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQ---GANFSTYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERG
                    SL +GK R+RKRK G   T    +QQ    A  +TYN   V++ ++++  PS P+N+   T               D NG  +     
Subjt:  RYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQ---GANFSTYNVTDVEISVLDQRLPSAPINMSSLTHHYAHSQAIVCNHEGDKNGDVIALERG

Query:  KKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPV
            +D V      R ++++E + KLSELK    TN   G+ + +       + N       K    E    +RC GAK+RKSG V+RF++++T +    
Subjt:  KKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSELPV

Query:  AQNAELTLAVVESGAQVEPIG-IENSGYRGESFIRNNKTDDARNELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINF
         Q+A      +  G    P+      G      +     DD+++   I +++ P+ Y AS SN++ DV VT     +DG  V+VDNKFLK NNPLLLINF
Subjt:  AQNAELTLAVVESGAQVEPIG-IENSGYRGESFIRNNKTDDARNELRIIKIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINF

Query:  YEQHLRY
        YE+++RY
Subjt:  YEQHLRY

Q946J8 Chromo domain-containing protein LHP12.7e-5839.52Show/hide
Query:  GSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFYEIEA
        G  RK S  G   + DGG    GG  +       ++    +                +E+   + DDGGD EE++E +G+      +R  LD+GFYEIEA
Subjt:  GSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFYEIEA

Query:  IRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQ-RKRKRKHGVVHTQTKKRQQ--GANFSTYNVTDVEI
        IRRKRVRKG++QYLIKWRGWPETANTWEPLENL + +D IDAFE             SL  GK  RKRKRK+   H+Q KK+Q+    +      +D   
Subjt:  IRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQ-RKRKRKHGVVHTQTKKRQQ--GANFSTYNVTDVEI

Query:  SVLDQRLPSAP----INMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEAR
        S+ +  LP  P    ++ SSL +    ++    +++ + N   + + R  ++ IDN           E EYDP L+EL+  V  N + G   +       
Subjt:  SVLDQRLPSAP----INMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEAR

Query:  TTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSE---LPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRII
          DN  P  L K    E   ++R IGAKRRKSGSVKRF+QD + S     P  QN    L  ++S  ++  +G E  G   E+   + KT     EL I 
Subjt:  TTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSE---LPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRII

Query:  KIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRY
        KI+KP+ ++ASVS+N+Q+VLVT     SDG E +VDN+FLKA+NP LLI FYEQHL+Y
Subjt:  KIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRY

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)1.9e-5939.52Show/hide
Query:  GSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFYEIEA
        G  RK S  G   + DGG    GG  +       ++    +                +E+   + DDGGD EE++E +G+      +R  LD+GFYEIEA
Subjt:  GSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFYEIEA

Query:  IRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQ-RKRKRKHGVVHTQTKKRQQ--GANFSTYNVTDVEI
        IRRKRVRKG++QYLIKWRGWPETANTWEPLENL + +D IDAFE             SL  GK  RKRKRK+   H+Q KK+Q+    +      +D   
Subjt:  IRRKRVRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQ-RKRKRKHGVVHTQTKKRQQ--GANFSTYNVTDVEI

Query:  SVLDQRLPSAP----INMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEAR
        S+ +  LP  P    ++ SSL +    ++    +++ + N   + + R  ++ IDN           E EYDP L+EL+  V  N + G   +       
Subjt:  SVLDQRLPSAP----INMSSLTHHYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEAR

Query:  TTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSE---LPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRII
          DN  P  L K    E   ++R IGAKRRKSGSVKRF+QD + S     P  QN    L  ++S  ++  +G E  G   E+   + KT     EL I 
Subjt:  TTDNNGPAALSKSGTVEATADNRCIGAKRRKSGSVKRFRQDSTLSE---LPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRII

Query:  KIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRY
        KI+KP+ ++ASVS+N+Q+VLVT     SDG E +VDN+FLKA+NP LLI FYEQHL+Y
Subjt:  KIIKPLGYSASVSNNIQDVLVT-----SDGTEVMVDNKFLKANNPLLLINFYEQHLRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCAAAGGGAGTGGAAGGAAGAAAAGTGCGAGCGGTTCAACGGTGGCCATGGATGGTGGTGAAGTCGTCCATGGAGGCGACTCGGATTACGCTAATATGAACAG
CAACAACAACAACAACAACAACAACGCCATTATTAGCGTTGAGCCTTCGACTTCACATTTATCGAAGACCCATCAAGAACAAAACCTAGGAGAGCTTGATGACGGCGGCG
ATGGGGAGGAAGAAGATGAACACGACGGCGATGAGGCTGATTTTGCTGCTCAGAGAACCAATCTCGATGATGGGTTCTATGAAATTGAAGCAATTCGTCGAAAAAGAGTT
CGTAAGGGACAGCTTCAGTACTTGATCAAATGGCGAGGCTGGCCGGAGACGGCTAATACTTGGGAGCCCTTGGAGAATCTCCATACGTGCTCCGATTTCATCGACGCATT
TGAACGGAGGTATAATGTTGTTGAAATTCGGTTTTGTTTTCGAAGTTTAATGACAGGAAAGCAACGGAAGCGAAAGAGAAAACACGGGGTTGTTCATACTCAAACTAAGA
AAAGGCAGCAGGGAGCTAACTTTTCTACTTACAATGTCACAGATGTTGAAATCAGTGTTCTTGATCAACGTCTGCCGTCTGCTCCGATAAACATGTCTAGCCTTACTCAT
CATTACGCTCATTCGCAAGCGATTGTTTGTAATCACGAAGGAGATAAGAATGGAGATGTAATAGCTCTTGAAAGAGGCAAGAAAATCGATATTGATAACGTGGGGAAGAA
TGCTACTCAACGAAAGAAAGATGAACATGAGTATGATCCGAAACTTAGTGAGCTTAAGGCAACAGTATTAACGAACGTAGCCATTGGCGATAAGCTTGCAGTCAATTTTC
AAGAAGCTAGGACGACGGATAACAATGGCCCGGCAGCTCTTTCTAAAAGTGGCACTGTGGAAGCAACCGCTGACAATCGGTGCATTGGGGCTAAGAGAAGGAAGTCGGGT
TCGGTTAAAAGGTTTAGACAAGATTCAACTTTATCTGAACTACCTGTGGCTCAAAATGCAGAATTGACATTGGCTGTAGTAGAATCCGGTGCTCAAGTAGAACCGATTGG
GATCGAGAATTCTGGATATCGTGGGGAAAGTTTCATTCGCAATAACAAGACCGATGACGCCAGAAACGAGCTGCGAATCATCAAGATTATCAAACCGTTAGGCTATTCTG
CATCTGTGTCGAACAACATTCAGGACGTGTTGGTAACGTCTGATGGAACCGAAGTGATGGTCGATAACAAGTTCCTCAAGGCTAACAATCCGCTACTGTTGATCAACTTC
TACGAGCAACATCTCCGCTATACGACTAGATCATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAAGGAACAAGTTCATCTTATTCCCATTTCAACTCACGCAATATCATCTTGAATTCGGTTCACCAACATCAACGCTGTGGAGTTAGAACAGACGGCG
TAATCCGAAATTGGGAGCAAAAGAACGACACCCACATCAGATTTGGCAGAAAAAAGAGAGGCAAAATATAAAACGTTTCATACAGGAAAAGGAAAATTTCAATAAATGTG
GCACCTCAGTCCATTTTGTAGTGACATTTATCTTCGTTTTGGAAAAATGTTGAGAACCCACATAAACCCTTGAGCGTAGGTGAATTTGCAGTGGAATTTTGCGGGTTCCC
AATCTCCAAAAACCCACAAAACCCCTTGAAATTAAAGAGCATGTGCGTTTTATCTTCAATTTCTAAGTTCGTAAGGAGAAGTACAGAGGGAGAGAGGATTTTAGAATGAA
AATCAAAGGGAGTGGAAGGAAGAAAAGTGCGAGCGGTTCAACGGTGGCCATGGATGGTGGTGAAGTCGTCCATGGAGGCGACTCGGATTACGCTAATATGAACAGCAACA
ACAACAACAACAACAACAACGCCATTATTAGCGTTGAGCCTTCGACTTCACATTTATCGAAGACCCATCAAGAACAAAACCTAGGAGAGCTTGATGACGGCGGCGATGGG
GAGGAAGAAGATGAACACGACGGCGATGAGGCTGATTTTGCTGCTCAGAGAACCAATCTCGATGATGGGTTCTATGAAATTGAAGCAATTCGTCGAAAAAGAGTTCGTAA
GGGACAGCTTCAGTACTTGATCAAATGGCGAGGCTGGCCGGAGACGGCTAATACTTGGGAGCCCTTGGAGAATCTCCATACGTGCTCCGATTTCATCGACGCATTTGAAC
GGAGGTATAATGTTGTTGAAATTCGGTTTTGTTTTCGAAGTTTAATGACAGGAAAGCAACGGAAGCGAAAGAGAAAACACGGGGTTGTTCATACTCAAACTAAGAAAAGG
CAGCAGGGAGCTAACTTTTCTACTTACAATGTCACAGATGTTGAAATCAGTGTTCTTGATCAACGTCTGCCGTCTGCTCCGATAAACATGTCTAGCCTTACTCATCATTA
CGCTCATTCGCAAGCGATTGTTTGTAATCACGAAGGAGATAAGAATGGAGATGTAATAGCTCTTGAAAGAGGCAAGAAAATCGATATTGATAACGTGGGGAAGAATGCTA
CTCAACGAAAGAAAGATGAACATGAGTATGATCCGAAACTTAGTGAGCTTAAGGCAACAGTATTAACGAACGTAGCCATTGGCGATAAGCTTGCAGTCAATTTTCAAGAA
GCTAGGACGACGGATAACAATGGCCCGGCAGCTCTTTCTAAAAGTGGCACTGTGGAAGCAACCGCTGACAATCGGTGCATTGGGGCTAAGAGAAGGAAGTCGGGTTCGGT
TAAAAGGTTTAGACAAGATTCAACTTTATCTGAACTACCTGTGGCTCAAAATGCAGAATTGACATTGGCTGTAGTAGAATCCGGTGCTCAAGTAGAACCGATTGGGATCG
AGAATTCTGGATATCGTGGGGAAAGTTTCATTCGCAATAACAAGACCGATGACGCCAGAAACGAGCTGCGAATCATCAAGATTATCAAACCGTTAGGCTATTCTGCATCT
GTGTCGAACAACATTCAGGACGTGTTGGTAACGTCTGATGGAACCGAAGTGATGGTCGATAACAAGTTCCTCAAGGCTAACAATCCGCTACTGTTGATCAACTTCTACGA
GCAACATCTCCGCTATACGACTAGATCATGAATAGCAAAACGAACGAACGTATCCTCCACCGTGGTCACGTCTCCTTCTTGCTGTGAAGAAACGAGATAAGAAGTTCGAG
GCTTAGAATCTTTCCGCTAATTATGATATTAATGTTTTATATGTAGGTAGCTGCGTCGTTCCTAAGGTTTTGTATATAGTACCGTCGTGTCGATCTCGTTGCAGTCAATG
CACTTGTTTAATCTTTGATTTCTCAAGAGTCAAGAGCAATGGATAGAATTGGTTGTAACTTAAAGAGTCCTTAGGGAGTCAATGCAATCATTGTGGATTGCATATATATG
GTCTTTTTTTCCCCCTTTTCTTGTGTGTTTTTGAGAACGATACGATAGTATGTTTCCTCGAGGTGTTTTTTGTTCTTTTCTATGCTTTTGGAGACGACATCGTGTCGAAG
TTTACATGGGAAGAGAGCGAATTTCTGTTTCCGAGCAATGTGGCTCGCAATTTATGCACTAAAAATGAAAAAAGAGTGATAATTACTCCTC
Protein sequenceShow/hide protein sequence
MKIKGSGRKKSASGSTVAMDGGEVVHGGDSDYANMNSNNNNNNNNAIISVEPSTSHLSKTHQEQNLGELDDGGDGEEEDEHDGDEADFAAQRTNLDDGFYEIEAIRRKRV
RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIDAFERRYNVVEIRFCFRSLMTGKQRKRKRKHGVVHTQTKKRQQGANFSTYNVTDVEISVLDQRLPSAPINMSSLTH
HYAHSQAIVCNHEGDKNGDVIALERGKKIDIDNVGKNATQRKKDEHEYDPKLSELKATVLTNVAIGDKLAVNFQEARTTDNNGPAALSKSGTVEATADNRCIGAKRRKSG
SVKRFRQDSTLSELPVAQNAELTLAVVESGAQVEPIGIENSGYRGESFIRNNKTDDARNELRIIKIIKPLGYSASVSNNIQDVLVTSDGTEVMVDNKFLKANNPLLLINF
YEQHLRYTTRS