; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001677 (gene) of Snake gourd v1 genome

Gene IDTan0001677
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionchromo domain-containing protein LHP1-like
Genome locationLG06:72164429..72168999
RNA-Seq ExpressionTan0001677
SyntenyTan0001677
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR008251 - Chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013388.1 Chromo domain-containing protein LHP1, partial [Cucurbita argyrosperma subsp. argyrosperma]5.2e-20885.37Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDD
        MKIKGG RKKSASGS+EVV+GSIQDAMD GE VHGGDSDY ++N+NNN ING EPSTSHLS+THQEQNL+EP++    +E+DEQDG EAAF AQRTNLDD
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDD

Query:  GFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLP
        GFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCS+FIEAFEQSLI+GKQRKRKRKHGVVH QTKKRQQR NFSTYNVTDVEISV+DQRLP
Subjt:  GFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLP

Query:  SAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAA
        SAPIN+ +LT+PYA S+S+VYNHEG+KNGDVTA+ER +  DI+N GRNATQR   KK EHEYDPKLSELKATVLTNIAIGDKLAI+F +AR TENN PAA
Subjt:  SAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAA

Query:  GLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSAS
        GLSK+GSVE V ENRCTGAKRRKSGSVKRF+QDSTLSELPV QN ELTLAVVESGV VEPIGVENSGYHGESL RNN+TDEARNE SIIKI+KPLGYSAS
Subjt:  GLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSAS

Query:  VSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        VSNNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHL YTTRS
Subjt:  VSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

XP_008460094.1 PREDICTED: chromo domain-containing protein LHP1-like [Cucumis melo]1.8e-20885.27Show/hide
Query:  MKIKGG-GRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEP-EDDGDGEEEDEQDGDEAAFAAQRTNL
        MK KGG GRKKSAS S EVVVGSIQDAMD+GE VHGG+SDYAN N NNN ING+EPS SHL+ETHQ+QNLEEP +DDG+G+EEDEQDGDEAAFA+QRTNL
Subjt:  MKIKGG-GRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEP-EDDGDGEEEDEQDGDEAAFAAQRTNL

Query:  DDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLD
        DDGFYEIEAIRRKR+RKGQLQYLIKWRGWPE ANTWEPLENLHTCSDFIEAFEQSL+TGKQRKRKRKHGVVHTQTKKR  QQR +FS YNVTDVEISV+D
Subjt:  DDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLD

Query:  QRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENN
        QRLPSAP+N+S+LT+PYAHSQS+VYNHEG+KNGDVTA+ER KQTDI+N+GRNATQR   KKDEHEYDPKLSELKATVLTNIA+ DK  IN Q+AR TENN
Subjt:  QRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENN

Query:  GPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLG
        G AAGLSK   VEPV +NRCTGA+RRKSGSV+RF+ DSTLSELP +Q+AELTLAVVESGVRVEPIGVENSGYHGESL RNNKTD+ARNE SIIKIIKPLG
Subjt:  GPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLG

Query:  YSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        YSASVSNN+QDVLVTFVAMRSDGTEV+VDNKFLKA NPLLLINFYEQHLRYTTRS
Subjt:  YSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

XP_022964471.1 chromo domain-containing protein LHP1-like [Cucurbita moschata]1.8e-20885.87Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-----NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQR
        MKIKG GRKKSASGS+         AMD GEVVHGGDSDYAN+NS     NNNAI   EPSTSHLS+THQEQNL E +D GDGEEEDE DGDEA FAAQR
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-----NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQR

Query:  TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVL
        TNLDDGFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFI+AFE+SL+TGKQRKRKRKHGVVHTQTKKRQQ  NFSTYNVTDVEISVL
Subjt:  TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVL

Query:  DQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP
        DQRLPSAPINMS+LTH YAHSQ+IV NHEGDKNGDV A+ER K+ DI+NVG+NATQRKKDEHEYDPKLSELKATVLTN+AIGDKLA+NFQEAR T+NNGP
Subjt:  DQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP

Query:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS
        AA LSKSG+VE  A+NRC GAKRRKSGSVKRF+QDSTLSELPVAQNAELTLAVVESG +VEPIG+ENSGY GES IRNNKTD+ARNEL IIKIIKPLGYS
Subjt:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS

Query:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        ASVSNNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

XP_023514742.1 chromo domain-containing protein LHP1-like [Cucurbita pepo subsp. pepo]1.6e-20986.19Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLD
        MKIKG GRKKSASGS+         AMD GEVVHGGDSDYAN+NS NNNAI   EPSTSHLS+THQEQNL E +D GDGEEEDE DGDEA F+AQRTNLD
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLD

Query:  DGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRL
        DGFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFI+AFE+SL+TGKQRKRKRKHGVVHTQTKKRQQ  NFSTYNVTDVEISVLDQRL
Subjt:  DGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRL

Query:  PSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGL
        PSAPINM++LTH YAHSQ+IV NHEGDKNGDV A+ER K+ DI+NVG+NATQRKKDEHEYDPKLSELKATVLTN+AIGDKLA+NFQEAR  +NNGPAA L
Subjt:  PSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGL

Query:  SKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVS
        SKSG+VE  A+NRC GAKRRKSGSVKRF+QDSTLSELPVAQNAELTLAVVESG +VEPIGVENSGYHGE  IRNNKTD+ARNEL IIKIIKPLGYSASVS
Subjt:  SKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVS

Query:  NNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        NNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  NNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

XP_038874378.1 chromo domain-containing protein LHP1 [Benincasa hispida]1.4e-21687.2Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDD
        MKIKGGGRKKSAS S EVVVGS QDAMD GE VHGGDSDYANVN+NNN IN +EPSTSHL ET Q+QNLEEP+DDG+G+EEDEQDGD+AAFAAQRTNLDD
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDD

Query:  GFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLDQR
        GFYEIEAIRR+R+RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSL+ GKQRKRKRKHGVVHTQTKKR  QQR +FSTYNVTDVEISV+DQR
Subjt:  GFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLDQR

Query:  LPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP
        LPSAP+NMS+LT+PYAHSQS+VYNHEG+KNGDVT +ER KQ DI+N+GRNATQR   KKDEHEYDPKLSELK TVLTNIAIGDKLAINFQ+AR TENNG 
Subjt:  LPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP

Query:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS
        AAGL K GSVEPV +NRCTGA+RRKSGSVKRF+QDSTLS++P++QNAELTLAVVESGVRVEPIGVENSGYHGESL RNN+TD+ARNE SIIKIIKPLGYS
Subjt:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS

Query:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        ASVSNNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

TrEMBL top hitse value%identityAlignment
A0A0A0KBN0 Chromo domain-containing protein2.4e-20684.4Show/hide
Query:  MKIKGG-GRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEE-PEDDGDGEEEDEQDGDEAAFAAQRTNL
        MK KGG GRKKS+S S EV VGSIQDAMD+GE VHGG+SDYANVN NN  ING+EPS SHL+ETHQ+QNLEE  +DDG+G+EEDEQDGDEAAFA+QRTNL
Subjt:  MKIKGG-GRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEE-PEDDGDGEEEDEQDGDEAAFAAQRTNL

Query:  DDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLD
        DDGFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSL+TGKQRKRKRKHGVVHTQTKKR  QQR +FS YNVTDVEISV+D
Subjt:  DDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLD

Query:  QRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENN
        QRLPSAP+NMS+LT+P+AHSQS+VYNHEG+KNGDVTA+ER KQTDI+N+GR ATQR   KKDEHEYDPKLSELKATVLTNIAI DK  INFQ++R TENN
Subjt:  QRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENN

Query:  GPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLG
        G AAGLSK   VEPV +NRCTGA+RRKSGSV+RF+ DSTLS LP +QNAELTLAVVESG RVEPIGVENSGYHGESL RNNKTD+ARNE+SI KIIKPLG
Subjt:  GPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLG

Query:  YSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        YSASVSNN+QDVLVTFVAMRSDGTEV+VDNKFLKA NPLLLINFYEQHLRYTTRS
Subjt:  YSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A1S3CBU8 chromo domain-containing protein LHP1-like8.7e-20985.27Show/hide
Query:  MKIKGG-GRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEP-EDDGDGEEEDEQDGDEAAFAAQRTNL
        MK KGG GRKKSAS S EVVVGSIQDAMD+GE VHGG+SDYAN N NNN ING+EPS SHL+ETHQ+QNLEEP +DDG+G+EEDEQDGDEAAFA+QRTNL
Subjt:  MKIKGG-GRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEP-EDDGDGEEEDEQDGDEAAFAAQRTNL

Query:  DDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLD
        DDGFYEIEAIRRKR+RKGQLQYLIKWRGWPE ANTWEPLENLHTCSDFIEAFEQSL+TGKQRKRKRKHGVVHTQTKKR  QQR +FS YNVTDVEISV+D
Subjt:  DDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKR--QQRTNFSTYNVTDVEISVLD

Query:  QRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENN
        QRLPSAP+N+S+LT+PYAHSQS+VYNHEG+KNGDVTA+ER KQTDI+N+GRNATQR   KKDEHEYDPKLSELKATVLTNIA+ DK  IN Q+AR TENN
Subjt:  QRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENN

Query:  GPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLG
        G AAGLSK   VEPV +NRCTGA+RRKSGSV+RF+ DSTLSELP +Q+AELTLAVVESGVRVEPIGVENSGYHGESL RNNKTD+ARNE SIIKIIKPLG
Subjt:  GPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLG

Query:  YSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        YSASVSNN+QDVLVTFVAMRSDGTEV+VDNKFLKA NPLLLINFYEQHLRYTTRS
Subjt:  YSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1HIZ9 chromo domain-containing protein LHP1-like8.7e-20985.87Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-----NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQR
        MKIKG GRKKSASGS+         AMD GEVVHGGDSDYAN+NS     NNNAI   EPSTSHLS+THQEQNL E +D GDGEEEDE DGDEA FAAQR
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-----NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQR

Query:  TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVL
        TNLDDGFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFI+AFE+SL+TGKQRKRKRKHGVVHTQTKKRQQ  NFSTYNVTDVEISVL
Subjt:  TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVL

Query:  DQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP
        DQRLPSAPINMS+LTH YAHSQ+IV NHEGDKNGDV A+ER K+ DI+NVG+NATQRKKDEHEYDPKLSELKATVLTN+AIGDKLA+NFQEAR T+NNGP
Subjt:  DQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP

Query:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS
        AA LSKSG+VE  A+NRC GAKRRKSGSVKRF+QDSTLSELPVAQNAELTLAVVESG +VEPIG+ENSGY GES IRNNKTD+ARNEL IIKIIKPLGYS
Subjt:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS

Query:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        ASVSNNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1KL10 chromo domain-containing protein LHP1-like4.3e-20885.21Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-----NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQR
        MKIKG GRKKSASGS+         AMD GEVVHGGDSDYAN+NS     NN+AI   EPSTSHLS+THQEQNL E +D GDGEEEDE DGDEA FAAQR
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNS-----NNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQR

Query:  TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVL
        TNLDDGFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCSDFI+AFE+SL TGKQRKRKRKHGVVHTQTKKRQQ  NFSTYNVTDVEISVL
Subjt:  TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVL

Query:  DQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP
        DQRLPSAPINMS+LTH Y HSQSIV NHEGDKNGD+ A+ER K+ +I+NVG+N TQRKKDEHEYDPKLSELKATVLTN+AIGDKLA+NFQEAR  +NNGP
Subjt:  DQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGP

Query:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS
        AA LSKSG+VE  A+NRC GAKRRKSGSVKRF+QDSTLSELPVAQNAELTLAVVESG +VEPIGVENSGYHGE  IRNNKTDEARNEL IIKIIKPLGYS
Subjt:  AAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYS

Query:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        ASVSNNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHLRYTTRS
Subjt:  ASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

A0A6J1KVS2 chromo domain-containing protein LHP1-like1.4e-20685.14Show/hide
Query:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDD
        MKIKGG RKKSASGS+EVV+G IQDAMD GE VHGGDSDY ++N+NNN ING EPSTSHLSETHQEQNL+EP++    +E+DEQDG EAAFAAQRTNLDD
Subjt:  MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDD

Query:  GFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLP
        GFYEIEAIRRKR+RKGQLQYLIKWRGWPETANTWEPLENLHTCS+FIEAFEQSLI+GKQRKRKRKHGVVH QTKKRQQR NFSTYNVTDVEISV+DQRLP
Subjt:  GFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLP

Query:  SAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAA
        SAPINM +LT+PYA S+S+VYNHEG+KNGDVTA+ER +  DI+NVGRNATQR   KK EHEYDPKLSELKAT LTNIAIGDKLAI+F +AR TENN  AA
Subjt:  SAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQR---KKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAA

Query:  GLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSAS
        GLSK+GSVE V ENRCTGAKRRKSGSVKRF+QDSTLSELPV QN ELTLAVVES V VEPIGVENSGYHGE L RNN+TDEARNE SIIKI+KPLGYSAS
Subjt:  GLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSAS

Query:  VSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS
        VSNNIQDVLVTFVAMRSDGTEV+VDNKFLKANNPLLLINFYEQHL YTTRS
Subjt:  VSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRYTTRS

SwissProt top hitse value%identityAlignment
O00257 E3 SUMO-protein ligase CBX49.8e-0857.89Show/hide
Query:  YEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENL
        + +E+I +KRIRKG+++YL+KWRGW    NTWEP EN+
Subjt:  YEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENL

O55187 E3 SUMO-protein ligase CBX49.8e-0857.89Show/hide
Query:  YEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENL
        + +E+I +KRIRKG+++YL+KWRGW    NTWEP EN+
Subjt:  YEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENL

Q339W7 Probable chromo domain-containing protein LHP16.7e-4134.28Show/hide
Query:  ETHQEQNLEEPEDDGDGEEEDE-------QDGDEAAFAAQR----------------------TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETAN
        E  +EQ   E E+  +GEEE+E       + G+E+  AA                          L +G+YEIE IRR+R+RKG+LQYL+KWRGWPE+AN
Subjt:  ETHQEQNLEEPEDDGDGEEEDE-------QDGDEAAFAAQR----------------------TNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETAN

Query:  TWEPLENLHTCSDFIEAFEQSLITGKQ-RKRKRK------HGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEG
        TWEPLENL  CSD I+AFE  L + +  RKRKRK       G   +  K+ + R +  ++                 P   S        S+++      
Subjt:  TWEPLENLHTCSDFIEAFEQSLITGKQ-RKRKRK------HGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEG

Query:  DKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVK
        D +G V           N + +N  Q        +   S +  T    + +  +L     E  +   +  +  L K   V P    + TGAK+RKSG+V+
Subjt:  DKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVK

Query:  RFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKF
        RF+Q+      P     E    VV      E +G    G  G+      KT+   N + I KIIKP+ ++A+V+N++Q V +TF A+RSDG EV+VD+K 
Subjt:  RFKQDSTLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKF

Query:  LKANNPLLLINFYEQHLRYTTRS
        LKANNPLLLI++YEQ LRY   S
Subjt:  LKANNPLLLINFYEQHLRYTTRS

Q944N1 Chromo domain protein LHP11.3e-5739.39Show/hide
Query:  EEP---EDDGDGEEEDEQ---------DGDEAAFAAQRTNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITG
        EEP   E +G+GEE+DE          DGD  A    +  L +GFYEIE +RR+R  KG++ YLIKWRGWPE+ANTWEP  NL +C+D I+A+E+SL +G
Subjt:  EEP---EDDGDGEEEDEQ---------DGDEAAFAAQRTNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITG

Query:  KQRKRKRKHGVVHTQTKKRQQR---TNFSTYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKK
        K R+RKRK G   T    +QQR      +TYN   V++ ++++  PS P+N+   T        +V ++  + N  V  V       +N  G     R +
Subjt:  KQRKRKRKHGVVHTQTKKRQQR---TNFSTYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKK

Query:  DEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVR
        +++E + KLSELK    TN   G+ + I+         NG   G  K    E    +RCTGAK+RKSG V+RFK+++T +     Q+A          + 
Subjt:  DEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSELPVAQNAELTLAVVESGVR

Query:  VEPIGV-ENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRY
          P+      G H   ++     D++++  +I +++ P+ Y AS SN++ DV VTFVA R+DG  V+VDNKFLK NNPLLLINFYE+++RY
Subjt:  VEPIGV-ENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRY

Q946J8 Chromo domain-containing protein LHP16.7e-6544.82Show/hide
Query:  EQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQ-RKRKR
        ++  EE ED+ DG +E++++G+      +R  LD+GFYEIEAIRRKR+RKG++QYLIKWRGWPETANTWEPLENL + +D I+AFE SL  GK  RKRKR
Subjt:  EQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQ-RKRKR

Query:  KHGVVHTQTKKRQQRTNFS--TYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKD-EHEYDP
        K+   H+Q KK+Q+ T+ S      +D   S+ +  LP  P        P   S S + N   D       V    + +  +VG     R  D E EYDP
Subjt:  KHGVVHTQTKKRQQRTNFS--TYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKD-EHEYDP

Query:  KLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSE---LPVAQNAELTLAVVESGVRVEPI
         L+EL+  V  +   G              +N    GL K    E    +R  GAKRRKSGSVKRFKQD + S     P  QN    L  ++S  R+  +
Subjt:  KLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSE---LPVAQNAELTLAVVESGVRVEPI

Query:  GVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRY
        G E  G   E+   + KT     EL I KI+KP+ ++ASVS+N+Q+VLVTF+A+RSDG E +VDN+FLKA+NP LLI FYEQHL+Y
Subjt:  GVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRY

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)4.7e-6644.82Show/hide
Query:  EQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQ-RKRKR
        ++  EE ED+ DG +E++++G+      +R  LD+GFYEIEAIRRKR+RKG++QYLIKWRGWPETANTWEPLENL + +D I+AFE SL  GK  RKRKR
Subjt:  EQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDDGFYEIEAIRRKRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQ-RKRKR

Query:  KHGVVHTQTKKRQQRTNFS--TYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKD-EHEYDP
        K+   H+Q KK+Q+ T+ S      +D   S+ +  LP  P        P   S S + N   D       V    + +  +VG     R  D E EYDP
Subjt:  KHGVVHTQTKKRQQRTNFS--TYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIVYNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKD-EHEYDP

Query:  KLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSE---LPVAQNAELTLAVVESGVRVEPI
         L+EL+  V  +   G              +N    GL K    E    +R  GAKRRKSGSVKRFKQD + S     P  QN    L  ++S  R+  +
Subjt:  KLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQDSTLSE---LPVAQNAELTLAVVESGVRVEPI

Query:  GVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRY
        G E  G   E+   + KT     EL I KI+KP+ ++ASVS+N+Q+VLVTF+A+RSDG E +VDN+FLKA+NP LLI FYEQHL+Y
Subjt:  GVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQHLRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCAAAGGGGGAGGAAGGAAGAAAAGTGCGAGCGGTTCGTCGGAGGTTGTAGTGGGTTCGATTCAGGACGCCATGGATGCTGGTGAAGTCGTCCATGGAGGCGA
CTCGGATTATGCTAATGTGAATAGCAACAACAACGCCATTAATGGCGCTGAGCCTTCGACTTCCCATTTATCGGAGACCCATCAAGAGCAAAACCTAGAAGAGCCTGAGG
ACGATGGCGATGGGGAGGAAGAAGATGAACAGGATGGAGATGAGGCTGCTTTTGCTGCTCAGAGAACCAATCTCGATGATGGGTTCTATGAAATTGAAGCCATTCGTCGA
AAAAGGATTCGCAAGGGCCAGCTTCAGTACTTGATCAAATGGCGTGGCTGGCCTGAGACGGCTAATACCTGGGAGCCCTTGGAAAATCTCCATACGTGCTCTGATTTCAT
AGAGGCATTCGAACAGAGTTTAATAACGGGAAAGCAGCGGAAACGGAAGCGAAAACATGGGGTTGTTCATACTCAAACGAAGAAGAGGCAGCAGCGAACCAACTTTTCTA
CTTACAATGTCACAGATGTTGAAATCAGTGTTCTTGATCAACGTCTGCCCTCTGCTCCTATAAACATGTCTAACCTTACTCATCCTTATGCTCATTCCCAATCCATAGTT
TATAATCATGAAGGAGATAAGAATGGAGATGTAACTGCTGTTGAAAGATGCAAGCAAACCGATATCAATAATGTAGGCAGGAATGCTACTCAACGAAAGAAAGATGAACA
TGAGTATGATCCCAAACTTAGTGAGCTTAAGGCAACAGTATTAACGAACATAGCCATTGGCGATAAGCTTGCAATCAATTTTCAAGAAGCCAGAATGACGGAGAACAATG
GCCCTGCAGCTGGTCTTTCTAAAAGTGGCTCTGTGGAGCCAGTTGCCGAAAATCGGTGCACTGGGGCTAAGAGAAGGAAGTCTGGTTCGGTTAAAAGGTTTAAACAAGAT
TCAACTTTATCTGAACTGCCAGTGGCTCAAAATGCAGAATTGACATTGGCTGTAGTAGAATCTGGTGTCCGAGTGGAACCGATAGGGGTTGAGAATTCTGGATATCATGG
GGAAAGTTTAATTCGCAATAACAAAACCGATGAAGCCAGAAATGAGCTGAGTATCATCAAGATTATCAAACCATTAGGCTATTCAGCATCTGTGTCAAACAACATTCAAG
ATGTGTTGGTAACTTTTGTGGCCATGAGGTCTGATGGAACAGAAGTGATAGTTGATAACAAGTTCCTCAAGGCTAACAATCCACTACTGTTGATCAACTTCTACGAGCAA
CATCTCCGCTATACTACCAGATCATGA
mRNA sequenceShow/hide mRNA sequence
TTTCATCTAACAGAGAACACCGAACAATCCGAAAAGAAAGAATCGTTTTAAAAACACAAAGAGGAATCAATTCCCTTTCATTTGCCCTTTCAATTGACGCTGTATCGTCT
ACAATTTGTTCCACCAACGGATCAAAGATGCTCTCGAGCTAGAACAGAAGGCGGAATCGAGAATTGGGAGAGAAAGAAAGAGACCCACATTAGATTTTGCAGAAAAAAGA
AACGCGAAATCATAAGATGATTCATATAGAGAAAGGGAAAGTTCAATAAATGTGGCTGTTCAGTCTCAGTCTCAGTCTCAGTCTCAGTCTCTGCTCCATTTTGTAGTGAC
ATTTGTCTTCGTTTTGGAAAATTTCTTTTGAGAGGTTGTTTTTCTCTGTGTATTCAAAAACCCTCTCTCTCTCTCTCTTGTGGGTAGGTGAATTTGTAGTGGAATTTTCA
GCACTCCCATTTACTTCAAGAACCCACAAAAAAACCCTTGAAATTTTAGCATGTATGTTTTTTTTTTTTTCCCCCACGATTTCTAAGTTTCAGAAGAGATAAATAGAGAG
AGCGAAAGAGGGTTTTAGAATGAAAATCAAAGGGGGAGGAAGGAAGAAAAGTGCGAGCGGTTCGTCGGAGGTTGTAGTGGGTTCGATTCAGGACGCCATGGATGCTGGTG
AAGTCGTCCATGGAGGCGACTCGGATTATGCTAATGTGAATAGCAACAACAACGCCATTAATGGCGCTGAGCCTTCGACTTCCCATTTATCGGAGACCCATCAAGAGCAA
AACCTAGAAGAGCCTGAGGACGATGGCGATGGGGAGGAAGAAGATGAACAGGATGGAGATGAGGCTGCTTTTGCTGCTCAGAGAACCAATCTCGATGATGGGTTCTATGA
AATTGAAGCCATTCGTCGAAAAAGGATTCGCAAGGGCCAGCTTCAGTACTTGATCAAATGGCGTGGCTGGCCTGAGACGGCTAATACCTGGGAGCCCTTGGAAAATCTCC
ATACGTGCTCTGATTTCATAGAGGCATTCGAACAGAGTTTAATAACGGGAAAGCAGCGGAAACGGAAGCGAAAACATGGGGTTGTTCATACTCAAACGAAGAAGAGGCAG
CAGCGAACCAACTTTTCTACTTACAATGTCACAGATGTTGAAATCAGTGTTCTTGATCAACGTCTGCCCTCTGCTCCTATAAACATGTCTAACCTTACTCATCCTTATGC
TCATTCCCAATCCATAGTTTATAATCATGAAGGAGATAAGAATGGAGATGTAACTGCTGTTGAAAGATGCAAGCAAACCGATATCAATAATGTAGGCAGGAATGCTACTC
AACGAAAGAAAGATGAACATGAGTATGATCCCAAACTTAGTGAGCTTAAGGCAACAGTATTAACGAACATAGCCATTGGCGATAAGCTTGCAATCAATTTTCAAGAAGCC
AGAATGACGGAGAACAATGGCCCTGCAGCTGGTCTTTCTAAAAGTGGCTCTGTGGAGCCAGTTGCCGAAAATCGGTGCACTGGGGCTAAGAGAAGGAAGTCTGGTTCGGT
TAAAAGGTTTAAACAAGATTCAACTTTATCTGAACTGCCAGTGGCTCAAAATGCAGAATTGACATTGGCTGTAGTAGAATCTGGTGTCCGAGTGGAACCGATAGGGGTTG
AGAATTCTGGATATCATGGGGAAAGTTTAATTCGCAATAACAAAACCGATGAAGCCAGAAATGAGCTGAGTATCATCAAGATTATCAAACCATTAGGCTATTCAGCATCT
GTGTCAAACAACATTCAAGATGTGTTGGTAACTTTTGTGGCCATGAGGTCTGATGGAACAGAAGTGATAGTTGATAACAAGTTCCTCAAGGCTAACAATCCACTACTGTT
GATCAACTTCTACGAGCAACATCTCCGCTATACTACCAGATCATGAATTGCAAAGCAATGAACGGTACGACTTTTGGGATCCCCGTTTTGCTAATTATTTACATTTTAGT
GCATTTGTACTGCGAGATCGAGCGTAAGGAGATAGTTAAACAAGGTCTGAAAGTTTTAGCGATAACAGTATCCTCCATCGTCCCATCTACCTCTTGCTGTGCAGAACAGA
GATTAGAAGTTAGAGGCTTAGAATCTTTCTGCTAATTATGAAATTAATGTTTTATATATAGCTGTGTTGTTCCAAAGGTTTTTTTGTACATATTACCGTTCTGTGTTGAT
CTCCTTGTAGTCAGTGCACTTGTTTAACCTTATTTTTGGTTTCTCAAAGAGCAATGGAAAGAATTGGTTGTAATTTTTGGAGGGTCCTTGGGGAGTTAATGCAATCGTTG
TGGATTGCAGATATATGGGTCTTTTTGTGTGTGTGTTTTTGTTTTCCGAAAGTTCGAGAACGATATGATGGTATGTTTTCTCGAGGTGTTTTTCGTTCCTTTTGGTGCTT
TTGGAGACAACGTCATGTCGGAGTTTACCTGTAAAGAGAGCAAATTTCTGTTTCCAAGCAATGTGGC
Protein sequenceShow/hide protein sequence
MKIKGGGRKKSASGSSEVVVGSIQDAMDAGEVVHGGDSDYANVNSNNNAINGAEPSTSHLSETHQEQNLEEPEDDGDGEEEDEQDGDEAAFAAQRTNLDDGFYEIEAIRR
KRIRKGQLQYLIKWRGWPETANTWEPLENLHTCSDFIEAFEQSLITGKQRKRKRKHGVVHTQTKKRQQRTNFSTYNVTDVEISVLDQRLPSAPINMSNLTHPYAHSQSIV
YNHEGDKNGDVTAVERCKQTDINNVGRNATQRKKDEHEYDPKLSELKATVLTNIAIGDKLAINFQEARMTENNGPAAGLSKSGSVEPVAENRCTGAKRRKSGSVKRFKQD
STLSELPVAQNAELTLAVVESGVRVEPIGVENSGYHGESLIRNNKTDEARNELSIIKIIKPLGYSASVSNNIQDVLVTFVAMRSDGTEVIVDNKFLKANNPLLLINFYEQ
HLRYTTRS