; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G083580 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G083580
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionChromo domain protein LHP1-like
Genome locationCicolChr05:2225806..2229833
RNA-Seq ExpressionCcUC05G083580
SyntenyCcUC05G083580
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR017984 - Chromo domain subgroup
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049056.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]7.4e-22578.94Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNN+G+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN+KLH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKAD VVGSAQGSHS GAKRRKSS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL   SEQGLKQNAAT  IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P  +P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

TYK17507.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]6.3e-23280.4Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNN+G+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN+KLH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKAD VVGSAQGSHS GAKRRKSS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL   SEQGLKQNAAT  IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P  +P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

XP_008438194.2 PREDICTED: chromo domain protein LHP1-like [Cucumis melo]2.2e-17682.09Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNN+G+E  IP P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKDGDIE++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN+KLH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKAD VVGSAQGSHS GAKRRKSS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL   SEQGLKQNAAT  IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HL
        HL
Subjt:  HL

XP_038877763.1 probable chromo domain-containing protein LHP1 isoform X1 [Benincasa hispida]4.9e-19285.85Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKAVGSSEPET ALPIPDFTQSTHLNGDSGPSISNN+GNEP+IPSPYPPSSLQNS VQIPLPTD+AGEV+ EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQ+C EFIDEFE SFC SRSGKQRKRKRKDGDIENQP EEKQLQ++AIDNVTDVVI TVDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGS
        LN K  CD   PQAPVDSTHEG FGSH                VDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSDK+AVASNDL+ VYDVSKAD VVGS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGS

Query:  AQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRV+RFTKDSALSE SEQ LKQNAAT SIEPTDR +Q GPENPS SGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKFLKANNPHL
        KEVTVNNKFLK NNPHL
Subjt:  KEVTVNNKFLKANNPHL

XP_038877764.1 chromo domain protein LHP1-like isoform X2 [Benincasa hispida]3.3e-18885.13Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKAVGSSEPET ALPIPDFTQSTHLNGDSGPSISNN+GNEP+IPSPYPPSSLQNS VQIPLPTD+AGEV+ EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQ+C EFIDEFE     SRSGKQRKRKRKDGDIENQP EEKQLQ++AIDNVTDVVI TVDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGS
        LN K  CD   PQAPVDSTHEG FGSH                VDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSDK+AVASNDL+ VYDVSKAD VVGS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGS

Query:  AQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRV+RFTKDSALSE SEQ LKQNAAT SIEPTDR +Q GPENPS SGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKFLKANNPHL
        KEVTVNNKFLK NNPHL
Subjt:  KEVTVNNKFLKANNPHL

TrEMBL top hitse value%identityAlignment
A0A0A0L6G7 Chromo domain-containing protein2.2e-17481.59Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALPIPDFTQSTHLNGDS PSISNN+G+EP I  P+ PSSL N+ VQIPLP D+AG V+GEDN +PDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVIST+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCDFP--QAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
         N+KLH D P  Q P+DS HEG      +DGKFDGSRKKDEYDLKLI+  ASIS NMVDS+K+ VASND+S VYDVSKAD VVGSAQ SHS GAKRRKSS
Subjt:  LNRKLHCDFP--QAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL    EQGLKQNAAT SIEP D  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HL
        HL
Subjt:  HL

A0A1S3AVS6 chromo domain protein LHP1-like1.1e-17682.09Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNN+G+E  IP P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKDGDIE++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN+KLH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKAD VVGSAQGSHS GAKRRKSS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL   SEQGLKQNAAT  IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HL
        HL
Subjt:  HL

A0A5A7U472 Chromo domain protein LHP1-like3.6e-22578.94Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNN+G+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN+KLH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKAD VVGSAQGSHS GAKRRKSS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL   SEQGLKQNAAT  IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P  +P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

A0A5D3D0R0 Chromo domain protein LHP1-like3.0e-23280.4Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNN+G+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSCFEFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN+KLH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKAD VVGSAQGSHS GAKRRKSS
Subjt:  LNRKLHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTKDSAL   SEQGLKQNAAT  IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P  +P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--FPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

A0A6J1GGE8 chromo domain-containing protein LHP1-like1.6e-14871.43Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKAVGSSE E +ALP+P FT ST +NGDS PS SNN+GNE  I S +P SS+QNS VQ PL   E GEV GE+NAV DV+ASE T LD+GFF VE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKW GWPET NTWEP DNLQSC EFI+EFE     SRSGKQRKRKRKDGD  NQ  EEKQ +++A DNVT+V +STVDD LSA P
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRKLHCDF--PQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS
        LN  +H D   PQ  +DST      +  +  KFDGSR++DEYDLKLIELKA+ISANMVDSDK+A +S DL  VYD SKAD  VGS QGSHSIGAKRRKSS
Subjt:  LNRKLHCDF--PQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSS

Query:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RV+RFTK++  SENS+  LKQN    ++EPTD+ EQ+GPENPS SGHSRNV+TI RIIKPVGYSVSVSNNIPDV+VTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVRRFTKDSALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSF
         L  +F
Subjt:  HLGSSF

SwissProt top hitse value%identityAlignment
O95931 Chromobox protein homolog 71.8e-0853.19Show/hide
Query:  ERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNL
        E + + E  F VE+IR+KRVRKG+++YLVKW+GWP   +TWEP +++
Subjt:  ERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNL

P05205 Heterochromatin protein 11.1e-0838.2Show/hide
Query:  EVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKR
        ++D  +++     A E    +E  + VE I  +RVRKG+++Y +KW+G+PETENTWEP +NL  C + I ++EAS          K+ R
Subjt:  EVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKR

Q339W7 Probable chromo domain-containing protein LHP11.5e-3132.66Show/hide
Query:  EDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDI--ENQPHEEK
        E  A   V       L EG++E+E IRR+R+RKG+LQYLVKWRGWPE+ NTWEP++NL +C + ID FE    S R G++RKRK     +   N  H ++
Subjt:  EDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDI--ENQPHEEK

Query:  QLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDG-KFDGSRKKDEYDLKLIELKAS------------ISANMVDSDKQ
            +   + T           + AP  ++L C                S TV G    GS  +++    +++  +S            +S  + D   +
Subjt:  QLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDG-KFDGSRKKDEYDLKLIELKAS------------ISANMVDSDKQ

Query:  AVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQG---LKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKP
            N  S   ++ K    V  +QG    GAK+RKS  VRRF ++       E G   + ++  +   E  D+++  G  N            IT+IIKP
Subjt:  AVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQG---LKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKP

Query:  VGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF
        V ++ +V+N++  V +TF A+RSDG+EV V++K LKANNP L  S+
Subjt:  VGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF

Q944N1 Chromo domain protein LHP11.4e-3736.42Show/hide
Query:  DEAGEVDGEDNAVPD-VSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIE
        DEA EV   +    D V+   +  L EGF+E+E +RR+R  KG++ YL+KWRGWPE+ NTWEP  NL SC + ID +E S    +SGK R+RKRK G  +
Subjt:  DEAGEVDGEDNAVPD-VSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIE

Query:  NQPHEEKQLQI---VAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDGKFDGS----RKKDEYDLKLIELKASISANMVDSDK
          P  ++Q +    VA  N   V +  +++   + PLN     D     VDS   G   +  VD   +G+    R+++E +LKL ELK + S N    D 
Subjt:  NQPHEEKQLQI---VAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDGKFDGS----RKKDEYDLKLIELKASISANMVDSDK

Query:  QAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKD--SALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKP
          ++ N L+  +   K +G     Q     GAK+RKS  VRRF ++  SA+ ++++  L        ++       +  ++      S++  TIT+++ P
Subjt:  QAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKD--SALSENSEQGLKQNAATASIEPTDRREQIGPENPSFSGHSRNVSTITRIIKP

Query:  VGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF
        V Y  S SN++ DV VTF+A R+DG  V V+NKFLK NNP L  +F
Subjt:  VGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF

Q946J8 Chromo domain-containing protein LHP11.6e-4436.97Show/hide
Query:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRK--------
        DE GE +G           ER  LDEGF+E+EAIRRKRVRKG++QYL+KWRGWPET NTWEP++NLQS  + ID FE S    + G++RKRK        
Subjt:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRK--------

Query:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------
        +K   + +  H+  EK     +++N +   I    D   ++ LNR +  +   A V +  E   GS  +  +      + EYD  L EL+  ++      
Subjt:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------

Query:  ---ANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAA--TASIEPTDRREQIGPENPSFSGHSR
              + S+   V  N L  VY   + D      + S  IGAKRRKS  V+RF +D + S N      QN      +++   R  ++G E P     + 
Subjt:  ---ANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAA--TASIEPTDRREQIGPENPSFSGHSR

Query:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP
        N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N+FLKA+NPHL   F +     N+ P
Subjt:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)1.1e-4536.97Show/hide
Query:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRK--------
        DE GE +G           ER  LDEGF+E+EAIRRKRVRKG++QYL+KWRGWPET NTWEP++NLQS  + ID FE S    + G++RKRK        
Subjt:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRK--------

Query:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------
        +K   + +  H+  EK     +++N +   I    D   ++ LNR +  +   A V +  E   GS  +  +      + EYD  L EL+  ++      
Subjt:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------

Query:  ---ANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAA--TASIEPTDRREQIGPENPSFSGHSR
              + S+   V  N L  VY   + D      + S  IGAKRRKS  V+RF +D + S N      QN      +++   R  ++G E P     + 
Subjt:  ---ANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAA--TASIEPTDRREQIGPENPSFSGHSR

Query:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP
        N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N+FLKA+NPHL   F +     N+ P
Subjt:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAGGGAAGAAGAAGGCGGTGGGAAGCTCTGAGCCTGAGACAGTGGCGCTTCCAATCCCTGATTTCACTCAATCTACTCATCTTAATGGAGATTCAGGCCCTTC
CATCTCTAACAACGATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTCTTCACTTCAGAATAGTTATGTGCAAATTCCACTACCCACCGATGAGGCCGGAGAAG
TCGACGGAGAAGATAATGCTGTACCTGATGTTTCTGCTTCCGAGCGAACTAACCTCGACGAAGGCTTCTTCGAAGTCGAAGCTATTCGGCGGAAAAGAGTTCGTAAGGGA
CAGCTTCAGTACCTCGTCAAATGGCGTGGGTGGCCAGAGACAGAAAATACATGGGAACCCGTGGACAATCTCCAATCATGCTTTGAATTTATTGACGAATTTGAAGCAAG
CTTTTGTAGCTCGCGATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCTATTGATAATG
TCACGGATGTAGTTATCAGTACTGTGGATGATCGTCTATCGGCGGCTCCTTTAAACAGAAAACTTCATTGTGATTTTCCTCAAGCACCGGTAGACTCTACTCATGAAGGA
GGGTTTGGAAGCCATACCGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGACGAATATGATCTGAAACTTATTGAGCTGAAGGCATCAATCTCTGCCAATATGGTTGA
TTCTGATAAACAAGCAGTGGCTTCTAACGATCTTAGCGCTGTTTATGATGTTTCGAAGGCCGATGGCGTGGTGGGTTCTGCTCAGGGAAGTCACTCCATTGGAGCCAAGA
GAAGGAAATCTAGTAGGGTGAGAAGGTTCACTAAGGATTCAGCCTTGTCTGAAAACTCTGAACAAGGATTAAAACAAAATGCAGCGACTGCAAGCATTGAGCCTACTGAT
CGAAGAGAACAAATAGGACCCGAGAATCCTAGTTTTTCAGGCCACTCCAGAAATGTGTCTACCATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAA
TAACATCCCAGATGTAATCGTAACCTTCTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATTTCTTAAGGCTAACAATCCACATCTGGGGTCAAGCT
TTTCCAAAGTCACTCCTACCACCAACAAACAGCCTTTTCCCCCTCAATATCACTCTTTTGCAACCCAGGTTCTGATTCAGAGAGAGATAATGGGATATACTACTGCAGGA
CCATCACCTGTTGTTCCGTGTATCATAGTTGGATTTCTGGGGATGATAATATTTTGGCCAACTCTTCAATCCATCTGGGAGAGTGTAGAGTCTCTACTTGAACTGGGCAT
TTGGGTTGCAGTGATTCTTCTTTTCCTCTTACTGCTTGTACATTTGCTTTCTATTTTCTTTCCTGTTCTTCATGTTTCATCCACTTTTGCAGTTCAGCATTCCAGCAGCC
CTGGCTACGATGCTGATGGATTTGGTTTCGGGTCAGGAGCATTGTTTCTAGGTCTTCTCTTCCTTGTCCTCTATAATCTGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAGGGAAGAAGAAGGCGGTGGGAAGCTCTGAGCCTGAGACAGTGGCGCTTCCAATCCCTGATTTCACTCAATCTACTCATCTTAATGGAGATTCAGGCCCTTC
CATCTCTAACAACGATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTCTTCACTTCAGAATAGTTATGTGCAAATTCCACTACCCACCGATGAGGCCGGAGAAG
TCGACGGAGAAGATAATGCTGTACCTGATGTTTCTGCTTCCGAGCGAACTAACCTCGACGAAGGCTTCTTCGAAGTCGAAGCTATTCGGCGGAAAAGAGTTCGTAAGGGA
CAGCTTCAGTACCTCGTCAAATGGCGTGGGTGGCCAGAGACAGAAAATACATGGGAACCCGTGGACAATCTCCAATCATGCTTTGAATTTATTGACGAATTTGAAGCAAG
CTTTTGTAGCTCGCGATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCTATTGATAATG
TCACGGATGTAGTTATCAGTACTGTGGATGATCGTCTATCGGCGGCTCCTTTAAACAGAAAACTTCATTGTGATTTTCCTCAAGCACCGGTAGACTCTACTCATGAAGGA
GGGTTTGGAAGCCATACCGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGACGAATATGATCTGAAACTTATTGAGCTGAAGGCATCAATCTCTGCCAATATGGTTGA
TTCTGATAAACAAGCAGTGGCTTCTAACGATCTTAGCGCTGTTTATGATGTTTCGAAGGCCGATGGCGTGGTGGGTTCTGCTCAGGGAAGTCACTCCATTGGAGCCAAGA
GAAGGAAATCTAGTAGGGTGAGAAGGTTCACTAAGGATTCAGCCTTGTCTGAAAACTCTGAACAAGGATTAAAACAAAATGCAGCGACTGCAAGCATTGAGCCTACTGAT
CGAAGAGAACAAATAGGACCCGAGAATCCTAGTTTTTCAGGCCACTCCAGAAATGTGTCTACCATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAA
TAACATCCCAGATGTAATCGTAACCTTCTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATTTCTTAAGGCTAACAATCCACATCTGGGGTCAAGCT
TTTCCAAAGTCACTCCTACCACCAACAAACAGCCTTTTCCCCCTCAATATCACTCTTTTGCAACCCAGGTTCTGATTCAGAGAGAGATAATGGGATATACTACTGCAGGA
CCATCACCTGTTGTTCCGTGTATCATAGTTGGATTTCTGGGGATGATAATATTTTGGCCAACTCTTCAATCCATCTGGGAGAGTGTAGAGTCTCTACTTGAACTGGGCAT
TTGGGTTGCAGTGATTCTTCTTTTCCTCTTACTGCTTGTACATTTGCTTTCTATTTTCTTTCCTGTTCTTCATGTTTCATCCACTTTTGCAGTTCAGCATTCCAGCAGCC
CTGGCTACGATGCTGATGGATTTGGTTTCGGGTCAGGAGCATTGTTTCTAGGTCTTCTCTTCCTTGTCCTCTATAATCTGTTGTAA
Protein sequenceShow/hide protein sequence
MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNDGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKG
QLQYLVKWRGWPETENTWEPVDNLQSCFEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAPLNRKLHCDFPQAPVDSTHEG
GFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADGVVGSAQGSHSIGAKRRKSSRVRRFTKDSALSENSEQGLKQNAATASIEPTD
RREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQPFPPQYHSFATQVLIQREIMGYTTAG
PSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL