; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G03200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G03200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionChromo domain protein LHP1-like
Genome locationClcChr05:2208920..2212826
RNA-Seq ExpressionClc05G03200
SyntenyClc05G03200
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR017984 - Chromo domain subgroup
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049056.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]1.1e-22579.3Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNNNG+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN++LH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKADCVVGSAQGSHS GAKRRKSS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSALS   EQGLKQNAATV IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P   P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

TYK17507.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]7.4e-23380.77Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNNNG+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN++LH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKADCVVGSAQGSHS GAKRRKSS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSALS   EQGLKQNAATV IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P   P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

XP_008438194.2 PREDICTED: chromo domain protein LHP1-like [Cucumis melo]5.2e-17882.59Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNNNG+E  IP P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKDGDIE++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN++LH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKADCVVGSAQGSHS GAKRRKSS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSALS   EQGLKQNAATV IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HL
        HL
Subjt:  HL

XP_038877763.1 probable chromo domain-containing protein LHP1 isoform X1 [Benincasa hispida]1.0e-19486.57Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKAVGSSEPET ALPIPDFTQSTHLNGDSGPSISNNNGNEP+IPSPYPPSSLQNS VQIPLPTD+AGEV+ EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQ+CSEFIDEFE SFC SRSGKQRKRKRKDGDIENQP EEKQLQ++AIDNVTDVVI TVDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGS
        LN +  CD   PQAPVDSTHEG FGSH                VDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSDK+AVASNDL+ VYDVSKADCVVGS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGS

Query:  AQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRVKRFTKDSALSE  EQ LKQNAATVSIEPTDR +Q GPENPS SGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKFLKANNPHL
        KEVTVNNKFLK NNPHL
Subjt:  KEVTVNNKFLKANNPHL

XP_038877764.1 chromo domain protein LHP1-like isoform X2 [Benincasa hispida]7.0e-19185.85Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKAVGSSEPET ALPIPDFTQSTHLNGDSGPSISNNNGNEP+IPSPYPPSSLQNS VQIPLPTD+AGEV+ EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQ+CSEFIDEFE     SRSGKQRKRKRKDGDIENQP EEKQLQ++AIDNVTDVVI TVDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGS
        LN +  CD   PQAPVDSTHEG FGSH                VDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSDK+AVASNDL+ VYDVSKADCVVGS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHT---------------VDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGS

Query:  AQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRVKRFTKDSALSE  EQ LKQNAATVSIEPTDR +Q GPENPS SGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKFLKANNPHL
        KEVTVNNKFLK NNPHL
Subjt:  KEVTVNNKFLKANNPHL

TrEMBL top hitse value%identityAlignment
A0A0A0L6G7 Chromo domain-containing protein5.3e-17682.09Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALPIPDFTQSTHLNGDS PSISNNNG+EP I  P+ PSSL N+ VQIPLP D+AG V+GEDN +PDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVIST+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCDFP--QAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
         N++LH D P  Q P+DS HEG      +DGKFDGSRKKDEYDLKLI+  ASIS NMVDS+K+ VASND+S VYDVSKADCVVGSAQ SHS GAKRRKSS
Subjt:  LNRELHCDFP--QAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSAL    EQGLKQNAATVSIEP D  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HL
        HL
Subjt:  HL

A0A1S3AVS6 chromo domain protein LHP1-like2.5e-17882.59Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNNNG+E  IP P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKDGDIE++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN++LH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKADCVVGSAQGSHS GAKRRKSS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSALS   EQGLKQNAATV IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HL
        HL
Subjt:  HL

A0A5A7U472 Chromo domain protein LHP1-like5.5e-22679.3Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNNNG+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN++LH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKADCVVGSAQGSHS GAKRRKSS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSALS   EQGLKQNAATV IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P   P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

A0A5D3D0R0 Chromo domain protein LHP1-like3.6e-23380.77Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKA GSSEPETVALP PDFTQSTHLNGDS PSISNNNG+E  I  P+PPSSL N+ VQIPLPTD+AG V+GEDNAVPDVSASERTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEP+DNLQSC EFI+E+E  FC SRSGKQRKRKRKDGD+E++  EEK LQI+AIDNVTDVVI+T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN++LH D   PQ P+DS HEG      +D KFDGSRK+DEYD+KLI+  AS+S NMVDSDK+ VASND+S VYDVSKADCVVGSAQGSHS GAKRRKSS
Subjt:  LNRELHCD--FPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKDSALS   EQGLKQNAATV IEPTD  EQ+GP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI
        HL   FS V    +  P   P    S +  +LI Q +IMGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLLLVH LSI
Subjt:  HLGSSFSKVTPTTNKQP--SPPQYHSFATQVLI-QREIMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSI

Query:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL
        FFPVLH SSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLY LL
Subjt:  FFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL

A0A6J1GGE8 chromo domain-containing protein LHP1-like2.6e-15172.17Show/hide
Query:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE
        MGRGKKKAVGSSE E +ALP+P FT ST +NGDS PS SNNNGNE  I S +P SS+QNS VQ PL   E GEV GE+NAV DV+ASE T LD+GFF VE
Subjt:  MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP
        AIRRKRVRKGQLQYLVKW GWPET NTWEP DNLQSC+EFI+EFE     SRSGKQRKRKRKDGD  NQ  EEKQ +++A DNVT+V +STVDD LSA P
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAP

Query:  LNRELHCDF--PQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS
        LN  +H D   PQ  +DST      +  +  KFDGSR++DEYDLKLIELKA+ISANMVDSDK+A +S DL  VYD SKADC VGS QGSHSIGAKRRKSS
Subjt:  LNRELHCDF--PQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSS

Query:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTK++  SEN +  LKQN   V++EPTD+ EQ+GPENPS SGHSRNV+TI RIIKPVGYSVSVSNNIPDV+VTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNP

Query:  HLGSSF
         L  +F
Subjt:  HLGSSF

SwissProt top hitse value%identityAlignment
P05205 Heterochromatin protein 16.3e-0938.2Show/hide
Query:  EVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKR
        ++D  +++     A E    +E  + VE I  +RVRKG+++Y +KW+G+PETENTWEP +NL  C + I ++EAS          K+ R
Subjt:  EVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKR

P45973 Chromobox protein homolog 52.4e-0839.74Show/hide
Query:  SASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRK
        +A   ++ DE  + VE +  +RV KGQ++YL+KW+G+ E  NTWEP  NL  C E I EF   +   + G+  K + K
Subjt:  SASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRK

Q339W7 Probable chromo domain-containing protein LHP14.0e-3233.53Show/hide
Query:  EDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDI--ENQPHEEK
        E  A   V       L EG++E+E IRR+R+RKG+LQYLVKWRGWPE+ NTWEP++NL +CS+ ID FE    S R G++RKRK     +   N  H ++
Subjt:  EDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDI--ENQPHEEK

Query:  QLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDG-KFDGSRKKDEYDLKLIELKAS------------ISANMVDSDKQ
            +   + T           + AP  ++L C                S TV G    GS  +++    +++  +S            +S  + D   +
Subjt:  QLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDG-KFDGSRKKDEYDLKLIELKAS------------ISANMVDSDKQ

Query:  AVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGY
            N  S   ++ K    V  +QG    GAK+RKS  V+RF ++        QG  +  A V  E     E    +     G    V  IT+IIKPV +
Subjt:  AVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIKPVGY

Query:  SVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF
        + +V+N++  V +TF A+RSDG+EV V++K LKANNP L  S+
Subjt:  SVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF

Q944N1 Chromo domain protein LHP11.9e-3736.02Show/hide
Query:  DEAGEVDGEDNAVPD-VSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIE
        DEA EV   +    D V+   +  L EGF+E+E +RR+R  KG++ YL+KWRGWPE+ NTWEP  NL SC++ ID +E S    +SGK R+RKRK G  +
Subjt:  DEAGEVDGEDNAVPD-VSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIE

Query:  NQPHEEKQLQI---VAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDGKFDGS----RKKDEYDLKLIELKASISANMVDSDK
          P  ++Q +    VA  N   V +  +++   + PLN     D     VDS   G   +  VD   +G+    R+++E +LKL ELK + S N    D 
Subjt:  NQPHEEKQLQI---VAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDGKFDGS----RKKDEYDLKLIELKASISANMVDSDK

Query:  QAVASNDLSAVY-DVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKD--SALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIK
          ++ N L+  +  V+ A+      Q     GAK+RKS  V+RF ++  SA+ ++ +  L        ++       +  ++      S++  TIT+++ 
Subjt:  QAVASNDLSAVY-DVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKD--SALSENFEQGLKQNAATVSIEPTDRREQIGPENPSFSGHSRNVSTITRIIK

Query:  PVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF
        PV Y  S SN++ DV VTF+A R+DG  V V+NKFLK NNP L  +F
Subjt:  PVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSF

Q946J8 Chromo domain-containing protein LHP16.4e-4637.23Show/hide
Query:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRK--------
        DE GE +G           ER  LDEGF+E+EAIRRKRVRKG++QYL+KWRGWPET NTWEP++NLQS ++ ID FE S    + G++RKRK        
Subjt:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRK--------

Query:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------
        +K   + +  H+  EK     +++N +   I    D   ++ LNR++  +   A V +  E   GS  +  +      + EYD  L EL+  ++      
Subjt:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------

Query:  ---ANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAA--TVSIEPTDRREQIGPENPSFSGHSR
              + S+   V  N L  VY   + D      + S  IGAKRRKS  VKRF +D + S N      QN      +++   R  ++G E P     + 
Subjt:  ---ANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAA--TVSIEPTDRREQIGPENPSFSGHSR

Query:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP
        N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N+FLKA+NPHL   F +     N+ P
Subjt:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)4.6e-4737.23Show/hide
Query:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRK--------
        DE GE +G           ER  LDEGF+E+EAIRRKRVRKG++QYL+KWRGWPET NTWEP++NLQS ++ ID FE S    + G++RKRK        
Subjt:  DEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRK--------

Query:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------
        +K   + +  H+  EK     +++N +   I    D   ++ LNR++  +   A V +  E   GS  +  +      + EYD  L EL+  ++      
Subjt:  RKDGDIENQPHE--EKQLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEGGFGSHTVDGKFDGSRKKDEYDLKLIELKASIS------

Query:  ---ANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAA--TVSIEPTDRREQIGPENPSFSGHSR
              + S+   V  N L  VY   + D      + S  IGAKRRKS  VKRF +D + S N      QN      +++   R  ++G E P     + 
Subjt:  ---ANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAA--TVSIEPTDRREQIGPENPSFSGHSR

Query:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP
        N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N+FLKA+NPHL   F +     N+ P
Subjt:  NVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAGGGAAGAAGAAAGCGGTGGGAAGCTCTGAGCCTGAGACAGTGGCGCTTCCAATCCCTGATTTCACTCAATCTACTCATCTTAATGGAGATTCAGGCCCTTC
CATCTCTAACAACAATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTCTTCACTTCAGAATAGCTATGTGCAAATTCCACTACCCACCGATGAGGCCGGAGAAG
TCGACGGAGAAGATAATGCTGTACCTGATGTTTCTGCTTCCGAGCGAACTAACCTCGACGAAGGCTTCTTCGAAGTCGAAGCTATTCGGCGGAAAAGAGTTCGTAAGGGA
CAGCTTCAGTACCTCGTCAAATGGCGTGGGTGGCCAGAGACAGAAAATACATGGGAACCCGTGGACAATCTCCAATCATGCTCTGAATTTATTGACGAATTTGAAGCAAG
CTTTTGTAGCTCGCGATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCTATTGATAATG
TCACGGATGTAGTTATCAGTACTGTGGATGATCGTCTATCGGCGGCTCCTTTAAACAGAGAACTTCATTGTGATTTTCCTCAAGCACCGGTAGACTCTACTCATGAAGGA
GGGTTTGGAAGCCATACCGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGACGAATATGATCTGAAACTTATTGAGCTGAAGGCATCAATCTCTGCCAATATGGTTGA
TTCTGATAAACAAGCAGTGGCTTCTAACGATCTTAGCGCTGTTTATGATGTTTCGAAGGCCGATTGCGTGGTGGGTTCTGCTCAGGGAAGTCACTCCATTGGAGCCAAGA
GAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATTCAGCCTTGTCTGAAAACTTTGAACAAGGATTAAAACAAAATGCAGCGACTGTAAGCATTGAGCCTACTGAT
CGAAGAGAACAAATAGGACCCGAGAATCCTAGTTTTTCAGGCCACTCCAGAAATGTGTCTACCATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAA
TAACATCCCAGATGTAATCGTAACCTTCTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATTTCTTAAGGCTAACAATCCACATCTGGGGTCAAGCT
TTTCCAAAGTCACTCCTACCACCAACAAACAGCCTTCTCCCCCTCAATATCACTCTTTTGCAACCCAGGTTCTGATTCAGAGAGAGATAATGGGATATACTACTGCAGGA
CCATCACCTGTTGTTCCTTGTATCATAGTTGGATTTCTGGGGATGATAATATTTTGGCCAACTCTTCAATCCATCTGGGAGAGTGTAGAGTCTCTACTTGAACTGGGCAT
TTGGGTTGCAGTGATTCTTCTTTTCCTCTTACTGCTTGTACATTTGCTTTCTATTTTCTTTCCTGTTCTTCATGTTTCATCCACTTTTGCAGTTCAGCATTCCAGCAGCC
CTGGCTACGATGCTGATGGATTTGGTTTCGGGTCAGGAGCATTGTTTCTAGGTCTTCTCTTCCTTGTCCTCTATAATCTGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAGGGAAGAAGAAAGCGGTGGGAAGCTCTGAGCCTGAGACAGTGGCGCTTCCAATCCCTGATTTCACTCAATCTACTCATCTTAATGGAGATTCAGGCCCTTC
CATCTCTAACAACAATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTCTTCACTTCAGAATAGCTATGTGCAAATTCCACTACCCACCGATGAGGCCGGAGAAG
TCGACGGAGAAGATAATGCTGTACCTGATGTTTCTGCTTCCGAGCGAACTAACCTCGACGAAGGCTTCTTCGAAGTCGAAGCTATTCGGCGGAAAAGAGTTCGTAAGGGA
CAGCTTCAGTACCTCGTCAAATGGCGTGGGTGGCCAGAGACAGAAAATACATGGGAACCCGTGGACAATCTCCAATCATGCTCTGAATTTATTGACGAATTTGAAGCAAG
CTTTTGTAGCTCGCGATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCTATTGATAATG
TCACGGATGTAGTTATCAGTACTGTGGATGATCGTCTATCGGCGGCTCCTTTAAACAGAGAACTTCATTGTGATTTTCCTCAAGCACCGGTAGACTCTACTCATGAAGGA
GGGTTTGGAAGCCATACCGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGACGAATATGATCTGAAACTTATTGAGCTGAAGGCATCAATCTCTGCCAATATGGTTGA
TTCTGATAAACAAGCAGTGGCTTCTAACGATCTTAGCGCTGTTTATGATGTTTCGAAGGCCGATTGCGTGGTGGGTTCTGCTCAGGGAAGTCACTCCATTGGAGCCAAGA
GAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATTCAGCCTTGTCTGAAAACTTTGAACAAGGATTAAAACAAAATGCAGCGACTGTAAGCATTGAGCCTACTGAT
CGAAGAGAACAAATAGGACCCGAGAATCCTAGTTTTTCAGGCCACTCCAGAAATGTGTCTACCATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAA
TAACATCCCAGATGTAATCGTAACCTTCTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATTTCTTAAGGCTAACAATCCACATCTGGGGTCAAGCT
TTTCCAAAGTCACTCCTACCACCAACAAACAGCCTTCTCCCCCTCAATATCACTCTTTTGCAACCCAGGTTCTGATTCAGAGAGAGATAATGGGATATACTACTGCAGGA
CCATCACCTGTTGTTCCTTGTATCATAGTTGGATTTCTGGGGATGATAATATTTTGGCCAACTCTTCAATCCATCTGGGAGAGTGTAGAGTCTCTACTTGAACTGGGCAT
TTGGGTTGCAGTGATTCTTCTTTTCCTCTTACTGCTTGTACATTTGCTTTCTATTTTCTTTCCTGTTCTTCATGTTTCATCCACTTTTGCAGTTCAGCATTCCAGCAGCC
CTGGCTACGATGCTGATGGATTTGGTTTCGGGTCAGGAGCATTGTTTCTAGGTCTTCTCTTCCTTGTCCTCTATAATCTGTTGTAA
Protein sequenceShow/hide protein sequence
MGRGKKKAVGSSEPETVALPIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPSSLQNSYVQIPLPTDEAGEVDGEDNAVPDVSASERTNLDEGFFEVEAIRRKRVRKG
QLQYLVKWRGWPETENTWEPVDNLQSCSEFIDEFEASFCSSRSGKQRKRKRKDGDIENQPHEEKQLQIVAIDNVTDVVISTVDDRLSAAPLNRELHCDFPQAPVDSTHEG
GFGSHTVDGKFDGSRKKDEYDLKLIELKASISANMVDSDKQAVASNDLSAVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSENFEQGLKQNAATVSIEPTD
RREQIGPENPSFSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKFLKANNPHLGSSFSKVTPTTNKQPSPPQYHSFATQVLIQREIMGYTTAG
PSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQHSSSPGYDADGFGFGSGALFLGLLFLVLYNLL