; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020861 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020861
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionChromo domain protein LHP1-like
Genome locationchr7:2666838..2670686
RNA-Seq ExpressionLag0020861
SyntenyLag0020861
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR008251 - Chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049056.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]2.6e-18566.25Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK  GSSEPE +ALP P FT  TH+NGDS PS  NNNG E  IS P P SS+ N+SVQ P+ TD+ G VN       GE+NA   VSASE T LD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        EGFFEVEAIRRKRVRK Q +++   RGWPET NTWEPLDNLQSC EFI+E+E+    S+SGKQRKRKRKDGD+E++ QEEK   I+AI+NVTDV I+T+D
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        DRLS+APLN  +  DLP PQ P+DS  EGE                    +D KFDGSRKRDEYD+KL +  A++S NMVDSDKK+  S D+  VYDVSK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC+VGS QGSHS GAKRRKS+RVKRFT D AL   SEQGLKQN  TV  EPTD +EQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDV+VTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL
        L VRSDGKEVTVNNKFLKANNP L+                             + +IMG+ T+GPSP+VPCIIVGFLG+IIFWPTL SIWES+E LLEL
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL

Query:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL
        GIWVAVILLFLLLLVH LSIFFPVLH SSTFAVQ++SSP +DADGFGFG G LFL LLFLVLY LL
Subjt:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL

KAG6597211.1 Chromo domain protein LHP1, partial [Cucurbita argyrosperma subsp. sororia]1.1e-16474.48Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK VGSSE EA+ALP+PGFTD T VNGDS PSN NNNG+E IISS FP+SS+QNSSVQTP++  EGGEV+       GEENA A V+ASELTKLD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        +GFF VEAIRRKRVRKGQLQYLVKW GWPETANTWEP DNLQSC+EFI+EFEK    S+SGKQRKRKRKDGD  NQ QEEKQ+ ++A +NVT+V +STVD
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        D LS+ PLN  I  DLPTPQV +DS               TGT  VENG +  KFDGSR+RDEYDLKL ELKAAISANMVDSD K+E SKDLG VYD SK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC VGSTQGSHSIGAKRRKS+RVKRFT +   SE+SEQ LKQNG+ V+ EPTDQNEQLGP+NPSLSGHSRNV+TI RIIKPVGYSVSVSNNIPDVVVTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER
        L VRSDGKEVTVNNKFLKANNPLL++ F E+
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER

TYK17507.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]2.2e-19267.67Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK  GSSEPE +ALP P FT  TH+NGDS PS  NNNG E  IS P P SS+ N+SVQ P+ TD+ G VN       GE+NA   VSASE T LD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        EGFFEVEAIRRKRVRKGQLQYLVKWRGWPET NTWEPLDNLQSC EFI+E+E+    S+SGKQRKRKRKDGD+E++ QEEK   I+AI+NVTDV I+T+D
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        DRLS+APLN  +  DLP PQ P+DS  EGE                    +D KFDGSRKRDEYD+KL +  A++S NMVDSDKK+  S D+  VYDVSK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC+VGS QGSHS GAKRRKS+RVKRFT D AL   SEQGLKQN  TV  EPTD +EQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDV+VTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL
        L VRSDGKEVTVNNKFLKANNP L+                             + +IMG+ T+GPSP+VPCIIVGFLG+IIFWPTL SIWES+E LLEL
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL

Query:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL
        GIWVAVILLFLLLLVH LSIFFPVLH SSTFAVQ++SSP +DADGFGFG G LFL LLFLVLY LL
Subjt:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL

XP_038877763.1 probable chromo domain-containing protein LHP1 isoform X1 [Benincasa hispida]1.4e-17877.96Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK VGSSEPE  ALPIP FT  TH+NGDS PS  NNNG+EPII SP+P SS+QNSSVQ P+ TD+ GEVN        E+NA   VSAS  T LD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        EGFFEVEAIRRKRVRKGQLQYLVKWRGWPET NTWEPLDNLQ+CSEFIDEFE+S   S+SGKQRKRKRKDGDIENQP+EEKQ  +LAI+NVTDV I TVD
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        DRLS+APLNG   CDLP PQ PVDST EGEFGSHLNH+KTT T+ VENGHVDGKFDGSRKRDEYDLKL ELKAAISANMVDSDKK+  S DL  VYDVSK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC+VGS Q SHSIGAKRRKS+RVKRFT D ALSE+SEQ LKQN  TVS EPTD+++Q GP+NPSLSGHSRNV TITRIIKPVGYSVSV NNIPDV+VTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER
        L VRSDGKEVTVNNKFLK NNP L++ + E+
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER

XP_038877764.1 chromo domain protein LHP1-like isoform X2 [Benincasa hispida]1.2e-17978.27Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK VGSSEPE  ALPIP FT  TH+NGDS PS  NNNG+EPII SP+P SS+QNSSVQ P+ TD+ GEVN        E+NA   VSAS  T LD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRL
        EGFFEVEAIRRKRVRKGQLQYLVKWRGWPET NTWEPLDNLQ+CSEFIDEFE+ S+SGKQRKRKRKDGDIENQP+EEKQ  +LAI+NVTDV I TVDDRL
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRL

Query:  SSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADC
        S+APLNG   CDLP PQ PVDST EGEFGSHLNH+KTT T+ VENGHVDGKFDGSRKRDEYDLKL ELKAAISANMVDSDKK+  S DL  VYDVSKADC
Subjt:  SSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADC

Query:  MVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVV
        +VGS Q SHSIGAKRRKS+RVKRFT D ALSE+SEQ LKQN  TVS EPTD+++Q GP+NPSLSGHSRNV TITRIIKPVGYSVSV NNIPDV+VTFL V
Subjt:  MVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVV

Query:  RSDGKEVTVNNKFLKANNPLLVLRFRER
        RSDGKEVTVNNKFLK NNP L++ + E+
Subjt:  RSDGKEVTVNNKFLKANNPLLVLRFRER

TrEMBL top hitse value%identityAlignment
A0A5A7U472 Chromo domain protein LHP1-like1.2e-18566.25Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK  GSSEPE +ALP P FT  TH+NGDS PS  NNNG E  IS P P SS+ N+SVQ P+ TD+ G VN       GE+NA   VSASE T LD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        EGFFEVEAIRRKRVRK Q +++   RGWPET NTWEPLDNLQSC EFI+E+E+    S+SGKQRKRKRKDGD+E++ QEEK   I+AI+NVTDV I+T+D
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        DRLS+APLN  +  DLP PQ P+DS  EGE                    +D KFDGSRKRDEYD+KL +  A++S NMVDSDKK+  S D+  VYDVSK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC+VGS QGSHS GAKRRKS+RVKRFT D AL   SEQGLKQN  TV  EPTD +EQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDV+VTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL
        L VRSDGKEVTVNNKFLKANNP L+                             + +IMG+ T+GPSP+VPCIIVGFLG+IIFWPTL SIWES+E LLEL
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL

Query:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL
        GIWVAVILLFLLLLVH LSIFFPVLH SSTFAVQ++SSP +DADGFGFG G LFL LLFLVLY LL
Subjt:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL

A0A5D3D0R0 Chromo domain protein LHP1-like1.1e-19267.67Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK  GSSEPE +ALP P FT  TH+NGDS PS  NNNG E  IS P P SS+ N+SVQ P+ TD+ G VN       GE+NA   VSASE T LD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        EGFFEVEAIRRKRVRKGQLQYLVKWRGWPET NTWEPLDNLQSC EFI+E+E+    S+SGKQRKRKRKDGD+E++ QEEK   I+AI+NVTDV I+T+D
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEK---SSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        DRLS+APLN  +  DLP PQ P+DS  EGE                    +D KFDGSRKRDEYD+KL +  A++S NMVDSDKK+  S D+  VYDVSK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC+VGS QGSHS GAKRRKS+RVKRFT D AL   SEQGLKQN  TV  EPTD +EQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDV+VTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL
        L VRSDGKEVTVNNKFLKANNP L+                             + +IMG+ T+GPSP+VPCIIVGFLG+IIFWPTL SIWES+E LLEL
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLR-----------------------FRERKREIMGH-TAGPSPLVPCIIVGFLGMIIFWPTLQSIWESLESLLEL

Query:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL
        GIWVAVILLFLLLLVH LSIFFPVLH SSTFAVQ++SSP +DADGFGFG G LFL LLFLVLY LL
Subjt:  GIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL

A0A6J1EA84 chromo domain-containing protein LHP1-like isoform X21.7e-15871.26Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGR KKK  GSSEPE + LPI G TD THVNGDS  S  N+NG+EP+I+SP+P+SSVQNSSVQTP+VTDE GEVNG  DG+ GE NAA  VSA E TK D
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRL
        EGFFEVE+I RKRVRKGQLQYLVKW GWP+TANTWEP DNLQSCSE IDEFE+SS+SGKQRKRKRK G +ENQ +E+K+HH LA NNVTD+ ISTVDD +
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRL

Query:  SSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADC
        S+ PLN  I CDLPTPQ PVD                     VENGH++G F GSRKRD++DLKLSELKAA+SANMVDSDKK+  S DL  VYDVSK DC
Subjt:  SSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADC

Query:  MVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVV
        +VGS Q SHSIG+KRRKS+RVKRFT D ALSEDSEQGLK+N  T+S EPTD+NE+L  +NPSLSGHSR VS ITRIIKPVGYSVSVSN IPDV VTFLV+
Subjt:  MVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVV

Query:  RSDGKEVTVNNKFLKANNPLLVLRFRER
        RSDGKEVTV+NKFLK NNP L++ F E+
Subjt:  RSDGKEVTVNNKFLKANNPLLVLRFRER

A0A6J1GGE8 chromo domain-containing protein LHP1-like7.8e-16474.01Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK VGSSE EA+ALP+PGFTD T VNGDS PSN NNNG+E  ISS FP+SS+QNSSVQTP++  EGGEV+       GEENA A V+ASELTKLD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        +GFF VEAIRRKRVRKGQLQYLVKW GWPETANTWEP DNLQSC+EFI+EFEK    S+SGKQRKRKRKDGD  NQ QEEKQ+ ++A +NVT+V +STVD
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        D LS+ PLN  I  DLPTPQV +DS               TGT  VENG +  KFDGSR+RDEYDLKL ELKAAISANMVDSDKK+E SKDLG VYD SK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC VGSTQGSHSIGAKRRKS+RVKRFT +   SE+S+  LKQNG+ V+ EPTDQNEQLGP+NPSLSGHSRNV+TI RIIKPVGYSVSVSNNIPDVVVTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER
        L VRSDGKEVTVNNKFLKANNPLL++ F E+
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER

A0A6J1IGX1 chromo domain-containing protein LHP1-like3.0e-16374.01Show/hide
Query:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD
        MGRGKKK VGSSE EA+ALP+PGFTD T VNGDS PSN NNNG+E +ISS FP+SS+QNSSVQTP++  EGGEVN       GEENA A V+ASELTKLD
Subjt:  MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD
        +GFF VEAIRRKRVRKGQLQYLVKW GWPETANTWEP DNLQSC+EFI+EFEK    S+SGKQRKRKRKDGD  NQ QEEKQ+ ++A +NVT+V +STVD
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKS---SQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVD

Query:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK
        D LS+ PLN  I  DLPTPQV +DS               TGT  VENG +  KFDGSRKRDEYDLKL EL AAISANMVDSD K+E SKDLG VYD SK
Subjt:  DRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSK

Query:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF
        ADC VGSTQGSHSIGAKRRKS+RVKRFT + A +E+SEQ LKQNGVTV+    DQNEQ+GP+NPSLSGHSRNV+TI RIIKPVGYSVSVSNNIPDVVVTF
Subjt:  ADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTF

Query:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER
        L VRSDGKEVTVNNKFLKANNPLL++ F E+
Subjt:  LVVRSDGKEVTVNNKFLKANNPLLVLRFRER

SwissProt top hitse value%identityAlignment
O95931 Chromobox protein homolog 71.8e-0843.66Show/hide
Query:  ELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKR
        EL+ + E  F VE+IR+KRVRKG+++YLVKW+GWP   +TWEP +++      +   EK  +      RKR
Subjt:  ELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKR

P60889 Chromobox protein homolog 78.2e-0942.25Show/hide
Query:  ELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNL---QSCSEFIDEFEKSSQSGKQRK
        EL+ + E  F VE+IR+KRVRKG+++YLVKW+GWP   +TWEP +++   +    + ++ EK   SG +++
Subjt:  ELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNL---QSCSEFIDEFEKSSQSGKQRK

Q339W7 Probable chromo domain-containing protein LHP11.2e-3132.43Show/hide
Query:  EGGEVNGDGDGESG-------EENAAAHVSASELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQ-R
        EGGE +     E         E  AAA V      KL EG++E+E IRR+R+RKG+LQYLVKWRGWPE+ANTWEPL+NL +CS+ ID FE   QS +  R
Subjt:  EGGEVNGDGDGESG-------EENAAAHVSASELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQ-R

Query:  KRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEY
        KRKRK                  I        +    +     L+       P P+      R     +    SKT   +      V  +   +  ++  
Subjt:  KRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEY

Query:  DLKLSEL---KAAISANMVDSDKKSEGSKDLGHVYDVSKADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGP
           +S     +  +S  + D   +        +  ++ K    V  +QG    GAK+RKS  V+RF           QG  + G  V  E     E    
Subjt:  DLKLSEL---KAAISANMVDSDKKSEGSKDLGHVYDVSKADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLGP

Query:  KNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRER
              G    V  IT+IIKPV ++ +V+N++  V +TF  +RSDG+EV V++K LKANNPLL++ + E+
Subjt:  KNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRER

Q944N1 Chromo domain protein LHP16.0e-4437.3Show/hide
Query:  VNGDGDGESGEENAAAHVSASE-----------LTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRK
        V G+G+GE  E++ A  V   E             KL EGF+E+E +RR+R  KG++ YL+KWRGWPE+ANTWEP  NL SC++ ID +E+S +SGK R+
Subjt:  VNGDGDGESGEENAAAHVSASE-----------LTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRK

Query:  RKRKDGDIENQPQEEKQHHI---LAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRD
        RKRK G  +  P  ++Q      +A  N   V +  +++   S PLN   + DL      VDS      GS LN SK           V+G     R+++
Subjt:  RKRKDGDIENQPQEEKQHHI---LAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRD

Query:  EYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMVGSTQGSHSIGAKRRKSNRVKRF--TTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLG
        E +LKLSELK A S N    D    G  +      V+ A+      Q     GAK+RKS  V+RF   T  A+ +D++  L    +    +    N  + 
Subjt:  EYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMVGSTQGSHSIGAKRRKSNRVKRF--TTDIALSEDSEQGLKQNGVTVSTEPTDQNEQLG

Query:  PKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRE
             ++  S++  TIT+++ PV Y  S SN++ DV VTF+  R+DG  V V+NKFLK NNPLL++ F E
Subjt:  PKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRE

Q946J8 Chromo domain-containing protein LHP11.3e-4336.98Show/hide
Query:  DEGGEVNGDGDGESGEENAAAHVSASELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQ-RKRKRKD
        D G E + +G+GE G+E         E  KLDEGF+E+EAIRRKRVRKG++QYL+KWRGWPETANTWEPL+NLQS ++ ID FE S + GK  RKRKRK 
Subjt:  DEGGEVNGDGDGESGEENAAAHVSASELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQ-RKRKRKD

Query:  GDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVEN--GHVDGKFDGSRK------RD
            +Q +++++        +T       +   SS  LN +   D+P P        +    S LN        YV N      G    +R+        
Subjt:  GDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVEN--GHVDGKFDGSRK------RD

Query:  EYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMV-----GSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNE
        EYD  L+EL+  +  N  +    S+G   +G   D  + + ++        + S  IGAKRRKS  VKRF  D + S +      QN +T      D   
Subjt:  EYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMV-----GSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNE

Query:  QLGPKNPSLSGHSRNVS----------TITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRER
        ++        G   N +           IT+I+KP+ ++ SVS+N+ +V+VTFL +RSDGKE  V+N+FLKA+NP L++ F E+
Subjt:  QLGPKNPSLSGHSRNVS----------TITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRER

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)9.5e-4536.98Show/hide
Query:  DEGGEVNGDGDGESGEENAAAHVSASELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQ-RKRKRKD
        D G E + +G+GE G+E         E  KLDEGF+E+EAIRRKRVRKG++QYL+KWRGWPETANTWEPL+NLQS ++ ID FE S + GK  RKRKRK 
Subjt:  DEGGEVNGDGDGESGEENAAAHVSASELTKLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQ-RKRKRKD

Query:  GDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVEN--GHVDGKFDGSRK------RD
            +Q +++++        +T       +   SS  LN +   D+P P        +    S LN        YV N      G    +R+        
Subjt:  GDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPVDSTREGEFGSHLNHSKTTGTVYVEN--GHVDGKFDGSRK------RD

Query:  EYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMV-----GSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNE
        EYD  L+EL+  +  N  +    S+G   +G   D  + + ++        + S  IGAKRRKS  VKRF  D + S +      QN +T      D   
Subjt:  EYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMV-----GSTQGSHSIGAKRRKSNRVKRFTTDIALSEDSEQGLKQNGVTVSTEPTDQNE

Query:  QLGPKNPSLSGHSRNVS----------TITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRER
        ++        G   N +           IT+I+KP+ ++ SVS+N+ +V+VTFL +RSDGKE  V+N+FLKA+NP L++ F E+
Subjt:  QLGPKNPSLSGHSRNVS----------TITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAGGGAAGAAGAAGACGGTGGGAAGCTCTGAGCCTGAGGCTTTGGCGCTTCCAATCCCTGGTTTCACTGATCCAACTCACGTTAATGGAGATTCAGTTCCTTC
CAACTGTAACAACAATGGCGACGAACCTATAATTTCATCTCCATTTCCATCTTCTTCGGTTCAGAACAGTTCTGTCCAAACTCCAGTAGTCACCGACGAAGGTGGAGAAG
TCAACGGAGACGGCGATGGCGAAAGCGGAGAAGAGAATGCTGCAGCTCATGTTTCTGCCTCCGAGCTAACAAAGCTTGATGAAGGCTTCTTCGAAGTTGAAGCTATTCGG
CGGAAAAGAGTTCGTAAGGGACAGCTTCAGTACCTCGTCAAATGGCGTGGCTGGCCAGAGACTGCTAATACATGGGAACCCTTGGACAATCTCCAATCATGCTCTGAATT
TATTGATGAATTTGAAAAAAGCTCACAATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACATAGAAAATCAACCTCAGGAGGAAAAACAGCATCACATCTTAG
CTATTAATAATGTCACAGATGTAGATATCAGTACTGTGGATGATCGTCTATCGTCTGCTCCTTTAAACGGCAATATTTCTTGTGATCTTCCTACTCCTCAAGTACCTGTA
GACTCCACTCGTGAAGGAGAGTTTGGCAGCCACCTTAATCATTCCAAAACTACTGGAACAGTTTATGTCGAAAATGGACATGTGGATGGGAAATTTGATGGAAGTAGAAA
GAGAGATGAATATGATCTGAAACTTAGTGAGCTCAAGGCAGCAATATCTGCCAATATGGTTGATTCTGATAAGAAATCAGAGGGTTCTAAAGATCTTGGGCATGTCTATG
ATGTTTCCAAGGCCGATTGCATGGTGGGTTCCACTCAGGGAAGTCACTCCATTGGAGCCAAGAGGAGGAAGTCTAATAGGGTGAAAAGGTTCACTACGGACATAGCCTTG
TCTGAAGACTCTGAACAAGGATTAAAACAAAATGGAGTGACTGTAAGCACTGAGCCGACTGATCAAAACGAACAATTGGGGCCCAAGAATCCTAGTTTGTCAGGCCACTC
CAGAAATGTGTCTACTATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAATAACATCCCAGATGTAGTCGTAACCTTCTTGGTTGTGAGGTCTGATG
GAAAAGAAGTGACGGTGAATAACAAATTTCTTAAGGCTAACAATCCACTTCTGGTTCTAAGATTCAGGGAGAGGAAGAGAGAGATTATGGGGCATACTGCAGGGCCATCA
CCTCTTGTTCCTTGTATCATAGTTGGATTTCTAGGGATGATAATATTTTGGCCAACACTTCAATCCATTTGGGAGAGTTTAGAGTCTCTACTTGAACTGGGTATTTGGGT
TGCAGTGATTCTTCTTTTCCTTCTTCTACTTGTGCACTTGCTTTCTATTTTCTTTCCCGTTCTTCATGTTTCATCCACTTTTGCAGTTCAGTATACCAGCAGCCCTAGCC
ATGATGCTGACGGATTCGGTTTCGGGTTAGGAACATTGTTTCTAGTTCTTCTCTTCCTTGTCCTCTATAATCTATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAGGGAAGAAGAAGACGGTGGGAAGCTCTGAGCCTGAGGCTTTGGCGCTTCCAATCCCTGGTTTCACTGATCCAACTCACGTTAATGGAGATTCAGTTCCTTC
CAACTGTAACAACAATGGCGACGAACCTATAATTTCATCTCCATTTCCATCTTCTTCGGTTCAGAACAGTTCTGTCCAAACTCCAGTAGTCACCGACGAAGGTGGAGAAG
TCAACGGAGACGGCGATGGCGAAAGCGGAGAAGAGAATGCTGCAGCTCATGTTTCTGCCTCCGAGCTAACAAAGCTTGATGAAGGCTTCTTCGAAGTTGAAGCTATTCGG
CGGAAAAGAGTTCGTAAGGGACAGCTTCAGTACCTCGTCAAATGGCGTGGCTGGCCAGAGACTGCTAATACATGGGAACCCTTGGACAATCTCCAATCATGCTCTGAATT
TATTGATGAATTTGAAAAAAGCTCACAATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACATAGAAAATCAACCTCAGGAGGAAAAACAGCATCACATCTTAG
CTATTAATAATGTCACAGATGTAGATATCAGTACTGTGGATGATCGTCTATCGTCTGCTCCTTTAAACGGCAATATTTCTTGTGATCTTCCTACTCCTCAAGTACCTGTA
GACTCCACTCGTGAAGGAGAGTTTGGCAGCCACCTTAATCATTCCAAAACTACTGGAACAGTTTATGTCGAAAATGGACATGTGGATGGGAAATTTGATGGAAGTAGAAA
GAGAGATGAATATGATCTGAAACTTAGTGAGCTCAAGGCAGCAATATCTGCCAATATGGTTGATTCTGATAAGAAATCAGAGGGTTCTAAAGATCTTGGGCATGTCTATG
ATGTTTCCAAGGCCGATTGCATGGTGGGTTCCACTCAGGGAAGTCACTCCATTGGAGCCAAGAGGAGGAAGTCTAATAGGGTGAAAAGGTTCACTACGGACATAGCCTTG
TCTGAAGACTCTGAACAAGGATTAAAACAAAATGGAGTGACTGTAAGCACTGAGCCGACTGATCAAAACGAACAATTGGGGCCCAAGAATCCTAGTTTGTCAGGCCACTC
CAGAAATGTGTCTACTATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAATAACATCCCAGATGTAGTCGTAACCTTCTTGGTTGTGAGGTCTGATG
GAAAAGAAGTGACGGTGAATAACAAATTTCTTAAGGCTAACAATCCACTTCTGGTTCTAAGATTCAGGGAGAGGAAGAGAGAGATTATGGGGCATACTGCAGGGCCATCA
CCTCTTGTTCCTTGTATCATAGTTGGATTTCTAGGGATGATAATATTTTGGCCAACACTTCAATCCATTTGGGAGAGTTTAGAGTCTCTACTTGAACTGGGTATTTGGGT
TGCAGTGATTCTTCTTTTCCTTCTTCTACTTGTGCACTTGCTTTCTATTTTCTTTCCCGTTCTTCATGTTTCATCCACTTTTGCAGTTCAGTATACCAGCAGCCCTAGCC
ATGATGCTGACGGATTCGGTTTCGGGTTAGGAACATTGTTTCTAGTTCTTCTCTTCCTTGTCCTCTATAATCTATTGTAA
Protein sequenceShow/hide protein sequence
MGRGKKKTVGSSEPEALALPIPGFTDPTHVNGDSVPSNCNNNGDEPIISSPFPSSSVQNSSVQTPVVTDEGGEVNGDGDGESGEENAAAHVSASELTKLDEGFFEVEAIR
RKRVRKGQLQYLVKWRGWPETANTWEPLDNLQSCSEFIDEFEKSSQSGKQRKRKRKDGDIENQPQEEKQHHILAINNVTDVDISTVDDRLSSAPLNGNISCDLPTPQVPV
DSTREGEFGSHLNHSKTTGTVYVENGHVDGKFDGSRKRDEYDLKLSELKAAISANMVDSDKKSEGSKDLGHVYDVSKADCMVGSTQGSHSIGAKRRKSNRVKRFTTDIAL
SEDSEQGLKQNGVTVSTEPTDQNEQLGPKNPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVVVTFLVVRSDGKEVTVNNKFLKANNPLLVLRFRERKREIMGHTAGPS
PLVPCIIVGFLGMIIFWPTLQSIWESLESLLELGIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQYTSSPSHDADGFGFGLGTLFLVLLFLVLYNLL