; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G011780 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G011780
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionchromo domain-containing protein LHP1-like
Genome locationCmo_Chr06:9003908..9006992
RNA-Seq ExpressionCmoCh06G011780
SyntenyCmoCh06G011780
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR008251 - Chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597211.1 Chromo domain protein LHP1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-22799.04Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNEL ISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSD KAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKEAVSSENS+ RLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
        LLLINFYEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

KAG7028682.1 Chromo domain protein LHP1 [Cucurbita argyrosperma subsp. argyrosperma]1.8e-22396.02Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNEL I SRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKW----------HGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMS
        AIRRKRVRKGQLQYLVKW          HGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMS
Subjt:  AIRRKRVRKGQLQYLVKW----------HGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMS

Query:  TVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSH
        TVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSD KAESSKDLGLVYDDSKADC VGSTQGSH
Subjt:  TVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSH

Query:  SIGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTV
        SIGAKRRKSSRVKRFTKEAVSSENS+ RLK NGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTV

Query:  NNKFLKANNPLLLINFYEQHLRYNPTL
        NNKFLKANNPLLLINFYEQHLRYNPTL
Subjt:  NNKFLKANNPLLLINFYEQHLRYNPTL

XP_022950639.1 chromo domain-containing protein LHP1-like [Cucurbita moschata]2.0e-230100Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
        LLLINFYEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

XP_022974186.1 chromo domain-containing protein LHP1-like [Cucurbita maxima]1.0e-22196.16Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNEL ISSRFPASSIQNSSVQTPLLAGEGGEV+GEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSR+RDEYDLKL+EL AAISANMVDSD KAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKEA S+ENS+ RLKQNG+ VNM   DQNEQ+GPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
        LLLINFYEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

XP_023540194.1 chromo domain-containing protein LHP1-like [Cucurbita pepo subsp. pepo]1.3e-22498.08Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNEL ISSRFPA SIQNSSVQTPLLAGEGGEV+GEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LNSIIHYDLPTPQVLIDST TFNVENGDMGEKFDGSR RDEYDLKLIELKAAISANMVDSD KAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKEAVSSENS+ RLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
        LLLINFYEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

TrEMBL top hitse value%identityAlignment
A0A0A0L6G7 Chromo domain-containing protein3.0e-14768.35Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKA GSSE E +ALP+P FT ST +NGDSAPS SNNNG+E KI      SS+ N+SVQ PL   + G V+GE+N + DV+ASE T LD+GFF VE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKW GWPET NTWEP DNLQSC EFIEE+E+R   SRSGKQRKRKRKD D  ++ QEEK  ++IA DNVT+V +ST+DD LSA P
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
         N  +H DLP  Q  +DS     +  G++  KFDGSR++DEYDLKLI+  A+IS NMVDS+KK  +S D+ LVYD SKADC VGS Q SHS GAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTK++   E     LKQN   V++EP D +EQLGP+NPS SGHSRNV+TI RII+PVGYSVSV NNIPDV+VTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
         LLIN+YEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

A0A1S3AVS6 chromo domain protein LHP1-like7.4e-15470.5Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKA GSSE E +ALP P FT ST +NGDSAPS SNNNG+ELKI    P SS+ N+SVQ PL   + G V+GE+NAV DV+ASE T LD+GFF VE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKW GWPET NTWEP DNLQSC EFIEE+E+R   SRSGKQRKRKRKDGD  ++ QEEK  ++IA DNVT+V ++T+DD LSA P
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LN  +H DLP PQ  +DS     +  G++ EKFDGSR+RDEYD+KLI+  A++S NMVDSDKK  +S D+ LVYD SKADC VGS QGSHS GAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTK++  SE     LKQN   V +EPTD +EQLGP+NPS SGHSRNV+TI RII+PVGYSVSV NNIPDV+VTFL VRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
         LLIN+YEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

A0A6J1EA84 chromo domain-containing protein LHP1-like isoform X24.0e-14768.79Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVH-------GEENAVADVTASELTKLD
        MGR KKKA GSSE E + LP+ G TDST VNGDS  S  N+NGNE  I+S +PASS+QNSSVQTPL+  E GEV+       GE NA  DV+A E TK D
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVH-------GEENAVADVTASELTKLD

Query:  DGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVD
        +GFF VE+I RKRVRKGQLQYLVKWHGWP+TANTWEP DNLQSC+E I+EFE   E SRSGKQRKRKRK G   NQ +E+K++  +AT+NVT++ +STVD
Subjt:  DGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVD

Query:  DCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIG
        DC+SATPLN  IH DLPTPQ  +D      VENG M   F GSR+RD++DLKL ELKAA+SANMVDSDKKA +S DL LVYD SK DC VGS Q SHSIG
Subjt:  DCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIG

Query:  AKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNK
        +KRRKSSRVKRFTK+A  SE+S+  LK+N   +++EPTD+NE+L  ENPSLSGHSR V+ I RIIKPVGYSVSVSN IPDV VTFL +RSDGKEVTV+NK
Subjt:  AKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNK

Query:  FLKANNPLLLINFYEQHLRYNPT
        FLK NNP LLINFYEQHLRYNPT
Subjt:  FLKANNPLLLINFYEQHLRYNPT

A0A6J1GGE8 chromo domain-containing protein LHP1-like9.8e-231100Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
        LLLINFYEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

A0A6J1IGX1 chromo domain-containing protein LHP1-like4.9e-22296.16Show/hide
Query:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE
        MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNEL ISSRFPASSIQNSSVQTPLLAGEGGEV+GEENAVADVTASELTKLDDGFFVVE
Subjt:  MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVE

Query:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
        AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP
Subjt:  AIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATP

Query:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
        LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSR+RDEYDLKL+EL AAISANMVDSD KAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS
Subjt:  LNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSS

Query:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
        RVKRFTKEA S+ENS+ RLKQNG+ VNM   DQNEQ+GPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP
Subjt:  RVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNP

Query:  LLLINFYEQHLRYNPTL
        LLLINFYEQHLRYNPTL
Subjt:  LLLINFYEQHLRYNPTL

SwissProt top hitse value%identityAlignment
O95931 Chromobox protein homolog 72.2e-0942.67Show/hide
Query:  ELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRK
        EL+ + +  F VE+IR+KRVRKG+++YLVKW GWP   +TWEP +++      +  +E++ E  R+   RKR  K
Subjt:  ELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRK

Q339W7 Probable chromo domain-containing protein LHP11.4e-3734.09Show/hide
Query:  EENAVADVTASELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQY
        E  A A V      KL +G++ +E IRR+R+RKG+LQYLVKW GWPE+ANTWEP +NL +C++ I+ FE R++  R G++RKRK                
Subjt:  EENAVADVTASELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQY

Query:  RVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGD----MGEKFDGSRRRDEYDLKLIE------LKAAISANMVDSDKKAES
          I T  V     S            S      P P+ L   T      N       G    GS  R++    +++      +       +  S +  + 
Subjt:  RVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGD----MGEKFDGSRRRDEYDLKLIE------LKAAISANMVDSDKKAES

Query:  SKDLGLVYDDSKAD--CAVGSTQGSHSIGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYS
          +  LV   S ++    V  +QG    GAK+RKS  V+RF +   +         + G  V  E     E    +     G   N   I +IIKPV ++
Subjt:  SKDLGLVYDDSKAD--CAVGSTQGSHSIGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYS

Query:  VSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT
         +V+N++  V +TF A+RSDG+EV V++K LKANNPLLLI++YEQ LRYNPT
Subjt:  VSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT

Q8VDS3 Chromobox protein homolog 72.2e-0942.67Show/hide
Query:  ELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRK
        EL+ + +  F VE+IR+KRVRKG+++YLVKW GWP   +TWEP +++      +  +E++ E  R+   RKR  K
Subjt:  ELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRK

Q944N1 Chromo domain protein LHP11.6e-4437.82Show/hide
Query:  EGGEVHGEENAVADVTASEL-TKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNN
        E  EV   E    D  A ++  KL +GF+ +E +RR+R  KG++ YL+KW GWPE+ANTWEP  NL SC + I+ +E+ +   +SGK R+RKRK G    
Subjt:  EGGEVHGEENAVADVTASEL-TKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNN

Query:  QLQEEKQYRV---IATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGS----RRRDEYDLKLIELKAAISANMVDSD
            ++Q R    +AT N   V +  +++   + PLN +   D      L+DS G  +  N  + E  +G+    R ++E +LKL ELK A S N    D
Subjt:  QLQEEKQYRV---IATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGS----RRRDEYDLKLIELKAAISANMVDSD

Query:  KKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSSRVKRFTKEAVSSENSDHR--LKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIK
                 GL     K + A    Q     GAK+RKS  V+RF +E  S+   D +  L    +A  M+    N  +      ++  S++  TI +++ 
Subjt:  KKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSSRVKRFTKEAVSSENSDHR--LKQNGIAVNMEPTDQNEQLGPENPSLSGHSRNVATIRRIIK

Query:  PVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT
        PV Y  S SN++ DV VTF+A R+DG  V V+NKFLK NNPLLLINFYE+++RY+PT
Subjt:  PVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT

Q946J8 Chromo domain-containing protein LHP15.8e-4736.75Show/hide
Query:  EGGEVHGEENAVADVTASELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQ
        +GG+   EE    +    E  KLD+GF+ +EAIRRKRVRKG++QYL+KW GWPETANTWEP +NLQS A+ I+ FE  ++  + G  RKRKRK    ++Q
Subjt:  EGGEVHGEENAVADVTASELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQ

Query:  LQEEKQYRVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKD
        ++++++        +T+      +   S+T LN+    D+P P   +D +G+ ++ N D+  K   +   ++ +     +  A    ++D++K+ + + +
Subjt:  LQEEKQYRVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKD

Query:  --LGLVYDDSKADCAVGSTQGSHS----------------------IGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQ--------NEQL
           G V + + A C+ G   GS                        IGAKRRKS  VKRF ++  +S N      QN +  ++   D         NE  
Subjt:  --LGLVYDDSKADCAVGSTQGSHS----------------------IGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQ--------NEQL

Query:  G-PENPSLSGHSR-NVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT
        G  EN +LS  ++     I +I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N+FLKA+NP LLI FYEQHL+YN T
Subjt:  G-PENPSLSGHSR-NVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)4.2e-4836.75Show/hide
Query:  EGGEVHGEENAVADVTASELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQ
        +GG+   EE    +    E  KLD+GF+ +EAIRRKRVRKG++QYL+KW GWPETANTWEP +NLQS A+ I+ FE  ++  + G  RKRKRK    ++Q
Subjt:  EGGEVHGEENAVADVTASELTKLDDGFFVVEAIRRKRVRKGQLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQ

Query:  LQEEKQYRVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKD
        ++++++        +T+      +   S+T LN+    D+P P   +D +G+ ++ N D+  K   +   ++ +     +  A    ++D++K+ + + +
Subjt:  LQEEKQYRVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTGTFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKD

Query:  --LGLVYDDSKADCAVGSTQGSHS----------------------IGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQ--------NEQL
           G V + + A C+ G   GS                        IGAKRRKS  VKRF ++  +S N      QN +  ++   D         NE  
Subjt:  --LGLVYDDSKADCAVGSTQGSHS----------------------IGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEPTDQ--------NEQL

Query:  G-PENPSLSGHSR-NVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT
        G  EN +LS  ++     I +I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N+FLKA+NP LLI FYEQHL+YN T
Subjt:  G-PENPSLSGHSR-NVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCGAGGGAAGAAGAAGGCGGTGGGAAGCTCTGAGACTGAAGCAATGGCGCTTCCAGTCCCTGGTTTCACAGATTCAACTCTTGTTAATGGAGATTCAGCTCCTTC
CAACTCTAACAACAATGGAAATGAACTTAAAATTTCATCTCGATTTCCAGCTTCTTCGATTCAGAACAGTTCTGTCCAAACTCCACTACTCGCCGGTGAAGGCGGAGAAG
TCCACGGAGAAGAGAATGCTGTCGCTGATGTTACTGCTTCTGAGCTAACAAAGCTTGACGATGGCTTCTTCGTAGTTGAAGCTATTCGAAGGAAACGAGTTCGTAAGGGA
CAGCTTCAGTATCTTGTCAAATGGCATGGCTGGCCAGAAACTGCCAACACATGGGAACCCTGGGACAATCTCCAATCATGCGCTGAATTTATCGAGGAATTTGAGAAAAG
GATGGAAATCTCACGATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACGGTAATAATCAACTTCAGGAGGAAAAACAGTACCGCGTCATAGCTACTGATAATG
TCACAAATGTTTACATGAGTACTGTGGACGATTGTCTATCGGCCACTCCTTTAAACAGCATAATTCATTATGATCTTCCTACTCCTCAAGTACTTATAGACTCTACTGGA
ACATTCAATGTTGAAAATGGAGATATGGGTGAGAAATTTGATGGAAGTAGAAGGAGAGACGAATATGATTTGAAACTTATCGAGCTCAAGGCAGCAATCTCTGCCAATAT
GGTTGATTCTGATAAAAAAGCAGAGTCTTCTAAAGATCTTGGCCTTGTTTATGATGATTCCAAGGCTGATTGCGCGGTGGGTTCCACTCAGGGAAGTCACTCCATTGGCG
CCAAGAGAAGGAAATCTAGTAGGGTGAAAAGGTTCACTAAGGAAGCAGTCTCGTCTGAAAACTCTGACCACAGATTAAAACAAAATGGAATTGCTGTAAACATGGAGCCA
ACTGATCAAAATGAACAATTGGGGCCCGAGAATCCTAGTTTGTCAGGCCACTCCAGAAATGTAGCTACTATAAGAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGT
ATCAAATAACATCCCGGATGTAGTCGTGACCTTCTTGGCTGTGAGGTCTGATGGAAAAGAAGTAACGGTGAATAACAAATTTCTAAAGGCTAACAATCCACTTCTGTTGA
TTAACTTCTATGAGCAACATCTCCGATATAATCCAACATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCGAGGGAAGAAGAAGGCGGTGGGAAGCTCTGAGACTGAAGCAATGGCGCTTCCAGTCCCTGGTTTCACAGATTCAACTCTTGTTAATGGAGATTCAGCTCCTTC
CAACTCTAACAACAATGGAAATGAACTTAAAATTTCATCTCGATTTCCAGCTTCTTCGATTCAGAACAGTTCTGTCCAAACTCCACTACTCGCCGGTGAAGGCGGAGAAG
TCCACGGAGAAGAGAATGCTGTCGCTGATGTTACTGCTTCTGAGCTAACAAAGCTTGACGATGGCTTCTTCGTAGTTGAAGCTATTCGAAGGAAACGAGTTCGTAAGGGA
CAGCTTCAGTATCTTGTCAAATGGCATGGCTGGCCAGAAACTGCCAACACATGGGAACCCTGGGACAATCTCCAATCATGCGCTGAATTTATCGAGGAATTTGAGAAAAG
GATGGAAATCTCACGATCAGGAAAGCAGCGGAAGCGCAAGCGCAAGGATGGAGACGGTAATAATCAACTTCAGGAGGAAAAACAGTACCGCGTCATAGCTACTGATAATG
TCACAAATGTTTACATGAGTACTGTGGACGATTGTCTATCGGCCACTCCTTTAAACAGCATAATTCATTATGATCTTCCTACTCCTCAAGTACTTATAGACTCTACTGGA
ACATTCAATGTTGAAAATGGAGATATGGGTGAGAAATTTGATGGAAGTAGAAGGAGAGACGAATATGATTTGAAACTTATCGAGCTCAAGGCAGCAATCTCTGCCAATAT
GGTTGATTCTGATAAAAAAGCAGAGTCTTCTAAAGATCTTGGCCTTGTTTATGATGATTCCAAGGCTGATTGCGCGGTGGGTTCCACTCAGGGAAGTCACTCCATTGGCG
CCAAGAGAAGGAAATCTAGTAGGGTGAAAAGGTTCACTAAGGAAGCAGTCTCGTCTGAAAACTCTGACCACAGATTAAAACAAAATGGAATTGCTGTAAACATGGAGCCA
ACTGATCAAAATGAACAATTGGGGCCCGAGAATCCTAGTTTGTCAGGCCACTCCAGAAATGTAGCTACTATAAGAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGT
ATCAAATAACATCCCGGATGTAGTCGTGACCTTCTTGGCTGTGAGGTCTGATGGAAAAGAAGTAACGGTGAATAACAAATTTCTAAAGGCTAACAATCCACTTCTGTTGA
TTAACTTCTATGAGCAACATCTCCGATATAATCCAACATTATGAAAGAGCTACAAGTGTATCTGGCCACCATCATGTTTATGGTTTACAAGCTGTAATTTTGTAGCTTTA
GGAGATTTGGTGGAGTATAATATGAATTCTTGGTAAAATTCTCAAGAACTGTGTTAAGAGGAGATCAAACATTTTCTTGCTCTGTTTTATTTCTACATGAGGATAAATGG
AAAGTTATATTCTTTTTGGATATTGAAAATCCAATCTAGCAGTTTAAAGACTTGGTAAATCGAATTATGAACTTCTAAGGTTCTACACACCCCTCCCCCTCCCCCAAAAA
AAAAAACAAAGTCCAAGGAAGGAGGGTAACAGCGTCCACGGCTAGCAATATTGTCTACTTTAGCCTGTTACGTATAGCCATCAACCTCACGGTTTTAAGACGTGTCTATT
AGGGAGGTTTTCATACCCTTATAAGAAATACTTCATTCCCCTCTTCAACCGACGTGAGATCTTACAATCCACTCCCTTGGGGGCCAGCGTCCTCGCTAATATACCATCCA
ATGTCTGGCTCTGATACCATGTTAGATTTTGGTTTTTCCAAAAAAGCCTCATCCAATGGAGATGTATTCCTTACTTATAAACCCATGATCAACCCCTTAATTAGTCGACG
TAGGAGTCCTTTCCCAACAATCCTCAACAAGGTGGTGTATGTTCATGCTTGTATTATAGTGCTAGACTAAATAATCTTGTAGGAAATTGTCACCCATTTCCATTTCCACA
TTTCCTTTACGTTATTTTTGATGTCCAATGGAAAGCTTCATACAGATTCAGGTGGTGGTTATATACGCTTTCTACAGATTCATGTGGTGCTTTTCTGCCCTCCAGATGCT
GAGTGTTCATCAAAGAGCCATGGGATGCTTATTTATCAGGTTTTCTTACTGAACTTTCTTGGTCTCCATTTATATATTTTCTTTTGGTCTCCTATGATGTCATTCAAATA
CATATTTAAACTAAGTGTCATTATCAAA
Protein sequenceShow/hide protein sequence
MGRGKKKAVGSSETEAMALPVPGFTDSTLVNGDSAPSNSNNNGNELKISSRFPASSIQNSSVQTPLLAGEGGEVHGEENAVADVTASELTKLDDGFFVVEAIRRKRVRKG
QLQYLVKWHGWPETANTWEPWDNLQSCAEFIEEFEKRMEISRSGKQRKRKRKDGDGNNQLQEEKQYRVIATDNVTNVYMSTVDDCLSATPLNSIIHYDLPTPQVLIDSTG
TFNVENGDMGEKFDGSRRRDEYDLKLIELKAAISANMVDSDKKAESSKDLGLVYDDSKADCAVGSTQGSHSIGAKRRKSSRVKRFTKEAVSSENSDHRLKQNGIAVNMEP
TDQNEQLGPENPSLSGHSRNVATIRRIIKPVGYSVSVSNNIPDVVVTFLAVRSDGKEVTVNNKFLKANNPLLLINFYEQHLRYNPTL