; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013009 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013009
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionChromo domain protein LHP1-like
Genome locationChr01:26093219..26097060
RNA-Seq ExpressionHG10013009
SyntenyHG10013009
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049056.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]3.2e-22577.94Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  I  P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQKFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAPL
        AIRRKRVRKLQ+FVFDRRGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD D+E++  EEK LQI+AIDNVTDVVI+TLDDRLSAAPL
Subjt:  AIRRKRVRKLQKFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAPL

Query:  NRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHS
        N+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSHS
Subjt:  NRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHS

Query:  IGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVN
         GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVN
Subjt:  IGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVN

Query:  NKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLL
        NK+LKANNPHL                                  MGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLL
Subjt:  NKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLL

Query:  LVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL
        LVH LSIFFPVLH SSTFAVQHSSSP YDADGFGFGSGALFLGLLFLVLY LL
Subjt:  LVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL

TYK17507.1 chromo domain protein LHP1-like [Cucumis melo var. makuwa]1.2e-21976.71Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  I  P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD D+E++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLL
        NNK+LKANNPHL                                  MGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLL
Subjt:  NNKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLL

Query:  LLVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL
        LLVH LSIFFPVLH SSTFAVQHSSSP YDADGFGFGSGALFLGLLFLVLY LL
Subjt:  LLVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL

XP_008438194.2 PREDICTED: chromo domain protein LHP1-like [Cucumis melo]5.0e-17079.18Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  IP P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLM
        NNK+LKANNPHL+
Subjt:  NNKYLKANNPHLM

XP_038877763.1 probable chromo domain-containing protein LHP1 isoform X1 [Benincasa hispida]4.5e-19587.32Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKAVGSSEPET AL IPDFTQSTHLNGDSGPSISNNNGNEP+IPSPYPP SLQNSSVQIPLPTDDAG    EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQ+CSEFIDEFEESFC SRSGKQRKRKRKD DIENQP EEKQLQ++AIDNVTDVVI T+DDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS
        LN K  CDLPIPQ PVDSTHEG FGS     KTTR IDVENGHVDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSD+KAVASN L++VYDVSKADCVVGS
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS

Query:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRVKRFTKDSALSE+SEQ LKQNAATVSIEPTDRS+Q GPENPSLSGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKYLKANNPHLM
        KEVTVNNK+LK NNPHL+
Subjt:  KEVTVNNKYLKANNPHLM

XP_038877764.1 chromo domain protein LHP1-like isoform X2 [Benincasa hispida]3.0e-19186.6Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKAVGSSEPET AL IPDFTQSTHLNGDSGPSISNNNGNEP+IPSPYPP SLQNSSVQIPLPTDDAG    EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQ+CSEFIDEFEE    SRSGKQRKRKRKD DIENQP EEKQLQ++AIDNVTDVVI T+DDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS
        LN K  CDLPIPQ PVDSTHEG FGS     KTTR IDVENGHVDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSD+KAVASN L++VYDVSKADCVVGS
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS

Query:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRVKRFTKDSALSE+SEQ LKQNAATVSIEPTDRS+Q GPENPSLSGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKYLKANNPHLM
        KEVTVNNK+LK NNPHL+
Subjt:  KEVTVNNKYLKANNPHLM

TrEMBL top hitse value%identityAlignment
A0A0A0L6G7 Chromo domain-containing protein6.6e-16878.69Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL IPDFTQSTHLNGDS PSISNNNG+EP I  P+ P SL N+SVQIPLP DDA    GEDN +PDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQSC EFI+E+EE FC SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVISTLDDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
         N+KLH DLPI Q P+DS HE               G +DGKFDGSRKKDEYDLKLI+  ASIS NMVDS++K VASN +S+VYDVSKADCVVGSAQ SH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL    EQGLKQNAATVSIEP D SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLM
        NNK+LKANNPHL+
Subjt:  NNKYLKANNPHLM

A0A1S3AVS6 chromo domain protein LHP1-like2.4e-17079.18Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  IP P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLM
        NNK+LKANNPHL+
Subjt:  NNKYLKANNPHLM

A0A5A7U472 Chromo domain protein LHP1-like1.5e-22577.94Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  I  P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQKFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAPL
        AIRRKRVRKLQ+FVFDRRGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD D+E++  EEK LQI+AIDNVTDVVI+TLDDRLSAAPL
Subjt:  AIRRKRVRKLQKFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAPL

Query:  NRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHS
        N+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSHS
Subjt:  NRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHS

Query:  IGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVN
         GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTVN
Subjt:  IGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVN

Query:  NKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLL
        NK+LKANNPHL                                  MGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLLL
Subjt:  NKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLLL

Query:  LVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL
        LVH LSIFFPVLH SSTFAVQHSSSP YDADGFGFGSGALFLGLLFLVLY LL
Subjt:  LVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL

A0A5D3D0R0 Chromo domain protein LHP1-like5.6e-22076.71Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  I  P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD D+E++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLL
        NNK+LKANNPHL                                  MGYTT+GPSPVVPCIIVGFLG+IIFWPTL SIWES+E LLELGIWVAVILLFLL
Subjt:  NNKYLKANNPHL----------------------------------MGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIWESVESLLELGIWVAVILLFLL

Query:  LLVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL
        LLVH LSIFFPVLH SSTFAVQHSSSP YDADGFGFGSGALFLGLLFLVLY LL
Subjt:  LLVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL

A0A6J1EA84 chromo domain-containing protein LHP1-like isoform X24.3e-14368.81Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA-----------GEDNAVPDVSASQRTNLD
        MGR KKKA GSSEPETV L I   T STH+NGDSG SI N+NGNEPLI SPYP  S+QNSSVQ PL TD+A           GE NA  DVSA ++T  D
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA-----------GEDNAVPDVSASQRTNLD

Query:  EGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLD
        EGFFEVE+I RKRVRK Q +++    GWP+T NTWEP DNLQSCSE IDEFEES   SRSGKQRKRKRK   +ENQ  E+K+   +A +NVTD+VIST+D
Subjt:  EGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLD

Query:  DRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVV
        D +SA PLN K+HCDLP PQ PV                DVENGH++G F GSRK+D++DLKL ELKA++SANMVDSD+KAVASN L +VYDVSK DCVV
Subjt:  DRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVV

Query:  GSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRS
        GS Q SHSIG+KRRKSSRVKRFTKD+ALSEDSEQGLK+NA+T+SIEPTDR+E+L  ENPSLSGHSR VS ITRIIKPVGYSVSVSN IPDV VTFL +RS
Subjt:  GSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRS

Query:  DGKEVTVNNKYLKANNPHLM
        DGKEVTV+NK+LK NNPHL+
Subjt:  DGKEVTVNNKYLKANNPHLM

SwissProt top hitse value%identityAlignment
Q339W7 Probable chromo domain-containing protein LHP12.0e-2530.55Show/hide
Query:  EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDI--ENQPHEEK
        E  A   V       L EG++E+E IRR+R+RK + +++   RGWPE+ NTWEPL+NL +CS+ ID FE    + R G++RKRK     +   N  H ++
Subjt:  EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDI--ENQPHEEK

Query:  QLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIP---QTPVDSTHEGV--FGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL---KASISAN
                                  L+ K H   P P   Q P  ++        SKT   +D     V  +   +  ++     +      +  +S  
Subjt:  QLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIP---QTPVDSTHEGV--FGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL---KASISAN

Query:  MVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITR
        + D   +    N  S   ++ K    V  +QG    GAK+RKS  V+RF ++        QG  +  A V  E    +E    +     G    V  IT+
Subjt:  MVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITR

Query:  IIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM
        IIKPV ++ +V+N++  V +TF A+RSDG+EV V++K LKANNP L+
Subjt:  IIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM

Q944N1 Chromo domain protein LHP18.6e-3234.23Show/hide
Query:  VSASQRTNLDEGFFEVEAIRRKR-VRKLQKFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQI---VA
        V+   +  L EGF+E+E +RR+R V+    ++   RGWPE+ NTWEP  NL SC++ ID +EES    +SGK R+RKRK    +  P  ++Q +    VA
Subjt:  VSASQRTNLDEGFFEVEAIRRKR-VRKLQKFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQI---VA

Query:  IDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNAL
          N   V +  +++   + PLN     DL      VDS      GS+    +D     V+G     R+++E +LKL ELK + S N    D   ++ N L
Subjt:  IDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNAL

Query:  SVVY-DVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKD--SALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSV
        +  +  V+ A+      Q     GAK+RKS  V+RF ++  SA+ +D++  L        ++    +  +      ++  S++  TIT+++ PV Y  S 
Subjt:  SVVY-DVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKD--SALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSV

Query:  SNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM
        SN++ DV VTF+A R+DG  V V+NK+LK NNP L+
Subjt:  SNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM

Q946J8 Chromo domain-containing protein LHP11.0e-3734.32Show/hide
Query:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R
        DD G   ++    +    +R  LDEGF+E+EAIRRKRVRK + +++   RGWPET NTWEPL+NLQS ++ ID FE S    + G++RKRK        +
Subjt:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R

Query:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL
        K   + +  H+  EK     +++N +   I    D   ++ LNR +          V++    V  ++  R ID E               EYD  L EL
Subjt:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL

Query:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG
        +  ++            + S+   V  N L  VY   + D      + S  IGAKRRKS  VKRF +D + S +      QN      +++   R  ++G
Subjt:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG

Query:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM
         E P +   + N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N++LKA+NPHL+
Subjt:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)7.4e-3934.32Show/hide
Query:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R
        DD G   ++    +    +R  LDEGF+E+EAIRRKRVRK + +++   RGWPET NTWEPL+NLQS ++ ID FE S    + G++RKRK        +
Subjt:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQ-KFVFDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R

Query:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL
        K   + +  H+  EK     +++N +   I    D   ++ LNR +          V++    V  ++  R ID E               EYD  L EL
Subjt:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL

Query:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG
        +  ++            + S+   V  N L  VY   + D      + S  IGAKRRKS  VKRF +D + S +      QN      +++   R  ++G
Subjt:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG

Query:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM
         E P +   + N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N++LKA+NPHL+
Subjt:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAGGGAAGAAGAAAGCGGTGGGAAGCTCTGAGCCGGAGACAGTGGCGCTTCGAATCCCTGATTTCACTCAATCTACTCATCTGAATGGAGATTCTGGCCCTTC
CATCTCTAACAACAATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTATTCACTTCAGAACAGCTCTGTGCAAATTCCACTACCCACCGACGACGCGGGAGAAG
ATAATGCTGTTCCTGATGTTTCTGCTTCTCAGCGAACTAACCTCGATGAAGGCTTCTTCGAAGTCGAAGCTATTAGGCGGAAAAGAGTTCGTAAGCTTCAGAAATTTGTT
TTTGACAGGCGTGGCTGGCCAGAGACAGAAAACACATGGGAACCCTTGGACAATCTCCAATCATGCTCTGAATTTATTGATGAATTTGAAGAAAGCTTTTGTAACTCGCG
ATCAGGAAAGCAACGGAAGCGCAAGCGCAAGGATGTAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCCATTGATAATGTCACGGATGTAGTTA
TCAGTACTTTGGATGATCGTCTATCGGCCGCTCCTTTAAACAGGAAACTTCATTGTGATCTTCCTATTCCTCAAACGCCGGTAGACTCTACTCATGAAGGAGTGTTTGGA
AGCAAGACTACTCGAGCAATTGATGTTGAAAATGGTCATGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGATGAATATGATCTGAAACTTATTGAGCTCAAGGCATC
AATCTCTGCCAATATGGTTGATTCTGATCAAAAAGCAGTGGCTTCTAACGCTCTCAGCGTTGTTTATGATGTTTCCAAGGCCGATTGTGTGGTGGGTTCTGCTCAGGGAA
GTCACTCCATTGGAGCCAAGAGAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATTCAGCCTTGTCTGAAGATTCTGAACAAGGATTAAAACAAAATGCAGCGACT
GTAAGCATCGAACCTACTGATCGAAGCGAACAATTAGGACCCGAGAACCCTAGTTTGTCAGGCCACTCCAGAAATGTGTCTACTATCACAAGGATTATCAAGCCTGTTGG
TTATTCAGTTTCAGTATCAAATAACATCCCAGATGTAATCGTCACCTTTTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATATCTTAAGGCTAACA
ATCCACATCTGATGGGATATACTACTGCAGGACCATCACCTGTTGTACCTTGTATCATAGTTGGATTTCTGGGGATGATAATATTTTGGCCAACTCTTCAATCCATCTGG
GAGAGTGTAGAGTCTCTACTTGAACTGGGCATTTGGGTTGCAGTGATTCTTCTTTTCCTCTTACTGCTTGTACACTTGCTTTCTATTTTCTTTCCTGTTCTTCACGTTTC
ATCCACTTTTGCAGTTCAGCATTCCAGCAGCCCTTGCTACGATGCTGATGGATTCGGTTTCGGGTCAGGAGCATTGTTTCTAGGTCTTCTCTTCCTTGTCCTCTATAATC
TGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGAGGGAAGAAGAAAGCGGTGGGAAGCTCTGAGCCGGAGACAGTGGCGCTTCGAATCCCTGATTTCACTCAATCTACTCATCTGAATGGAGATTCTGGCCCTTC
CATCTCTAACAACAATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTATTCACTTCAGAACAGCTCTGTGCAAATTCCACTACCCACCGACGACGCGGGAGAAG
ATAATGCTGTTCCTGATGTTTCTGCTTCTCAGCGAACTAACCTCGATGAAGGCTTCTTCGAAGTCGAAGCTATTAGGCGGAAAAGAGTTCGTAAGCTTCAGAAATTTGTT
TTTGACAGGCGTGGCTGGCCAGAGACAGAAAACACATGGGAACCCTTGGACAATCTCCAATCATGCTCTGAATTTATTGATGAATTTGAAGAAAGCTTTTGTAACTCGCG
ATCAGGAAAGCAACGGAAGCGCAAGCGCAAGGATGTAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCCATTGATAATGTCACGGATGTAGTTA
TCAGTACTTTGGATGATCGTCTATCGGCCGCTCCTTTAAACAGGAAACTTCATTGTGATCTTCCTATTCCTCAAACGCCGGTAGACTCTACTCATGAAGGAGTGTTTGGA
AGCAAGACTACTCGAGCAATTGATGTTGAAAATGGTCATGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGATGAATATGATCTGAAACTTATTGAGCTCAAGGCATC
AATCTCTGCCAATATGGTTGATTCTGATCAAAAAGCAGTGGCTTCTAACGCTCTCAGCGTTGTTTATGATGTTTCCAAGGCCGATTGTGTGGTGGGTTCTGCTCAGGGAA
GTCACTCCATTGGAGCCAAGAGAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATTCAGCCTTGTCTGAAGATTCTGAACAAGGATTAAAACAAAATGCAGCGACT
GTAAGCATCGAACCTACTGATCGAAGCGAACAATTAGGACCCGAGAACCCTAGTTTGTCAGGCCACTCCAGAAATGTGTCTACTATCACAAGGATTATCAAGCCTGTTGG
TTATTCAGTTTCAGTATCAAATAACATCCCAGATGTAATCGTCACCTTTTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATATCTTAAGGCTAACA
ATCCACATCTGATGGGATATACTACTGCAGGACCATCACCTGTTGTACCTTGTATCATAGTTGGATTTCTGGGGATGATAATATTTTGGCCAACTCTTCAATCCATCTGG
GAGAGTGTAGAGTCTCTACTTGAACTGGGCATTTGGGTTGCAGTGATTCTTCTTTTCCTCTTACTGCTTGTACACTTGCTTTCTATTTTCTTTCCTGTTCTTCACGTTTC
ATCCACTTTTGCAGTTCAGCATTCCAGCAGCCCTTGCTACGATGCTGATGGATTCGGTTTCGGGTCAGGAGCATTGTTTCTAGGTCTTCTCTTCCTTGTCCTCTATAATC
TGTTGTAA
Protein sequenceShow/hide protein sequence
MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAGEDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKLQKFV
FDRRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFG
SKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAAT
VSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLMGYTTAGPSPVVPCIIVGFLGMIIFWPTLQSIW
ESVESLLELGIWVAVILLFLLLLVHLLSIFFPVLHVSSTFAVQHSSSPCYDADGFGFGSGALFLGLLFLVLYNLL