; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G019400 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G019400
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionchromo domain protein LHP1-like
Genome locationchr05:26573226..26575798
RNA-Seq ExpressionLsi05G019400
SyntenyLsi05G019400
Gene Ontology termsGO:0031507 - heterochromatin assembly (biological process)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR008251 - Chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR017984 - Chromo domain subgroup
IPR023779 - Chromo domain, conserved site
IPR023780 - Chromo domain
IPR044251 - Chromo domain-containing protein LHP1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN56653.2 hypothetical protein Csa_009693 [Cucumis sativus]1.9e-18381.46Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL IPDFTQSTHLNGDS PSISNNNG+EP I  P+ P SL N+SVQIPLP DDA    GEDN +PDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSC EFI+E+EE FC SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVISTLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
         N+KLH DLPI Q P+DS HE               G +DGKFDGSRKKDEYDLKLI+  ASIS NMVDS++K VASN +S+VYDVSKADCVVGSAQ SH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL    EQGLKQNAATVSIEP D SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLINYYEQHLRYNPT
        NNK+LKANNPHLLINYYEQHLRYNPT
Subjt:  NNKYLKANNPHLLINYYEQHLRYNPT

XP_008438194.2 PREDICTED: chromo domain protein LHP1-like [Cucumis melo]6.9e-18681.92Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  IP P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLINYYEQHLRYNPT
        NNK+LKANNPHLLINYYEQHLRYNPT
Subjt:  NNKYLKANNPHLLINYYEQHLRYNPT

XP_011650798.1 chromo domain-containing protein LHP1 isoform X1 [Cucumis sativus]1.9e-18381.46Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL IPDFTQSTHLNGDS PSISNNNG+EP I  P+ P SL N+SVQIPLP DDA    GEDN +PDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSC EFI+E+EE FC SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVISTLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
         N+KLH DLPI Q P+DS HE               G +DGKFDGSRKKDEYDLKLI+  ASIS NMVDS++K VASN +S+VYDVSKADCVVGSAQ SH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL    EQGLKQNAATVSIEP D SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLINYYEQHLRYNPT
        NNK+LKANNPHLLINYYEQHLRYNPT
Subjt:  NNKYLKANNPHLLINYYEQHLRYNPT

XP_038877763.1 probable chromo domain-containing protein LHP1 isoform X1 [Benincasa hispida]4.8e-21189.79Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKAVGSSEPET AL IPDFTQSTHLNGDSGPSISNNNGNEP+IPSPYPP SLQNSSVQIPLPTDDAG    EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQ+CSEFIDEFEESFC SRSGKQRKRKRKD DIENQP EEKQLQ++AIDNVTDVVI T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS
        LN K  CDLPIPQ PVDSTHEG FGS     KTTR IDVENGHVDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSD+KAVASN L++VYDVSKADCVVGS
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS

Query:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRVKRFTKDSALSE+SEQ LKQNAATVSIEPTDRS+Q GPENPSLSGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKYLKANNPHLLINYYEQHLRYNPT
        KEVTVNNK+LK NNPHLLINYYEQHLRYNPT
Subjt:  KEVTVNNKYLKANNPHLLINYYEQHLRYNPT

XP_038877764.1 chromo domain protein LHP1-like isoform X2 [Benincasa hispida]4.2e-20789.1Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKAVGSSEPET AL IPDFTQSTHLNGDSGPSISNNNGNEP+IPSPYPP SLQNSSVQIPLPTDDAG    EDNAVPDVSAS RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAG----EDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQ+CSEFIDEFEE    SRSGKQRKRKRKD DIENQP EEKQLQ++AIDNVTDVVI T+DDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS
        LN K  CDLPIPQ PVDSTHEG FGS     KTTR IDVENGHVDGKFDGSRK+DEYDLKL+ELKA+ISANMVDSD+KAVASN L++VYDVSKADCVVGS
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGS-----KTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGS

Query:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG
        AQ SHSIGAKRRKSSRVKRFTKDSALSE+SEQ LKQNAATVSIEPTDRS+Q GPENPSLSGHSRNV TITRIIKPVGYSVSV NNIPDVIVTFLAVRSDG
Subjt:  AQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDG

Query:  KEVTVNNKYLKANNPHLLINYYEQHLRYNPT
        KEVTVNNK+LK NNPHLLINYYEQHLRYNPT
Subjt:  KEVTVNNKYLKANNPHLLINYYEQHLRYNPT

TrEMBL top hitse value%identityAlignment
A0A0A0L6G7 Chromo domain-containing protein9.1e-18481.46Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL IPDFTQSTHLNGDS PSISNNNG+EP I  P+ P SL N+SVQIPLP DDA    GEDN +PDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSC EFI+E+EE FC SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVISTLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
         N+KLH DLPI Q P+DS HE               G +DGKFDGSRKKDEYDLKLI+  ASIS NMVDS++K VASN +S+VYDVSKADCVVGSAQ SH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL    EQGLKQNAATVSIEP D SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLINYYEQHLRYNPT
        NNK+LKANNPHLLINYYEQHLRYNPT
Subjt:  NNKYLKANNPHLLINYYEQHLRYNPT

A0A1S3AVS6 chromo domain protein LHP1-like3.4e-18681.92Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  IP P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD DIE++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFL VRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLINYYEQHLRYNPT
        NNK+LKANNPHLLINYYEQHLRYNPT
Subjt:  NNKYLKANNPHLLINYYEQHLRYNPT

A0A5A7U472 Chromo domain protein LHP1-like7.5e-17078.55Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  I  P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRK Q +++   RGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD D+E++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLIN
        NNK+LKANNPHL+ +
Subjt:  NNKYLKANNPHLLIN

A0A5D3D0R0 Chromo domain protein LHP1-like4.9e-17780.48Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE
        MGRGKKKA GSSEPETVAL  PDFTQSTHLNGDS PSISNNNG+E  I  P+PP SL N+SVQIPLPTDDA    GEDNAVPDVSAS+RTNLDEGFFEVE
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA----GEDNAVPDVSASQRTNLDEGFFEVE

Query:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP
        AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSC EFI+E+EE FC+SRSGKQRKRKRKD D+E++  EEK LQI+AIDNVTDVVI+TLDDRLSAAP
Subjt:  AIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAP

Query:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH
        LN+KLH DLPIPQ P+DS HE               G +D KFDGSRK+DEYD+KLI+  AS+S NMVDSD+K VASN +S+VYDVSKADCVVGSAQGSH
Subjt:  LNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSH

Query:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV
        S GAKRRKSSRVKRFTKDSAL   SEQGLKQNAATV IEPTD SEQLGP+NPS SGHSRNVSTITRII+PVGYSVSV NNIPDVIVTFLAVRSDGKEVTV
Subjt:  SIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTV

Query:  NNKYLKANNPHLLIN
        NNK+LKANNPHL+ +
Subjt:  NNKYLKANNPHLLIN

A0A6J1EA84 chromo domain-containing protein LHP1-like isoform X24.6e-15971.66Show/hide
Query:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA-----------GEDNAVPDVSASQRTNLD
        MGR KKKA GSSEPETV L I   T STH+NGDSG SI N+NGNEPLI SPYP  S+QNSSVQ PL TD+A           GE NA  DVSA ++T  D
Subjt:  MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDA-----------GEDNAVPDVSASQRTNLD

Query:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLD
        EGFFEVE+I RKRVRKGQLQYLVKW GWP+T NTWEP DNLQSCSE IDEFEES   SRSGKQRKRKRK   +ENQ  E+K+   +A +NVTD+VIST+D
Subjt:  EGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLD

Query:  DRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVV
        D +SA PLN K+HCDLP PQ PV                DVENGH++G F GSRK+D++DLKL ELKA++SANMVDSD+KAVASN L +VYDVSK DCVV
Subjt:  DRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVV

Query:  GSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRS
        GS Q SHSIG+KRRKSSRVKRFTKD+ALSEDSEQGLK+NA+T+SIEPTDR+E+L  ENPSLSGHSR VS ITRIIKPVGYSVSVSN IPDV VTFL +RS
Subjt:  GSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRS

Query:  DGKEVTVNNKYLKANNPHLLINYYEQHLRYNPTS
        DGKEVTV+NK+LK NNPHLLIN+YEQHLRYNPTS
Subjt:  DGKEVTVNNKYLKANNPHLLINYYEQHLRYNPTS

SwissProt top hitse value%identityAlignment
P05205 Heterochromatin protein 11.4e-0843.48Show/hide
Query:  DEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKR
        +E  + VE I  +RVRKG+++Y +KW+G+PETENTWEP +NL  C + I ++E S  +       K+ R
Subjt:  DEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKR

P45973 Chromobox protein homolog 51.9e-0839.74Show/hide
Query:  SASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRK
        +A   ++ DE  + VE +  +RV KGQ++YL+KW+G+ E  NTWEP  NL  C E I EF + +   + G+  K + K
Subjt:  SASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRK

Q339W7 Probable chromo domain-containing protein LHP15.4e-4035.18Show/hide
Query:  EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDI--ENQPHEEK
        E  A   V       L EG++E+E IRR+R+RKG+LQYLVKWRGWPE+ NTWEPL+NL +CS+ ID FE    + R G++RKRK     +   N  H ++
Subjt:  EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDI--ENQPHEEK

Query:  QLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIP---QTPVDSTHEGV--FGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL---KASISAN
                                  L+ K H   P P   Q P  ++        SKT   +D     V  +   +  ++     +      +  +S  
Subjt:  QLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIP---QTPVDSTHEGV--FGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL---KASISAN

Query:  MVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITR
        + D   +    N  S   ++ K    V  +QG    GAK+RKS  V+RF ++        QG  +  A V  E    +E    +     G    V  IT+
Subjt:  MVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITR

Query:  IIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPTS
        IIKPV ++ +V+N++  V +TF A+RSDG+EV V++K LKANNP LLI+YYEQ LRYNPTS
Subjt:  IIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPTS

Q944N1 Chromo domain protein LHP13.6e-4436.96Show/hide
Query:  VSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQI---VA
        V+   +  L EGF+E+E +RR+R  KG++ YL+KWRGWPE+ NTWEP  NL SC++ ID +EES    +SGK R+RKRK    +  P  ++Q +    VA
Subjt:  VSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQI---VA

Query:  IDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNAL
          N   V +  +++   + PLN     DL      VDS      GS+    +D     V+G     R+++E +LKL ELK + S N    D   ++ N L
Subjt:  IDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNAL

Query:  SVVY-DVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKD--SALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSV
        +  +  V+ A+      Q     GAK+RKS  V+RF ++  SA+ +D++  L        ++    +  +      ++  S++  TIT+++ PV Y  S 
Subjt:  SVVY-DVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKD--SALSEDSEQGLKQNAATVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSV

Query:  SNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPT
        SN++ DV VTF+A R+DG  V V+NK+LK NNP LLIN+YE+++RY+PT
Subjt:  SNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPT

Q946J8 Chromo domain-containing protein LHP11.5e-5037.31Show/hide
Query:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R
        DD G   ++    +    +R  LDEGF+E+EAIRRKRVRKG++QYL+KWRGWPET NTWEPL+NLQS ++ ID FE S    + G++RKRK        +
Subjt:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R

Query:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL
        K   + +  H+  EK     +++N +   I    D   ++ LNR +          V++    V  ++  R ID E               EYD  L EL
Subjt:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL

Query:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG
        +  ++            + S+   V  N L  VY   + D      + S  IGAKRRKS  VKRF +D + S +      QN      +++   R  ++G
Subjt:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG

Query:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPT
         E P +   + N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N++LKA+NPHLLI +YEQHL+YN T
Subjt:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPT

Arabidopsis top hitse value%identityAlignment
AT5G17690.1 like heterochromatin protein (LHP1)1.1e-5137.31Show/hide
Query:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R
        DD G   ++    +    +R  LDEGF+E+EAIRRKRVRKG++QYL+KWRGWPET NTWEPL+NLQS ++ ID FE S    + G++RKRK        +
Subjt:  DDAG---EDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQYLVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRK--------R

Query:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL
        K   + +  H+  EK     +++N +   I    D   ++ LNR +          V++    V  ++  R ID E               EYD  L EL
Subjt:  KDVDIENQPHE--EKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVFGSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIEL

Query:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG
        +  ++            + S+   V  N L  VY   + D      + S  IGAKRRKS  VKRF +D + S +      QN      +++   R  ++G
Subjt:  KASIS---------ANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA--TVSIEPTDRSEQLG

Query:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPT
         E P +   + N+S         IT+I+KP+ ++ SVS+N+ +V+VTFLA+RSDGKE  V+N++LKA+NPHLLI +YEQHL+YN T
Subjt:  PENPSLSGHSRNVS--------TITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGAGGGAAGAAGAAAGCGGTGGGAAGCTCTGAGCCGGAGACAGTGGCGCTTCGAATCCCTGATTTCACTCAATCTACTCATCTGAATGGAGATTCTGGCCCTTC
CATCTCTAACAACAATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTATTCACTTCAGAACAGCTCTGTGCAAATTCCACTACCCACCGACGACGCGGGAGAAG
ATAATGCTGTTCCTGATGTTTCTGCTTCTCAGCGAACTAACCTCGATGAAGGCTTCTTCGAAGTCGAAGCTATTAGGCGGAAAAGAGTTCGTAAGGGACAGCTTCAGTAC
CTCGTCAAATGGCGTGGCTGGCCAGAGACAGAAAACACATGGGAACCCTTGGACAATCTCCAATCATGCTCTGAATTTATTGATGAATTTGAAGAAAGCTTTTGTAACTC
GCGATCAGGAAAGCAACGGAAGCGCAAGCGCAAGGATGTAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGCCATTGATAATGTCACGGATGTAG
TTATCAGTACTTTGGATGATCGTCTATCGGCCGCTCCTTTAAACAGGAAACTTCATTGTGATCTTCCTATTCCTCAAACGCCGGTAGACTCTACTCATGAAGGAGTGTTT
GGAAGCAAGACTACTCGAGCAATTGATGTTGAAAATGGTCATGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGATGAATATGATCTGAAACTTATTGAGCTCAAGGC
ATCAATCTCTGCCAATATGGTTGATTCTGATCAAAAAGCAGTGGCTTCTAACGCTCTCAGCGTTGTTTATGATGTTTCCAAGGCCGATTGTGTGGTGGGTTCTGCTCAGG
GAAGTCACTCCATTGGAGCCAAGAGAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATTCAGCCTTGTCTGAAGATTCTGAACAAGGATTAAAACAAAATGCAGCG
ACTGTAAGCATCGAACCTACTGATCGAAGCGAACAATTAGGACCCGAGAACCCTAGTTTGTCAGGCCACTCCAGAAATGTGTCTACTATCACAAGGATTATCAAGCCTGT
TGGTTATTCAGTTTCAGTATCAAATAACATCCCAGATGTAATCGTCACCTTTTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGTGAATAACAAATATCTTAAGGCTA
ACAATCCACATCTGTTGATTAACTACTATGAGCAACATCTCCGATATAATCCAACATCATGA
mRNA sequenceShow/hide mRNA sequence
GGAAAAGGAGGGATAAAACGAAAATGGGGAGAGGGAAGAAGAAAGCGGTGGGAAGCTCTGAGCCGGAGACAGTGGCGCTTCGAATCCCTGATTTCACTCAATCTACTCAT
CTGAATGGAGATTCTGGCCCTTCCATCTCTAACAACAATGGTAATGAACCTTTAATTCCATCTCCATATCCACCTTATTCACTTCAGAACAGCTCTGTGCAAATTCCACT
ACCCACCGACGACGCGGGAGAAGATAATGCTGTTCCTGATGTTTCTGCTTCTCAGCGAACTAACCTCGATGAAGGCTTCTTCGAAGTCGAAGCTATTAGGCGGAAAAGAG
TTCGTAAGGGACAGCTTCAGTACCTCGTCAAATGGCGTGGCTGGCCAGAGACAGAAAACACATGGGAACCCTTGGACAATCTCCAATCATGCTCTGAATTTATTGATGAA
TTTGAAGAAAGCTTTTGTAACTCGCGATCAGGAAAGCAACGGAAGCGCAAGCGCAAGGATGTAGACATTGAAAATCAACCTCATGAGGAAAAACAGCTCCAAATTGTAGC
CATTGATAATGTCACGGATGTAGTTATCAGTACTTTGGATGATCGTCTATCGGCCGCTCCTTTAAACAGGAAACTTCATTGTGATCTTCCTATTCCTCAAACGCCGGTAG
ACTCTACTCATGAAGGAGTGTTTGGAAGCAAGACTACTCGAGCAATTGATGTTGAAAATGGTCATGTGGATGGGAAATTTGATGGAAGTAGAAAGAAAGATGAATATGAT
CTGAAACTTATTGAGCTCAAGGCATCAATCTCTGCCAATATGGTTGATTCTGATCAAAAAGCAGTGGCTTCTAACGCTCTCAGCGTTGTTTATGATGTTTCCAAGGCCGA
TTGTGTGGTGGGTTCTGCTCAGGGAAGTCACTCCATTGGAGCCAAGAGAAGGAAGTCTAGTAGGGTGAAAAGGTTCACTAAGGATTCAGCCTTGTCTGAAGATTCTGAAC
AAGGATTAAAACAAAATGCAGCGACTGTAAGCATCGAACCTACTGATCGAAGCGAACAATTAGGACCCGAGAACCCTAGTTTGTCAGGCCACTCCAGAAATGTGTCTACT
ATCACAAGGATTATCAAGCCTGTTGGTTATTCAGTTTCAGTATCAAATAACATCCCAGATGTAATCGTCACCTTTTTGGCTGTGAGGTCGGATGGAAAAGAAGTGACGGT
GAATAACAAATATCTTAAGGCTAACAATCCACATCTGTTGATTAACTACTATGAGCAACATCTCCGATATAATCCAACATCATGA
Protein sequenceShow/hide protein sequence
MGRGKKKAVGSSEPETVALRIPDFTQSTHLNGDSGPSISNNNGNEPLIPSPYPPYSLQNSSVQIPLPTDDAGEDNAVPDVSASQRTNLDEGFFEVEAIRRKRVRKGQLQY
LVKWRGWPETENTWEPLDNLQSCSEFIDEFEESFCNSRSGKQRKRKRKDVDIENQPHEEKQLQIVAIDNVTDVVISTLDDRLSAAPLNRKLHCDLPIPQTPVDSTHEGVF
GSKTTRAIDVENGHVDGKFDGSRKKDEYDLKLIELKASISANMVDSDQKAVASNALSVVYDVSKADCVVGSAQGSHSIGAKRRKSSRVKRFTKDSALSEDSEQGLKQNAA
TVSIEPTDRSEQLGPENPSLSGHSRNVSTITRIIKPVGYSVSVSNNIPDVIVTFLAVRSDGKEVTVNNKYLKANNPHLLINYYEQHLRYNPTS