; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033308 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033308
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionChlorophyllase
Genome locationchr11:42663747..42667281
RNA-Seq ExpressionLag0033308
SyntenyLag0033308
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0052689 - carboxylic ester hydrolase activity (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606413.1 Chlorophyllase type 0, partial [Cucurbita argyrosperma subsp. sororia]6.1e-21755.61Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRGNWK V+  YE+ PE AQ LKLTR GDTALHLAV+DN+E +V++LV  ++ S    +LL   ND E N LHLAA  GSA MC+AIASAH  LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        + RNK  ETPL+LAA  G+R+ F+CLY+FCR    ++  NCR  ++G+TVLHSALRN++FDLAF ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVN-EESEIVDI
        I+G  NI YY T V++LKPQ I++L + W   +  TNT+T  FPANY  CI FFT +WDGFLK      +    +   +KK  +D  +++  E  E  + 
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVN-EESEIVDI

Query:  IDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQT
         + ++ +D  +L+KA   ++ EASITN P NY TCI+    VFSAI++ LG+GS   +KI+ +K+KH+WS+QVME LL+ +  D+YD +G  P +STSQT
Subjt:  IDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQT

Query:  YDKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQT
         D+ + T PY   D+  V FS ++ +KP  +    + Q  E+ MLLAAKNGV+E+VKG++  FPL+I+D   +KKNVVLLA EY QPDVYRFLL+     
Subjt:  YDKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQT

Query:  KSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIAT
        ++LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K G+WL  TS+SCSVV  LI T
Subjt:  KSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIAT

Query:  VAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRL
        VAF S  +IPGG N  DG+  L   E FF + + SLIALCLSSTSV +FLAILT RFDA DF +NLPWKL IG SSL+ SII+ LISFCAGHYFL+   +
Subjt:  VAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRL

Query:  QNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
         + A LLYT+  +PVALIF +++LPLY D++QAI K+V KRSA VV +
Subjt:  QNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

XP_022931012.1 uncharacterized protein LOC111437338 isoform X1 [Cucurbita moschata]2.2e-21454.53Show/hide
Query:  NNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMC
        N    A   + LY N KR  W+ V+  YE+ PE AQ LKLTR GDTALHLAV+DN+E +V++LV  ++ S    +LL   ND + N LHLAA  GSA MC
Subjt:  NNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMC

Query:  HAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLA
        +AIASAH  LV+ RNK  ETPL+LAA  G+R+ F+CLY+FCR     +  NCR  ++G+TVLHSALRN++FDLAF ++H++N+A++WV   G TPLH LA
Subjt:  HAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLA

Query:  NKPTAFRSGTSIKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKH
        +KPTAF+SG+ I+G  NI YY T VD+L PQ I++L + W   +   NT+T  FPANYETCI FFT +WDGFLK      +    +   +KK  +D  ++
Subjt:  NKPTAFRSGTSIKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKH

Query:  VN-EESEIVDIIDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHD
        +  E  E  +  + ++ +D  +L+KA   ++ EASITN P NY TCI+    VFSAI++ LG+GS   +KI+ +K+KH+WS+QVME LL+ +  D+YD +
Subjt:  VN-EESEIVDIIDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHD

Query:  GYTPKESTSQTYDKT-ETSPYDIGDEGRVGFSDNVATKPNPSN--NSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQP
        G  P +STSQT D+   T PY   D+  V FS ++ +KP  +      + QA E+ MLLAAKNGV+E+VKG++  FPL+I+D   +KKNVVLLA EY QP
Subjt:  GYTPKESTSQTYDKT-ETSPYDIGDEGRVGFSDNVATKPNPSN--NSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQP

Query:  DVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNT
        DVYRFLL+ +   ++LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K G+WL  T
Subjt:  DVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNT

Query:  SESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLIS
        S+SCSVV  LI TVAF S  +IPGG N  DG+  L   E FF + + SLIALCLSSTSV +FLAILT RFDA DF +NLPWKL IG SSL+ SII+ LIS
Subjt:  SESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLIS

Query:  FCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        FCAGHYFL+   + + A LLYT+  +PVALIF +++LPLY D++QAI K+V KRSA VV +
Subjt:  FCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

XP_022995620.1 uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima]3.4e-22857.16Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRG W+ V+  YEE PE AQ+LKLTR GDTALHLAV+DN+E VV++LV  ++ +  Y +LL   ND E   LHLAA  GSA MC+AIASAH+ LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        D+RNK  ETPL+LAA  G+R+ F+CLY+FCR+N  ++  NCR T +G+TVLHSALRN++FDLAFQ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII
        I+G  NI YY T VD+LKPQ I++L + W   +   NT+T  FPANY TCI FFT +WDGFLK      +    E   +KK  + +   + E  E ++  
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII

Query:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY
        +S++ +D   L+KA   ++ EASITN P NY TCI+    VFSAIM+ LG GS   +KI+ +K+KH+WS+QVME LL+ +P D+Y+ +G TP +STSQT 
Subjt:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY

Query:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK
        D+ E T PY +   G V  S+++ +KP  +    NVQA E+AMLLAAKNGV+E+VKG++  FPL+I D   +KKNVVLLA EY QPDVYRFLL+ +   +
Subjt:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK

Query:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV
        +LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K GEWL  TS+SCSVV TLI TV
Subjt:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV

Query:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ
        AF S  +IPGG N  DG+  L   + FF + + SLIALCLSSTSV MFLAILT RFDA DF +NLPWKL IG SSL+ SII+ L+SFCAGHYFL+   + 
Subjt:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ

Query:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        + A LLYT+  +PVALIF +++LPLY D++QAI K+V  RSA VV +
Subjt:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

XP_022995621.1 uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima]2.5e-22657.03Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRG W+ V+  YEE PE AQ+LKLTR GDTALHLAV+DN+E VV++LV  ++ +  Y +LL   ND E   LHLAA  GSA MC+AIASAH+ LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        D+RNK  ETPL+LAA  G+R+ F+CLY+FCR+N  ++  NCR T +G+TVLHSALRN++FDLAFQ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII
        I+G  NI YY T VD+LKPQ I++L + W   +   NT+T  FPANY TCI FFT +WDGFLK      +    E   +KK  + +   + E  E ++  
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII

Query:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY
        +S++ +D   L+KA   ++ EASITN P NY TCI+    VFSAIM+ LG GS   +KI+ +K+KH+WS+QVME LL+ +P D+Y+ +G TP +STSQT 
Subjt:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY

Query:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK
        D+ E T PY +   G V  S+++ +KP  +    NVQ  E+AMLLAAKNGV+E+VKG++  FPL+I D   +KKNVVLLA EY QPDVYRFLL+ +   +
Subjt:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK

Query:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV
        +LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K GEWL  TS+SCSVV TLI TV
Subjt:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV

Query:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ
        AF S  +IPGG N  DG+  L   + FF + + SLIALCLSSTSV MFLAILT RFDA DF +NLPWKL IG SSL+ SII+ L+SFCAGHYFL+   + 
Subjt:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ

Query:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        + A LLYT+  +PVALIF +++LPLY D++QAI K+V  RSA VV +
Subjt:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

XP_022995622.1 uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima]3.4e-22857.16Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRG W+ V+  YEE PE AQ+LKLTR GDTALHLAV+DN+E VV++LV  ++ +  Y +LL   ND E   LHLAA  GSA MC+AIASAH+ LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        D+RNK  ETPL+LAA  G+R+ F+CLY+FCR+N  ++  NCR T +G+TVLHSALRN++FDLAFQ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII
        I+G  NI YY T VD+LKPQ I++L + W   +   NT+T  FPANY TCI FFT +WDGFLK      +    E   +KK  + +   + E  E ++  
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII

Query:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY
        +S++ +D   L+KA   ++ EASITN P NY TCI+    VFSAIM+ LG GS   +KI+ +K+KH+WS+QVME LL+ +P D+Y+ +G TP +STSQT 
Subjt:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY

Query:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK
        D+ E T PY +   G V  S+++ +KP  +    NVQA E+AMLLAAKNGV+E+VKG++  FPL+I D   +KKNVVLLA EY QPDVYRFLL+ +   +
Subjt:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK

Query:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV
        +LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K GEWL  TS+SCSVV TLI TV
Subjt:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV

Query:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ
        AF S  +IPGG N  DG+  L   + FF + + SLIALCLSSTSV MFLAILT RFDA DF +NLPWKL IG SSL+ SII+ L+SFCAGHYFL+   + 
Subjt:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ

Query:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        + A LLYT+  +PVALIF +++LPLY D++QAI K+V  RSA VV +
Subjt:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

TrEMBL top hitse value%identityAlignment
A0A6J1ESA5 uncharacterized protein LOC111437338 isoform X11.0e-21454.53Show/hide
Query:  NNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMC
        N    A   + LY N KR  W+ V+  YE+ PE AQ LKLTR GDTALHLAV+DN+E +V++LV  ++ S    +LL   ND + N LHLAA  GSA MC
Subjt:  NNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMC

Query:  HAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLA
        +AIASAH  LV+ RNK  ETPL+LAA  G+R+ F+CLY+FCR     +  NCR  ++G+TVLHSALRN++FDLAF ++H++N+A++WV   G TPLH LA
Subjt:  HAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLA

Query:  NKPTAFRSGTSIKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKH
        +KPTAF+SG+ I+G  NI YY T VD+L PQ I++L + W   +   NT+T  FPANYETCI FFT +WDGFLK      +    +   +KK  +D  ++
Subjt:  NKPTAFRSGTSIKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKH

Query:  VN-EESEIVDIIDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHD
        +  E  E  +  + ++ +D  +L+KA   ++ EASITN P NY TCI+    VFSAI++ LG+GS   +KI+ +K+KH+WS+QVME LL+ +  D+YD +
Subjt:  VN-EESEIVDIIDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHD

Query:  GYTPKESTSQTYDKT-ETSPYDIGDEGRVGFSDNVATKPNPSN--NSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQP
        G  P +STSQT D+   T PY   D+  V FS ++ +KP  +      + QA E+ MLLAAKNGV+E+VKG++  FPL+I+D   +KKNVVLLA EY QP
Subjt:  GYTPKESTSQTYDKT-ETSPYDIGDEGRVGFSDNVATKPNPSN--NSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQP

Query:  DVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNT
        DVYRFLL+ +   ++LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K G+WL  T
Subjt:  DVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNT

Query:  SESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLIS
        S+SCSVV  LI TVAF S  +IPGG N  DG+  L   E FF + + SLIALCLSSTSV +FLAILT RFDA DF +NLPWKL IG SSL+ SII+ LIS
Subjt:  SESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLIS

Query:  FCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        FCAGHYFL+   + + A LLYT+  +PVALIF +++LPLY D++QAI K+V KRSA VV +
Subjt:  FCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

A0A6J1ET45 Chlorophyllase1.0e-21454.53Show/hide
Query:  NNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMC
        N    A   + LY N KR  W+ V+  YE+ PE AQ LKLTR GDTALHLAV+DN+E +V++LV  ++ S    +LL   ND + N LHLAA  GSA MC
Subjt:  NNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMC

Query:  HAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLA
        +AIASAH  LV+ RNK  ETPL+LAA  G+R+ F+CLY+FCR     +  NCR  ++G+TVLHSALRN++FDLAF ++H++N+A++WV   G TPLH LA
Subjt:  HAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCR-RTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLA

Query:  NKPTAFRSGTSIKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKH
        +KPTAF+SG+ I+G  NI YY T VD+L PQ I++L + W   +   NT+T  FPANYETCI FFT +WDGFLK      +    +   +KK  +D  ++
Subjt:  NKPTAFRSGTSIKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKH

Query:  VN-EESEIVDIIDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHD
        +  E  E  +  + ++ +D  +L+KA   ++ EASITN P NY TCI+    VFSAI++ LG+GS   +KI+ +K+KH+WS+QVME LL+ +  D+YD +
Subjt:  VN-EESEIVDIIDSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHD

Query:  GYTPKESTSQTYDKT-ETSPYDIGDEGRVGFSDNVATKPNPSN--NSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQP
        G  P +STSQT D+   T PY   D+  V FS ++ +KP  +      + QA E+ MLLAAKNGV+E+VKG++  FPL+I+D   +KKNVVLLA EY QP
Subjt:  GYTPKESTSQTYDKT-ETSPYDIGDEGRVGFSDNVATKPNPSN--NSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQP

Query:  DVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNT
        DVYRFLL+ +   ++LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K G+WL  T
Subjt:  DVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNT

Query:  SESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLIS
        S+SCSVV  LI TVAF S  +IPGG N  DG+  L   E FF + + SLIALCLSSTSV +FLAILT RFDA DF +NLPWKL IG SSL+ SII+ LIS
Subjt:  SESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLIS

Query:  FCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        FCAGHYFL+   + + A LLYT+  +PVALIF +++LPLY D++QAI K+V KRSA VV +
Subjt:  FCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X31.7e-22857.16Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRG W+ V+  YEE PE AQ+LKLTR GDTALHLAV+DN+E VV++LV  ++ +  Y +LL   ND E   LHLAA  GSA MC+AIASAH+ LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        D+RNK  ETPL+LAA  G+R+ F+CLY+FCR+N  ++  NCR T +G+TVLHSALRN++FDLAFQ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII
        I+G  NI YY T VD+LKPQ I++L + W   +   NT+T  FPANY TCI FFT +WDGFLK      +    E   +KK  + +   + E  E ++  
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII

Query:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY
        +S++ +D   L+KA   ++ EASITN P NY TCI+    VFSAIM+ LG GS   +KI+ +K+KH+WS+QVME LL+ +P D+Y+ +G TP +STSQT 
Subjt:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY

Query:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK
        D+ E T PY +   G V  S+++ +KP  +    NVQA E+AMLLAAKNGV+E+VKG++  FPL+I D   +KKNVVLLA EY QPDVYRFLL+ +   +
Subjt:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK

Query:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV
        +LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K GEWL  TS+SCSVV TLI TV
Subjt:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV

Query:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ
        AF S  +IPGG N  DG+  L   + FF + + SLIALCLSSTSV MFLAILT RFDA DF +NLPWKL IG SSL+ SII+ L+SFCAGHYFL+   + 
Subjt:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ

Query:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        + A LLYT+  +PVALIF +++LPLY D++QAI K+V  RSA VV +
Subjt:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X21.2e-22657.03Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRG W+ V+  YEE PE AQ+LKLTR GDTALHLAV+DN+E VV++LV  ++ +  Y +LL   ND E   LHLAA  GSA MC+AIASAH+ LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        D+RNK  ETPL+LAA  G+R+ F+CLY+FCR+N  ++  NCR T +G+TVLHSALRN++FDLAFQ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII
        I+G  NI YY T VD+LKPQ I++L + W   +   NT+T  FPANY TCI FFT +WDGFLK      +    E   +KK  + +   + E  E ++  
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII

Query:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY
        +S++ +D   L+KA   ++ EASITN P NY TCI+    VFSAIM+ LG GS   +KI+ +K+KH+WS+QVME LL+ +P D+Y+ +G TP +STSQT 
Subjt:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY

Query:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK
        D+ E T PY +   G V  S+++ +KP  +    NVQ  E+AMLLAAKNGV+E+VKG++  FPL+I D   +KKNVVLLA EY QPDVYRFLL+ +   +
Subjt:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK

Query:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV
        +LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K GEWL  TS+SCSVV TLI TV
Subjt:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV

Query:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ
        AF S  +IPGG N  DG+  L   + FF + + SLIALCLSSTSV MFLAILT RFDA DF +NLPWKL IG SSL+ SII+ L+SFCAGHYFL+   + 
Subjt:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ

Query:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        + A LLYT+  +PVALIF +++LPLY D++QAI K+V  RSA VV +
Subjt:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X11.7e-22857.16Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        LY N KRG W+ V+  YEE PE AQ+LKLTR GDTALHLAV+DN+E VV++LV  ++ +  Y +LL   ND E   LHLAA  GSA MC+AIASAH+ LV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS
        D+RNK  ETPL+LAA  G+R+ F+CLY+FCR+N  ++  NCR T +G+TVLHSALRN++FDLAFQ++H+NN+A++WV   G TPLH LA+KPTAF+SG+ 
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRT-DGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTS

Query:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII
        I+G  NI YY T VD+LKPQ I++L + W   +   NT+T  FPANY TCI FFT +WDGFLK      +    E   +KK  + +   + E  E ++  
Subjt:  IKGCLNIVYYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLK------VISSLEPLVNKKSDEDLEKHVNEESEIVDII

Query:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY
        +S++ +D   L+KA   ++ EASITN P NY TCI+    VFSAIM+ LG GS   +KI+ +K+KH+WS+QVME LL+ +P D+Y+ +G TP +STSQT 
Subjt:  DSNEHMDMVRLVKADDPSESEASITNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTY

Query:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK
        D+ E T PY +   G V  S+++ +KP  +    NVQA E+AMLLAAKNGV+E+VKG++  FPL+I D   +KKNVVLLA EY QPDVYRFLL+ +   +
Subjt:  DKTE-TSPYDIGDEGRVGFSDNVATKPNPSNNSTNVQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTK

Query:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV
        +LF+AVD  GNSALHLAA      +WRITGAALQMQWE+KWYK+VEES+PL FFAHYNKEGK A AIFHE+H  LV K GEWL  TS+SCSVV TLI TV
Subjt:  SLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATV

Query:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ
        AF S  +IPGG N  DG+  L   + FF + + SLIALCLSSTSV MFLAILT RFDA DF +NLPWKL IG SSL+ SII+ L+SFCAGHYFL+   + 
Subjt:  AFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQ

Query:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN
        + A LLYT+  +PVALIF +++LPLY D++QAI K+V  RSA VV +
Subjt:  NIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFN

SwissProt top hitse value%identityAlignment
O15084 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A5.3e-0630.16Show/hide
Query:  GDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRS
        G+T LH+A  + Q+ VV  L+           ++  +N+     LH AA +    +C  +   +   V++++KDG+TPL + ALHG          F RS
Subjt:  GDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRS

Query:  NLY---QLAINCRRTDGNTVLHSALR
                 I+C   +GNT LH A R
Subjt:  NLY---QLAINCRRTDGNTVLHSALR

Q5F478 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit B7.0e-0632.54Show/hide
Query:  GDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRS
        G+TALH+A  + Q+ VV  L+        Y   +   N+     LH AA +    +C  +   +   V++++KDG++PL + A+HG          F RS
Subjt:  GDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRS

Query:  -NLYQLA--INCRRTDGNTVLHSALR
          L Q    I+C   DGNT LH A R
Subjt:  -NLYQLA--INCRRTDGNTVLHSALR

Q5ZIJ9 E3 ubiquitin-protein ligase MIB27.0e-0628.76Show/hide
Query:  GDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRS
        GDT LH A+  + + ++E L  +       Q   G       N LH +A+ G+      I +    LVD + +DG T L LAAL+ H+ V   L    R 
Subjt:  GDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRS

Query:  NLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLH
        +     +N +     T LH A+   +  L   L+   +D +N  D  G T +H
Subjt:  NLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLH

Q80YE7 Death-associated protein kinase 11.2e-0526.45Show/hide
Query:  RQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFC
        + G+TALH+A       VV+ L         +      ++  E   LH AA +G   +  A+       V+++N++GETPL  A+  G+ ++  CL    
Subjt:  RQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLYYFC

Query:  RSNLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLH
            +   +N    DG+  LH A+R    ++   L+  +   +++ D  G TPLH
Subjt:  RSNLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLH

Q91974 NF-kappa-B inhibitor alpha8.2e-0728.48Show/hide
Query:  KLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLY
        +LT  GDT LHLA++ ++E+ +   VI  R++      L  +N+     LHLA +   A +   +  A   L DVR+  G TPL +A   G       L 
Subjt:  KLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLVDVRNKDGETPLFLAALHGHRNVFYCLY

Query:  YFCRSNLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLH
          C+ +     +     +G+T LH A    Y  +   L+ +  D        G T LH
Subjt:  YFCRSNLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLH

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein7.3e-3533.11Show/hide
Query:  AMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAA-NMKDHKLWRITGAALQMQWEVK
        A+  A +NG+VE ++ +  H+P  +   N    N+   AV  RQ  ++  + N   +   L    D   N+ LH AA      +L  I GAALQMQ E++
Subjt:  AMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAA-NMKDHKLWRITGAALQMQWEVK

Query:  WYKYVEESMPLDFFAHYN-KEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALC
        W+K VE+ +        N K+ KT  A+F + H+ LV +G +W+  T+ SC+VVA LI T+ F+S   +PGG   SDG  +   +  F I+ IS  I+L 
Subjt:  WYKYVEESMPLDFFAHYN-KEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALC

Query:  LSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQA
         S  S++MFL IL SR+  +DF  +LP KL++GL +L+ S+   +++F      L+ +++  ++     L  +P+ + F + Q P+  ++ +A
Subjt:  LSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQA

AT3G18670.1 Ankyrin repeat family protein1.5e-1133.07Show/hide
Query:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV
        L+ N   G  +A     + +PEA   + LT  GDT +H AV+    ++VE    ++R+    +Q+L I+ND    AL  AA  G  R+   + +   GLV
Subjt:  LYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHEGLV

Query:  DVRNKDGETPLFLAALHGHRNVFYCLY
         VRN     P+ +A+L+GH+++   LY
Subjt:  DVRNKDGETPLFLAALHGHRNVFYCLY

AT5G04690.1 Ankyrin repeat family protein4.2e-3034.75Show/hide
Query:  QATESAMLLAAKNG----VVEMVKG----LWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQA-VDHIGNSALHLAA-NMKDHKLW
        ++ + A+L A + G    +VEM+K     LW          +     +   AV+ RQ  V+  LL      K LF A  D  GNS LHLA     ++KL 
Subjt:  QATESAMLLAAKNG----VVEMVKG----LWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQA-VDHIGNSALHLAA-NMKDHKLW

Query:  RITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEG
         +  A LQMQ E++W+K +E  +P       N E  T   IF + HE +  +  +W+ +T+ SCS+VA LI TV FA+   +PGGT+D+ G     + E 
Subjt:  RITGAALQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEG

Query:  FF-IYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPL
         F I+ +S LI+   + TSV++FL ILT+R+   DF  +LP  ++ GLS+L+ SI A L++F +   F I +    +A  ++   F   AL+F + Q PL
Subjt:  FF-IYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPL

Query:  YFDLL
          +L+
Subjt:  YFDLL

AT5G04700.1 Ankyrin repeat family protein5.8e-3235.02Show/hide
Query:  QATESAMLLAAKNGVVE-MVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAA-NMKDHKLWRITGAALQ
        +  + A+L A + G V+ +V+ +  +  L           + LLAVE+RQ  V+  L     +   L    D  GN  LHLA       KL  + GA LQ
Subjt:  QATESAMLLAAKNGVVE-MVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAA-NMKDHKLWRITGAALQ

Query:  MQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTND-SDGTAILGKEEGFFIYTIS
        +Q E++W+K VE   P       N E +T   IF + H+ L  +  +W+ +T+ SCS+VA LI TV FA+   +PGGT+D S G     ++  F I+ +S
Subjt:  MQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTND-SDGTAILGKEEGFFIYTIS

Query:  SLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLL
         LI+   S TSV++FL ILT+R+   DF   LP K++ GLS L+ SI A LI+F +  + ++    + I         LP AL+F L Q PL  +++
Subjt:  SLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein9.3e-3031.86Show/hide
Query:  AMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEK-KNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMK-DHKLWRITGAALQMQWEV
        A+L AAK+G  +    + +     +  +NP   +N+  LAVE+++  ++  +     +  +L ++ D   N+ LH+A  +    +L +I+GAAL+MQ E 
Subjt:  AMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEK-KNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMK-DHKLWRITGAALQMQWEV

Query:  KWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALC
        +W+K VE  +        NK+ KT   IF   HE L  +G EW+  T+ +CS VA LIATV F +   +PGG + + G+ ++  +  F  +  +  +A  
Subjt:  KWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALC

Query:  LSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAII
         S  SV++FL+ILTSR+   DF  +LP K+++G S L+ SI + L++F       +   +++   L+Y L   P+A    L  L L + LL+ +I
Subjt:  LSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAII

AT5G35810.1 Ankyrin repeat family protein9.9e-3232.79Show/hide
Query:  SAMLL--AAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAV-----DHIGNSALHLAANM-KDHKLWRITGAA
        S MLL  AA++G +E++  L   +P  I  ++ + +++  +A   R   ++    N  Y+  ++   +         ++ LHL A +   ++L  ++GAA
Subjt:  SAMLL--AAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAV-----DHIGNSALHLAANM-KDHKLWRITGAA

Query:  LQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILG-----KEEGF
        LQMQ E+ WYK V+E +P  +    NK+ + A+ +F + H+ L  +G +W+  T+ +C +V+TLIATV FA+   +PGG + S     LG     KE  F
Subjt:  LQMQWEVKWYKYVEESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILG-----KEEGF

Query:  FIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYF
         ++ IS  +AL  S TS+++FL+ILTSR+    F + LP KL++GL +L+ SII+ +++F A    LI D+    +++L        AL F +    L+F
Subjt:  FIYTISSLIALCLSSTSVIMFLAILTSRFDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYF

Query:  DLLQA
        D L++
Subjt:  DLLQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAACAATAATAAGAATGCAAATTCAGGAGAGTGTCTGTATGCAAATGCAAAGAGAGGGAATTGGAAGGCAGTGGTGGCCACGTACGAGGAAGACCCAGAAGCAGC
TCAGAAGCTGAAGCTGACCCGACAAGGCGACACGGCGCTGCACCTGGCGGTCGTCGACAATCAAGAACGAGTAGTTGAAAGGCTCGTGATACTCGTTAGAAAATCCATGC
ATTACCAGCAGCTTCTTGGAATTAGAAATGACGCAGAGAACAACGCTCTCCACCTGGCCGCCGTGAACGGAAGCGCCAGAATGTGCCACGCCATTGCTTCGGCCCATGAG
GGCTTGGTGGATGTGAGAAACAAAGACGGGGAAACGCCTCTGTTCTTGGCGGCTCTCCATGGCCACAGAAATGTCTTCTACTGCCTTTACTATTTTTGCAGATCAAACCT
CTATCAACTCGCCATCAACTGTAGACGAACTGATGGAAATACCGTACTGCATTCTGCCCTCAGAAACGAGTATTTTGATTTGGCATTTCAATTAATTCACATCAACAATG
ATGCTATAAACTGGGTGGATGGGCAAGGCTTCACCCCTCTCCATGGTCTAGCAAATAAGCCAACTGCCTTCAGAAGTGGAACCTCCATCAAGGGCTGCCTCAACATCGTC
TATTACTTCACATGTGTGGATCGACTAAAGCCTCAATCAATAGAAACCCTCGGCAAAGCGTGGAAGAAAAGTGTTCCCAAAACAAACACGGCTACTTCCTATTTTCCAGC
TAATTACGAGACATGCATTCACTTCTTTACGAGGTTATGGGATGGATTTTTAAAAGTCATCAGCTCTTTGGAACCACTGGTCAACAAGAAAAGCGATGAAGATTTGGAGA
AACATGTTAACGAGGAAAGTGAAATTGTTGACATCATTGATTCCAACGAGCACATGGACATGGTTAGACTTGTTAAAGCCGATGATCCATCAGAAAGCGAGGCCTCGATT
ACAAATTTCCCGGAAAACTATACTACCTGCATCAACATTCTTCACGCTGTTTTCTCGGCCATTATGGTCATTCTTGGGTTCGGATCCATGGCAATCAGAAAGATACAAGA
GGAGAAAAAGAAACATAGTTGGTCACTTCAAGTGATGGAGATACTCTTAAAACTTTCTCCCATCGATAAATATGACCACGATGGATATACTCCCAAGGAATCGACATCTC
AAACCTACGACAAAACTGAAACATCTCCTTACGACATCGGAGACGAAGGTCGAGTCGGTTTCAGTGATAACGTAGCAACGAAACCAAACCCCTCCAACAACTCCACAAAT
GTCCAAGCCACGGAGTCAGCAATGTTATTAGCAGCAAAGAATGGTGTGGTTGAGATGGTGAAGGGACTGTGGGAACACTTTCCACTGGCGATCCATGATATGAACCCAGA
GAAGAAGAATGTGGTGCTTTTGGCTGTGGAGTACAGGCAGCCCGACGTCTACAGGTTTTTACTCAATAACAGATATCAGACCAAAAGCCTCTTTCAAGCCGTCGATCATA
TTGGCAACAGCGCCTTGCATCTCGCAGCTAACATGAAAGATCATAAGCTTTGGCGCATCACCGGAGCTGCACTGCAGATGCAATGGGAAGTTAAATGGTATAAGTACGTG
GAAGAGTCGATGCCATTGGATTTCTTTGCCCACTACAACAAGGAAGGGAAAACTGCAAATGCAATCTTCCATGAATCCCACGAGACGCTAGTGACAAAGGGTGGGGAATG
GCTTTACAACACCTCAGAGTCATGCTCCGTGGTGGCTACTCTGATCGCAACGGTGGCCTTTGCATCGACGGTCGCCATCCCGGGGGGCACCAACGACAGCGACGGTACCG
CAATACTTGGAAAAGAGGAAGGCTTTTTTATCTACACCATCTCTTCTCTCATTGCCCTCTGCCTCTCTTCAACCTCAGTCATCATGTTTTTGGCCATCCTGACCTCGAGG
TTCGACGCCAAAGACTTCGGATCAAACTTGCCCTGGAAACTGCTCATCGGCTTATCCTCACTCTACTTTTCCATCATCGCCACCTTGATTTCATTTTGTGCAGGCCACTA
CTTTCTCATCATTGATCGCCTTCAAAACATCGCCATTCTTCTCTACACACTTACTTTTCTCCCCGTCGCTTTGATTTTCGGCTTAGCACAACTTCCTCTCTACTTCGATT
TGCTCCAGGCTATTATCAAAGTAGTGACTAAAAGGAGCGCTGATGTTGTTTTTAATACTACCGATGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAACAATAATAAGAATGCAAATTCAGGAGAGTGTCTGTATGCAAATGCAAAGAGAGGGAATTGGAAGGCAGTGGTGGCCACGTACGAGGAAGACCCAGAAGCAGC
TCAGAAGCTGAAGCTGACCCGACAAGGCGACACGGCGCTGCACCTGGCGGTCGTCGACAATCAAGAACGAGTAGTTGAAAGGCTCGTGATACTCGTTAGAAAATCCATGC
ATTACCAGCAGCTTCTTGGAATTAGAAATGACGCAGAGAACAACGCTCTCCACCTGGCCGCCGTGAACGGAAGCGCCAGAATGTGCCACGCCATTGCTTCGGCCCATGAG
GGCTTGGTGGATGTGAGAAACAAAGACGGGGAAACGCCTCTGTTCTTGGCGGCTCTCCATGGCCACAGAAATGTCTTCTACTGCCTTTACTATTTTTGCAGATCAAACCT
CTATCAACTCGCCATCAACTGTAGACGAACTGATGGAAATACCGTACTGCATTCTGCCCTCAGAAACGAGTATTTTGATTTGGCATTTCAATTAATTCACATCAACAATG
ATGCTATAAACTGGGTGGATGGGCAAGGCTTCACCCCTCTCCATGGTCTAGCAAATAAGCCAACTGCCTTCAGAAGTGGAACCTCCATCAAGGGCTGCCTCAACATCGTC
TATTACTTCACATGTGTGGATCGACTAAAGCCTCAATCAATAGAAACCCTCGGCAAAGCGTGGAAGAAAAGTGTTCCCAAAACAAACACGGCTACTTCCTATTTTCCAGC
TAATTACGAGACATGCATTCACTTCTTTACGAGGTTATGGGATGGATTTTTAAAAGTCATCAGCTCTTTGGAACCACTGGTCAACAAGAAAAGCGATGAAGATTTGGAGA
AACATGTTAACGAGGAAAGTGAAATTGTTGACATCATTGATTCCAACGAGCACATGGACATGGTTAGACTTGTTAAAGCCGATGATCCATCAGAAAGCGAGGCCTCGATT
ACAAATTTCCCGGAAAACTATACTACCTGCATCAACATTCTTCACGCTGTTTTCTCGGCCATTATGGTCATTCTTGGGTTCGGATCCATGGCAATCAGAAAGATACAAGA
GGAGAAAAAGAAACATAGTTGGTCACTTCAAGTGATGGAGATACTCTTAAAACTTTCTCCCATCGATAAATATGACCACGATGGATATACTCCCAAGGAATCGACATCTC
AAACCTACGACAAAACTGAAACATCTCCTTACGACATCGGAGACGAAGGTCGAGTCGGTTTCAGTGATAACGTAGCAACGAAACCAAACCCCTCCAACAACTCCACAAAT
GTCCAAGCCACGGAGTCAGCAATGTTATTAGCAGCAAAGAATGGTGTGGTTGAGATGGTGAAGGGACTGTGGGAACACTTTCCACTGGCGATCCATGATATGAACCCAGA
GAAGAAGAATGTGGTGCTTTTGGCTGTGGAGTACAGGCAGCCCGACGTCTACAGGTTTTTACTCAATAACAGATATCAGACCAAAAGCCTCTTTCAAGCCGTCGATCATA
TTGGCAACAGCGCCTTGCATCTCGCAGCTAACATGAAAGATCATAAGCTTTGGCGCATCACCGGAGCTGCACTGCAGATGCAATGGGAAGTTAAATGGTATAAGTACGTG
GAAGAGTCGATGCCATTGGATTTCTTTGCCCACTACAACAAGGAAGGGAAAACTGCAAATGCAATCTTCCATGAATCCCACGAGACGCTAGTGACAAAGGGTGGGGAATG
GCTTTACAACACCTCAGAGTCATGCTCCGTGGTGGCTACTCTGATCGCAACGGTGGCCTTTGCATCGACGGTCGCCATCCCGGGGGGCACCAACGACAGCGACGGTACCG
CAATACTTGGAAAAGAGGAAGGCTTTTTTATCTACACCATCTCTTCTCTCATTGCCCTCTGCCTCTCTTCAACCTCAGTCATCATGTTTTTGGCCATCCTGACCTCGAGG
TTCGACGCCAAAGACTTCGGATCAAACTTGCCCTGGAAACTGCTCATCGGCTTATCCTCACTCTACTTTTCCATCATCGCCACCTTGATTTCATTTTGTGCAGGCCACTA
CTTTCTCATCATTGATCGCCTTCAAAACATCGCCATTCTTCTCTACACACTTACTTTTCTCCCCGTCGCTTTGATTTTCGGCTTAGCACAACTTCCTCTCTACTTCGATT
TGCTCCAGGCTATTATCAAAGTAGTGACTAAAAGGAGCGCTGATGTTGTTTTTAATACTACCGATGTGTAG
Protein sequenceShow/hide protein sequence
MSNNNKNANSGECLYANAKRGNWKAVVATYEEDPEAAQKLKLTRQGDTALHLAVVDNQERVVERLVILVRKSMHYQQLLGIRNDAENNALHLAAVNGSARMCHAIASAHE
GLVDVRNKDGETPLFLAALHGHRNVFYCLYYFCRSNLYQLAINCRRTDGNTVLHSALRNEYFDLAFQLIHINNDAINWVDGQGFTPLHGLANKPTAFRSGTSIKGCLNIV
YYFTCVDRLKPQSIETLGKAWKKSVPKTNTATSYFPANYETCIHFFTRLWDGFLKVISSLEPLVNKKSDEDLEKHVNEESEIVDIIDSNEHMDMVRLVKADDPSESEASI
TNFPENYTTCINILHAVFSAIMVILGFGSMAIRKIQEEKKKHSWSLQVMEILLKLSPIDKYDHDGYTPKESTSQTYDKTETSPYDIGDEGRVGFSDNVATKPNPSNNSTN
VQATESAMLLAAKNGVVEMVKGLWEHFPLAIHDMNPEKKNVVLLAVEYRQPDVYRFLLNNRYQTKSLFQAVDHIGNSALHLAANMKDHKLWRITGAALQMQWEVKWYKYV
EESMPLDFFAHYNKEGKTANAIFHESHETLVTKGGEWLYNTSESCSVVATLIATVAFASTVAIPGGTNDSDGTAILGKEEGFFIYTISSLIALCLSSTSVIMFLAILTSR
FDAKDFGSNLPWKLLIGLSSLYFSIIATLISFCAGHYFLIIDRLQNIAILLYTLTFLPVALIFGLAQLPLYFDLLQAIIKVVTKRSADVVFNTTDV