; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033313 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033313
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionChlorophyllase
Genome locationchr11:42688748..42691883
RNA-Seq ExpressionLag0033313
SyntenyLag0033313
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0052689 - carboxylic ester hydrolase activity (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022995620.1 uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima]7.0e-23758.3Show/hide
Query:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN
        LR+FL  NT+R KW  V++KYEEYPE AQ+LKL+R GDTALHLAV+DN+EE V+KLV  I    +  +LL+TTN+R   PLHLAA  GSA MC+AIASA+
Subjt:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN

Query:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FCRN++++I  NCR  +NGDTVLH ALR++HF+LAFQ++H+NN+AM WV + G+TPLHVLAS PT+FK
Subjt:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK

Query:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET
        SGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NY TCI+FFT + D   K   L  +    +    KKD  + R    E  E+
Subjt:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET

Query:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA
        +E D S +R             E  ITN P+NY TC+  FQIVF+ +MI LG GS E KKI +KK+KHTWSVQVMEKL  ++  PD+Y  +G  P D  +
Subjt:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA

Query:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD
           + +E T PY++   +V  S N ++ K  + E+ K+V  P++AMLLAAKNGV+E+V+G+  RFP +I + R DKKNVVLLAAEY QP VYRFL + K 
Subjt:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD

Query:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI
          E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+VK  G+WLT+TSKSCSVVGTLI
Subjt:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI

Query:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH
        VTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S L+SFCAGHYFL+  
Subjt:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH

Query:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK
         +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP+RSA VV     PP T ++    K
Subjt:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK

XP_022995621.1 uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima]1.8e-23758.32Show/hide
Query:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN
        LR+FL  NT+R KW  V++KYEEYPE AQ+LKL+R GDTALHLAV+DN+EE V+KLV  I    +  +LL+TTN+R   PLHLAA  GSA MC+AIASA+
Subjt:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN

Query:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FCRN++++I  NCR  +NGDTVLH ALR++HF+LAFQ++H+NN+AM WV + G+TPLHVLAS PT+FK
Subjt:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK

Query:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET
        SGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NY TCI+FFT + D   K   L  +    +    KKD  + R    E  E+
Subjt:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET

Query:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA
        +E D S +R             E  ITN P+NY TC+  FQIVF+ +MI LG GS E KKI +KK+KHTWSVQVMEKL  ++  PD+Y  +G  P D  +
Subjt:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA

Query:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDL
           + +E T PY++   +V  S N ++ K  + E+ K+V ++AMLLAAKNGV+E+V+G+  RFP +I + R DKKNVVLLAAEY QP VYRFL + K   
Subjt:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDL

Query:  ESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVT
        E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+VK  G+WLT+TSKSCSVVGTLIVT
Subjt:  ESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVT

Query:  VAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRL
        VAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S L+SFCAGHYFL+   +
Subjt:  VAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRL

Query:  RSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK
           A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP+RSA VV     PP T ++    K
Subjt:  RSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK

XP_022995622.1 uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima]7.0e-23758.3Show/hide
Query:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN
        LR+FL  NT+R KW  V++KYEEYPE AQ+LKL+R GDTALHLAV+DN+EE V+KLV  I    +  +LL+TTN+R   PLHLAA  GSA MC+AIASA+
Subjt:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN

Query:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FCRN++++I  NCR  +NGDTVLH ALR++HF+LAFQ++H+NN+AM WV + G+TPLHVLAS PT+FK
Subjt:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK

Query:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET
        SGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NY TCI+FFT + D   K   L  +    +    KKD  + R    E  E+
Subjt:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET

Query:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA
        +E D S +R             E  ITN P+NY TC+  FQIVF+ +MI LG GS E KKI +KK+KHTWSVQVMEKL  ++  PD+Y  +G  P D  +
Subjt:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA

Query:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD
           + +E T PY++   +V  S N ++ K  + E+ K+V  P++AMLLAAKNGV+E+V+G+  RFP +I + R DKKNVVLLAAEY QP VYRFL + K 
Subjt:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD

Query:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI
          E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+VK  G+WLT+TSKSCSVVGTLI
Subjt:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI

Query:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH
        VTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S L+SFCAGHYFL+  
Subjt:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH

Query:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK
         +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP+RSA VV     PP T ++    K
Subjt:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK

XP_023534305.1 uncharacterized protein LOC111795904 isoform X1 [Cucurbita pepo subsp. pepo]6.1e-23357.26Show/hide
Query:  DDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGS
        ++  N +H+  LR+FL +NT+R +W  V++KYE++PE A +LKL+R GDTALHLAV+DN+E+ V+KLV  I +  +  +LLETTN+R  NPLHLAA  GS
Subjt:  DDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGS

Query:  ARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPL
        A MC+AIASA   LV  RN  DETPL+LAAA+G+ DAF+CLY+FCRN +++I  NCR  +NGDTVLH ALR+ HF+LAFQ++H+NN+AM WV + G+TPL
Subjt:  ARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPL

Query:  HVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDM
        HVLAS PT+FKSGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NYETCI+FFT + D   K   L  +    K    KKD D 
Subjt:  HVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDM

Query:  ERENL-----ETVEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKY
           N+     E++E D   +R             E  +TN P+NY TC+  FQIVF+ ++I LG GS E KKI ++K+KHTWSVQVMEKL  ++  PD+Y
Subjt:  ERENL-----ETVEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKY

Query:  GDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQ
          +G IP      T  +  +T PY+ +++ V FS++ ++ K  + E+ KD  ++ MLLAAKNGV+E+V+G+ +RFP +I++   DKKNVVLLAAEY QP+
Subjt:  GDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQ

Query:  VYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTS
        VYRFL +     E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+V+  GDWLT+TS
Subjt:  VYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTS

Query:  KSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISF
        KSCSVVG LIVTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S LISF
Subjt:  KSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISF

Query:  CAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV
        CAGHYFL+   +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP RSA VV
Subjt:  CAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV

XP_023534306.1 uncharacterized protein LOC111795904 isoform X2 [Cucurbita pepo subsp. pepo]6.1e-23357.26Show/hide
Query:  DDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGS
        ++  N +H+  LR+FL +NT+R +W  V++KYE++PE A +LKL+R GDTALHLAV+DN+E+ V+KLV  I +  +  +LLETTN+R  NPLHLAA  GS
Subjt:  DDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGS

Query:  ARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPL
        A MC+AIASA   LV  RN  DETPL+LAAA+G+ DAF+CLY+FCRN +++I  NCR  +NGDTVLH ALR+ HF+LAFQ++H+NN+AM WV + G+TPL
Subjt:  ARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPL

Query:  HVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDM
        HVLAS PT+FKSGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NYETCI+FFT + D   K   L  +    K    KKD D 
Subjt:  HVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDM

Query:  ERENL-----ETVEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKY
           N+     E++E D   +R             E  +TN P+NY TC+  FQIVF+ ++I LG GS E KKI ++K+KHTWSVQVMEKL  ++  PD+Y
Subjt:  ERENL-----ETVEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKY

Query:  GDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQ
          +G IP      T  +  +T PY+ +++ V FS++ ++ K  + E+ KD  ++ MLLAAKNGV+E+V+G+ +RFP +I++   DKKNVVLLAAEY QP+
Subjt:  GDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQ

Query:  VYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTS
        VYRFL +     E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+V+  GDWLT+TS
Subjt:  VYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTS

Query:  KSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISF
        KSCSVVG LIVTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S LISF
Subjt:  KSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISF

Query:  CAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV
        CAGHYFL+   +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP RSA VV
Subjt:  CAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV

TrEMBL top hitse value%identityAlignment
A0A6J1ESA5 uncharacterized protein LOC111437338 isoform X15.8e-22956.6Show/hide
Query:  MSDDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQ
        + ++  N  H+  LR+FL +NT+R +W  V++KYE++PE AQ LKL+R GDTALHLAV+DN+EE V+KLV  I +  +  +LLETTN+R  NPLHLAA  
Subjt:  MSDDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQ

Query:  GSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLT
        GSA MC+AIASA+  LV+ RN  DETPL+LAAA G+ DAF+CLY+FCR+  + I  NCR  +NGDTVLH ALR++HF+LAF ++H++N+AM WV + G+T
Subjt:  GSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLT

Query:  PLHVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDV
        PLHVLAS PT+FKSGS IRGW+ I YYCT VD+L PQPI++L + W  R+         FP NYETCI+FFT + D   K   L  +    K    KKD 
Subjt:  PLHVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDV

Query:  DMERENLETVEPDNSSE-------------------REVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTP
        D    N+  VE   SSE                   +E  ITN P+NY TC+  FQIVF+ ++I LG+GS E KKI ++K+KHTWSVQVMEKL  ++  P
Subjt:  DMERENLETVEPDNSSE-------------------REVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTP

Query:  DKYGDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETK----DVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLA
        D+Y  +G IP      T  +   T PY+ +++ V FS++ ++ K  + E+ K      P++ MLLAAKNGV+E+V+G+  RFP +I++   DKKNVVLLA
Subjt:  DKYGDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETK----DVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLA

Query:  AEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGG
        AEY QP VYRFL + K   E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+V+  G
Subjt:  AEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGG

Query:  DWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSI
        DWL +TSKSCSVVG LIVTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV +FLAILT RF+A DF ++LPWKLFIG SSLF SI
Subjt:  DWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSI

Query:  VSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV
        +S LISFCAGHYFL+   +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP RSA VV
Subjt:  VSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV

A0A6J1EX58 uncharacterized protein LOC111437338 isoform X22.6e-22956.69Show/hide
Query:  MSDDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQ
        + ++  N  H+  LR+FL +NT+R +W  V++KYE++PE AQ LKL+R GDTALHLAV+DN+EE V+KLV  I +  +  +LLETTN+R  NPLHLAA  
Subjt:  MSDDKNNADHQ--LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQ

Query:  GSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLT
        GSA MC+AIASA+  LV+ RN  DETPL+LAAA G+ DAF+CLY+FCR+  + I  NCR  +NGDTVLH ALR++HF+LAF ++H++N+AM WV + G+T
Subjt:  GSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLT

Query:  PLHVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDV
        PLHVLAS PT+FKSGS IRGW+ I YYCT VD+L PQPI++L + W  R+         FP NYETCI+FFT + D   K   L  +    K    KKD 
Subjt:  PLHVLASYPTSFKSGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDV

Query:  DMERENLETVEPDNSSE-------------------REVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTP
        D    N+  VE   SSE                   +E  ITN P+NY TC+  FQIVF+ ++I LG+GS E KKI ++K+KHTWSVQVMEKL  ++  P
Subjt:  DMERENLETVEPDNSSE-------------------REVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTP

Query:  DKYGDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDK-KQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEY
        D+Y  +G IP      T  +   T PY+ +++ V FS+++  K  + +  + KD  ++ MLLAAKNGV+E+V+G+  RFP +I++   DKKNVVLLAAEY
Subjt:  DKYGDDGKIP--DGMATGHEDSETFPYAVEENKVNFSINVVDK-KQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEY

Query:  RQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWL
         QP VYRFL + K   E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+V+  GDWL
Subjt:  RQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWL

Query:  TQTSKSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVST
         +TSKSCSVVG LIVTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV +FLAILT RF+A DF ++LPWKLFIG SSLF SI+S 
Subjt:  TQTSKSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVST

Query:  LISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV
        LISFCAGHYFL+   +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP RSA VV
Subjt:  LISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X33.4e-23758.3Show/hide
Query:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN
        LR+FL  NT+R KW  V++KYEEYPE AQ+LKL+R GDTALHLAV+DN+EE V+KLV  I    +  +LL+TTN+R   PLHLAA  GSA MC+AIASA+
Subjt:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN

Query:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FCRN++++I  NCR  +NGDTVLH ALR++HF+LAFQ++H+NN+AM WV + G+TPLHVLAS PT+FK
Subjt:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK

Query:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET
        SGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NY TCI+FFT + D   K   L  +    +    KKD  + R    E  E+
Subjt:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET

Query:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA
        +E D S +R             E  ITN P+NY TC+  FQIVF+ +MI LG GS E KKI +KK+KHTWSVQVMEKL  ++  PD+Y  +G  P D  +
Subjt:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA

Query:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD
           + +E T PY++   +V  S N ++ K  + E+ K+V  P++AMLLAAKNGV+E+V+G+  RFP +I + R DKKNVVLLAAEY QP VYRFL + K 
Subjt:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD

Query:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI
          E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+VK  G+WLT+TSKSCSVVGTLI
Subjt:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI

Query:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH
        VTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S L+SFCAGHYFL+  
Subjt:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH

Query:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK
         +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP+RSA VV     PP T ++    K
Subjt:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X28.9e-23858.32Show/hide
Query:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN
        LR+FL  NT+R KW  V++KYEEYPE AQ+LKL+R GDTALHLAV+DN+EE V+KLV  I    +  +LL+TTN+R   PLHLAA  GSA MC+AIASA+
Subjt:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN

Query:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FCRN++++I  NCR  +NGDTVLH ALR++HF+LAFQ++H+NN+AM WV + G+TPLHVLAS PT+FK
Subjt:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK

Query:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET
        SGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NY TCI+FFT + D   K   L  +    +    KKD  + R    E  E+
Subjt:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET

Query:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA
        +E D S +R             E  ITN P+NY TC+  FQIVF+ +MI LG GS E KKI +KK+KHTWSVQVMEKL  ++  PD+Y  +G  P D  +
Subjt:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA

Query:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDL
           + +E T PY++   +V  S N ++ K  + E+ K+V ++AMLLAAKNGV+E+V+G+  RFP +I + R DKKNVVLLAAEY QP VYRFL + K   
Subjt:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDL

Query:  ESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVT
        E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+VK  G+WLT+TSKSCSVVGTLIVT
Subjt:  ESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVT

Query:  VAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRL
        VAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S L+SFCAGHYFL+   +
Subjt:  VAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRL

Query:  RSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK
           A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP+RSA VV     PP T ++    K
Subjt:  RSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X13.4e-23758.3Show/hide
Query:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN
        LR+FL  NT+R KW  V++KYEEYPE AQ+LKL+R GDTALHLAV+DN+EE V+KLV  I    +  +LL+TTN+R   PLHLAA  GSA MC+AIASA+
Subjt:  LREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASAN

Query:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FCRN++++I  NCR  +NGDTVLH ALR++HF+LAFQ++H+NN+AM WV + G+TPLHVLAS PT+FK
Subjt:  GLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFK

Query:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET
        SGS IRGW+ I YYCT VD+LKPQPI++L + W  R+         FP NY TCI+FFT + D   K   L  +    +    KKD  + R    E  E+
Subjt:  SGSHIRGWQEIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRK----KKDVDMER----ENLET

Query:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA
        +E D S +R             E  ITN P+NY TC+  FQIVF+ +MI LG GS E KKI +KK+KHTWSVQVMEKL  ++  PD+Y  +G  P D  +
Subjt:  VEPDNSSER-------------EVPITNFPKNYTTCVDIFQIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIP-DGMA

Query:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD
           + +E T PY++   +V  S N ++ K  + E+ K+V  P++AMLLAAKNGV+E+V+G+  RFP +I + R DKKNVVLLAAEY QP VYRFL + K 
Subjt:  TGHEDSE-TFPYAVEENKVNFSINVVDKKQNDPEETKDV--PDSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKD

Query:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI
          E+LFRAVD +GN ALHLAAAASKSM+WRITG ALQMQWE+KWYKF+EES+PL+FF HYNKE K A+ IFHETHMD+VK  G+WLT+TSKSCSVVGTLI
Subjt:  DLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLI

Query:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH
        VTVAFTS  +IPGG++  +G PFL+  ++FF FA+ SLIALCLSS SV MFLAILT RF+A DF ++LPWKLFIG SSLF SI+S L+SFCAGHYFL+  
Subjt:  VTVAFTSATTIPGGYDT-EGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDH

Query:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK
         +   A LLYT+  +PVA IF+ISKLPLYIDVV+A+   VP+RSA VV     PP T ++    K
Subjt:  RLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVV-----PPSTGNLLNLSK

SwissProt top hitse value%identityAlignment
B2RXR6 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit B3.4e-0829.19Show/hide
Query:  QGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAI--ASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYF
        +G TAL+LA      E VE LV       +   +    N     PLH +   G       +   + N  +VD+++   +TPL LA AYGH+DA   L   
Subjt:  QGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAI--ASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYF

Query:  CRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLAS
          N  A   + C       T LHR +   H E    L+      +   + +G TPLH  A+
Subjt:  CRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLAS

O15084 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A6.8e-0929.94Show/hide
Query:  GDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRN
        G+T LH+A  + Q+  V +L+       +   ++   NE+   PLH AAA     +C  +   NG  V+M++ D +TPL + A +G       +      
Subjt:  GDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRN

Query:  DSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLA
          +   I+C +K NG+T LH A R  H  L   LI    D        G+ PLH+ A
Subjt:  DSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLA

Q01484 Ankyrin-27.5e-0828.49Show/hide
Query:  VVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLF
        V+E   +Y  S Q   ++  G T +H+A            + I+    +N    + TN R    LH+AA  G   +   +   NG LVD R  +++TPL 
Subjt:  VVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLF

Query:  LAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASY
        +A+  G  +    L     +  A          NG T LH + R+   ++A  L+     A S   +KG TPLHV A Y
Subjt:  LAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASY

Q505D1 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A6.8e-0931.21Show/hide
Query:  GDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRN
        G+T LH+A  + Q+  V +L+   C    N++     NE+   PLH AAA     +C  +   NG  V+M++ D +TPL + A +G       +      
Subjt:  GDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRN

Query:  DSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLA
          +   I+C +K NG+T LH A R  H  L   LI    D        G+ PLH+ A
Subjt:  DSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLA

Q8C8R3 Ankyrin-27.5e-0828.49Show/hide
Query:  VVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLF
        V+E   +Y  S Q   ++  G T +H+A            + I+    +N    + TN R    LH+AA  G   +   +   NG LVD R  +++TPL 
Subjt:  VVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLF

Query:  LAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASY
        +A+  G  +    L     +  A          NG T LH + R+   ++A  L+     A S   +KG TPLHV A Y
Subjt:  LAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASY

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein2.5e-3832.31Show/hide
Query:  DSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAA-AASKSMVWRITGDALQMQWE
        + A+  A +NG++E +  + + +P  +    +   N+   A   RQ +++  ++N       L    D+  N  LH AA  A  S +  I G ALQMQ E
Subjt:  DSAMLLAAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAA-AASKSMVWRITGDALQMQWE

Query:  VKWYKFIEESM-PLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYDTEGVPFLEKEQSFFIFAVSSLIAL
        ++W+K +E+ + P H      K++KT   +F + H D+V+ G  W+ +T+ SC+VV  LI T+ F+SA T+PGGY ++G+P    +  F IF +S  I+L
Subjt:  VKWYKFIEESM-PLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYDTEGVPFLEKEQSFFIFAVSSLIAL

Query:  CLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRA
          S  S+ MFL IL SR+  +DF   LP KL +GL +LF S+ + +++F      L+  ++  V+     L  +P+  +F++ + P+ +++ RA
Subjt:  CLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRA

AT3G18670.1 Ankyrin repeat family protein8.5e-0724.56Show/hide
Query:  LSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYY
        L+  GDT +H AV+    + VE++++   R  + +Q+L+  N+     L  AA  G  R+   + +    LV +RN  +  P+ +A+ YGH      LY 
Subjt:  LSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASANGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYY

Query:  FCRNDSAQIAINCREK--NNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFKS
                   +  E    NG  ++   + D  + +A  LI          +    T +  LA  P +F S
Subjt:  FCRNDSAQIAINCREK--NNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFKS

AT5G04680.1 Ankyrin repeat family protein3.8e-3136.53Show/hide
Query:  EETKDVPDSAMLLAAKNG----VLEMVRGLSQRFPPAIHETRNDKKNVV-LLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAAS-KSMVW
        +E  +  D A+L A + G    ++EM+R  S+     +  TR    + + LLA E+RQ +V+  L+   D    L    D DGNG LHLA   S  S + 
Subjt:  EETKDVPDSAMLLAAKNG----VLEMVRGLSQRFPPAIHETRNDKKNVV-LLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAAS-KSMVW

Query:  RITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIV---------TVAFTSATTIPGGYD--TE
         +    L+MQ E++W+K +E   P       N EE+T   IF + H  + +    W+  T+ SCS+V  LIV         TV F +  T+ GG D  +E
Subjt:  RITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIV---------TVAFTSATTIPGGYD--TE

Query:  GVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISF
        G PF   EQ F IF VS LI+   +  +V +FL ILT+R+   DF   LP K+  GLS LF SI + LI+F
Subjt:  GVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISF

AT5G04700.1 Ankyrin repeat family protein1.9e-3534.75Show/hide
Query:  EETKDVPDSAMLLAAKNG----VLEMVRGLSQRFPPAIHETRNDKKNVV-LLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAAS-KSMVW
        +E  +  D A+L A + G    ++EM+R  S+     +  TR    + + LLA E+RQ +V+  L+   D    L    D DGNG LHLA   S  S + 
Subjt:  EETKDVPDSAMLLAAKNG----VLEMVRGLSQRFPPAIHETRNDKKNVV-LLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAAS-KSMVW

Query:  RITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYD--TEGVPFLEKEQ
         + G  LQ+Q E++W+K +E   P       N EE+T   IF + H  + +    W+  T+ SCS+V  LIVTV F +  T+PGG D  ++G PF  +++
Subjt:  RITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYD--TEGVPFLEKEQ

Query:  SFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPL
         F IF VS LI+   S  SV +FL ILT+R+   DF   LP K+  GLS LF SI + LI+F +  + ++    + +         LP A +F++ + PL
Subjt:  SFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPL

Query:  YIDVV
          +++
Subjt:  YIDVV

AT5G04730.1 Ankyrin-repeat containing protein1.9e-3032.45Show/hide
Query:  KDVPDSAMLLAAKNG----VLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAAS-KSMVWRITG
        K+    A+L AAK+G     +E+++  SQ     ++ T    +N+  LA E+++ +++  +H   D   +L R+ D   N  LH+A   S    + +I+G
Subjt:  KDVPDSAMLLAAKNG----VLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAAS-KSMVWRITG

Query:  DALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIF
         AL+MQ E +W+K +E  +        NK+ KT   IF   H  + K G +W+  T+ +CS V  LI TV F +  T+PGG D T G P +  +  F  F
Subjt:  DALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYD-TEGVPFLEKEQSFFIF

Query:  AVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLP--VAFIFLISKLPLYID
          +  +A   S  SV +FL+ILTSR+   DF   LP K+ +G S LF SI S L++F       +   +R    L+Y L  L    + +FL+ + PL  +
Subjt:  AVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLP--VAFIFLISKLPLYID

Query:  VV
        ++
Subjt:  VV

AT5G35810.1 Ankyrin repeat family protein2.4e-3333.12Show/hide
Query:  EETKDVPDSAMLL--AAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLH--NKKDDLESLFRAVDVDGNGALHLAA-AASKSMVWR
        E ++ V  S MLL  AA++G LE++  L + +P  I    +  +++  +AA  R  +++  ++      DL ++++  + + N  LHL A     + +  
Subjt:  EETKDVPDSAMLL--AAKNGVLEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLH--NKKDDLESLFRAVDVDGNGALHLAA-AASKSMVWR

Query:  ITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYDTE------GVPFLE
        ++G ALQMQ E+ WYK ++E +P  +    NK+E+ A  +F + H ++ K G  W+ +T+ +C +V TLI TV F +A T+PGG DT       G P   
Subjt:  ITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEEKTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYDTE------GVPFLE

Query:  KEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLR-SVALLLYTLTFLPVAFIFLIS
        KE  F +F +S  +AL  S  S+ +FL+ILTSR+    F + LP KL +GL +LF SI+S +++F A    + D   + S+ LL+Y  +   ++F+ L  
Subjt:  KEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIGLSSLFFSIVSTLISFCAGHYFLIDHRLR-SVALLLYTLTFLPVAFIFLIS

Query:  KLPLYIDVVRA
           L+ D +R+
Subjt:  KLPLYIDVVRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGATAAGAATAATGCAGATCATCAATTACGAGAGTTTCTTGAGATGAACACGAGGAGAAGGAAATGGAATGCAGTGGTCGAAAAGTACGAGGAATACCCAGA
ATCAGCTCAGAAGCTGAAGCTGAGCCGACAGGGCGACACGGCTCTGCATTTGGCTGTCATCGACAACCAAGAAGAAACAGTTGAAAAGCTTGTGAAGATCATTTGCAGAT
ACCCAGAAAACAAGCAACTTCTCGAGACTACAAATGAGAGAAACAATAACCCTCTCCATCTCGCCGCCGCGCAGGGCAGCGCCAGAATGTGCCACGCCATTGCTTCAGCC
AATGGGCTCTTAGTGGACATGAGAAACGTAGACGACGAAACGCCGCTCTTCTTGGCGGCGGCGTACGGCCACATGGACGCCTTTTATTGCCTTTACTACTTTTGCCGAAA
CGACTCGGCTCAAATCGCCATCAACTGCAGAGAGAAGAACAATGGAGACACCGTGCTACACCGCGCCCTCAGAGACAACCACTTTGAATTGGCATTTCAGTTAATTCACA
TGAACAATGATGCTATGAGTTGGGTGAACCAGAAAGGCTTAACCCCACTCCATGTTCTAGCAAGCTACCCAACTTCCTTCAAAAGTGGATCCCATATTAGGGGATGGCAG
GAAATCGTCTATTACTGCACATGTGTGGATAAACTAAAGCCACAACCAATAGAAACCCTTAGCAAAGCGTGGACGAAAAGGTTGCCCAAAGCAAAGAGAATTCCTTACTC
TTTTCCAGAGAACTACGAGACCTGCATTGAATTCTTTACGACGTTGAAGGATGCCATTTTTAAAGTCATCACTTTGAATGGACTACTACTTTTCCGCAAGAAGAAAGATG
TTGACATGGAAAGGGAAAATTTGGAGACCGTTGAACCTGATAATTCATCGGAAAGGGAGGTCCCGATTACAAATTTCCCGAAAAACTATACTACCTGCGTGGATATTTTT
CAAATAGTTTTCACGGGCTTGATGATCATCCTTGGGTTCGGATCTAAGGAAATAAAAAAGATAGTAAAGAAGAAACAGAAACACACTTGGTCAGTTCAAGTGATGGAGAA
ACTGTTTGATCAATTTGGTACACCCGACAAATATGGCGACGATGGAAAAATTCCCGATGGTATGGCAACCGGTCATGAAGATTCTGAAACATTTCCTTACGCCGTTGAAG
AAAACAAAGTCAATTTCAGTATTAATGTGGTAGACAAGAAACAAAACGACCCAGAGGAGACAAAGGATGTCCCAGATTCGGCGATGTTATTAGCAGCGAAGAACGGCGTT
CTTGAGATGGTGAGGGGACTGTCCCAAAGATTTCCACCGGCGATCCACGAAACAAGGAATGACAAGAAGAATGTGGTGCTTTTGGCTGCGGAGTACAGGCAGCCGCAAGT
CTACAGGTTTTTACACAACAAAAAGGATGACTTAGAAAGCCTGTTTCGAGCTGTCGATGTCGATGGAAACGGCGCCTTGCATCTCGCAGCCGCCGCCTCAAAGTCTATGG
TTTGGCGCATCACCGGAGATGCACTGCAGATGCAATGGGAAGTTAAATGGTATAAGTTCATTGAGGAGTCGATGCCACTCCATTTCTTCACCCACTATAACAAGGAAGAA
AAAACTGCAAGTACAATCTTCCATGAAACCCACATGGACATGGTGAAGAACGGCGGCGACTGGCTTACTCAAACCTCCAAGTCATGCTCGGTGGTGGGCACCCTGATTGT
AACAGTGGCTTTTACTTCCGCCACCACTATCCCCGGCGGCTACGACACGGAGGGCGTCCCATTCCTTGAAAAGGAGCAAAGCTTTTTCATCTTCGCCGTGTCTTCCCTCA
TTGCCCTCTGCCTCTCTTCAGCCTCCGTCGCCATGTTTCTGGCCATCTTGACCTCCAGGTTCGAAGCCAAAGACTTCGGCTCCGACTTGCCTTGGAAACTCTTCATAGGC
CTTTCCTCTCTTTTCTTTTCCATCGTCTCCACCTTGATTTCCTTCTGCGCCGGTCACTACTTCCTCATCGACCACCGCCTTCGCAGCGTCGCCCTTCTTCTCTACACACT
CACTTTTCTCCCCGTCGCATTCATCTTCCTCATCTCCAAGCTTCCTCTCTACATCGATGTCGTGCGCGCCCTTGTCGCAACGGTGCCTAGCAGGAGCGCCGACGTCGTCC
CCCCGTCGACCGGAAACCTTCTAAACCTCTCCAAAAAGGAAAATTTGAGGTTTCTTCGATCAGGGCTGAGACCGACTCCGACGAAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGATAAGAATAATGCAGATCATCAATTACGAGAGTTTCTTGAGATGAACACGAGGAGAAGGAAATGGAATGCAGTGGTCGAAAAGTACGAGGAATACCCAGA
ATCAGCTCAGAAGCTGAAGCTGAGCCGACAGGGCGACACGGCTCTGCATTTGGCTGTCATCGACAACCAAGAAGAAACAGTTGAAAAGCTTGTGAAGATCATTTGCAGAT
ACCCAGAAAACAAGCAACTTCTCGAGACTACAAATGAGAGAAACAATAACCCTCTCCATCTCGCCGCCGCGCAGGGCAGCGCCAGAATGTGCCACGCCATTGCTTCAGCC
AATGGGCTCTTAGTGGACATGAGAAACGTAGACGACGAAACGCCGCTCTTCTTGGCGGCGGCGTACGGCCACATGGACGCCTTTTATTGCCTTTACTACTTTTGCCGAAA
CGACTCGGCTCAAATCGCCATCAACTGCAGAGAGAAGAACAATGGAGACACCGTGCTACACCGCGCCCTCAGAGACAACCACTTTGAATTGGCATTTCAGTTAATTCACA
TGAACAATGATGCTATGAGTTGGGTGAACCAGAAAGGCTTAACCCCACTCCATGTTCTAGCAAGCTACCCAACTTCCTTCAAAAGTGGATCCCATATTAGGGGATGGCAG
GAAATCGTCTATTACTGCACATGTGTGGATAAACTAAAGCCACAACCAATAGAAACCCTTAGCAAAGCGTGGACGAAAAGGTTGCCCAAAGCAAAGAGAATTCCTTACTC
TTTTCCAGAGAACTACGAGACCTGCATTGAATTCTTTACGACGTTGAAGGATGCCATTTTTAAAGTCATCACTTTGAATGGACTACTACTTTTCCGCAAGAAGAAAGATG
TTGACATGGAAAGGGAAAATTTGGAGACCGTTGAACCTGATAATTCATCGGAAAGGGAGGTCCCGATTACAAATTTCCCGAAAAACTATACTACCTGCGTGGATATTTTT
CAAATAGTTTTCACGGGCTTGATGATCATCCTTGGGTTCGGATCTAAGGAAATAAAAAAGATAGTAAAGAAGAAACAGAAACACACTTGGTCAGTTCAAGTGATGGAGAA
ACTGTTTGATCAATTTGGTACACCCGACAAATATGGCGACGATGGAAAAATTCCCGATGGTATGGCAACCGGTCATGAAGATTCTGAAACATTTCCTTACGCCGTTGAAG
AAAACAAAGTCAATTTCAGTATTAATGTGGTAGACAAGAAACAAAACGACCCAGAGGAGACAAAGGATGTCCCAGATTCGGCGATGTTATTAGCAGCGAAGAACGGCGTT
CTTGAGATGGTGAGGGGACTGTCCCAAAGATTTCCACCGGCGATCCACGAAACAAGGAATGACAAGAAGAATGTGGTGCTTTTGGCTGCGGAGTACAGGCAGCCGCAAGT
CTACAGGTTTTTACACAACAAAAAGGATGACTTAGAAAGCCTGTTTCGAGCTGTCGATGTCGATGGAAACGGCGCCTTGCATCTCGCAGCCGCCGCCTCAAAGTCTATGG
TTTGGCGCATCACCGGAGATGCACTGCAGATGCAATGGGAAGTTAAATGGTATAAGTTCATTGAGGAGTCGATGCCACTCCATTTCTTCACCCACTATAACAAGGAAGAA
AAAACTGCAAGTACAATCTTCCATGAAACCCACATGGACATGGTGAAGAACGGCGGCGACTGGCTTACTCAAACCTCCAAGTCATGCTCGGTGGTGGGCACCCTGATTGT
AACAGTGGCTTTTACTTCCGCCACCACTATCCCCGGCGGCTACGACACGGAGGGCGTCCCATTCCTTGAAAAGGAGCAAAGCTTTTTCATCTTCGCCGTGTCTTCCCTCA
TTGCCCTCTGCCTCTCTTCAGCCTCCGTCGCCATGTTTCTGGCCATCTTGACCTCCAGGTTCGAAGCCAAAGACTTCGGCTCCGACTTGCCTTGGAAACTCTTCATAGGC
CTTTCCTCTCTTTTCTTTTCCATCGTCTCCACCTTGATTTCCTTCTGCGCCGGTCACTACTTCCTCATCGACCACCGCCTTCGCAGCGTCGCCCTTCTTCTCTACACACT
CACTTTTCTCCCCGTCGCATTCATCTTCCTCATCTCCAAGCTTCCTCTCTACATCGATGTCGTGCGCGCCCTTGTCGCAACGGTGCCTAGCAGGAGCGCCGACGTCGTCC
CCCCGTCGACCGGAAACCTTCTAAACCTCTCCAAAAAGGAAAATTTGAGGTTTCTTCGATCAGGGCTGAGACCGACTCCGACGAAGAAATAA
Protein sequenceShow/hide protein sequence
MSDDKNNADHQLREFLEMNTRRRKWNAVVEKYEEYPESAQKLKLSRQGDTALHLAVIDNQEETVEKLVKIICRYPENKQLLETTNERNNNPLHLAAAQGSARMCHAIASA
NGLLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCRNDSAQIAINCREKNNGDTVLHRALRDNHFELAFQLIHMNNDAMSWVNQKGLTPLHVLASYPTSFKSGSHIRGWQ
EIVYYCTCVDKLKPQPIETLSKAWTKRLPKAKRIPYSFPENYETCIEFFTTLKDAIFKVITLNGLLLFRKKKDVDMERENLETVEPDNSSEREVPITNFPKNYTTCVDIF
QIVFTGLMIILGFGSKEIKKIVKKKQKHTWSVQVMEKLFDQFGTPDKYGDDGKIPDGMATGHEDSETFPYAVEENKVNFSINVVDKKQNDPEETKDVPDSAMLLAAKNGV
LEMVRGLSQRFPPAIHETRNDKKNVVLLAAEYRQPQVYRFLHNKKDDLESLFRAVDVDGNGALHLAAAASKSMVWRITGDALQMQWEVKWYKFIEESMPLHFFTHYNKEE
KTASTIFHETHMDMVKNGGDWLTQTSKSCSVVGTLIVTVAFTSATTIPGGYDTEGVPFLEKEQSFFIFAVSSLIALCLSSASVAMFLAILTSRFEAKDFGSDLPWKLFIG
LSSLFFSIVSTLISFCAGHYFLIDHRLRSVALLLYTLTFLPVAFIFLISKLPLYIDVVRALVATVPSRSADVVPPSTGNLLNLSKKENLRFLRSGLRPTPTKK