; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy02g018660 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy02g018660
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionChlorophyllase
Genome locationChr02:45827024..45839274
RNA-Seq ExpressionLcy02g018660
SyntenyLcy02g018660
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]6.2e-19853.53Show/hide
Query:  DKNNADNQLREFLEMKTRRRKWNAVVEKYEEYPESAQKL--KLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSA
        DK     ++ E  + K   +K+    E +E+Y E  +K   KL R+GDT LHLAVIDNQE  VE+LV+ +       ++LE TNER NNPLHLAA  GS 
Subjt:  DKNNADNQLREFLEMKTRRRKWNAVVEKYEEYPESAQKL--KLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSA

Query:  RMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLH
        RMC  IAS +  LVD RN DDETPLFLAAAYG+ DAF+CLY FC    +RI+ NCR   +GDTVLH+A+ NE+FDLAFQLI++  ++  W+N+  LTP+H
Subjt:  RMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLH

Query:  ILASKPASFKSGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEW-------KKLSRSERNTD-TFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVS
        +LASKP SFKSGSHIKGW+ ++Y C  VK L+P+ I+ L +  +        K L +  R+ D   FP NY+TCI F   ++D  +      C      +
Subjt:  ILASKPASFKSGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEW-------KKLSRSERNTD-TFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVS

Query:  TLVNKKKSSDEANKDK---DLEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDY
           + KK+  + + DK   DLEE   L +++      T+FP NY+TC+DFF I  SAI+I+L  G GS  I+KI ++K+KHTWSVQVME+LL       Y
Subjt:  TLVNKKKSSDEANKDK---DLEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDY

Query:  EDSKMGGSEMEDASHTFEEDQIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYR
         +   G S M+   H  E   + +              +   K+ E  +DV   ES MLLAARNGVIE+VK  +ER P+AI +TRK DKKNVV LAAE+R
Subjt:  EDSKMGGSEMEDASHTFEEDQIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYR

Query:  QSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLY
        Q +VY+FL +K   + +LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WY  VK SVPLHFFP +N QGKTAS IF +TH +L  KG  WLY
Subjt:  QSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLY

Query:  STSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAML
        STS+SCS+VATLIATVAFATAATIPGGN + G A L + + F IF  SSL+ALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L  LYFSIIAML
Subjt:  STSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAML

Query:  VSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD
        VSFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L K +P  + EV LCD
Subjt:  VSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]3.3e-19954.17Show/hide
Query:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR
        ++ E+  + ++   V+EKY+E  E     KL R+GDT LHLAVIDNQE IVE+LVKI+       ++LE TNE  NN LHLAA  GS RMC  IAS +  
Subjt:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR

Query:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG
        LVD RN DD+TPLFLAAAYG+ DAF+CLY FC +D++RI+ NCR   +GDTVLH A+ +E+FDLAFQLI++  ++  W+N+  LTP+H+LA KP SFKSG
Subjt:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG

Query:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D
        SHIKGW+ ++Y C  VK L+P+  E L +  E       E    + FP NY+TCI+FF  ++D  +      C      +   + KK+  + + DK   D
Subjt:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D

Query:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED
        LEE   L +++      T+FP NY+TC+DFF I  SAI+I+L  G GS  I+KI ++K+KHTWSVQVM +LL       Y +   G S M+      EE+
Subjt:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED

Query:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR
         + +                  K+ E  +DV   ES MLLAARNGVIE+VK   ER P+AI +TRK DKKNVV LAAE+RQ +VY+ L +K   + +LFR
Subjt:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR

Query:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT
        AVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK VK SVPLHFFP +N QGKTAS IF +TH +L  KG  WLYSTS+SCS+VATLIATVAFAT
Subjt:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT

Query:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL
        AATIPGGN + G A L + + F IF  SSL+ALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+
Subjt:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL

Query:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD
        +LYTLTF PVA +FGIVQLPLYFDLL+ L K +P  + EV LCD
Subjt:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]3.7e-19854.17Show/hide
Query:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR
        ++ E+  + ++   V+EKY+E  E     KL R+GDT LHLAVIDNQE IVE+LVKI+       ++LE TNE  NN LHLAA  GS RMC  IAS +  
Subjt:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR

Query:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG
        LVD RN DD+TPLFLAAAYG+ DAF+CLY FC +D++RI+ NCR   +GDTVLH A+ +E+FDLAFQLI++  ++  W+N+  LTP+H+LA KP SFKSG
Subjt:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG

Query:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D
        SHIKGW+ ++Y C  VK L+P+  E L +  E       E    + FP NY+TCI+FF  ++D  +      C      +   + KK+  + + DK   D
Subjt:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D

Query:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED
        LEE   L ++     + T+FP NY+TC+DFF I  SAI+I+L  G GS  I+KI ++K+KHTWSVQVM +LL       Y +   G S M+      EE+
Subjt:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED

Query:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR
         + +                  K+ E  +DV   ES MLLAARNGVIE+VK   ER P+AI +TRK DKKNVV LAAE+RQ +VY+ L +K   + +LFR
Subjt:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR

Query:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT
        AVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK VK SVPLHFFP +N QGKTAS IF +TH +L  KG  WLYSTS+SCS+VATLIATVAFAT
Subjt:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT

Query:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL
        AATIPGGN + G A L + + F IF  SSL+ALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+
Subjt:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL

Query:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD
        +LYTLTF PVA +FGIVQLPLYFDLL+ L K +P  + EV LCD
Subjt:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD

XP_022995621.1 uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima]6.0e-19350.79Show/hide
Query:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN
        LR+FL   T+R KW  V++KYEEYPE AQ+LKL R GDTALHLAV+DN+EE+V+KLV  I      ++LL+TTN+R   PLHLAA  GSA MC+ IASA+
Subjt:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN

Query:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FC N+++RI  NCR  +NGDTVLH A+ N++FDLAFQ++HL  ++  W+ +  +TPLH+LASKP +FK
Subjt:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK

Query:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK
        SGS I+GWR + Y C  V  LKPQPI++L R  +W     +   +   FP NY TCI FFT + D           GFLK S L  +     +DE+ KD 
Subjt:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK

Query:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL
         +               E H  L +E          ++ITN P+NY+TC+ FFQI+ SAI+I L  GLGS   +KI  +K+KHTWSVQVME+LL      
Subjt:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL

Query:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEY
        +YE +  GG+ M+  S T ++ ++ +  S        +++I    K++E  K+V+E+ MLLAA+NGVIE+VK ++ R P++I + RK DKKNVV LAAEY
Subjt:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEY

Query:  RQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWL
         Q +VY+FL       ++LFRAVD NGNSALHLAAA S   +W+ITGAALQ+Q E++WYK V+ESVPL+FF  YNK+GK A+ IF +TH+DL +K  +WL
Subjt:  RQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWL

Query:  YSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIA
          TS+SCSVV TLI TVAF + A+IPGG   H G   L++ K F  F + SL+ALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SII+
Subjt:  YSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIA

Query:  MLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP
        MLVSFC+GHYFL+ + + + A +LYT+  +PVA +F I +LPLY D+++ +FKI+P
Subjt:  MLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP

XP_023534309.1 uncharacterized protein LOC111795905 [Cucurbita pepo subsp. pepo]4.6e-19352.58Show/hide
Query:  DDKNNADNQLREFLEMKTRRRKWNAVVEKYEEYPE------------------SAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGN-------
        +D+N A +  RE L    +R KW  V+EKY+E  +                   A+  KL R+GDT LHLAVIDNQE  VE+LV+ I   P N       
Subjt:  DDKNNADNQLREFLEMKTRRRKWNAVVEKYEEYPE------------------SAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGN-------

Query:  -------------EQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDT
                     + +LETTNER NNPLHLAA  GS RMC  I S +  LVDMRN DDETPLFLAAAYG+ DAF+CLY FC +D++RI+ NCR   +GDT
Subjt:  -------------EQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDT

Query:  VLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNY
        VLH A+ NE+ DLAFQLI++  ++  W+N+  LTP+HILASKP SFKSGSHIKGW+ ++Y C  VK LK    + LH      K +R E ++ + FP NY
Subjt:  VLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNY

Query:  ETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---DLEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTI
         TCI F+  +RD  +  +               KK++  + + DK   DLEE   L ++     + TNFP NY+TC+DFF I  SAI+I+L  G GS  I
Subjt:  ETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---DLEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTI

Query:  RKIYERKQKHTWSVQVMEELLN----GGYSLDYE---DSKMGGSEMEDAS--HTFEEDQIKISTSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEM
        +KI ++K+KHTWSVQVME+LL       Y+ D     DSK    E    +  + F +++++ S      +   K+ E  +DV   ES MLLAA+NGVIE+
Subjt:  RKIYERKQKHTWSVQVMEELLN----GGYSLDYE---DSKMGGSEMEDAS--HTFEEDQIKISTSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEM

Query:  VKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHF
        VK   ER P AI +T K DKKNVV LAAE+RQ +VY+F  ++   +D LFRAVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WY  VK SVPLHF
Subjt:  VKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHF

Query:  FPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTA
        FP +N QGKTAS IF +TH +L  KG  WLYSTS+SCS+VATLIATVAFATAATIPGGN + G A L + + F IF  SSL+ALCLS+TSVIMFL+I+T+
Subjt:  FPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTA

Query:  RFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD
        RF+I++ F   LPWKLL+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L K +P  + EV LCD
Subjt:  RFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X21.8e-19854.17Show/hide
Query:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR
        ++ E+  + ++   V+EKY+E  E     KL R+GDT LHLAVIDNQE IVE+LVKI+       ++LE TNE  NN LHLAA  GS RMC  IAS +  
Subjt:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR

Query:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG
        LVD RN DD+TPLFLAAAYG+ DAF+CLY FC +D++RI+ NCR   +GDTVLH A+ +E+FDLAFQLI++  ++  W+N+  LTP+H+LA KP SFKSG
Subjt:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG

Query:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D
        SHIKGW+ ++Y C  VK L+P+  E L +  E       E    + FP NY+TCI+FF  ++D  +      C      +   + KK+  + + DK   D
Subjt:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D

Query:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED
        LEE   L ++     + T+FP NY+TC+DFF I  SAI+I+L  G GS  I+KI ++K+KHTWSVQVM +LL       Y +   G S M+      EE+
Subjt:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED

Query:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR
         + +                  K+ E  +DV   ES MLLAARNGVIE+VK   ER P+AI +TRK DKKNVV LAAE+RQ +VY+ L +K   + +LFR
Subjt:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR

Query:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT
        AVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK VK SVPLHFFP +N QGKTAS IF +TH +L  KG  WLYSTS+SCS+VATLIATVAFAT
Subjt:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT

Query:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL
        AATIPGGN + G A L + + F IF  SSL+ALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+
Subjt:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL

Query:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD
        +LYTLTF PVA +FGIVQLPLYFDLL+ L K +P  + EV LCD
Subjt:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X11.6e-19954.17Show/hide
Query:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR
        ++ E+  + ++   V+EKY+E  E     KL R+GDT LHLAVIDNQE IVE+LVKI+       ++LE TNE  NN LHLAA  GS RMC  IAS +  
Subjt:  EFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGR

Query:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG
        LVD RN DD+TPLFLAAAYG+ DAF+CLY FC +D++RI+ NCR   +GDTVLH A+ +E+FDLAFQLI++  ++  W+N+  LTP+H+LA KP SFKSG
Subjt:  LVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSG

Query:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D
        SHIKGW+ ++Y C  VK L+P+  E L +  E       E    + FP NY+TCI+FF  ++D  +      C      +   + KK+  + + DK   D
Subjt:  SHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDK---D

Query:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED
        LEE   L +++      T+FP NY+TC+DFF I  SAI+I+L  G GS  I+KI ++K+KHTWSVQVM +LL       Y +   G S M+      EE+
Subjt:  LEEHVNLGSENFDPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEED

Query:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR
         + +                  K+ E  +DV   ES MLLAARNGVIE+VK   ER P+AI +TRK DKKNVV LAAE+RQ +VY+ L +K   + +LFR
Subjt:  QIKI-------STSTHTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFR

Query:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT
        AVD  GNSALHLAA    PKLW+ITGAALQ+Q EV+WYK VK SVPLHFFP +N QGKTAS IF +TH +L  KG  WLYSTS+SCS+VATLIATVAFAT
Subjt:  AVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFAT

Query:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL
        AATIPGGN + G A L + + F IF  SSL+ALCLS+TSVIMFL+I+T+RF+I++ F   LPWKLL+ L  LYFSIIAMLVSFCSGHYFL+ +RL N+A+
Subjt:  AATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQAL

Query:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD
        +LYTLTF PVA +FGIVQLPLYFDLL+ L K +P  + EV LCD
Subjt:  VLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP-TTVEVDLCD

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X39.4e-19250.66Show/hide
Query:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN
        LR+FL   T+R KW  V++KYEEYPE AQ+LKL R GDTALHLAV+DN+EE+V+KLV  I      ++LL+TTN+R   PLHLAA  GSA MC+ IASA+
Subjt:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN

Query:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FC N+++RI  NCR  +NGDTVLH A+ N++FDLAFQ++HL  ++  W+ +  +TPLH+LASKP +FK
Subjt:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK

Query:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK
        SGS I+GWR + Y C  V  LKPQPI++L R  +W     +   +   FP NY TCI FFT + D           GFLK S L  +     +DE+ KD 
Subjt:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK

Query:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL
         +               E H  L +E          ++ITN P+NY+TC+ FFQI+ SAI+I L  GLGS   +KI  +K+KHTWSVQVME+LL      
Subjt:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL

Query:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAA
        +YE +  GG+ M+  S T ++ ++ +  S        +++I    K++E  K+V+  E+ MLLAA+NGVIE+VK ++ R P++I + RK DKKNVV LAA
Subjt:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAA

Query:  EYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKD
        EY Q +VY+FL       ++LFRAVD NGNSALHLAAA S   +W+ITGAALQ+Q E++WYK V+ESVPL+FF  YNK+GK A+ IF +TH+DL +K  +
Subjt:  EYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKD

Query:  WLYSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSI
        WL  TS+SCSVV TLI TVAF + A+IPGG   H G   L++ K F  F + SL+ALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SI
Subjt:  WLYSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSI

Query:  IAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP
        I+MLVSFC+GHYFL+ + + + A +LYT+  +PVA +F I +LPLY D+++ +FKI+P
Subjt:  IAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X22.9e-19350.79Show/hide
Query:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN
        LR+FL   T+R KW  V++KYEEYPE AQ+LKL R GDTALHLAV+DN+EE+V+KLV  I      ++LL+TTN+R   PLHLAA  GSA MC+ IASA+
Subjt:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN

Query:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FC N+++RI  NCR  +NGDTVLH A+ N++FDLAFQ++HL  ++  W+ +  +TPLH+LASKP +FK
Subjt:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK

Query:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK
        SGS I+GWR + Y C  V  LKPQPI++L R  +W     +   +   FP NY TCI FFT + D           GFLK S L  +     +DE+ KD 
Subjt:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK

Query:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL
         +               E H  L +E          ++ITN P+NY+TC+ FFQI+ SAI+I L  GLGS   +KI  +K+KHTWSVQVME+LL      
Subjt:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL

Query:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEY
        +YE +  GG+ M+  S T ++ ++ +  S        +++I    K++E  K+V+E+ MLLAA+NGVIE+VK ++ R P++I + RK DKKNVV LAAEY
Subjt:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEY

Query:  RQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWL
         Q +VY+FL       ++LFRAVD NGNSALHLAAA S   +W+ITGAALQ+Q E++WYK V+ESVPL+FF  YNK+GK A+ IF +TH+DL +K  +WL
Subjt:  RQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWL

Query:  YSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIA
          TS+SCSVV TLI TVAF + A+IPGG   H G   L++ K F  F + SL+ALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SII+
Subjt:  YSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIA

Query:  MLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP
        MLVSFC+GHYFL+ + + + A +LYT+  +PVA +F I +LPLY D+++ +FKI+P
Subjt:  MLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X19.4e-19250.66Show/hide
Query:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN
        LR+FL   T+R KW  V++KYEEYPE AQ+LKL R GDTALHLAV+DN+EE+V+KLV  I      ++LL+TTN+R   PLHLAA  GSA MC+ IASA+
Subjt:  LREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASAN

Query:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK
          LVD+RN  DETPL+LAAA G+ DAF+CLY+FC N+++RI  NCR  +NGDTVLH A+ N++FDLAFQ++HL  ++  W+ +  +TPLH+LASKP +FK
Subjt:  GRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFK

Query:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK
        SGS I+GWR + Y C  V  LKPQPI++L R  +W     +   +   FP NY TCI FFT + D           GFLK S L  +     +DE+ KD 
Subjt:  SGSHIKGWRKLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTL--VNKKKSSDEANKDK

Query:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL
         +               E H  L +E          ++ITN P+NY+TC+ FFQI+ SAI+I L  GLGS   +KI  +K+KHTWSVQVME+LL      
Subjt:  DL---------------EEHVNLGSENF------DPSTITNFPKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSL

Query:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAA
        +YE +  GG+ M+  S T ++ ++ +  S        +++I    K++E  K+V+  E+ MLLAA+NGVIE+VK ++ R P++I + RK DKKNVV LAA
Subjt:  DYEDSKMGGSEMEDASHTFEEDQIKISTS--------THTIVVHKKDSEDFKDVR--ESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAA

Query:  EYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKD
        EY Q +VY+FL       ++LFRAVD NGNSALHLAAA S   +W+ITGAALQ+Q E++WYK V+ESVPL+FF  YNK+GK A+ IF +TH+DL +K  +
Subjt:  EYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKD

Query:  WLYSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSI
        WL  TS+SCSVV TLI TVAF + A+IPGG   H G   L++ K F  F + SL+ALCLS+TSV MFL+ILT RF+  ++ T+ LPWKL +  SSL+ SI
Subjt:  WLYSTSQSCSVVATLIATVAFATAATIPGGNGDH-GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSI

Query:  IAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP
        I+MLVSFC+GHYFL+ + + + A +LYT+  +PVA +F I +LPLY D+++ +FKI+P
Subjt:  IAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMP

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR41.6e-1025.35Show/hide
Query:  IKISTSTHTIVVH-KKDSEDF------KDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKK--NVVHLAAEYRQSNVYKFLQRKDYSLDSLFRA
        I  +T  HT VV    + +DF      KD  ++ +  AAR G +E+VK + E+ P      R+ DKK    +H+A +    +V + L   D ++  L   
Subjt:  IKISTSTHTIVVH-KKDSEDF------KDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKK--NVVHLAAEYRQSNVYKFLQRKDYSLDSLFRA

Query:  VDLNGNSALHLAAAGSAPKLWQITGAALQ-----IQLEVEWYKL---VKESVPL-----HFFPLYNKQG---------------KTASRIFEDTHVDLEQ
         D NGN+ALH+A   +  K  +I    L+     +      +K    + E++PL         + ++ G               KT + I +D H  LEQ
Subjt:  VDLNGNSALHLAAAGSAPKLWQITGAALQ-----IQLEVEWYKL---VKESVPL-----HFFPLYNKQG---------------KTASRIFEDTHVDLEQ

Query:  KGK-----------------DWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREY
          K                 + + + + S +VVA L ATVAFA   T+PGGN ++GVA + +   F IF I + +AL  S   V++ ++++    +    
Subjt:  KGK-----------------DWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREY

Query:  FTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFG
           ++          ++ + +   +SF +  Y +L R  Q  AL++  +  + +A + G
Subjt:  FTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFG

O15084 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A1.8e-0627.39Show/hide
Query:  GDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGN
        G+T LH+A  + Q+ +V +L+           ++   NE+   PLH AAA     +C  +   NG  V+M++ D +TPL + A +G       +      
Subjt:  GDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGN

Query:  DSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILA
          +    +C E  NG+T LH A    +  L   LI    D+ +      + PLH+ A
Subjt:  DSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILA

O44997 Death-associated protein kinase dapk-13.1e-0632.1Show/hide
Query:  GDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGN
        GDT LHLA       IV  LV   C  P     L   N R   PLH+AAA+G       +  AN   +D    D +T L +A   G++D           
Subjt:  GDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGN

Query:  DSTRIATNC---REGNNGDTVLHYAIGNEYFDLAFQLIH--LTVDSRRWMNKKHLTPLHILA
         S  I   C      ++GDT LH A  +        L H  +TVDS   +N    T LH+ A
Subjt:  DSTRIATNC---REGNNGDTVLHYAIGNEYFDLAFQLIH--LTVDSRRWMNKKHLTPLHILA

Q08353 NF-kappa-B inhibitor alpha1.6e-0730.3Show/hide
Query:  TALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANG-----RLVDMRNVDDETPLFLAAAYGHMDAFYCLYYF
        T LHLAVI NQ EI E L++  C +P      E  + R N PLHLA  QG      V+    G      ++   N +  T L LA+ +G++     L   
Subjt:  TALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANG-----RLVDMRNVDDETPLFLAAAYGHMDAFYCLYYF

Query:  CGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPAS
            S     N +E  NG T LH A+  +  DL   L+    D  R +  +  +P  +   +P++
Subjt:  CGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPAS

Q505D1 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A1.4e-0628.66Show/hide
Query:  GDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGN
        G+T LH+A  + Q+ +V +L+   C    N++     NE+   PLH AAA     +C  +   NG  V+M++ D +TPL + A +G       +      
Subjt:  GDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASANGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGN

Query:  DSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILA
          +    +C E  NG+T LH A    +  L   LI    D+ +      + PLH+ A
Subjt:  DSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILA

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein1.4e-3031.85Show/hide
Query:  AARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAA-AGSAPKLWQITGAALQIQLEVEWYK
        A  NG++E ++++    P  ++ ++     N+   A   RQ  ++  +       + L    D+  N+ LH AA    A +L  I GAALQ+Q E++W+K
Subjt:  AARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAA-AGSAPKLWQITGAALQIQLEVEWYK

Query:  LVKESV-PLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTT
         V++ V P H   +  KQ KT   +F D H DL ++G+ W+  T+ SC+VVA LI T+ F++A T+PGG    G+        F IF+IS  ++L  S  
Subjt:  LVKESV-PLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTT

Query:  SVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLF
        S++MFL IL +R+   E F   LP KL++ L +L+ S+  M+V+F      L+  ++   +     L  +P+  MF ++Q P+  ++ R  +
Subjt:  SVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLF

AT3G54070.1 Ankyrin repeat family protein4.5e-2932.92Show/hide
Query:  VVHKKDSEDFKDVRESPMLL-AARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNG-NSALHLAAAGSA
        + HK     F+ V  S +L  AA  G +E++  I  R  + +      + + + H+AA YR  N++  +       D +    +    ++ LHL A    
Subjt:  VVHKKDSEDFKDVRESPMLL-AARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNG-NSALHLAAAGSA

Query:  PKLWQI-TGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAA--
            Q+ +GAAL +Q E+ W+K VKE VP  +    N +G+ A  IF + H +L ++G+ W+  T+ +C + ATLIATV FA A TIPGGN D G  A  
Subjt:  PKLWQI-TGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAA--

Query:  -----LQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLP-
              ++   F IF +S  VAL  S  S+++FLSI T+R+   E F   LP KL+  LS+L+ SII+M+++F    + ++  R++  +L L  ++ L  
Subjt:  -----LQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLP-

Query:  -VAFMFGIVQLPLYFDLLRVLF
          A  F  +   L+F+ LR ++
Subjt:  -VAFMFGIVQLPLYFDLLRVLF

AT5G04700.1 Ankyrin repeat family protein2.3e-3332.92Show/hide
Query:  QIKISTSTHTIVVHKKDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSA
        ++ +  S  T+ +  K+  +  D     +L A R G ++ + ++       ++ TR      +  LA E+RQ  V+  L   D     L    D +GN  
Subjt:  QIKISTSTHTIVVHKKDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSA

Query:  LHLAAAGSAP-KLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGN
        LHLA   S P KL  + GA LQ+Q E++W+K V+   P       N + +T   IF   H  L Q+ + W+  T+ SCS+VA LI TV FA   T+PGG 
Subjt:  LHLAAAGSAP-KLWQITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGN

Query:  GDH--GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLT
         D+  G       + F IF++S L++   S TSV++FL ILTAR+   + F   LP K++  LS L+ SI AML++F S  + ++ +  + + +V  T+ 
Subjt:  GDH--GVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLT

Query:  F--LPVAFMFGIVQLPLYFDLL
        F  LP A +F ++Q PL  +++
Subjt:  F--LPVAFMFGIVQLPLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein3.1e-3031.48Show/hide
Query:  KDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAP-KLWQ
        K+   +K+     +L AA++G  +   +I +     ++       +N+  LA E+++  ++  +   D    +L R+ D   N+ LH+A   S P +L +
Subjt:  KDSEDFKDVRESPMLLAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAP-KLWQ

Query:  ITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGG-NGDHGVAALQEHKPF
        I+GAAL++Q E +W+K V+  V        NK  KT  +IFE  H  L ++G++W+  T+ +CS VA LIATV F    T+PGG +G  G   +     F
Subjt:  ITGAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGG-NGDHGVAALQEHKPF

Query:  SIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLY
          F+ +  +A   S  SV++FLSILT+R+   ++  S LP K++L  S L+ SI +MLV+F +     L+  ++++  ++Y L   P+A    ++ L L 
Subjt:  SIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLY

Query:  FDLLR
        + LL+
Subjt:  FDLLR

AT5G35810.1 Ankyrin repeat family protein5.2e-3336.36Show/hide
Query:  VRESPMLL--AARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQ----SNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAP-KLWQIT
        V  SPMLL  AA++G +E++  +    P  I+ T     +++ H+AA  R     + +Y+    KD  L ++++  + N N  LHL A    P +L  ++
Subjt:  VRESPMLL--AARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQ----SNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAP-KLWQIT

Query:  GAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGN---GD---HGVAALQEH
        GAALQ+Q E+ WYK VKE VP  +    NK+ + A  +F   H +L ++G+ W+  T+ +C +V+TLIATV FA A T+PGGN   GD    G    ++ 
Subjt:  GAALQIQLEVEWYKLVKESVPLHFFPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGN---GD---HGVAALQEH

Query:  KPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQL
          F +F+IS  VAL  S TS+++FLSILT+R+     F + LP KL+L L +L+ SII+M+++F +    L+  +    +L+L        A  F ++  
Subjt:  KPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTSMLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQL

Query:  PLYFDLLR
         L+FD LR
Subjt:  PLYFDLLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGATAAGAATAATGCAGATAATCAATTACGAGAGTTTCTTGAGATGAAAACGAGGAGAAGGAAATGGAATGCAGTGGTCGAGAAGTACGAGGAATACCCAGA
ATCGGCTCAGAAGCTGAAGCTGCGCCGACAGGGCGACACGGCTCTGCATTTGGCTGTCATCGACAACCAAGAAGAAATAGTTGAAAAGCTTGTGAAGATCATTTGCAGAA
ACCCAGGAAACGAGCAACTTCTCGAGACTACAAATGAGAGACACAATAACCCTCTCCATCTCGCCGCCGCGCAGGGGAGCGCCAGAATGTGCCACGTCATTGCTTCAGCC
AATGGGCGCTTAGTGGACATGAGAAACGTAGACGACGAAACGCCGCTCTTCCTTGCGGCGGCGTATGGCCACATGGATGCCTTTTATTGCCTTTACTACTTTTGCGGAAA
CGACTCGACTCGAATCGCCACCAACTGCAGAGAGGGGAACAATGGAGACACCGTGCTACATTATGCCATCGGAAACGAGTACTTTGATTTGGCCTTTCAATTAATTCACT
TGACCGTTGACAGTAGGAGATGGATGAATAAGAAACACTTAACCCCTCTTCATATTCTAGCTAGCAAGCCAGCTTCCTTCAAAAGTGGATCCCACATCAAGGGATGGCGG
AAGCTCATCTATCAATGCATACTTGTGAAGAATCTGAAGCCTCAACCCATTGAAACCCTTCACCGTCACGAAGAGTGGAAGAAATTAAGTCGGTCCGAAAGAAACACAGA
TACATTTTTTCCACCCAACTACGAGACATGCATTTACTTCTTTACGAGGTTGCGGGATGCATGCATTTACTTCTTTACGAGTTTGTGTGGTGGATTTTTAAAAGTCAGCA
CTTTGGTGAACAAGAAGAAAAGTAGTGATGAAGCCAACAAGGATAAAGATTTGGAGGAACATGTTAACTTGGGAAGTGAAAATTTTGATCCATCGACGATTACAAATTTC
CCGAAAAACTATTCTACCTGCGTCGACTTTTTCCAAATAATTTTATCGGCCATCATAATCCTTCTTGCGTACGGTTTAGGATCTAGGACGATCAGAAAGATTTATGAGAG
GAAACAAAAACACACTTGGTCAGTTCAAGTGATGGAGGAACTTCTTAATGGCGGCTATTCGCTCGATTATGAAGATTCTAAGATGGGAGGTTCTGAGATGGAAGATGCTT
CACACACCTTTGAAGAAGACCAAATCAAGATCAGTACTAGTACTCATACTATTGTGGTACACAAGAAAGACTCAGAGGATTTCAAGGATGTCCGAGAGTCGCCGATGTTA
TTAGCAGCGAGGAACGGCGTGATTGAGATGGTGAAGAAAATTTACGAACGTTTGCCAATGGCAATATACGAGACTAGGAAACCAGATAAGAAGAATGTAGTGCATTTGGC
TGCAGAGTACAGGCAGTCGAACGTGTACAAGTTTTTACAAAGGAAAGATTATTCCTTAGACAGCCTGTTTCGAGCCGTGGATCTCAACGGCAACAGCGCCTTACATCTGG
CGGCTGCCGGATCGGCTCCTAAGCTTTGGCAAATCACTGGAGCTGCATTGCAGATCCAATTGGAAGTTGAGTGGTATAAGTTGGTGAAGGAGTCTGTGCCACTCCATTTT
TTTCCCCTCTATAATAAGCAAGGAAAAACTGCAAGTAGAATCTTCGAAGATACCCACGTGGATCTGGAGCAAAAAGGCAAGGATTGGCTTTACAGCACCTCGCAGTCATG
CTCTGTGGTGGCCACTCTGATTGCAACCGTGGCTTTTGCAACTGCAGCCACCATCCCAGGCGGCAACGGCGACCACGGTGTCGCAGCACTTCAGGAACACAAACCCTTTT
CTATCTTCGTCATCTCTTCCCTCGTTGCTCTATGCCTCTCCACAACCTCTGTCATCATGTTTCTTTCCATCTTGACCGCAAGGTTCGAAATCAGAGAGTACTTCACATCA
ATGTTGCCCTGGAAACTGCTCCTCGCCTTATCCTCTCTTTACTTTTCCATCATCGCCATGTTGGTTTCCTTTTGCAGCGGCCACTACTTTCTCCTCACTCGCCGCCTCCA
AAATCAAGCCCTTGTGCTCTACACACTTACTTTTCTCCCTGTCGCATTCATGTTCGGTATAGTGCAGCTTCCTCTCTACTTCGATCTCCTAAGGGTTCTTTTCAAAATCA
TGCCTACCACGGTCGAAGTCGATCTATGCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGATAAGAATAATGCAGATAATCAATTACGAGAGTTTCTTGAGATGAAAACGAGGAGAAGGAAATGGAATGCAGTGGTCGAGAAGTACGAGGAATACCCAGA
ATCGGCTCAGAAGCTGAAGCTGCGCCGACAGGGCGACACGGCTCTGCATTTGGCTGTCATCGACAACCAAGAAGAAATAGTTGAAAAGCTTGTGAAGATCATTTGCAGAA
ACCCAGGAAACGAGCAACTTCTCGAGACTACAAATGAGAGACACAATAACCCTCTCCATCTCGCCGCCGCGCAGGGGAGCGCCAGAATGTGCCACGTCATTGCTTCAGCC
AATGGGCGCTTAGTGGACATGAGAAACGTAGACGACGAAACGCCGCTCTTCCTTGCGGCGGCGTATGGCCACATGGATGCCTTTTATTGCCTTTACTACTTTTGCGGAAA
CGACTCGACTCGAATCGCCACCAACTGCAGAGAGGGGAACAATGGAGACACCGTGCTACATTATGCCATCGGAAACGAGTACTTTGATTTGGCCTTTCAATTAATTCACT
TGACCGTTGACAGTAGGAGATGGATGAATAAGAAACACTTAACCCCTCTTCATATTCTAGCTAGCAAGCCAGCTTCCTTCAAAAGTGGATCCCACATCAAGGGATGGCGG
AAGCTCATCTATCAATGCATACTTGTGAAGAATCTGAAGCCTCAACCCATTGAAACCCTTCACCGTCACGAAGAGTGGAAGAAATTAAGTCGGTCCGAAAGAAACACAGA
TACATTTTTTCCACCCAACTACGAGACATGCATTTACTTCTTTACGAGGTTGCGGGATGCATGCATTTACTTCTTTACGAGTTTGTGTGGTGGATTTTTAAAAGTCAGCA
CTTTGGTGAACAAGAAGAAAAGTAGTGATGAAGCCAACAAGGATAAAGATTTGGAGGAACATGTTAACTTGGGAAGTGAAAATTTTGATCCATCGACGATTACAAATTTC
CCGAAAAACTATTCTACCTGCGTCGACTTTTTCCAAATAATTTTATCGGCCATCATAATCCTTCTTGCGTACGGTTTAGGATCTAGGACGATCAGAAAGATTTATGAGAG
GAAACAAAAACACACTTGGTCAGTTCAAGTGATGGAGGAACTTCTTAATGGCGGCTATTCGCTCGATTATGAAGATTCTAAGATGGGAGGTTCTGAGATGGAAGATGCTT
CACACACCTTTGAAGAAGACCAAATCAAGATCAGTACTAGTACTCATACTATTGTGGTACACAAGAAAGACTCAGAGGATTTCAAGGATGTCCGAGAGTCGCCGATGTTA
TTAGCAGCGAGGAACGGCGTGATTGAGATGGTGAAGAAAATTTACGAACGTTTGCCAATGGCAATATACGAGACTAGGAAACCAGATAAGAAGAATGTAGTGCATTTGGC
TGCAGAGTACAGGCAGTCGAACGTGTACAAGTTTTTACAAAGGAAAGATTATTCCTTAGACAGCCTGTTTCGAGCCGTGGATCTCAACGGCAACAGCGCCTTACATCTGG
CGGCTGCCGGATCGGCTCCTAAGCTTTGGCAAATCACTGGAGCTGCATTGCAGATCCAATTGGAAGTTGAGTGGTATAAGTTGGTGAAGGAGTCTGTGCCACTCCATTTT
TTTCCCCTCTATAATAAGCAAGGAAAAACTGCAAGTAGAATCTTCGAAGATACCCACGTGGATCTGGAGCAAAAAGGCAAGGATTGGCTTTACAGCACCTCGCAGTCATG
CTCTGTGGTGGCCACTCTGATTGCAACCGTGGCTTTTGCAACTGCAGCCACCATCCCAGGCGGCAACGGCGACCACGGTGTCGCAGCACTTCAGGAACACAAACCCTTTT
CTATCTTCGTCATCTCTTCCCTCGTTGCTCTATGCCTCTCCACAACCTCTGTCATCATGTTTCTTTCCATCTTGACCGCAAGGTTCGAAATCAGAGAGTACTTCACATCA
ATGTTGCCCTGGAAACTGCTCCTCGCCTTATCCTCTCTTTACTTTTCCATCATCGCCATGTTGGTTTCCTTTTGCAGCGGCCACTACTTTCTCCTCACTCGCCGCCTCCA
AAATCAAGCCCTTGTGCTCTACACACTTACTTTTCTCCCTGTCGCATTCATGTTCGGTATAGTGCAGCTTCCTCTCTACTTCGATCTCCTAAGGGTTCTTTTCAAAATCA
TGCCTACCACGGTCGAAGTCGATCTATGCGATTAG
Protein sequenceShow/hide protein sequence
MSDDKNNADNQLREFLEMKTRRRKWNAVVEKYEEYPESAQKLKLRRQGDTALHLAVIDNQEEIVEKLVKIICRNPGNEQLLETTNERHNNPLHLAAAQGSARMCHVIASA
NGRLVDMRNVDDETPLFLAAAYGHMDAFYCLYYFCGNDSTRIATNCREGNNGDTVLHYAIGNEYFDLAFQLIHLTVDSRRWMNKKHLTPLHILASKPASFKSGSHIKGWR
KLIYQCILVKNLKPQPIETLHRHEEWKKLSRSERNTDTFFPPNYETCIYFFTRLRDACIYFFTSLCGGFLKVSTLVNKKKSSDEANKDKDLEEHVNLGSENFDPSTITNF
PKNYSTCVDFFQIILSAIIILLAYGLGSRTIRKIYERKQKHTWSVQVMEELLNGGYSLDYEDSKMGGSEMEDASHTFEEDQIKISTSTHTIVVHKKDSEDFKDVRESPML
LAARNGVIEMVKKIYERLPMAIYETRKPDKKNVVHLAAEYRQSNVYKFLQRKDYSLDSLFRAVDLNGNSALHLAAAGSAPKLWQITGAALQIQLEVEWYKLVKESVPLHF
FPLYNKQGKTASRIFEDTHVDLEQKGKDWLYSTSQSCSVVATLIATVAFATAATIPGGNGDHGVAALQEHKPFSIFVISSLVALCLSTTSVIMFLSILTARFEIREYFTS
MLPWKLLLALSSLYFSIIAMLVSFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLFKIMPTTVEVDLCD