; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033309 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033309
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionChlorophyllase
Genome locationchr11:42669116..42673185
RNA-Seq ExpressionLag0033309
SyntenyLag0033309
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0052689 - carboxylic ester hydrolase activity (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]8.9e-24558.82Show/hide
Query:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE  VE LV+ +      
Subjt:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNER++NPLHLAA MGSVRMC AIAS H GL+D+RN  DETPLFLAA YG +DAF+CLY FC   P RIS NCR+K +GDTV+H+ALRN+HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETL---SDEALRKSR-----SKAKTTD-AYFPKNYETCYD
        A+QLI+++ EA++WVNE G TP+HVLA+KPTSFKSGSHI+GW++++Y+C FV  L+P++I+ L   SD +++K+R      KA+  D   FP NY+TC D
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETL---SDEALRKSR-----SKAKTTD-AYFPKNYETCYD

Query:  FFTSLKVTFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKK
        F   +K   L V       NN  N+  +  K++ K+  ++              + T+   T+FP NY TC++FF   FSAIMI+LGFGS EI+KI +KK
Subjt:  FFTSLKVTFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKK

Query:  QKHTWSVQVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRF
        +KHTWSVQVM+KLLE A PDKY +DG +PM  K   DEA G+T PYN VD+EV+FS   +   +PK+ +  ++V+ASES MLLAARNGVIEIV+G ++RF
Subjt:  QKHTWSVQVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRF

Query:  PLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEK
        PLAI + RKDKKNVVLLAAE+RQPDVYRFLL K +++++LFRAVD  GNSALHLAA A  PKLW ITGAALQMQWE KWY YVK SVPLHFF  +N + K
Subjt:  PLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEK

Query:  TASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGS
        TAS IF+ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSL+ALCLSSTSVIMFLAI+TSR D KDFG 
Subjt:  TASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGS

Query:  ILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
         LPWKLLIGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  ILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]3.2e-24258.84Show/hide
Query:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN  D+TPLFLAA YG +DAF+CLY+FC DD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++++Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV
          L         NN  N+  +  K++ K+  ++              + T+   T+FP NY TC++FF   FSAIMI+LGFGS EI+KI +KK+KHTWSV
Subjt:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV

Query:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA
        QVM KLLE A PDKY +DG +PM SK + DE  G+T PYN VD+EV+FS+ F    +PK+ +  ++V+A ES MLLAARNGVIEIV+G  +RFPLAI + 
Subjt:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA

Query:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ
        RKDKKNVVLLAAE+RQPDVYR LL K  ++++LFRAVD  GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+
Subjt:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ

Query:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL
        ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSL+ALCLSSTSVIMFLAI+TSR D KDFG  LPWKLL
Subjt:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL

Query:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        IGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]2.1e-24159.35Show/hide
Query:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN  D+TPLFLAA YG +DAF+CLY+FC DD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++++Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV
          L         NN  N+  +  K++ K+ + +  N D   L+  +   T S  T+FP NY TC++FF   FSAIMI+LGFGS EI+KI +KK+KHTWSV
Subjt:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV

Query:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA
        QVM KLLE A PDKY +DG +PM SK + DE  G+T PYN VD+EV+FS+ F    +PK+ +  ++V+A ES MLLAARNGVIEIV+G  +RFPLAI + 
Subjt:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA

Query:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ
        RKDKKNVVLLAAE+RQPDVYR LL K  ++++LFRAVD  GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+
Subjt:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ

Query:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL
        ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSL+ALCLSSTSVIMFLAI+TSR D KDFG  LPWKLL
Subjt:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL

Query:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        IGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022995620.1 uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima]7.1e-24259.65Show/hide
Query:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+EE+V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FC ++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD
        GS IRGWRN+ YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +ND++              G+ 
Subjt:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS
        LE D  E  +  D+  LKA   A+ ++ ITN PRNY TC++FFQ VFSAIMI LG GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM S
Subjt:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS

Query:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN
          Q  +   +T PY+LV  EV+ S++   + KPK+ +  KNVQA E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+
Subjt:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN

Query:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA
         +   E+LFRAVDDNGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV 
Subjt:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA

Query:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL
        TLI TVAF + A+IPGG++   G+  L+D   FF FA+ SL+ALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL
Subjt:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL

Query:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          + + + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022995622.1 uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima]7.1e-24259.65Show/hide
Query:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+EE+V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FC ++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD
        GS IRGWRN+ YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +ND++              G+ 
Subjt:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS
        LE D  E  +  D+  LKA   A+ ++ ITN PRNY TC++FFQ VFSAIMI LG GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM S
Subjt:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS

Query:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN
          Q  +   +T PY+LV  EV+ S++   + KPK+ +  KNVQA E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+
Subjt:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN

Query:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA
         +   E+LFRAVDDNGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV 
Subjt:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA

Query:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL
        TLI TVAF + A+IPGG++   G+  L+D   FF FA+ SL+ALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL
Subjt:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL

Query:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          + + + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X21.0e-24159.35Show/hide
Query:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN  D+TPLFLAA YG +DAF+CLY+FC DD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++++Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV
          L         NN  N+  +  K++ K+ + +  N D   L+  +   T S  T+FP NY TC++FF   FSAIMI+LGFGS EI+KI +KK+KHTWSV
Subjt:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV

Query:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA
        QVM KLLE A PDKY +DG +PM SK + DE  G+T PYN VD+EV+FS+ F    +PK+ +  ++V+A ES MLLAARNGVIEIV+G  +RFPLAI + 
Subjt:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA

Query:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ
        RKDKKNVVLLAAE+RQPDVYR LL K  ++++LFRAVD  GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+
Subjt:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ

Query:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL
        ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSL+ALCLSSTSVIMFLAI+TSR D KDFG  LPWKLL
Subjt:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL

Query:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        IGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X11.5e-24258.84Show/hide
Query:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYAHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN  D+TPLFLAA YG +DAF+CLY+FC DD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++++Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV
          L         NN  N+  +  K++ K+  ++              + T+   T+FP NY TC++FF   FSAIMI+LGFGS EI+KI +KK+KHTWSV
Subjt:  TFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSV

Query:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA
        QVM KLLE A PDKY +DG +PM SK + DE  G+T PYN VD+EV+FS+ F    +PK+ +  ++V+A ES MLLAARNGVIEIV+G  +RFPLAI + 
Subjt:  QVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEA

Query:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ
        RKDKKNVVLLAAE+RQPDVYR LL K  ++++LFRAVD  GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+
Subjt:  RKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQ

Query:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL
        ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSL+ALCLSSTSVIMFLAI+TSR D KDFG  LPWKLL
Subjt:  ETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLL

Query:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        IGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  IGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X33.4e-24259.65Show/hide
Query:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+EE+V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FC ++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD
        GS IRGWRN+ YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +ND++              G+ 
Subjt:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS
        LE D  E  +  D+  LKA   A+ ++ ITN PRNY TC++FFQ VFSAIMI LG GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM S
Subjt:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS

Query:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN
          Q  +   +T PY+LV  EV+ S++   + KPK+ +  KNVQA E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+
Subjt:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN

Query:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA
         +   E+LFRAVDDNGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV 
Subjt:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA

Query:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL
        TLI TVAF + A+IPGG++   G+  L+D   FF FA+ SL+ALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL
Subjt:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL

Query:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          + + + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X22.5e-24059.52Show/hide
Query:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+EE+V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FC ++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD
        GS IRGWRN+ YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +ND++              G+ 
Subjt:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS
        LE D  E  +  D+  LKA   A+ ++ ITN PRNY TC++FFQ VFSAIMI LG GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM S
Subjt:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS

Query:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN
          Q  +   +T PY+LV  EV+ S++   + KPK+ +  KNVQ  E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+
Subjt:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN

Query:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA
         +   E+LFRAVDDNGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV 
Subjt:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA

Query:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL
        TLI TVAF + A+IPGG++   G+  L+D   FF FA+ SL+ALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL
Subjt:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL

Query:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          + + + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X13.4e-24259.65Show/hide
Query:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+EE+V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FC ++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD
        GS IRGWRN+ YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +ND++              G+ 
Subjt:  GSHIRGWRNLIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS
        LE D  E  +  D+  LKA   A+ ++ ITN PRNY TC++FFQ VFSAIMI LG GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM S
Subjt:  LEKDVNEESENFDSRFLKAGQSAETDSLITNFPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGS

Query:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN
          Q  +   +T PY+LV  EV+ S++   + KPK+ +  KNVQA E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+
Subjt:  KIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLN

Query:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA
         +   E+LFRAVDDNGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV 
Subjt:  KRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVA

Query:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL
        TLI TVAF + A+IPGG++   G+  L+D   FF FA+ SL+ALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL
Subjt:  TLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL

Query:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          + + + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  TARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

SwissProt top hitse value%identityAlignment
Q08353 NF-kappa-B inhibitor alpha3.7e-0728.92Show/hide
Query:  ETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG-----GLLDKRNIADETPLFLAAVYGKRDAFYCLYS
        +T LHLAVITNQ EI E L++  C         E  + R + PLHLA   G +     +    G      +L   N    T L LA+++G       L S
Subjt:  ETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG-----GLLDKRNIADETPLFLAAVYGKRDAFYCLYS

Query:  FCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTS
           D       N +   NG T +H A+  Q+ +L   L+   ++ +N V   G +P  +   +P++
Subjt:  FCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTS

Q25338 Delta-latroinsectotoxin-Lt1a1.5e-0525.48Show/hide
Query:  NPTSNNNNNELREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVR
        +P    N  E   F   HT       V+     HP+      +  G T  HLA+I   +E+ E+LV+           L   +     P+H AA MGS++
Subjt:  NPTSNNNNNELREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVR

Query:  MCHAIASAHGGLLDKRNI------ADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDG
        M   + S    + DK +I       + TPL  A  + K DA   L      D + ++    + +   TV+H A+      +  +L+   S       E G
Subjt:  MCHAIASAHGGLLDKRNI------ADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDG

Query:  CTPLHVLA
         T LH+ A
Subjt:  CTPLHVLA

Q4UMH6 Putative ankyrin repeat protein RF_03811.5e-0828.08Show/hide
Query:  TRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSF
        T  GET LH AV  N  ++V  L+        Y   +    +     LH A + G++ +   + S HG  ++ +  + ET L+ A  YG  D  Y L ++
Subjt:  TRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSF

Query:  CGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMN
          D   +        +NG+TV+HYA+ + + +L   LIH  +   N
Subjt:  CGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMN

Q8BZ25 Ankyrin repeat and protein kinase domain-containing protein 16.2e-0727.55Show/hide
Query:  LREFLYAHTKRAKWDAV--IQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASA
        LR  L+   +R K  A+  + K    P+A    L   G + LH+A    ++ I + L++       Y   LE   ++   PLHLA + G + + H +A +
Subjt:  LREFLYAHTKRAKWDAV--IQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASA

Query:  HGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANK
        H   LD       TPL LAA  G+      L   CG +P         + +G T +H A+    F     L+   ++ ++  N+ G TP H+ A K
Subjt:  HGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANK

Q8NFD2 Ankyrin repeat and protein kinase domain-containing protein 12.8e-0729.29Show/hide
Query:  LREFLYAHTKRAKWDAV--IQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASA
        LR  L+   +R K  A+  + K    P+A    L + G   LH A    +  I + L++       Y   LE    +   PLHLAA+ G + + H +A +
Subjt:  LREFLYAHTKRAKWDAV--IQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASA

Query:  HGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLI--HINSEAMNWVNEDGCTPLHVLANK
        H   +      + TPL LAA +G+      L   CG DP         + +G T +H A++   F     L+  H N  A N V   G TP H+ A K
Subjt:  HGGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLI--HINSEAMNWVNEDGCTPLHVLANK

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein4.7e-3432.25Show/hide
Query:  PKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAA-AARAPKL
        PK D   +        +  A  NG++E +  + + +P  ++       N+   A   RQ  ++  + N       L    D   N+ LH AA  A A +L
Subjt:  PKQDQPAKNVQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAA-AARAPKL

Query:  WLITGAALQMQWETKWYKYVKESV-PLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGY-DKGAAVLEDED
         LI GAALQMQ E +W+K V++ V P H      K++KT   +F + H  LV+    W++ T+ SC+VVA LI T+ F++A  +PGGY   G  +   + 
Subjt:  WLITGAALQMQWETKWYKYVKESV-PLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGY-DKGAAVLEDED

Query:  GFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPL
         F IF IS  ++L  S  S++MFL IL SR   +DF   LP KL++GL +L+LS+  M+++F      L   ++  V+     L  +P+ + F ++Q P+
Subjt:  GFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPL

Query:  YFDLLQA
          ++ +A
Subjt:  YFDLLQA

AT3G54070.1 Ankyrin repeat family protein1.5e-2732.46Show/hide
Query:  ASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKL---WLITGAALQ
        AS  ++  AA  G +EI+  L +     ++    + + +  +AA YR  +++  L+ +   ++ L  +  +  +    L   AR P +    + +GAAL 
Subjt:  ASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKL---WLITGAALQ

Query:  MQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK--------GAAVLEDEDGFF
        MQ E  W+K VKE VP  +    N + + A  IF E H  L K    W++ T+ +C + ATLIATV FA A  IPGG D         G         F 
Subjt:  MQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK--------GAAVLEDEDGFF

Query:  IFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFD
        IF +S  VAL  S  S+++FL+I TSR   +DF   LP KL+ GLS+L++SII+M+++F      +   +     VL+  L  L  AL F  +   L+F+
Subjt:  IFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFD

Query:  LLQAI
         L+++
Subjt:  LLQAI

AT5G04700.1 Ankyrin repeat family protein6.1e-3435.03Show/hide
Query:  ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV-LLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAP-KLWLITGAALQMQW
        +  +L A R G ++ +  + +     ++  R    + + LLA E+RQ  V+  L         L    D +GN  LHLA     P KL  + GA LQ+Q 
Subjt:  ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV-LLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAP-KLWLITGAALQMQW

Query:  ETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD---KGAAVLEDEDGFFIFAISSLV
        E +W+K V+   P       N EE+T   IF + H GL + A  W++ T+ SCS+VA LI TV FA    +PGG D   KG      +  F IF +S L+
Subjt:  ETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD---KGAAVLEDEDGFFIFAISSLV

Query:  ALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLL
        +   S TSV++FL ILT+R    DF   LP K++ GLS L++SI AMLI+F S  + +  +  + +         LP AL+F ++Q PL  +++
Subjt:  ALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein1.1e-2732.82Show/hide
Query:  KNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAP-KLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH
        +N+  LA E+++  ++  +        +L R+ D   N+ LH+A     P +L  I+GAAL+MQ E++W+K V+  V        NK+ KT   IF+  H
Subjt:  KNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAP-KLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH

Query:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIG
          L K    W++ T+ +CS VA LIATV F     +PGG D   G+ ++ ++  F  F  +  +A   S  SV++FL+ILTSR    DF   LP K+++G
Subjt:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIG

Query:  LSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLP--VALIFGIVQLPLYFDLLQA
         S L++SI +ML++F +      +  +++   L+Y L  L    +L+F ++Q PL  +++ +
Subjt:  LSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLP--VALIFGIVQLPLYFDLLQA

AT5G35810.1 Ankyrin repeat family protein1.6e-3434.43Show/hide
Query:  VQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYR--FLLNKRHDLESLFRAVDDNGNSALHLAAAARAP-KLWLITGAA
        V +S  ++  AA++G +E++  L + +P  I+      +++  +AA  R   ++   + L    DL ++++  + N N  LHL A    P +L +++GAA
Subjt:  VQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYR--FLLNKRHDLESLFRAVDDNGNSALHLAAAARAP-KLWLITGAA

Query:  LQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-------GAAVLEDEDGF
        LQMQ E  WYK VKE VP  +    NK+E+ A  +F + H  L K    W++ T+ +C +V+TLIATV FA A  +PGG D        G      E  F
Subjt:  LQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-------GAAVLEDEDGF

Query:  FIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYF
         +F IS  VAL  S TS+++FL+ILTSR     F + LP KL++GL +L++SII+M+++F +    L   +    +++L        AL F ++   L+F
Subjt:  FIFAISSLVALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYF

Query:  DLLQA
        D L++
Subjt:  DLLQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCAACGTCCAACAATAACAACAACGAATTACGAGAGTTTCTTTATGCACACACCAAGAGAGCAAAATGGGATGCAGTAATTCAAAAGTACGCCCAACACCCAGA
AGCTCAGGGGTTGAAGCTGACACGAAAGGGCGAGACGGCGCTGCATTTGGCTGTCATTACCAACCAAGAAGAAATAGTCGAAACGCTTGTCAAACTCATCTGCAAATTCC
CAGGATACAGAGAACGTCTTGAGACTACGAACGAGAGGAAAAGCAACCCTCTCCACCTTGCCGCATTTATGGGAAGCGTCAGAATGTGCCACGCCATTGCTTCAGCCCAT
GGGGGCTTACTGGACAAGAGAAACATCGCCGACGAAACGCCGCTCTTCTTGGCGGCGGTCTACGGCAAAAGGGACGCCTTTTATTGCCTTTACTCCTTCTGTGGAGACGA
CCCAGTTCGAATCTCCGGCAACTGCAGACTCAAGAACAATGGCGACACCGTGATACATTATGCCCTCAGAAACCAGCACTTTGAGTTGGCATATCAATTAATTCACATTA
ACAGCGAGGCTATGAATTGGGTGAATGAGGATGGTTGTACCCCTCTCCATGTTCTAGCAAATAAGCCAACTTCCTTCAAAAGTGGTTCCCACATCCGGGGATGGCGTAAC
CTCATCTATTACTGCATATTTGTGAATAAACTAGAGCCTCAAACAATTGAAACTCTTAGCGATGAAGCGTTGAGAAAAAGCAGGTCCAAAGCAAAGACGACTGATGCCTA
TTTTCCAAAAAACTATGAGACATGCTATGACTTCTTTACGAGTCTGAAGGTTACATTTTTAAAAGTCAGCGGTTTGAATCGACTGCTCAACAATAAGAAAAATGATAAAG
CCAACAACGGTAAAGATTTGGAGAAAGATGTTAACGAGGAAAGTGAAAATTTTGACAGTAGATTTCTTAAAGCTGGTCAATCTGCAGAAACAGATTCCCTAATTACAAAT
TTCCCGAGAAACTATGCTACCTGTGTCAACTTTTTTCAATTTGTTTTCTCGGCCATCATGATCGTTCTTGGGTTCGGATCTATGGAAATCAGAAAGATATATGAGAAGAA
ACAAAAGCACACTTGGTCAGTTCAAGTGATGGATAAACTTCTTGAATTTGCTACACCTGATAAATATTGCGACGACGGAAAAACTCCGATGGGTTCGAAAATTCAAGCCG
ACGAAGCGGAGGGAATTACGACTCCTTACAACTTGGTAGATGATGAAGTCAAGTTCAGTGATAACTTTAATACTGATCATAAACCAAAACAAGACCAGCCGGCCAAAAAT
GTTCAAGCATCAGAATCAGTGATGTTGTTAGCAGCAAGGAATGGTGTGATTGAAATAGTGAGGGGACTGTACAAACGTTTTCCCCTGGCGATCTACGAAGCTCGGAAAGA
TAAGAAGAATGTGGTGCTTTTGGCTGCGGAGTACAGACAGCCCGACGTCTACCGCTTTTTACTCAACAAAAGACATGACTTAGAAAGCCTCTTTCGCGCTGTCGATGACA
ATGGCAACAGCGCCTTACATCTCGCCGCCGCCGCCAGAGCTCCCAAGCTTTGGCTCATCACCGGAGCTGCACTGCAGATGCAATGGGAAACCAAGTGGTATAAGTACGTG
AAGGAGTCTGTGCCACTCCATTTCTTTGCCCCCTATAACAAGGAAGAAAAAACTGCAAGTACAATCTTCCAGGAAACCCACATGGGTCTGGTGAAGAGCGCCGGGAATTG
GCTTCGGGGCACCTCCGAGTCATGCTCTGTGGTGGCCACTCTCATTGCAACGGTGGCCTTTGCAACTGCGGCCAACATCCCGGGGGGCTACGACAAAGGCGCCGCAGTAC
TGGAAGATGAGGATGGCTTTTTTATCTTCGCCATCTCTTCCCTGGTTGCCCTTTGCCTCTCTTCAACATCGGTCATCATGTTTCTCGCCATCTTGACCTCGCGCCTCGAT
ACCAAAGACTTCGGCTCCATCTTGCCCTGGAAACTGCTCATCGGCTTATCCTCTCTCTACCTTTCCATCATCGCCATGTTGATTTCCTTTTGCAGCGGCCACTACTTTCT
CACCGCTCGCCGCCTCCAAAACGTCGCCGTTCTGCTCTACACCCTTACTTTTCTCCCCGTCGCTTTGATCTTCGGCATAGTCCAGCTTCCTCTCTACTTCGATTTGCTGC
AGGCTATTCTCAAAATAGTGCCTAAAAGGAGCGCCGCCGTCGTCTGCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCAACGTCCAACAATAACAACAACGAATTACGAGAGTTTCTTTATGCACACACCAAGAGAGCAAAATGGGATGCAGTAATTCAAAAGTACGCCCAACACCCAGA
AGCTCAGGGGTTGAAGCTGACACGAAAGGGCGAGACGGCGCTGCATTTGGCTGTCATTACCAACCAAGAAGAAATAGTCGAAACGCTTGTCAAACTCATCTGCAAATTCC
CAGGATACAGAGAACGTCTTGAGACTACGAACGAGAGGAAAAGCAACCCTCTCCACCTTGCCGCATTTATGGGAAGCGTCAGAATGTGCCACGCCATTGCTTCAGCCCAT
GGGGGCTTACTGGACAAGAGAAACATCGCCGACGAAACGCCGCTCTTCTTGGCGGCGGTCTACGGCAAAAGGGACGCCTTTTATTGCCTTTACTCCTTCTGTGGAGACGA
CCCAGTTCGAATCTCCGGCAACTGCAGACTCAAGAACAATGGCGACACCGTGATACATTATGCCCTCAGAAACCAGCACTTTGAGTTGGCATATCAATTAATTCACATTA
ACAGCGAGGCTATGAATTGGGTGAATGAGGATGGTTGTACCCCTCTCCATGTTCTAGCAAATAAGCCAACTTCCTTCAAAAGTGGTTCCCACATCCGGGGATGGCGTAAC
CTCATCTATTACTGCATATTTGTGAATAAACTAGAGCCTCAAACAATTGAAACTCTTAGCGATGAAGCGTTGAGAAAAAGCAGGTCCAAAGCAAAGACGACTGATGCCTA
TTTTCCAAAAAACTATGAGACATGCTATGACTTCTTTACGAGTCTGAAGGTTACATTTTTAAAAGTCAGCGGTTTGAATCGACTGCTCAACAATAAGAAAAATGATAAAG
CCAACAACGGTAAAGATTTGGAGAAAGATGTTAACGAGGAAAGTGAAAATTTTGACAGTAGATTTCTTAAAGCTGGTCAATCTGCAGAAACAGATTCCCTAATTACAAAT
TTCCCGAGAAACTATGCTACCTGTGTCAACTTTTTTCAATTTGTTTTCTCGGCCATCATGATCGTTCTTGGGTTCGGATCTATGGAAATCAGAAAGATATATGAGAAGAA
ACAAAAGCACACTTGGTCAGTTCAAGTGATGGATAAACTTCTTGAATTTGCTACACCTGATAAATATTGCGACGACGGAAAAACTCCGATGGGTTCGAAAATTCAAGCCG
ACGAAGCGGAGGGAATTACGACTCCTTACAACTTGGTAGATGATGAAGTCAAGTTCAGTGATAACTTTAATACTGATCATAAACCAAAACAAGACCAGCCGGCCAAAAAT
GTTCAAGCATCAGAATCAGTGATGTTGTTAGCAGCAAGGAATGGTGTGATTGAAATAGTGAGGGGACTGTACAAACGTTTTCCCCTGGCGATCTACGAAGCTCGGAAAGA
TAAGAAGAATGTGGTGCTTTTGGCTGCGGAGTACAGACAGCCCGACGTCTACCGCTTTTTACTCAACAAAAGACATGACTTAGAAAGCCTCTTTCGCGCTGTCGATGACA
ATGGCAACAGCGCCTTACATCTCGCCGCCGCCGCCAGAGCTCCCAAGCTTTGGCTCATCACCGGAGCTGCACTGCAGATGCAATGGGAAACCAAGTGGTATAAGTACGTG
AAGGAGTCTGTGCCACTCCATTTCTTTGCCCCCTATAACAAGGAAGAAAAAACTGCAAGTACAATCTTCCAGGAAACCCACATGGGTCTGGTGAAGAGCGCCGGGAATTG
GCTTCGGGGCACCTCCGAGTCATGCTCTGTGGTGGCCACTCTCATTGCAACGGTGGCCTTTGCAACTGCGGCCAACATCCCGGGGGGCTACGACAAAGGCGCCGCAGTAC
TGGAAGATGAGGATGGCTTTTTTATCTTCGCCATCTCTTCCCTGGTTGCCCTTTGCCTCTCTTCAACATCGGTCATCATGTTTCTCGCCATCTTGACCTCGCGCCTCGAT
ACCAAAGACTTCGGCTCCATCTTGCCCTGGAAACTGCTCATCGGCTTATCCTCTCTCTACCTTTCCATCATCGCCATGTTGATTTCCTTTTGCAGCGGCCACTACTTTCT
CACCGCTCGCCGCCTCCAAAACGTCGCCGTTCTGCTCTACACCCTTACTTTTCTCCCCGTCGCTTTGATCTTCGGCATAGTCCAGCTTCCTCTCTACTTCGATTTGCTGC
AGGCTATTCTCAAAATAGTGCCTAAAAGGAGCGCCGCCGTCGTCTGCAATTAG
Protein sequenceShow/hide protein sequence
MNPTSNNNNNELREFLYAHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEEIVETLVKLICKFPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAH
GGLLDKRNIADETPLFLAAVYGKRDAFYCLYSFCGDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRN
LIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDKANNGKDLEKDVNEESENFDSRFLKAGQSAETDSLITN
FPRNYATCVNFFQFVFSAIMIVLGFGSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGSKIQADEAEGITTPYNLVDDEVKFSDNFNTDHKPKQDQPAKN
VQASESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHDLESLFRAVDDNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYV
KESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDKGAAVLEDEDGFFIFAISSLVALCLSSTSVIMFLAILTSRLD
TKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVVCN