; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038287 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038287
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionChlorophyllase
Genome locationscaffold12:43022922..43027318
RNA-Seq ExpressionSpg038287
SyntenySpg038287
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]2.1e-22757.12Show/hide
Query:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE  VE LV+ +      
Subjt:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNER++NPLHLAA MGSVRMC AIAS H GL+D+RN DDETPLFLAA YG +DAF+CLY FCR  P RIS NCR+K +GDTV+H+ALRN+HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETL---SDEALRKSR-----SKAKTTD-AYFPKNYETCYD
        A+QLI+++ EA++WVNE G TP+HVLA+KPTSFKSGSHI+GW++ +Y+C FV  L+P++I+ L   SD +++K+R      KA+  D   FP NY+TC D
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETL---SDEALRKSR-----SKAKTTD-AYFPKNYETCYD

Query:  FFTSLKVTFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR-------------------FLKAGQSA-------GSMEIRKIYEKK
        F   +K   L V       NN  N++ +  K++ K+         EE +  +++                   F     SA       GS EI+KI +KK
Subjt:  FFTSLKVTFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR-------------------FLKAGQSA-------GSMEIRKIYEKK

Query:  QKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFT-----------------SESVMLLAARNGVIEIVRGLYKRFPLA
        +KHTWSVQVM+KLLE A PDKY +DG +PM  K   DEA G+T PYN VD+EV+F+                 SES MLLAARNGVIEIV+G ++RFPLA
Subjt:  QKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFT-----------------SESVMLLAARNGVIEIVRGLYKRFPLA

Query:  IYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTAS
        I + RKDKKNVVLLAAE+RQPDVYRFLL K + +++LFRAVDH GNSALHLAA A  PKLW ITGAALQMQWE KWY YVK SVPLHFF  +N + KTAS
Subjt:  IYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTAS

Query:  TIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILP
         IF+ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSLIALCLSSTSVIMFLAI+TSR D KDFG  LP
Subjt:  TIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILP

Query:  WKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        WKLLIGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  WKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022931013.1 uncharacterized protein LOC111437338 isoform X2 [Cucurbita moschata]1.7e-21855.95Show/hide
Query:  SNNNNNE--LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRM
        S N  +E  LR+FLY++TKR +W+ VI+KY +HPEAQGLKLTR G+TALHLAV+ N+E++V+ LV  I K     E LETTN+RK NPLHLAA MGS  M
Subjt:  SNNNNNE--LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRM

Query:  CHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVL
        C+AIASAH  L++KRN  DETPL+LAA  G RDAF+CLY FCRD    I+ NCRL +NGDTV+H ALRN HF+LA+ ++H+++EAM+WV +DG TPLHVL
Subjt:  CHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVL

Query:  ANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANN-----G
        A+KPT+FKSGS IRGWRN  YYC  V++L PQ I++L  + + +  S   T+   FP NYETC DFFT +   FLK SGL R+ ++ KNDE+       G
Subjt:  ANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANN-----G

Query:  KDLEKDVNEESE------NFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDD
        +++  +  E SE        D++ LKA   A                                GS E +KI  +K+KHTWSVQVM+KLLE+A PD+Y  +
Subjt:  KDLEKDVNEESE------NFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDD

Query:  GKTPMGPKIQADEAEGITTPYNLVDDEVKFT------------------SESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVY
        G  PM    Q  +  G+T PY+  DD+V F+                   E+ MLLAA+NGVIEIV+G++ RFPL+IY+A KDKKNVVLLAAEY QPDVY
Subjt:  GKTPMGPKIQADEAEGITTPYNLVDDEVKFT------------------SESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVY

Query:  RFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSES
        RFLL+ + + E+LFRAVD NGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LV+ +G+WL  TS+S
Subjt:  RFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSES

Query:  CSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCS
        CSVV  LI TVAF + A+IPGG++   G+  L+D + FF FA+ SLIALCLSSTSV +FLAILT R D  DF + LPWKL IG SSL+ SII+MLISFC+
Subjt:  CSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCS

Query:  GHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        GHYFL  R + + A LLYT+  +PVALIF I +LPLY D++QAI KIVPKRSA VV
Subjt:  GHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]1.1e-22356.87Show/hide
Query:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN DD+TPLFLAA YG +DAF+CLY+FCRDD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++ +Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR-------------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSV
          L         NN  N++ +  K++ K+         EE +  +++                   F     SA       GS EI+KI +KK+KHTWSV
Subjt:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR-------------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSV

Query:  QVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKD
        QVM KLLE A PDKY +DG +PM  K + DE  G+T PYN VD+EV+F++                 ES MLLAARNGVIEIV+G  +RFPLAI + RKD
Subjt:  QVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKD

Query:  KKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH
        KKNVVLLAAE+RQPDVYR LL K   +++LFRAVDH GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+ETH
Subjt:  KKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH

Query:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGL
          L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSLIALCLSSTSVIMFLAI+TSR D KDFG  LPWKLLIGL
Subjt:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGL

Query:  SSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  SSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]2.8e-22457.24Show/hide
Query:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN DD+TPLFLAA YG +DAF+CLY+FCRDD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++ +Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR--------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSVQVMDK
          L         NN  N++ +  K++ K+         EE +  +++              F     SA       GS EI+KI +KK+KHTWSVQVM K
Subjt:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR--------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSVQVMDK

Query:  LLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV
        LLE A PDKY +DG +PM  K + DE  G+T PYN VD+EV+F++                 ES MLLAARNGVIEIV+G  +RFPLAI + RKDKKNVV
Subjt:  LLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV

Query:  LLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVK
        LLAAE+RQPDVYR LL K   +++LFRAVDH GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+ETH  L  
Subjt:  LLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVK

Query:  SAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYL
          G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSLIALCLSSTSVIMFLAI+TSR D KDFG  LPWKLLIGL  LY 
Subjt:  SAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYL

Query:  SIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  SIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

XP_023534309.1 uncharacterized protein LOC111795905 [Cucurbita pepo subsp. pepo]1.2e-22256.5Show/hide
Query:  NNNNNELREFLYVHTKRAKWDAVIQKYAQ-------------------HPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYP------------
        N+ N      LY + KR KW+ VI+KY +                   +P A+  KL R+G+T LHLAVI NQE  VE LV+ I + P            
Subjt:  NNNNNELREFLYVHTKRAKWDAVIQKYAQ-------------------HPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYP------------

Query:  --------GYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIH
                 Y++ LETTNER +NPLHLAA MGSVRMC AI S H GL+D RN DDETPLFLAA YG +DAF+CLY+FC+DD  RIS NCR+K +GDTV+H
Subjt:  --------GYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIH

Query:  YALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTD--AYFPKNYET
         ALRN+H +LA+QLI+++ EA++WVNE G TP+H+LA+KPTSFKSGSHI+GW++ +Y+C FV  L+         + L  S  K +  D  + FP NY T
Subjt:  YALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTD--AYFPKNYET

Query:  CYDFFTSLKVTFLKVSGLNRLLNN--------KKNDEANNGKDLEKDVNEE-------SENFDSR------FLKAGQSA-------GSMEIRKIYEKKQK
        C DF+  ++   L V    R   N        K+ND  N+G     D+ E+       + NF         F     SA       GS EI+KI +KK+K
Subjt:  CYDFFTSLKVTFLKVSGLNRLLNN--------KKNDEANNGKDLEKDVNEE-------SENFDSR------FLKAGQSA-------GSMEIRKIYEKKQK

Query:  HTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFT-----------------SESVMLLAARNGVIEIVRGLYKRFPLAIY
        HTWSVQVM+KLLE A PDKY +DG +PM  K Q DEA G+T PYN VD+EV+F+                  ES MLLAA+NGVIEIV+G  +RFP AI 
Subjt:  HTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFT-----------------SESVMLLAARNGVIEIVRGLYKRFPLAIY

Query:  EARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTI
        +  KDKKNVVLLAAE+RQPDVYRF L +R+ ++ LFRAVDH GNSALHLAA A  PKLW ITGAALQMQWE KWY YVK SVPLHFF  +N + KTAS I
Subjt:  EARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTI

Query:  FQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWK
        F+ETH  L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSLIALCLSSTSVIMFLAI+TSR D KDFG  LPWK
Subjt:  FQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWK

Query:  LLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        LLIGL  LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  LLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X21.3e-22457.24Show/hide
Query:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN DD+TPLFLAA YG +DAF+CLY+FCRDD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++ +Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR--------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSVQVMDK
          L         NN  N++ +  K++ K+         EE +  +++              F     SA       GS EI+KI +KK+KHTWSVQVM K
Subjt:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR--------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSVQVMDK

Query:  LLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV
        LLE A PDKY +DG +PM  K + DE  G+T PYN VD+EV+F++                 ES MLLAARNGVIEIV+G  +RFPLAI + RKDKKNVV
Subjt:  LLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV

Query:  LLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVK
        LLAAE+RQPDVYR LL K   +++LFRAVDH GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+ETH  L  
Subjt:  LLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVK

Query:  SAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYL
          G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSLIALCLSSTSVIMFLAI+TSR D KDFG  LPWKLLIGL  LY 
Subjt:  SAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYL

Query:  SIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  SIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1EX58 uncharacterized protein LOC111437338 isoform X28.4e-21955.95Show/hide
Query:  SNNNNNE--LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRM
        S N  +E  LR+FLY++TKR +W+ VI+KY +HPEAQGLKLTR G+TALHLAV+ N+E++V+ LV  I K     E LETTN+RK NPLHLAA MGS  M
Subjt:  SNNNNNE--LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRM

Query:  CHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVL
        C+AIASAH  L++KRN  DETPL+LAA  G RDAF+CLY FCRD    I+ NCRL +NGDTV+H ALRN HF+LA+ ++H+++EAM+WV +DG TPLHVL
Subjt:  CHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVL

Query:  ANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANN-----G
        A+KPT+FKSGS IRGWRN  YYC  V++L PQ I++L  + + +  S   T+   FP NYETC DFFT +   FLK SGL R+ ++ KNDE+       G
Subjt:  ANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANN-----G

Query:  KDLEKDVNEESE------NFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDD
        +++  +  E SE        D++ LKA   A                                GS E +KI  +K+KHTWSVQVM+KLLE+A PD+Y  +
Subjt:  KDLEKDVNEESE------NFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDD

Query:  GKTPMGPKIQADEAEGITTPYNLVDDEVKFT------------------SESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVY
        G  PM    Q  +  G+T PY+  DD+V F+                   E+ MLLAA+NGVIEIV+G++ RFPL+IY+A KDKKNVVLLAAEY QPDVY
Subjt:  GKTPMGPKIQADEAEGITTPYNLVDDEVKFT------------------SESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVY

Query:  RFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSES
        RFLL+ + + E+LFRAVD NGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LV+ +G+WL  TS+S
Subjt:  RFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSES

Query:  CSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCS
        CSVV  LI TVAF + A+IPGG++   G+  L+D + FF FA+ SLIALCLSSTSV +FLAILT R D  DF + LPWKL IG SSL+ SII+MLISFC+
Subjt:  CSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCS

Query:  GHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
        GHYFL  R + + A LLYT+  +PVALIF I +LPLY D++QAI KIVPKRSA VV
Subjt:  GHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X15.1e-22456.87Show/hide
Query:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY
        N+ N      LY H KR +W  VI KY ++ E                                 KL R+G+T LHLAVI NQE IVE LVK++      
Subjt:  NNNNNELREFLYVHTKRAKWDAVIQKYAQHPEA-----------------------------QGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGY

Query:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL
         E LE TNE  +N LHLAA MGSVRMC AIAS H GL+D+RN DD+TPLFLAA YG +DAF+CLY+FCRDD  RIS NCR+K +GDTV+H AL ++HF+L
Subjt:  RERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFEL

Query:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV
        A+QLI+++ EA++WVNE G TP+HVLA KPTSFKSGSHI+GW++ +Y+C FV  L+P++ E L  E+  +S  KAK T   + FP NY+TC  FF  +K 
Subjt:  AYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTT--DAYFPKNYETCYDFFTSLKV

Query:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR-------------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSV
          L         NN  N++ +  K++ K+         EE +  +++                   F     SA       GS EI+KI +KK+KHTWSV
Subjt:  TFLKVSGLNRLLNNKKNDEANNGKDLEKDVN-------EESENFDSR-------------------FLKAGQSA-------GSMEIRKIYEKKQKHTWSV

Query:  QVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKD
        QVM KLLE A PDKY +DG +PM  K + DE  G+T PYN VD+EV+F++                 ES MLLAARNGVIEIV+G  +RFPLAI + RKD
Subjt:  QVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTS-----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKD

Query:  KKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH
        KKNVVLLAAE+RQPDVYR LL K   +++LFRAVDH GNSALHLAA A  PKLW ITGAALQMQWE KWYKYVK SVPLHFF  +N + KTAS IF+ETH
Subjt:  KKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH

Query:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGL
          L    G WL  TSESCS+VATLIATVAFATAA IPGG D+ G A L DE GF IF+ SSLIALCLSSTSVIMFLAI+TSR D KDFG  LPWKLLIGL
Subjt:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-GAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGL

Query:  SSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
          LY SIIAML+SFCSGHYFL  +RL N A+LLYTLTF PVALIFGIVQLPLYFDLLQA++K VP+RSA VV
Subjt:  SSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X21.1e-21856.03Show/hide
Query:  LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+E++V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FCR++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEA------------NNGKD
        GS IRGWRN  YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +NDE+              G+ 
Subjt:  GSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPK
        LE D  E  +  D+  LKA   A                                GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM   
Subjt:  LEKDVNEESENFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPK

Query:  IQADEAEGITTPYNLVDDEVKFTS----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHL
         Q  +   +T PY+LV  EV+ ++                E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+ + + 
Subjt:  IQADEAEGITTPYNLVDDEVKFTS----------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHL

Query:  ESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIAT
        E+LFRAVD NGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV TLI T
Subjt:  ESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIAT

Query:  VAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRL
        VAF + A+IPGG++   G+  L+D   FF FA+ SLIALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL  + +
Subjt:  VAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRL

Query:  QNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
         + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  QNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X11.9e-21855.88Show/hide
Query:  LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG
        LR+FLY +TKR KW+ VI+KY ++PEAQ LKLTR G+TALHLAV+ N+E++V+ LV  I     Y E L+TTN+R+  PLHLAA MGS  MC+AIASAH 
Subjt:  LREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHG

Query:  GLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS
         L+D RN  DETPL+LAA  G RDAF+CLY FCR++  RI+ NCRL +NGDTV+H ALRN HF+LA+Q++H+N+EAM+WV E G TPLHVLA+KPT+FKS
Subjt:  GLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKS

Query:  GSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEA------------NNGKD
        GS IRGWRN  YYC  V++L+PQ I++L  + + +  S   T+   FP NY TC DFFT +   FLK SGL R+ ++ +NDE+              G+ 
Subjt:  GSHIRGWRNFIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEA------------NNGKD

Query:  LEKDVNEESENFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPK
        LE D  E  +  D+  LKA   A                                GS E +KI  KK+KHTWSVQVM+KLLE+A PD+Y  +G TPM   
Subjt:  LEKDVNEESENFDSRFLKAGQSA--------------------------------GSMEIRKIYEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPK

Query:  IQADEAEGITTPYNLVDDEVKFTS------------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRH
         Q  +   +T PY+LV  EV+ ++                  E+ MLLAA+NGVIEIV+G++ RFPL+I +ARKDKKNVVLLAAEY QPDVYRFLL+ + 
Subjt:  IQADEAEGITTPYNLVDDEVKFTS------------------ESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRH

Query:  HLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLI
        + E+LFRAVD NGNSALHLAAAA    +W ITGAALQMQWE KWYK+V+ESVPL+FFA YNKE K A+ IF ETHM LVK +G WL  TS+SCSVV TLI
Subjt:  HLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLI

Query:  ATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTAR
         TVAF + A+IPGG++   G+  L+D   FF FA+ SLIALCLSSTSV MFLAILT R D  DF + LPWKL IG SSL+ SII+ML+SFC+GHYFL  +
Subjt:  ATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTAR

Query:  RLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV
         + + A LLYT+  +PVALIF I +LPLY D++QAI KIVP RSA VV
Subjt:  RLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVV

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR47.1e-0524Show/hide
Query:  KFTSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAAR-----APKLWLITG
        K   ++ +  AAR G +EIV+ L ++ P       K  +  + +A +    DV R L++      ++    D NGN+ALH+A   +     A  L L   
Subjt:  KFTSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAAR-----APKLWLITG

Query:  AALQMQWETKWYKYVKESVPL--------------------HFFAPYNKEEKTASTIFQETHMGLV-----------------KSAGNWLRGTSESCSVV
            +  + K    + E++PL                        P ++  KT + I ++ H  L                  K     +   + S +VV
Subjt:  AALQMQWETKWYKYVKESVPL--------------------HFFAPYNKEEKTASTIFQETHMGLV-----------------KSAGNWLRGTSESCSVV

Query:  ATLIATVAFATAANIPGG-YDKGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL
        A L ATVAFA    +PGG  + G AV+     F IF I + IAL  S   V++ + ++  R +TK    ++     +    ++L+ +   ISF +  Y +
Subjt:  ATLIATVAFATAANIPGG-YDKGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFL

Query:  TARRLQNVAVLLYTLTFLPVALIFG
          R  Q  A+L+  +  + +A + G
Subjt:  TARRLQNVAVLLYTLTFLPVALIFG

Q01484 Ankyrin-22.9e-0625.6Show/hide
Query:  MNPTSNNNNNELREFLYVH----TKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAF
        ++  +N N   L  F  +H      R K   ++ KY    +A    +T  G T +H+A       IV  L++            + TN R    LH+AA 
Subjt:  MNPTSNNNNNELREFLYVH----TKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAF

Query:  MGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGC
         G V +   +   +G L+D R  +++TPL +A+  GK +    L           +       NG T +H + R    ++A  L+   + A +   + G 
Subjt:  MGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGC

Query:  TPLHVLA
        TPLHV A
Subjt:  TPLHVLA

Q4UMH6 Putative ankyrin repeat protein RF_03813.1e-0828.08Show/hide
Query:  TRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSF
        T  GET LH AV  N   +V  L+        Y   +    +     LH A + G++ +   + S HG  ++ +    ET L+ A  YG  D  Y L ++
Subjt:  TRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSF

Query:  CRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMN
          D   +        +NG+TV+HYA+ + + +L   LIH  +   N
Subjt:  CRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMN

Q8C8R3 Ankyrin-22.9e-0625.6Show/hide
Query:  MNPTSNNNNNELREFLYVH----TKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAF
        ++  +N N   L  F  +H      R K   ++ KY    +A    +T  G T +H+A       IV  L++            + TN R    LH+AA 
Subjt:  MNPTSNNNNNELREFLYVH----TKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAF

Query:  MGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGC
         G V +   +   +G L+D R  +++TPL +A+  GK +    L           +       NG T +H + R    ++A  L+   + A +   + G 
Subjt:  MGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGC

Query:  TPLHVLA
        TPLHV A
Subjt:  TPLHVLA

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein4.7e-3623.82Show/hide
Query:  QHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGK
        ++PEA    LT  G+T +H AV++   +IVE +++ I       + L+  N+     L  AA  G VR+   + +   GL+  RN  +  P+ +A++YG 
Subjt:  QHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAHGGLLDKRNIDDETPLFLAAVYGK

Query:  RDAFYCLYS---FCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNK
        +     LYS       DP   S   + K NG  ++   + +  + +A  LI    +     + D  T +  LA  P +F S   I               
Subjt:  RDAFYCLYS---FCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRNFIYYCIFVNK

Query:  LEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANNGKDLEKDVNEESENFDSRFLKAGQSAGSMEIRKI
                                                                                                        IR++
Subjt:  LEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANNGKDLEKDVNEESENFDSRFLKAGQSAGSMEIRKI

Query:  YEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVL
        Y+ K  H  + +++D + +                PK  A + +                    +  A  NG++E +  + + +P  ++       N+  
Subjt:  YEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVL

Query:  LAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAA-AARAPKLWLITGAALQMQWETKWYKYVKESV-PLHFFAPYNKEEKTASTIFQETHMGLV
         A   RQ  ++  + N       L    D   N+ LH AA  A A +L LI GAALQMQ E +W+K V++ V P H      K++KT   +F + H  LV
Subjt:  LAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAA-AARAPKLWLITGAALQMQWETKWYKYVKESV-PLHFFAPYNKEEKTASTIFQETHMGLV

Query:  KSAGNWLRGTSESCSVVATLIATVAFATAANIPGGY-DKGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLY
        +    W++ T+ SC+VVA LI T+ F++A  +PGGY   G  +   +  F IF IS  I+L  S  S++MFL IL SR   +DF   LP KL++GL +L+
Subjt:  KSAGNWLRGTSESCSVVATLIATVAFATAANIPGGY-DKGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLY

Query:  LSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQA
        LS+  M+++F      L   ++  V+     L  +P+ + F ++Q P+  ++ +A
Subjt:  LSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQA

AT3G54070.1 Ankyrin repeat family protein2.6e-2632.32Show/hide
Query:  AARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKL---WLITGAALQMQWETKWY
        AA  G +EI+  L +     ++    + + +  +AA YR  +++  L+ +   ++ L  +     +    L   AR P +    + +GAAL MQ E  W+
Subjt:  AARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKL---WLITGAALQMQWETKWY

Query:  KYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK--------GAAVLEDEDGFFIFAISSLI
        K VKE VP  +    N + + A  IF E H  L K    W++ T+ +C + ATLIATV FA A  IPGG D         G         F IF +S  +
Subjt:  KYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK--------GAAVLEDEDGFFIFAISSLI

Query:  ALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAI
        AL  S  S+++FL+I TSR   +DF   LP KL+ GLS+L++SII+M+++F      +   +     VL+  L  L  AL F  +   L+F+ L+++
Subjt:  ALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLLQAI

AT5G04700.1 Ankyrin repeat family protein3.3e-3435.47Show/hide
Query:  TSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV-LLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAP-KLWLITGAALQM
        T +  +L A R G ++ +  + +     ++  R    + + LLA E+RQ  V+  L         L    D +GN  LHLA     P KL  + GA LQ+
Subjt:  TSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVV-LLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAP-KLWLITGAALQM

Query:  QWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD---KGAAVLEDEDGFFIFAISS
        Q E +W+K V+   P       N EE+T   IF + H GL + A  W++ T+ SCS+VA LI TV FA    +PGG D   KG      +  F IF +S 
Subjt:  QWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD---KGAAVLEDEDGFFIFAISS

Query:  LIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLL
        LI+   S TSV++FL ILT+R    DF   LP K++ GLS L++SI AMLI+F S  + +  +  + +         LP AL+F ++Q PL  +++
Subjt:  LIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGIVQLPLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein1.0e-2732.82Show/hide
Query:  KNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAP-KLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH
        +N+  LA E+++  ++  +        +L R+ D   N+ LH+A     P +L  I+GAAL+MQ E++W+K V+  V        NK+ KT   IF+  H
Subjt:  KNVVLLAAEYRQPDVYRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAP-KLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETH

Query:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIG
          L K    W++ T+ +CS VA LIATV F     +PGG D   G+ ++ ++  F  F  +  +A   S  SV++FL+ILTSR    DF   LP K+++G
Subjt:  MGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYD--KGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIG

Query:  LSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLP--VALIFGIVQLPLYFDLLQA
         S L++SI +ML++F +      +  +++   L+Y L  L    +L+F ++Q PL  +++ +
Subjt:  LSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLP--VALIFGIVQLPLYFDLLQA

AT5G35810.1 Ankyrin repeat family protein5.7e-3434.29Show/hide
Query:  VDDEVKFTSESVMLL--AARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYR--FLLNKRHHLESLFRAVDHNGNSALHLAAAARAP-KL
        V++  +F   S MLL  AA++G +E++  L + +P  I+      +++  +AA  R   ++   + L     L ++++  + N N  LHL A    P +L
Subjt:  VDDEVKFTSESVMLL--AARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDVYR--FLLNKRHHLESLFRAVDHNGNSALHLAAAARAP-KL

Query:  WLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-------GAAV
         +++GAALQMQ E  WYK VKE VP  +    NK+E+ A  +F + H  L K    W++ T+ +C +V+TLIATV FA A  +PGG D        G   
Subjt:  WLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIATVAFATAANIPGGYDK-------GAAV

Query:  LEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGI
           E  F +F IS  +AL  S TS+++FL+ILTSR     F + LP KL++GL +L++SII+M+++F +    L   +    +++L        AL F +
Subjt:  LEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLTFLPVALIFGI

Query:  VQLPLYFDLLQA
        +   L+FD L++
Subjt:  VQLPLYFDLLQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCAACGTCCAACAATAACAACAACGAATTACGAGAGTTTCTTTATGTACACACCAAGAGAGCAAAATGGGATGCTGTGATTCAAAAGTACGCCCAACACCCAGA
AGCTCAGGGGTTGAAGCTGACACGAAAGGGCGAGACGGCGCTGCATTTGGCTGTCATTACCAACCAAGAACAAATAGTTGAAACGCTTGTCAAACTCATTTGCAAATACC
CAGGATACAGAGAACGTCTTGAGACTACCAACGAGAGGAAAAGCAACCCTCTCCACCTTGCCGCATTTATGGGAAGCGTCAGAATGTGCCACGCCATTGCTTCAGCCCAT
GGGGGCTTACTGGACAAGAGAAACATCGACGACGAAACGCCGCTCTTCTTGGCGGCGGTCTACGGCAAGAGGGACGCCTTTTATTGCCTTTACTCCTTCTGTAGAGACGA
CCCAGTTCGAATCTCCGGCAACTGCAGACTCAAGAACAATGGCGACACCGTGATACATTATGCCCTCAGAAACCAGCACTTTGAGTTGGCATATCAATTAATTCACATTA
ACAGCGAGGCTATGAATTGGGTAAACGAGGATGGCTGTACCCCTCTCCATGTTCTAGCAAATAAGCCAACTTCCTTCAAAAGTGGTTCCCACATCCGGGGATGGCGTAAC
TTCATCTATTACTGCATATTTGTGAATAAACTAGAGCCTCAAACAATTGAAACTCTTAGCGATGAAGCGTTGAGAAAAAGCAGGTCCAAAGCAAAGACGACTGATGCCTA
TTTTCCAAAAAACTACGAGACATGCTATGACTTCTTTACGAGTCTGAAGGTTACATTTTTAAAAGTCAGCGGTTTGAATCGACTGCTCAACAATAAGAAAAATGATGAAG
CCAACAACGGTAAAGATTTGGAGAAAGATGTTAACGAGGAAAGTGAAAATTTTGACAGTAGATTTCTTAAAGCTGGTCAATCTGCAGGATCTATGGAAATCAGAAAGATA
TATGAGAAGAAACAAAAGCACACTTGGTCAGTTCAAGTGATGGATAAACTTCTTGAATTTGCTACACCTGATAAATATTGCGACGACGGAAAAACTCCGATGGGTCCGAA
AATTCAAGCCGACGAAGCGGAGGGAATTACGACTCCTTACAACTTGGTAGATGATGAAGTCAAGTTCACATCAGAATCAGTGATGTTGTTAGCAGCAAGGAATGGTGTGA
TTGAAATAGTGAGGGGACTGTACAAACGTTTTCCCCTGGCGATCTACGAAGCTCGGAAAGATAAGAAGAATGTGGTGCTTTTGGCTGCGGAGTACAGGCAGCCCGACGTC
TACCGCTTTTTACTCAACAAAAGGCATCACTTAGAAAGCCTCTTTCGCGCTGTCGATCACAATGGCAACAGCGCCTTACATCTCGCCGCCGCCGCCAGAGCTCCCAAGCT
TTGGCTCATCACCGGAGCTGCACTGCAGATGCAATGGGAAACCAAGTGGTATAAGTACGTGAAGGAGTCTGTGCCACTCCATTTCTTTGCCCCCTATAACAAGGAAGAAA
AAACTGCAAGTACAATCTTCCAGGAAACCCACATGGGTCTGGTGAAGAGCGCCGGGAATTGGCTTCGGGGCACCTCGGAGTCATGCTCTGTGGTGGCCACTCTGATTGCA
ACGGTGGCCTTTGCAACTGCGGCCAACATCCCGGGGGGCTACGACAAAGGCGCCGCAGTACTGGAAGATGAGGATGGCTTTTTTATCTTCGCCATCTCTTCCCTCATTGC
CCTTTGCCTCTCTTCAACATCGGTCATCATGTTTCTCGCCATCTTGACCTCGCGCCTCGATACCAAAGACTTCGGCTCCATCTTGCCCTGGAAACTGCTCATCGGCTTAT
CCTCTCTCTACCTTTCCATCATCGCCATGTTGATTTCCTTTTGCAGCGGCCACTACTTTCTCACCGCTCGCCGCCTCCAAAACGTCGCCGTTCTGCTCTACACCCTTACT
TTTCTCCCCGTCGCTTTGATCTTCGGCATAGTCCAGCTTCCTCTCTACTTCGATTTGCTGCAGGCTATTCTCAAAATAGTGCCTAAAAGGAGCGCCGCCGTCGTCTGCAA
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCAACGTCCAACAATAACAACAACGAATTACGAGAGTTTCTTTATGTACACACCAAGAGAGCAAAATGGGATGCTGTGATTCAAAAGTACGCCCAACACCCAGA
AGCTCAGGGGTTGAAGCTGACACGAAAGGGCGAGACGGCGCTGCATTTGGCTGTCATTACCAACCAAGAACAAATAGTTGAAACGCTTGTCAAACTCATTTGCAAATACC
CAGGATACAGAGAACGTCTTGAGACTACCAACGAGAGGAAAAGCAACCCTCTCCACCTTGCCGCATTTATGGGAAGCGTCAGAATGTGCCACGCCATTGCTTCAGCCCAT
GGGGGCTTACTGGACAAGAGAAACATCGACGACGAAACGCCGCTCTTCTTGGCGGCGGTCTACGGCAAGAGGGACGCCTTTTATTGCCTTTACTCCTTCTGTAGAGACGA
CCCAGTTCGAATCTCCGGCAACTGCAGACTCAAGAACAATGGCGACACCGTGATACATTATGCCCTCAGAAACCAGCACTTTGAGTTGGCATATCAATTAATTCACATTA
ACAGCGAGGCTATGAATTGGGTAAACGAGGATGGCTGTACCCCTCTCCATGTTCTAGCAAATAAGCCAACTTCCTTCAAAAGTGGTTCCCACATCCGGGGATGGCGTAAC
TTCATCTATTACTGCATATTTGTGAATAAACTAGAGCCTCAAACAATTGAAACTCTTAGCGATGAAGCGTTGAGAAAAAGCAGGTCCAAAGCAAAGACGACTGATGCCTA
TTTTCCAAAAAACTACGAGACATGCTATGACTTCTTTACGAGTCTGAAGGTTACATTTTTAAAAGTCAGCGGTTTGAATCGACTGCTCAACAATAAGAAAAATGATGAAG
CCAACAACGGTAAAGATTTGGAGAAAGATGTTAACGAGGAAAGTGAAAATTTTGACAGTAGATTTCTTAAAGCTGGTCAATCTGCAGGATCTATGGAAATCAGAAAGATA
TATGAGAAGAAACAAAAGCACACTTGGTCAGTTCAAGTGATGGATAAACTTCTTGAATTTGCTACACCTGATAAATATTGCGACGACGGAAAAACTCCGATGGGTCCGAA
AATTCAAGCCGACGAAGCGGAGGGAATTACGACTCCTTACAACTTGGTAGATGATGAAGTCAAGTTCACATCAGAATCAGTGATGTTGTTAGCAGCAAGGAATGGTGTGA
TTGAAATAGTGAGGGGACTGTACAAACGTTTTCCCCTGGCGATCTACGAAGCTCGGAAAGATAAGAAGAATGTGGTGCTTTTGGCTGCGGAGTACAGGCAGCCCGACGTC
TACCGCTTTTTACTCAACAAAAGGCATCACTTAGAAAGCCTCTTTCGCGCTGTCGATCACAATGGCAACAGCGCCTTACATCTCGCCGCCGCCGCCAGAGCTCCCAAGCT
TTGGCTCATCACCGGAGCTGCACTGCAGATGCAATGGGAAACCAAGTGGTATAAGTACGTGAAGGAGTCTGTGCCACTCCATTTCTTTGCCCCCTATAACAAGGAAGAAA
AAACTGCAAGTACAATCTTCCAGGAAACCCACATGGGTCTGGTGAAGAGCGCCGGGAATTGGCTTCGGGGCACCTCGGAGTCATGCTCTGTGGTGGCCACTCTGATTGCA
ACGGTGGCCTTTGCAACTGCGGCCAACATCCCGGGGGGCTACGACAAAGGCGCCGCAGTACTGGAAGATGAGGATGGCTTTTTTATCTTCGCCATCTCTTCCCTCATTGC
CCTTTGCCTCTCTTCAACATCGGTCATCATGTTTCTCGCCATCTTGACCTCGCGCCTCGATACCAAAGACTTCGGCTCCATCTTGCCCTGGAAACTGCTCATCGGCTTAT
CCTCTCTCTACCTTTCCATCATCGCCATGTTGATTTCCTTTTGCAGCGGCCACTACTTTCTCACCGCTCGCCGCCTCCAAAACGTCGCCGTTCTGCTCTACACCCTTACT
TTTCTCCCCGTCGCTTTGATCTTCGGCATAGTCCAGCTTCCTCTCTACTTCGATTTGCTGCAGGCTATTCTCAAAATAGTGCCTAAAAGGAGCGCCGCCGTCGTCTGCAA
TTAG
Protein sequenceShow/hide protein sequence
MNPTSNNNNNELREFLYVHTKRAKWDAVIQKYAQHPEAQGLKLTRKGETALHLAVITNQEQIVETLVKLICKYPGYRERLETTNERKSNPLHLAAFMGSVRMCHAIASAH
GGLLDKRNIDDETPLFLAAVYGKRDAFYCLYSFCRDDPVRISGNCRLKNNGDTVIHYALRNQHFELAYQLIHINSEAMNWVNEDGCTPLHVLANKPTSFKSGSHIRGWRN
FIYYCIFVNKLEPQTIETLSDEALRKSRSKAKTTDAYFPKNYETCYDFFTSLKVTFLKVSGLNRLLNNKKNDEANNGKDLEKDVNEESENFDSRFLKAGQSAGSMEIRKI
YEKKQKHTWSVQVMDKLLEFATPDKYCDDGKTPMGPKIQADEAEGITTPYNLVDDEVKFTSESVMLLAARNGVIEIVRGLYKRFPLAIYEARKDKKNVVLLAAEYRQPDV
YRFLLNKRHHLESLFRAVDHNGNSALHLAAAARAPKLWLITGAALQMQWETKWYKYVKESVPLHFFAPYNKEEKTASTIFQETHMGLVKSAGNWLRGTSESCSVVATLIA
TVAFATAANIPGGYDKGAAVLEDEDGFFIFAISSLIALCLSSTSVIMFLAILTSRLDTKDFGSILPWKLLIGLSSLYLSIIAMLISFCSGHYFLTARRLQNVAVLLYTLT
FLPVALIFGIVQLPLYFDLLQAILKIVPKRSAAVVCN