; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041292 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041292
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationchr13:15121513..15124761
RNA-Seq ExpressionLag0041292
SyntenyLag0041292
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]7.2e-16348.56Show/hide
Query:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL
        V EKY +  E    KL  +GDT LHLAVIDNQE  VE LV+ +       + LE  N+R NNPLHLAA +GS +MC  IAS H  LV+ +N   +TPLFL
Subjt:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL

Query:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS
        AA YG+KDAF+CLY FCR    + S N +   +GDTVLH+AL+NEHFDLAFQLI+MHK+A  WV+  G  P+HVLA KP SFKSG+HI GW+ IVY    
Subjt:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS

Query:  VDQLKPQSIETLSKAWKKMIKK--------------------------------------------KNSDEANKD----------------KDLEEHVDV
        V  L+P+SI+ L +   + +KK                                             N+++ N D                 DLEE   +
Subjt:  VDQLKPQSIETLSKAWKKMIKK--------------------------------------------KNSDEANKD----------------KDLEEHVDV

Query:  ETDQS--------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP-------TPRSGRIV--NF---
         T  +                                G  +I KIR+ K+KHTWSVQVMEKLLE    +     G +P          SG  +  NF   
Subjt:  ETDQS--------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP-------TPRSGRIV--NF---

Query:  -----------ETLARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHL
                   + L + Q V      MLLAA+ GV+E+V   F+RF   I DT +D KN+VLLAAE RQ  VYRFLL+K +EI++LFRAVD++GNSALHL
Subjt:  -----------ETLARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHL

Query:  AAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHG
        AA A   +   ITGAALQMQWEV+WY YV+ S P  FF  +N +GKT+S+IF +TH+ L  KG  WLY TS+SCS+VATLIATVAFATAATIPGGND+ G
Subjt:  AAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHG

Query:  AAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFI
         A L  EQGF IFS SSLIALCLS+TS+IMFL+I+TSRF  +DFG  LP K+LIGL  LYFSI+AML+SFCS HYFLI  R  N  +LLYTLTF P+  I
Subjt:  AAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFI

Query:  FGILQLPLFFDLLKPILEIVPSGGAEDVV
        FGI+QLPL+FDLL+ +++ VP   AE V+
Subjt:  FGILQLPLFFDLLKPILEIVPSGGAEDVV

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]8.0e-16248.54Show/hide
Query:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL
        V+EKY +  E    KL  +GDT LHLAVIDNQE IVE LVK++       + LE  N+  NN LHLAA +GS +MC  IAS H  LV+ +N   +TPLFL
Subjt:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL

Query:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS
        AA YG+KDAF+CLY FCR+D  + S N +   +GDTVLH AL +EHFDLAFQLI+MHK+A  WV+  G  P+HVLA KP SFKSG+HI GW+ IVY    
Subjt:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS

Query:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---
        V  L+P+S E L +  ++  +K                                     N+++ N D                 DLEE   + T  +   
Subjt:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---

Query:  -----------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF-
                                     G  +I KIR+ K+KHTWSVQVM KLLE    +     G +P     R                     NF 
Subjt:  -----------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF-

Query:  ---ETLARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSR
           + L + Q V      MLLAA+ GV+E+V    +RF   I DT +D KN+VLLAAE RQ  VYR LL+K  EI++LFRAVD++GNSALHLAA A   +
Subjt:  ---ETLARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSR

Query:  HLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQ
           ITGAALQMQWEV+WYKYV+ S P  FF  +N +GKT+S+IF +TH+ L  KG  WLY TS+SCS+VATLIATVAFATAATIPGGND+ G A L  EQ
Subjt:  HLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQ

Query:  GFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPL
        GF IFS SSLIALCLS+TS+IMFL+I+TSRF  +DFG  LP K+LIGL  LYFSI+AML+SFCS HYFLI  R  N  +LLYTLTF P+  IFGI+QLPL
Subjt:  GFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPL

Query:  FFDLLKPILEIVPSGGAEDVV
        +FDLL+ +++ VP   AE V+
Subjt:  FFDLLKPILEIVPSGGAEDVV

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]2.1e-16248.88Show/hide
Query:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL
        V+EKY +  E    KL  +GDT LHLAVIDNQE IVE LVK++       + LE  N+  NN LHLAA +GS +MC  IAS H  LV+ +N   +TPLFL
Subjt:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL

Query:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS
        AA YG+KDAF+CLY FCR+D  + S N +   +GDTVLH AL +EHFDLAFQLI+MHK+A  WV+  G  P+HVLA KP SFKSG+HI GW+ IVY    
Subjt:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS

Query:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---
        V  L+P+S E L +  ++  +K                                     N+++ N D                 DLEE   + T  +   
Subjt:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---

Query:  ------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF----ET
                                G  +I KIR+ K+KHTWSVQVM KLLE    +     G +P     R                     NF    + 
Subjt:  ------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF----ET

Query:  LARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGIT
        L + Q V      MLLAA+ GV+E+V    +RF   I DT +D KN+VLLAAE RQ  VYR LL+K  EI++LFRAVD++GNSALHLAA A   +   IT
Subjt:  LARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGIT

Query:  GAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIF
        GAALQMQWEV+WYKYV+ S P  FF  +N +GKT+S+IF +TH+ L  KG  WLY TS+SCS+VATLIATVAFATAATIPGGND+ G A L  EQGF IF
Subjt:  GAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIF

Query:  SISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLL
        S SSLIALCLS+TS+IMFL+I+TSRF  +DFG  LP K+LIGL  LYFSI+AML+SFCS HYFLI  R  N  +LLYTLTF P+  IFGI+QLPL+FDLL
Subjt:  SISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLL

Query:  KPILEIVPSGGAEDVV
        + +++ VP   AE V+
Subjt:  KPILEIVPSGGAEDVV

XP_022995622.1 uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima]1.2e-16044.91Show/hide
Query:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NTKRG WE V++KY + PEA + KL   GDTALHLAV+DN+EE+V++LV  I  ++ Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE

Query:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS
        +LV+ +N + +TPL+LAA  G++DAF+CLY+FCRN+  + + N +  +NGDTVLH AL+N+HFDLAFQ++H++ +A  WV   G  PLHVLA KP +FKS
Subjt:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS

Query:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV
        G+ I GWR I Y    VDQLKPQ I++L + W                                      K++     +DE+ KD  +  ++       +
Subjt:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV

Query:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T
        ETD+S                                               G  +  KIR  K+KHTWSVQVMEKLLE+   +  E  GGTP      T
Subjt:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T

Query:  PRSGRIVNFETLARG---------------------QGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIK
        P    +    +L  G                     Q     MLLAAK GV+E+V  +F RF  +I D  +D KN+VLLAAE  Q  VYRFLL  K   +
Subjt:  PRSGRIVNFETLARG---------------------QGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIK

Query:  SLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATV
        +LFRAVD++GNSALHLAAAA  S    ITGAALQMQWE++WYK+VE+S P  FFA YNKEGK +++IF +TH  LV+K  +WL KTS+SCSVV TLI TV
Subjt:  SLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATV

Query:  AFATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQ
        AF + A+IPGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+ HYFL+     
Subjt:  AFATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQ

Query:  NVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV
        +   LLYT+  +P+  IF I +LPL+ D+++ I +IVP+  A  V+
Subjt:  NVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV

XP_023534309.1 uncharacterized protein LOC111795905 [Cucurbita pepo subsp. pepo]9.4e-16347.31Show/hide
Query:  LRANTKRGNWEAVVEKYAK-------------------EPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICR--------------------STDY
        L A  KRG WE V+EKY +                    P A   KL  +GDT LHLAVIDNQE  VE LV+ I                      + +Y
Subjt:  LRANTKRGNWEAVVEKYAK-------------------EPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICR--------------------STDY

Query:  KQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDL
        K  LE  N+R NNPLHLAA +GS +MC  I S H  LV+ +N   +TPLFLAA YG+KDAF+CLY FC++D  + S N +   +GDTVLH AL+NEH DL
Subjt:  KQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDL

Query:  AFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLK-------------------PQSIETLSKAWKKM-----------
        AFQLI+MHK+A  WV+  G  P+H+LA KP SFKSG+HI GW+ IVY    V  LK                   P +  T    ++ M           
Subjt:  AFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLK-------------------PQSIETLSKAWKKM-----------

Query:  -----------IKKKNSDEANKDK---DLEEHVDVETDQS---------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTET
                    KK+N  + + DK   DLEE   + T  +                           G  +I KIR+ K+KHTWSVQVMEKLLE  L E 
Subjt:  -----------IKKKNSDEANKDK---DLEEHVDVETDQS---------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTET

Query:  PEVPGGTPTPRSGRIVNFET-------------------------LARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESR
         +      +P   +  N E                          L + Q V      MLLAAK GV+E+V    +RF   I DTT+D KN+VLLAAE R
Subjt:  PEVPGGTPTPRSGRIVNFET-------------------------LARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESR

Query:  QLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLY
        Q  VYRF L++++EI  LFRAVD++GNSALHLAA A   +   ITGAALQMQWEV+WY YV+ S P  FF  +N +GKT+S+IF +TH+ L  KG  WLY
Subjt:  QLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLY

Query:  KTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLI
         TS+SCS+VATLIATVAFATAATIPGGND+ G A L  EQGF IFS SSLIALCLS+TS+IMFL+I+TSRF  +DFG  LP K+LIGL  LYFSI+AML+
Subjt:  KTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLI

Query:  SFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV
        SFCS HYFLI  R  N  +LLYTLTF P+  IFGI+QLPL+FDLL+ +++ VP   AE V+
Subjt:  SFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X21.0e-16248.88Show/hide
Query:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL
        V+EKY +  E    KL  +GDT LHLAVIDNQE IVE LVK++       + LE  N+  NN LHLAA +GS +MC  IAS H  LV+ +N   +TPLFL
Subjt:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL

Query:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS
        AA YG+KDAF+CLY FCR+D  + S N +   +GDTVLH AL +EHFDLAFQLI+MHK+A  WV+  G  P+HVLA KP SFKSG+HI GW+ IVY    
Subjt:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS

Query:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---
        V  L+P+S E L +  ++  +K                                     N+++ N D                 DLEE   + T  +   
Subjt:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---

Query:  ------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF----ET
                                G  +I KIR+ K+KHTWSVQVM KLLE    +     G +P     R                     NF    + 
Subjt:  ------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF----ET

Query:  LARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGIT
        L + Q V      MLLAA+ GV+E+V    +RF   I DT +D KN+VLLAAE RQ  VYR LL+K  EI++LFRAVD++GNSALHLAA A   +   IT
Subjt:  LARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGIT

Query:  GAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIF
        GAALQMQWEV+WYKYV+ S P  FF  +N +GKT+S+IF +TH+ L  KG  WLY TS+SCS+VATLIATVAFATAATIPGGND+ G A L  EQGF IF
Subjt:  GAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIF

Query:  SISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLL
        S SSLIALCLS+TS+IMFL+I+TSRF  +DFG  LP K+LIGL  LYFSI+AML+SFCS HYFLI  R  N  +LLYTLTF P+  IFGI+QLPL+FDLL
Subjt:  SISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLL

Query:  KPILEIVPSGGAEDVV
        + +++ VP   AE V+
Subjt:  KPILEIVPSGGAEDVV

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X13.9e-16248.54Show/hide
Query:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL
        V+EKY +  E    KL  +GDT LHLAVIDNQE IVE LVK++       + LE  N+  NN LHLAA +GS +MC  IAS H  LV+ +N   +TPLFL
Subjt:  VVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFL

Query:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS
        AA YG+KDAF+CLY FCR+D  + S N +   +GDTVLH AL +EHFDLAFQLI+MHK+A  WV+  G  P+HVLA KP SFKSG+HI GW+ IVY    
Subjt:  AALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS

Query:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---
        V  L+P+S E L +  ++  +K                                     N+++ N D                 DLEE   + T  +   
Subjt:  VDQLKPQSIETLSKAWKKMIKK------------------------------------KNSDEANKD----------------KDLEEHVDVETDQS---

Query:  -----------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF-
                                     G  +I KIR+ K+KHTWSVQVM KLLE    +     G +P     R                     NF 
Subjt:  -----------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGR-------------------IVNF-

Query:  ---ETLARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSR
           + L + Q V      MLLAA+ GV+E+V    +RF   I DT +D KN+VLLAAE RQ  VYR LL+K  EI++LFRAVD++GNSALHLAA A   +
Subjt:  ---ETLARGQGV---MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSR

Query:  HLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQ
           ITGAALQMQWEV+WYKYV+ S P  FF  +N +GKT+S+IF +TH+ L  KG  WLY TS+SCS+VATLIATVAFATAATIPGGND+ G A L  EQ
Subjt:  HLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQ

Query:  GFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPL
        GF IFS SSLIALCLS+TS+IMFL+I+TSRF  +DFG  LP K+LIGL  LYFSI+AML+SFCS HYFLI  R  N  +LLYTLTF P+  IFGI+QLPL
Subjt:  GFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPL

Query:  FFDLLKPILEIVPSGGAEDVV
        +FDLL+ +++ VP   AE V+
Subjt:  FFDLLKPILEIVPSGGAEDVV

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X35.6e-16144.91Show/hide
Query:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NTKRG WE V++KY + PEA + KL   GDTALHLAV+DN+EE+V++LV  I  ++ Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE

Query:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS
        +LV+ +N + +TPL+LAA  G++DAF+CLY+FCRN+  + + N +  +NGDTVLH AL+N+HFDLAFQ++H++ +A  WV   G  PLHVLA KP +FKS
Subjt:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS

Query:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV
        G+ I GWR I Y    VDQLKPQ I++L + W                                      K++     +DE+ KD  +  ++       +
Subjt:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV

Query:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T
        ETD+S                                               G  +  KIR  K+KHTWSVQVMEKLLE+   +  E  GGTP      T
Subjt:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T

Query:  PRSGRIVNFETLARG---------------------QGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIK
        P    +    +L  G                     Q     MLLAAK GV+E+V  +F RF  +I D  +D KN+VLLAAE  Q  VYRFLL  K   +
Subjt:  PRSGRIVNFETLARG---------------------QGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIK

Query:  SLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATV
        +LFRAVD++GNSALHLAAAA  S    ITGAALQMQWE++WYK+VE+S P  FFA YNKEGK +++IF +TH  LV+K  +WL KTS+SCSVV TLI TV
Subjt:  SLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATV

Query:  AFATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQ
        AF + A+IPGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+ HYFL+     
Subjt:  AFATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQ

Query:  NVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV
        +   LLYT+  +P+  IF I +LPL+ D+++ I +IVP+  A  V+
Subjt:  NVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X27.3e-16144.89Show/hide
Query:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NTKRG WE V++KY + PEA + KL   GDTALHLAV+DN+EE+V++LV  I  ++ Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE

Query:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS
        +LV+ +N + +TPL+LAA  G++DAF+CLY+FCRN+  + + N +  +NGDTVLH AL+N+HFDLAFQ++H++ +A  WV   G  PLHVLA KP +FKS
Subjt:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS

Query:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV
        G+ I GWR I Y    VDQLKPQ I++L + W                                      K++     +DE+ KD  +  ++       +
Subjt:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV

Query:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T
        ETD+S                                               G  +  KIR  K+KHTWSVQVMEKLLE+   +  E  GGTP      T
Subjt:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T

Query:  PRSGRIVNFETLARGQGVM-------------------PVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSL
        P    +    +L  G+  +                     MLLAAK GV+E+V  +F RF  +I D  +D KN+VLLAAE  Q  VYRFLL  K   ++L
Subjt:  PRSGRIVNFETLARGQGVM-------------------PVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSL

Query:  FRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAF
        FRAVD++GNSALHLAAAA  S    ITGAALQMQWE++WYK+VE+S P  FFA YNKEGK +++IF +TH  LV+K  +WL KTS+SCSVV TLI TVAF
Subjt:  FRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAF

Query:  ATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNV
         + A+IPGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+ HYFL+     + 
Subjt:  ATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNV

Query:  PVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV
          LLYT+  +P+  IF I +LPL+ D+++ I +IVP+  A  V+
Subjt:  PVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X15.6e-16144.91Show/hide
Query:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NTKRG WE V++KY + PEA + KL   GDTALHLAV+DN+EE+V++LV  I  ++ Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHE

Query:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS
        +LV+ +N + +TPL+LAA  G++DAF+CLY+FCRN+  + + N +  +NGDTVLH AL+N+HFDLAFQ++H++ +A  WV   G  PLHVLA KP +FKS
Subjt:  DLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTK-NNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKS

Query:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV
        G+ I GWR I Y    VDQLKPQ I++L + W                                      K++     +DE+ KD  +  ++       +
Subjt:  GAHIWGWRKIVYDWISVDQLKPQSIETLSKAW--------------------------------------KKMIKKKNSDEANKDKDLEEHV------DV

Query:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T
        ETD+S                                               G  +  KIR  K+KHTWSVQVMEKLLE+   +  E  GGTP      T
Subjt:  ETDQS-----------------------------------------------GFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTP------T

Query:  PRSGRIVNFETLARG---------------------QGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIK
        P    +    +L  G                     Q     MLLAAK GV+E+V  +F RF  +I D  +D KN+VLLAAE  Q  VYRFLL  K   +
Subjt:  PRSGRIVNFETLARG---------------------QGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIK

Query:  SLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATV
        +LFRAVD++GNSALHLAAAA  S    ITGAALQMQWE++WYK+VE+S P  FFA YNKEGK +++IF +TH  LV+K  +WL KTS+SCSVV TLI TV
Subjt:  SLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATV

Query:  AFATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQ
        AF + A+IPGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+ HYFL+     
Subjt:  AFATAATIPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQ

Query:  NVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV
        +   LLYT+  +P+  IF I +LPL+ D+++ I +IVP+  A  V+
Subjt:  NVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVV

SwissProt top hitse value%identityAlignment
P16157 Ankyrin-12.0e-0624.66Show/hide
Query:  NSNLVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIA-
        N N +  L   +K G+ + VVE   K  E   +    KG+TALH+A +  Q+E+V  LV       +Y   + A++++   PL++AA     ++   +  
Subjt:  NSNLVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIA-

Query:  -SAHEDLVNEKNTLGQTPLFLAALYGHKDA--------------FYCLYYFCRND---------LDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMH
          A++++  E    G TPL +A   GH++                  L+   RND          +  + +  +  G T LH A   E+ ++A QL+   
Subjt:  -SAHEDLVNEKNTLGQTPLFLAALYGHKDA--------------FYCLYYFCRND---------LDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMH

Query:  KDAARWVDRQGFIPLHVLACKPN
          +  +  + G  PLH+ + + N
Subjt:  KDAARWVDRQGFIPLHVLACKPN

P50086 Probable 26S proteasome regulatory subunit p285.3e-0733.33Show/hide
Query:  GNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQ
        GN E V   Y +  +    K+  +G T LHLAV     E+ + L+       +    +  K+K N  PLH AA++GS K+  ++    +  VN ++  G 
Subjt:  GNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQ

Query:  TPLFLAALYGHKDA
        TPLF A   GH DA
Subjt:  TPLFLAALYGHKDA

Q02357 Ankyrin-12.0e-0624.66Show/hide
Query:  NSNLVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIA-
        N N +  L   +K G+ + VVE   K  E   +    KG+TALH+A +  Q+E+V  LV       +Y   + A++++   PL++AA     ++   +  
Subjt:  NSNLVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIA-

Query:  -SAHEDLVNEKNTLGQTPLFLAALYGHKDA--------------FYCLYYFCRND---------LDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMH
          A++++  E    G TPL +A   GH++                  L+   RND          +  + +  +  G T LH A   E+ ++A QL+   
Subjt:  -SAHEDLVNEKNTLGQTPLFLAALYGHKDA--------------FYCLYYFCRND---------LDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMH

Query:  KDAARWVDRQGFIPLHVLACKPN
          +  +  + G  PLH+ + + N
Subjt:  KDAARWVDRQGFIPLHVLACKPN

Q4UMH6 Putative ankyrin repeat protein RF_03811.9e-0932.3Show/hide
Query:  KRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTL
        K GN   V  K+  E +A        G+T LH AV  N  ++V  L+        Y   + AK       LH A   G+  +  ++ S   D VN K   
Subjt:  KRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTL

Query:  GQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIH
        G+T L+ A  YG  D  Y L  +          N K +NG+TVLHYA+++ + DL   LIH
Subjt:  GQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIH

Q8LPS2 Protein ACCELERATED CELL DEATH 61.6e-0823.39Show/hide
Query:  KNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKN---------TKNNNGDTVLHYALKNEHF
        K+   ++ LH+AA  G  ++   I      L+ E+N+  QTPL +A   GH      L     + L   S            K+ +G+T L+YA++  + 
Subjt:  KNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKN---------TKNNNGDTVLHYALKNEHF

Query:  DLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWKKMIKKKNSDEANKDKDLEEHVDVETDQSGF
        ++A  L++  KDA    + +G   L+      N F                   + L    ++T      + ++K N D   +      HV ++    G 
Subjt:  DLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWKKMIKKKNSDEANKDKDLEEHVDVETDQSGF

Query:  LDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFET----LARGQGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIV
        LD+     +  ++     +M++  E G T                I+N  T    +    G  P+   AAK    E++ E  KR   + +     G+NI+
Subjt:  LDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFET----LARGQGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIV

Query:  LLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA--------AARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQF------------FALY
         +AA++        L+  KD  K L    D DGN+ LHLA             SR+  I    L+ +  +R     E    P +            +A++
Subjt:  LLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA--------AARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQF------------FALY

Query:  NKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGG------NDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSIL
        +   ++  S+   +  +  +K   ++     +  VVA L+ATV FA   TIPGG        + G A L      FIF +  ++A+    +S+    +++
Subjt:  NKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGG------NDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSIL

Query:  TSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLT------FLPIIFIFG
         ++ G  D    L S + + L  L FS++ M ++F       IAH    V  LL T++      FL  IFI G
Subjt:  TSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLT------FLPIIFIFG

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein3.7e-4826.72Show/hide
Query:  NTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKN
        N   G  EA  +   + PEA    L   GDT +H AV+    +IVE +++   R  D +Q L+ KN      L  AA  G  ++   + +    LV+ +N
Subjt:  NTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKN

Query:  TLGQTPLFLAALYGHKDAFYCLY-YFCRNDLDQC--SKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIW
             P+ +A+LYGHK     LY +   +DLD C  S   K  NG  ++   + +  + +A  LI  +   A   D      +  LA  P +F S   I 
Subjt:  TLGQTPLFLAALYGHKDAFYCLY-YFCRNDLDQC--SKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIW

Query:  GWRKIVYDWISVDQLKPQSIETLSKAWKKMIKKKNSDEANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPR
                                                                    I ++ ++K  H  + ++++ + +       E+P       
Subjt:  GWRKIVYDWISVDQLKPQSIETLSKAWKKMIKKKNSDEANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPR

Query:  SGRIVNFETL-ARGQGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA-AAR
              F+    +  G+   +  A + G++E + E+ + +   +      G NI   A   RQ  ++  +     +   L    D   N+ LH AA  A 
Subjt:  SGRIVNFETL-ARGQGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA-AAR

Query:  TSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYN-KEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVL
         SR   I GAALQMQ E++W+K VEK   P+   + N K+ KT  ++F D H+ LV++GE W+ +T+ SC+VVA LI T+ F++A T+PGG    G  + 
Subjt:  TSRHLGITGAALQMQWEVRWYKYVEKSAPPQFFALYN-KEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVL

Query:  EKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGIL
          +  F IF IS  I+L  S  S++MFL IL SR+   DF  +LP+K+++GL +L+ S+  M+++F      L+  +   V      L  +P + +F +L
Subjt:  EKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGIL

Query:  QLPLFFDLLK
        Q P+  ++ +
Subjt:  QLPLFFDLLK

AT5G04680.1 Ankyrin repeat family protein3.2e-3628.36Show/hide
Query:  DTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRN---NNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFC
        +T L  A    + EIV+   +L+ R T   +Q+  K  +N   +  L + A  G+ ++   + + +  L+      GQ P+ +A    +       Y + 
Subjt:  DTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRN---NNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFC

Query:  RNDLDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQ-GFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWK
        R  + Q        +G  +   A+     D+A  L +M +  A     Q   IP+ VLA KP+ F    ++    + +Y WI V       + TL K   
Subjt:  RNDLDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQ-GFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWK

Query:  KMIKKKNSDEANKD-KDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLE-----FGLTETPEVPGGTPTPRSGRI-VNFETLA-----RGQGV
                  ANKD K     +     +S ++ + K+R  K    +   +M KLL+      G+ E   +       +   + ++ ETL      R + V
Subjt:  KMIKKKNSDEANKD-KDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLE-----FGLTETPEVPGGTPTPRSGRI-VNFETLA-----RGQGV

Query:  MPVMLLAAKYGVMEMVMELFKRFRTTIHDT-TEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA-AARTSRHLGITGAALQMQW
           +L A +YG ++ ++E+ +     +  T T     + LLA E RQ  V+  L    D    L    D+DGN  LHLA   +  S+   +  A L+MQ 
Subjt:  MPVMLLAAKYGVMEMVMELFKRFRTTIHDT-TEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA-AARTSRHLGITGAALQMQW

Query:  EVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIA---------TVAFATAATIPGGNDDH--GAAVLEKEQGF
        E++W+K VE+ AP       N E +T   IF   H+ L Q+ E W+  T+ SCS+VA LI          TV FA   T+ GG+DD+  G      EQ F
Subjt:  EVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIA---------TVAFATAATIPGGNDDH--GAAVLEKEQGF

Query:  FIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFF
         IF +S LI+   + T++ +FL ILT+R+   DF   LP+KM+ GLS L+ SI AMLI+F S     + ++ + +         LP + +F +LQ PL  
Subjt:  FIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFF

Query:  DLL
        +++
Subjt:  DLL

AT5G04690.1 Ankyrin repeat family protein4.7e-3535.37Show/hide
Query:  VNFETLA-----RGQGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRA-VDNDGNSALHLAAAAR
        ++ ETLA     R + V   +L A +YG ++ ++E+ K     +  T      +   A + RQ  V+  L    D  K LF A  D+DGNS LHLA    
Subjt:  VNFETLA-----RGQGVMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRA-VDNDGNSALHLAAAAR

Query:  TSRHLG-ITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDH--GAA
         +  L  +  A LQMQ E++W+K +E+  P       N E  T   IF   HE +  + E W+  T+ SCS+VA LI TV FA   T+PGG DD+  G  
Subjt:  TSRHLG-ITGAALQMQWEVRWYKYVEKSAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDH--GAA

Query:  VLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFG
            E+ F IF +S LI+   + TS+++FL ILT+R+   DF  +LP+ M+ GLS+L+ SI AML++F SA + +    +   P + +   F  ++F+  
Subjt:  VLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFG

Query:  ILQLPLFFDLL
        ++Q PL  +L+
Subjt:  ILQLPLFFDLL

AT5G04700.1 Ankyrin repeat family protein1.7e-4528.42Show/hide
Query:  DTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRND
        +T L  A    + EIV+ L++ +  + +      ++N   + PL + A  G+ ++   + + +  L+      GQ P+ +A    +       Y + R  
Subjt:  DTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQTPLFLAALYGHKDAFYCLYYFCRND

Query:  LDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQ-GFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWKKMI
        + Q   +    +G  +   A+  +  D+A  L +M +  A     Q   IP+ VLA KP+ F  G ++    + +Y WI V                K+ 
Subjt:  LDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQ-GFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWKKMI

Query:  KKKNSDEANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFETLA-----RGQGVMPVMLLAAKY
               +N+D+       +    S +  I ++  +K  H     +  K L  G++E                   ETLA     R + V   +L A +Y
Subjt:  KKKNSDEANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFETLA-----RGQGVMPVMLLAAKY

Query:  GVMEMVMELFKRFRTTIHDT-TEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA-AARTSRHLGITGAALQMQWEVRWYKYVEK
        G ++ ++E+ +     +  T T     + LLA E RQ  V+  L    D    L    D DGN  LHLA   +  S+   + GA LQ+Q E++W+K VE+
Subjt:  GVMEMVMELFKRFRTTIHDT-TEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAA-AARTSRHLGITGAALQMQWEVRWYKYVEK

Query:  SAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDH--GAAVLEKEQGFFIFSISSLIALCLSTTSII
         AP       N E +T   IF   H+ L Q+ E W+  T+ SCS+VA LI TV FA   T+PGG DD+  G     +++ F IF +S LI+   S TS++
Subjt:  SAPPQFFALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDH--GAAVLEKEQGFFIFSISSLIALCLSTTSII

Query:  MFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLL
        +FL ILT+R+   DF   LP+KM+ GLS L+ SI AMLI+F SA + ++    + +         LP + +F +LQ PL  +++
Subjt:  MFLSILTSRFGARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLL

AT5G04730.1 Ankyrin-repeat containing protein8.5e-3727.34Show/hide
Query:  VERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQ----TPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTKNN
        +E + KL+      + Q+E      +  L +AA  G+      +   H +    +N   +     P+  A+  GHK     LYY   N   +       N
Subjt:  VERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNEKNTLGQ----TPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTKNN

Query:  NGDT-----VLHYALKNEHFDLAFQLI-HMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWKKMIKKKNSD
        + D      +L  A+     D+A  +I H+   A     RQ       +A KP+ F+S  +   WR ++Y  I V +                     + 
Subjt:  NGDT-----VLHYALKNEHFDLAFQLI-HMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWISVDQLKPQSIETLSKAWKKMIKKKNSD

Query:  EANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFETLARGQGVMPVMLLAAKYGVMEMVMELFK
          N+D  L   + +      +  I +  ++KK+H+ +    +KLL+   T   +            I+    +   + V   +L AAK G  +  +E+ K
Subjt:  EANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFETLARGQGVMPVMLLAAKYGVMEMVMELFK

Query:  ---RFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLG-ITGAALQMQWEVRWYKYVEKSAPPQFFAL
           +    ++ T+  G+N+  LA E ++  ++  +    D   +L R+ D   N+ LH+A    T   L  I+GAAL+MQ E +W+K VE     +    
Subjt:  ---RFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLG-ITGAALQMQWEVRWYKYVEKSAPPQFFAL

Query:  YNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGND-DHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRF
         NK+ KT   IFE  HE L ++GE+W+  T+ +CS VA LIATV F    T+PGG D   G+ ++  +  F  F  +  +A   S  S+++FLSILTSR+
Subjt:  YNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGND-DHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRF

Query:  GARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPIL
           DF  +LP KM++G S L+ SI +ML++F ++    + H+    P L+Y L   P+     +L L L + LLK ++
Subjt:  GARDFGSNLPSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACTCAAATTTAGTAGAGTTTCTACGTGCAAACACGAAGAGAGGGAACTGGGAAGCAGTGGTTGAAAAGTATGCGAAAGAGCCTGAAGCTCCAAAGCAGAAGCT
GGCCCCAAAAGGAGACACAGCACTGCATTTGGCTGTTATTGACAATCAAGAAGAAATAGTCGAAAGGCTTGTCAAACTCATTTGCAGATCCACAGATTACAAGCAACAGC
TTGAAGCTAAAAACAAGAGAAATAACAATCCTCTCCACCTCGCTGCGGCACTGGGGAGTGCCAAAATGTGCCACATCATTGCTTCAGCCCATGAGGACTTAGTGAATGAG
AAAAACACCCTCGGCCAAACGCCTCTCTTCTTGGCGGCTCTGTATGGCCACAAGGACGCCTTTTATTGTCTTTACTACTTTTGCAGAAATGACCTCGATCAATGCTCCAA
GAATACAAAGAATAATAATGGAGACACCGTACTACATTATGCCCTAAAAAACGAGCATTTTGATTTGGCATTTCAATTAATTCACATGCACAAAGACGCTGCTCGTTGGG
TGGATAGGCAAGGCTTTATCCCTCTCCATGTTCTAGCATGTAAGCCAAATTCCTTCAAAAGTGGAGCACACATCTGGGGATGGCGGAAAATCGTTTATGACTGGATATCT
GTGGATCAACTAAAGCCTCAATCAATAGAAACCCTCAGTAAAGCGTGGAAGAAAATGATCAAAAAGAAGAATAGTGATGAAGCCAACAAGGATAAAGATTTGGAGGAACA
TGTTGATGTGGAAACTGATCAATCGGGATTTCTTGATATCGGAAAGATACGAGAAATGAAAAAGAAACACACTTGGTCGGTTCAAGTGATGGAAAAACTTCTAGAATTTG
GTCTCACAGAAACTCCCGAAGTACCCGGCGGAACACCAACACCTCGTAGTGGTCGAATAGTAAACTTTGAGACACTCGCACGTGGCCAAGGCGTGATGCCAGTGATGTTA
CTAGCAGCAAAGTATGGTGTGATGGAGATGGTGATGGAACTGTTCAAACGCTTTCGGACGACGATCCATGATACTACGGAAGATGGGAAAAATATAGTGCTTTTGGCTGC
AGAGTCTAGACAGCTACACGTGTACAGGTTTTTACTGGAGAAAAAAGATGAGATAAAAAGCCTGTTTCGAGCCGTCGATAATGATGGCAATAGCGCCTTGCATCTCGCAG
CAGCCGCCCGAACTTCAAGGCATTTGGGCATCACTGGAGCCGCACTACAGATGCAATGGGAAGTTAGGTGGTATAAGTACGTGGAGAAGTCTGCGCCGCCTCAATTTTTT
GCCCTCTATAATAAGGAAGGAAAAACTTCAAGTTCAATCTTTGAAGATACCCACGAGGTTCTAGTACAAAAGGGCGAGGATTGGCTTTACAAAACTTCACAGTCATGCTC
AGTGGTAGCTACTCTGATTGCAACAGTGGCTTTTGCAACCGCAGCTACTATCCCAGGCGGCAACGATGACCATGGCGCCGCAGTACTCGAAAAAGAGCAAGGCTTTTTTA
TCTTCTCCATCTCTTCCCTCATTGCTCTATGCCTCTCTACAACCTCAATCATCATGTTTCTTTCCATCTTGACCTCCAGGTTCGGTGCCAGAGACTTCGGATCAAACTTG
CCTTCCAAAATGCTCATTGGCTTATCCTCTCTTTACTTTTCCATCGTCGCCATGTTAATTTCCTTTTGTAGTGCCCATTACTTTCTCATCGCTCACCGTTTTCAAAATGT
GCCTGTTCTACTCTACACACTTACTTTTCTTCCCATCATATTCATCTTTGGAATATTGCAGCTTCCTCTCTTCTTCGATTTGCTGAAGCCTATTCTTGAAATAGTGCCTA
GTGGGGGCGCCGAGGACGTCGTATTCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAACTCAAATTTAGTAGAGTTTCTACGTGCAAACACGAAGAGAGGGAACTGGGAAGCAGTGGTTGAAAAGTATGCGAAAGAGCCTGAAGCTCCAAAGCAGAAGCT
GGCCCCAAAAGGAGACACAGCACTGCATTTGGCTGTTATTGACAATCAAGAAGAAATAGTCGAAAGGCTTGTCAAACTCATTTGCAGATCCACAGATTACAAGCAACAGC
TTGAAGCTAAAAACAAGAGAAATAACAATCCTCTCCACCTCGCTGCGGCACTGGGGAGTGCCAAAATGTGCCACATCATTGCTTCAGCCCATGAGGACTTAGTGAATGAG
AAAAACACCCTCGGCCAAACGCCTCTCTTCTTGGCGGCTCTGTATGGCCACAAGGACGCCTTTTATTGTCTTTACTACTTTTGCAGAAATGACCTCGATCAATGCTCCAA
GAATACAAAGAATAATAATGGAGACACCGTACTACATTATGCCCTAAAAAACGAGCATTTTGATTTGGCATTTCAATTAATTCACATGCACAAAGACGCTGCTCGTTGGG
TGGATAGGCAAGGCTTTATCCCTCTCCATGTTCTAGCATGTAAGCCAAATTCCTTCAAAAGTGGAGCACACATCTGGGGATGGCGGAAAATCGTTTATGACTGGATATCT
GTGGATCAACTAAAGCCTCAATCAATAGAAACCCTCAGTAAAGCGTGGAAGAAAATGATCAAAAAGAAGAATAGTGATGAAGCCAACAAGGATAAAGATTTGGAGGAACA
TGTTGATGTGGAAACTGATCAATCGGGATTTCTTGATATCGGAAAGATACGAGAAATGAAAAAGAAACACACTTGGTCGGTTCAAGTGATGGAAAAACTTCTAGAATTTG
GTCTCACAGAAACTCCCGAAGTACCCGGCGGAACACCAACACCTCGTAGTGGTCGAATAGTAAACTTTGAGACACTCGCACGTGGCCAAGGCGTGATGCCAGTGATGTTA
CTAGCAGCAAAGTATGGTGTGATGGAGATGGTGATGGAACTGTTCAAACGCTTTCGGACGACGATCCATGATACTACGGAAGATGGGAAAAATATAGTGCTTTTGGCTGC
AGAGTCTAGACAGCTACACGTGTACAGGTTTTTACTGGAGAAAAAAGATGAGATAAAAAGCCTGTTTCGAGCCGTCGATAATGATGGCAATAGCGCCTTGCATCTCGCAG
CAGCCGCCCGAACTTCAAGGCATTTGGGCATCACTGGAGCCGCACTACAGATGCAATGGGAAGTTAGGTGGTATAAGTACGTGGAGAAGTCTGCGCCGCCTCAATTTTTT
GCCCTCTATAATAAGGAAGGAAAAACTTCAAGTTCAATCTTTGAAGATACCCACGAGGTTCTAGTACAAAAGGGCGAGGATTGGCTTTACAAAACTTCACAGTCATGCTC
AGTGGTAGCTACTCTGATTGCAACAGTGGCTTTTGCAACCGCAGCTACTATCCCAGGCGGCAACGATGACCATGGCGCCGCAGTACTCGAAAAAGAGCAAGGCTTTTTTA
TCTTCTCCATCTCTTCCCTCATTGCTCTATGCCTCTCTACAACCTCAATCATCATGTTTCTTTCCATCTTGACCTCCAGGTTCGGTGCCAGAGACTTCGGATCAAACTTG
CCTTCCAAAATGCTCATTGGCTTATCCTCTCTTTACTTTTCCATCGTCGCCATGTTAATTTCCTTTTGTAGTGCCCATTACTTTCTCATCGCTCACCGTTTTCAAAATGT
GCCTGTTCTACTCTACACACTTACTTTTCTTCCCATCATATTCATCTTTGGAATATTGCAGCTTCCTCTCTTCTTCGATTTGCTGAAGCCTATTCTTGAAATAGTGCCTA
GTGGGGGCGCCGAGGACGTCGTATTCAATTAG
Protein sequenceShow/hide protein sequence
MANSNLVEFLRANTKRGNWEAVVEKYAKEPEAPKQKLAPKGDTALHLAVIDNQEEIVERLVKLICRSTDYKQQLEAKNKRNNNPLHLAAALGSAKMCHIIASAHEDLVNE
KNTLGQTPLFLAALYGHKDAFYCLYYFCRNDLDQCSKNTKNNNGDTVLHYALKNEHFDLAFQLIHMHKDAARWVDRQGFIPLHVLACKPNSFKSGAHIWGWRKIVYDWIS
VDQLKPQSIETLSKAWKKMIKKKNSDEANKDKDLEEHVDVETDQSGFLDIGKIREMKKKHTWSVQVMEKLLEFGLTETPEVPGGTPTPRSGRIVNFETLARGQGVMPVML
LAAKYGVMEMVMELFKRFRTTIHDTTEDGKNIVLLAAESRQLHVYRFLLEKKDEIKSLFRAVDNDGNSALHLAAAARTSRHLGITGAALQMQWEVRWYKYVEKSAPPQFF
ALYNKEGKTSSSIFEDTHEVLVQKGEDWLYKTSQSCSVVATLIATVAFATAATIPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGARDFGSNL
PSKMLIGLSSLYFSIVAMLISFCSAHYFLIAHRFQNVPVLLYTLTFLPIIFIFGILQLPLFFDLLKPILEIVPSGGAEDVVFN