; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028797 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028797
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAnkyrin repeat family protein
Genome locationscaffold7:15713982..15718672
RNA-Seq ExpressionSpg028797
SyntenySpg028797
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]9.7e-16347.57Show/hide
Query:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS
        N D  E L+   E    + V+EKY E  E    +L   GDT LHLAVIDNQE IVE LVK++       ++L   N+S N+ LHLAA +GS RMC  IAS
Subjt:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS

Query:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS
         +  LV+  N   +TPLFLAA YG+KDAF+CLY FCR + ++ + NC  + + DTVLH AL +EHFDLAFQLI+MHK+A  WV+  G TP+HVLA KP+S
Subjt:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS

Query:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTL--------------KPLVKKKNSDE
        FKSG+HI GW  IVY+  +V  L+P+S E L +  ++S    K     S FP NY TCIHFF  + D IL                   K  + K + D+
Subjt:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTL--------------KPLVKKKNSDE

Query:  ANKDKDLEEHVDVE--TDP--------------------SSINLVLPMPRQRMELL--------------------------------------------
         N D + ++ ++ +  T+P                    S+I ++L      ++ +                                            
Subjt:  ANKDKDLEEHVDVE--TDP--------------------SSINLVLPMPRQRMELL--------------------------------------------

Query:  --------------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRA
                            E  + E P+  +A    MLLA + GV+E+V    +RF   I DT +D KN+VLLAAE+RQ  VYR LL+K  EI++LFRA
Subjt:  --------------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRA

Query:  VDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATA
        VD  GNSALHLAATA   K   ITGAALQMQWEV+WYKYV+ SVP   F  +N +GKTAS IF ETH  +  KG  WLY TS+SCS+VATLIATVAFATA
Subjt:  VDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATA

Query:  ATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLL
        AT+PGGND+ G A L  EQGF IFS SSLIALCLS+TS+IMFL+I+TSRF  KDFG  LP K+LIGL  LYFSI+AML+SFCSGHYFLI+ RL N A LL
Subjt:  ATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLL

Query:  YTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        YTLTF P+  IFGI+QLPL+FDLL+ + + VP   A+ VL
Subjt:  YTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]4.4e-16348.44Show/hide
Query:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS
        N D  E L+   E    + V+EKY E  E    +L   GDT LHLAVIDNQE IVE LVK++       ++L   N+S N+ LHLAA +GS RMC  IAS
Subjt:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS

Query:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS
         +  LV+  N   +TPLFLAA YG+KDAF+CLY FCR + ++ + NC  + + DTVLH AL +EHFDLAFQLI+MHK+A  WV+  G TP+HVLA KP+S
Subjt:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS

Query:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVK---------KKNSDEANKDK
        FKSG+HI GW  IVY+  +V  L+P+S E L +  ++S    K     S FP NY TCIHFF  + D IL     K             KKN  + + DK
Subjt:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVK---------KKNSDEANKDK

Query:  ---DLEEHVDVETDP-------------------SSINLVLPMPRQRMELL-------------------------------------------------
           DLEE   + T                     S+I ++L      ++ +                                                 
Subjt:  ---DLEEHVDVETDP-------------------SSINLVLPMPRQRMELL-------------------------------------------------

Query:  ---------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYG
                       E  + E P+  +A    MLLA + GV+E+V    +RF   I DT +D KN+VLLAAE+RQ  VYR LL+K  EI++LFRAVD  G
Subjt:  ---------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYG

Query:  NSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPG
        NSALHLAATA   K   ITGAALQMQWEV+WYKYV+ SVP   F  +N +GKTAS IF ETH  +  KG  WLY TS+SCS+VATLIATVAFATAAT+PG
Subjt:  NSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPG

Query:  GNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTF
        GND+ G A L  EQGF IFS SSLIALCLS+TS+IMFL+I+TSRF  KDFG  LP K+LIGL  LYFSI+AML+SFCSGHYFLI+ RL N A LLYTLTF
Subjt:  GNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTF

Query:  LPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
         P+  IFGI+QLPL+FDLL+ + + VP   A+ VL
Subjt:  LPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

XP_022995620.1 uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima]1.0e-16446.26Show/hide
Query:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE
        L +FL  NT+RG WE V++KY E PEAQ L+LT NGDT LHLAV+DN+EE+V++LV  I  +  Y +LL   N    +PLHLAA +GSA MC+ IASA++
Subjt:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE

Query:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS
        +LV+  N   ETPL+LAA  G++DAF+CLY+FCR+N ++ T NC   SN DTVLH AL+N+HFDLAFQ++H++ +A HWV   G TPLHVLASKP++FKS
Subjt:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS

Query:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------
        G+ I GW  I Y   +VD+LKPQ I++L   W   +     NT+T  FPANY TCI FFT +WDG LK S LK +     +DE+ KD  +  ++      
Subjt:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------

Query:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF
         +ETD S   L               +  +PR                                         Q ME LLE                   
Subjt:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF

Query:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE
        +TP+  +  +P                                 MLLA K GV+E+V  ++ RF  +I D  +D KN+VLLAAEY Q  VYRFLL  K  
Subjt:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE

Query:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA
         E+LFRAVD  GNSALHLAA A  S    ITGAALQMQWE++WYK+VE+SVP   FA YNK+GK A+ IF ETHM +V+K  +WL KTS+SCSVV TLI 
Subjt:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA

Query:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR
        TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   
Subjt:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR

Query:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        + + A LLYT+  +P+  IF I +LPL+ D+++ I +IVP   A  VL
Subjt:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

XP_022995621.1 uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima]6.1e-16546.38Show/hide
Query:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE
        L +FL  NT+RG WE V++KY E PEAQ L+LT NGDT LHLAV+DN+EE+V++LV  I  +  Y +LL   N    +PLHLAA +GSA MC+ IASA++
Subjt:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE

Query:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS
        +LV+  N   ETPL+LAA  G++DAF+CLY+FCR+N ++ T NC   SN DTVLH AL+N+HFDLAFQ++H++ +A HWV   G TPLHVLASKP++FKS
Subjt:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS

Query:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------
        G+ I GW  I Y   +VD+LKPQ I++L   W   +     NT+T  FPANY TCI FFT +WDG LK S LK +     +DE+ KD  +  ++      
Subjt:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------

Query:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF
         +ETD S   L               +  +PR                                         Q ME LLE                   
Subjt:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF

Query:  ETPEYGQALMP------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIE
        +TP+  +  +P                               MLLA K GV+E+V  ++ RF  +I D  +D KN+VLLAAEY Q  VYRFLL  K   E
Subjt:  ETPEYGQALMP------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIE

Query:  SLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATV
        +LFRAVD  GNSALHLAA A  S    ITGAALQMQWE++WYK+VE+SVP   FA YNK+GK A+ IF ETHM +V+K  +WL KTS+SCSVV TLI TV
Subjt:  SLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATV

Query:  AFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQ
        AF + A++PGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + 
Subjt:  AFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQ

Query:  NVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        + A LLYT+  +P+  IF I +LPL+ D+++ I +IVP   A  VL
Subjt:  NVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

XP_022995622.1 uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima]1.0e-16446.26Show/hide
Query:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE
        L +FL  NT+RG WE V++KY E PEAQ L+LT NGDT LHLAV+DN+EE+V++LV  I  +  Y +LL   N    +PLHLAA +GSA MC+ IASA++
Subjt:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE

Query:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS
        +LV+  N   ETPL+LAA  G++DAF+CLY+FCR+N ++ T NC   SN DTVLH AL+N+HFDLAFQ++H++ +A HWV   G TPLHVLASKP++FKS
Subjt:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS

Query:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------
        G+ I GW  I Y   +VD+LKPQ I++L   W   +     NT+T  FPANY TCI FFT +WDG LK S LK +     +DE+ KD  +  ++      
Subjt:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------

Query:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF
         +ETD S   L               +  +PR                                         Q ME LLE                   
Subjt:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF

Query:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE
        +TP+  +  +P                                 MLLA K GV+E+V  ++ RF  +I D  +D KN+VLLAAEY Q  VYRFLL  K  
Subjt:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE

Query:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA
         E+LFRAVD  GNSALHLAA A  S    ITGAALQMQWE++WYK+VE+SVP   FA YNK+GK A+ IF ETHM +V+K  +WL KTS+SCSVV TLI 
Subjt:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA

Query:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR
        TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   
Subjt:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR

Query:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        + + A LLYT+  +P+  IF I +LPL+ D+++ I +IVP   A  VL
Subjt:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X22.1e-16348.44Show/hide
Query:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS
        N D  E L+   E    + V+EKY E  E    +L   GDT LHLAVIDNQE IVE LVK++       ++L   N+S N+ LHLAA +GS RMC  IAS
Subjt:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS

Query:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS
         +  LV+  N   +TPLFLAA YG+KDAF+CLY FCR + ++ + NC  + + DTVLH AL +EHFDLAFQLI+MHK+A  WV+  G TP+HVLA KP+S
Subjt:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS

Query:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVK---------KKNSDEANKDK
        FKSG+HI GW  IVY+  +V  L+P+S E L +  ++S    K     S FP NY TCIHFF  + D IL     K             KKN  + + DK
Subjt:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVK---------KKNSDEANKDK

Query:  ---DLEEHVDVETDP-------------------SSINLVLPMPRQRMELL-------------------------------------------------
           DLEE   + T                     S+I ++L      ++ +                                                 
Subjt:  ---DLEEHVDVETDP-------------------SSINLVLPMPRQRMELL-------------------------------------------------

Query:  ---------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYG
                       E  + E P+  +A    MLLA + GV+E+V    +RF   I DT +D KN+VLLAAE+RQ  VYR LL+K  EI++LFRAVD  G
Subjt:  ---------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYG

Query:  NSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPG
        NSALHLAATA   K   ITGAALQMQWEV+WYKYV+ SVP   F  +N +GKTAS IF ETH  +  KG  WLY TS+SCS+VATLIATVAFATAAT+PG
Subjt:  NSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPG

Query:  GNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTF
        GND+ G A L  EQGF IFS SSLIALCLS+TS+IMFL+I+TSRF  KDFG  LP K+LIGL  LYFSI+AML+SFCSGHYFLI+ RL N A LLYTLTF
Subjt:  GNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTF

Query:  LPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
         P+  IFGI+QLPL+FDLL+ + + VP   A+ VL
Subjt:  LPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X14.7e-16347.57Show/hide
Query:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS
        N D  E L+   E    + V+EKY E  E    +L   GDT LHLAVIDNQE IVE LVK++       ++L   N+S N+ LHLAA +GS RMC  IAS
Subjt:  NADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIAS

Query:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS
         +  LV+  N   +TPLFLAA YG+KDAF+CLY FCR + ++ + NC  + + DTVLH AL +EHFDLAFQLI+MHK+A  WV+  G TP+HVLA KP+S
Subjt:  AYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS

Query:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTL--------------KPLVKKKNSDE
        FKSG+HI GW  IVY+  +V  L+P+S E L +  ++S    K     S FP NY TCIHFF  + D IL                   K  + K + D+
Subjt:  FKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTL--------------KPLVKKKNSDE

Query:  ANKDKDLEEHVDVE--TDP--------------------SSINLVLPMPRQRMELL--------------------------------------------
         N D + ++ ++ +  T+P                    S+I ++L      ++ +                                            
Subjt:  ANKDKDLEEHVDVE--TDP--------------------SSINLVLPMPRQRMELL--------------------------------------------

Query:  --------------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRA
                            E  + E P+  +A    MLLA + GV+E+V    +RF   I DT +D KN+VLLAAE+RQ  VYR LL+K  EI++LFRA
Subjt:  --------------------EAIDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRA

Query:  VDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATA
        VD  GNSALHLAATA   K   ITGAALQMQWEV+WYKYV+ SVP   F  +N +GKTAS IF ETH  +  KG  WLY TS+SCS+VATLIATVAFATA
Subjt:  VDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATA

Query:  ATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLL
        AT+PGGND+ G A L  EQGF IFS SSLIALCLS+TS+IMFL+I+TSRF  KDFG  LP K+LIGL  LYFSI+AML+SFCSGHYFLI+ RL N A LL
Subjt:  ATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLL

Query:  YTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        YTLTF P+  IFGI+QLPL+FDLL+ + + VP   A+ VL
Subjt:  YTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X35.0e-16546.26Show/hide
Query:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE
        L +FL  NT+RG WE V++KY E PEAQ L+LT NGDT LHLAV+DN+EE+V++LV  I  +  Y +LL   N    +PLHLAA +GSA MC+ IASA++
Subjt:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE

Query:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS
        +LV+  N   ETPL+LAA  G++DAF+CLY+FCR+N ++ T NC   SN DTVLH AL+N+HFDLAFQ++H++ +A HWV   G TPLHVLASKP++FKS
Subjt:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS

Query:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------
        G+ I GW  I Y   +VD+LKPQ I++L   W   +     NT+T  FPANY TCI FFT +WDG LK S LK +     +DE+ KD  +  ++      
Subjt:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------

Query:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF
         +ETD S   L               +  +PR                                         Q ME LLE                   
Subjt:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF

Query:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE
        +TP+  +  +P                                 MLLA K GV+E+V  ++ RF  +I D  +D KN+VLLAAEY Q  VYRFLL  K  
Subjt:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE

Query:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA
         E+LFRAVD  GNSALHLAA A  S    ITGAALQMQWE++WYK+VE+SVP   FA YNK+GK A+ IF ETHM +V+K  +WL KTS+SCSVV TLI 
Subjt:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA

Query:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR
        TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   
Subjt:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR

Query:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        + + A LLYT+  +P+  IF I +LPL+ D+++ I +IVP   A  VL
Subjt:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X22.9e-16546.38Show/hide
Query:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE
        L +FL  NT+RG WE V++KY E PEAQ L+LT NGDT LHLAV+DN+EE+V++LV  I  +  Y +LL   N    +PLHLAA +GSA MC+ IASA++
Subjt:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE

Query:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS
        +LV+  N   ETPL+LAA  G++DAF+CLY+FCR+N ++ T NC   SN DTVLH AL+N+HFDLAFQ++H++ +A HWV   G TPLHVLASKP++FKS
Subjt:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS

Query:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------
        G+ I GW  I Y   +VD+LKPQ I++L   W   +     NT+T  FPANY TCI FFT +WDG LK S LK +     +DE+ KD  +  ++      
Subjt:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------

Query:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF
         +ETD S   L               +  +PR                                         Q ME LLE                   
Subjt:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF

Query:  ETPEYGQALMP------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIE
        +TP+  +  +P                               MLLA K GV+E+V  ++ RF  +I D  +D KN+VLLAAEY Q  VYRFLL  K   E
Subjt:  ETPEYGQALMP------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIE

Query:  SLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATV
        +LFRAVD  GNSALHLAA A  S    ITGAALQMQWE++WYK+VE+SVP   FA YNK+GK A+ IF ETHM +V+K  +WL KTS+SCSVV TLI TV
Subjt:  SLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATV

Query:  AFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQ
        AF + A++PGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + 
Subjt:  AFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQ

Query:  NVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        + A LLYT+  +P+  IF I +LPL+ D+++ I +IVP   A  VL
Subjt:  NVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X15.0e-16546.26Show/hide
Query:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE
        L +FL  NT+RG WE V++KY E PEAQ L+LT NGDT LHLAV+DN+EE+V++LV  I  +  Y +LL   N    +PLHLAA +GSA MC+ IASA++
Subjt:  LLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYE

Query:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS
        +LV+  N   ETPL+LAA  G++DAF+CLY+FCR+N ++ T NC   SN DTVLH AL+N+HFDLAFQ++H++ +A HWV   G TPLHVLASKP++FKS
Subjt:  DLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCS-RSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKS

Query:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------
        G+ I GW  I Y   +VD+LKPQ I++L   W   +     NT+T  FPANY TCI FFT +WDG LK S LK +     +DE+ KD  +  ++      
Subjt:  GAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHV------

Query:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF
         +ETD S   L               +  +PR                                         Q ME LLE                   
Subjt:  DVETDPSSINL---------------VLPMPR-----------------------------------------QRME-LLEAI---------------DF

Query:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE
        +TP+  +  +P                                 MLLA K GV+E+V  ++ RF  +I D  +D KN+VLLAAEY Q  VYRFLL  K  
Subjt:  ETPEYGQALMP--------------------------------VMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDE

Query:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA
         E+LFRAVD  GNSALHLAA A  S    ITGAALQMQWE++WYK+VE+SVP   FA YNK+GK A+ IF ETHM +V+K  +WL KTS+SCSVV TLI 
Subjt:  IESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA

Query:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR
        TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS+TS+ MFL+ILT RF A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   
Subjt:  TVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHR

Query:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL
        + + A LLYT+  +P+  IF I +LPL+ D+++ I +IVP   A  VL
Subjt:  LQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVL

SwissProt top hitse value%identityAlignment
G5E8K5 Ankyrin-31.3e-0532.91Show/hide
Query:  GDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRH
        G+T LH+A    Q E+V  LV      +D  Q + A+ K   +PLH++A LG A +   +        N    SG TPL LAA  GH+D      +   H
Subjt:  GDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRH

Query:  NLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIH--MHKDAAHWVDWNGFTPLHVLA
            A+ + +     T LH A K    ++A  L+      DAA     +G TPLHV A
Subjt:  NLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIH--MHKDAAHWVDWNGFTPLHVLA

P50086 Probable 26S proteasome regulatory subunit p288.6e-0531.58Show/hide
Query:  GNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGE
        GN E V   Y    +    ++T+ G T LHLAV     E+ + L++     R        ++K    PLH AA++GS ++  ++    +  VN  +  G 
Subjt:  GNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGE

Query:  TPLFLAALYGHKDA
        TPLF A   GH DA
Subjt:  TPLFLAALYGHKDA

Q5ZLC8 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit C9.5e-0427.27Show/hide
Query:  LTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYY
        L  N  T LH AVI+NQ+   E LV+ +       +++ +R+    +PLH AA   +     ++   ++  V+  +  G TPL +A+  GH  A   L Y
Subjt:  LTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYY

Query:  FCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKD
          + N+          N++T LH A    H   A  ++   +D
Subjt:  FCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKD

Q91974 NF-kappa-B inhibitor alpha3.5e-0626.74Show/hide
Query:  PEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKD
        P A   +LT +GDT LHLA+I  ++ +    +++I ++      L  +N    +PLHLA     A +   +  A  DL +  +  G TPL +A   G   
Subjt:  PEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKD

Query:  AFYCLYYFCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS
        +   L   C+ +   A    +  N  T LH A    +  +   L+ +  D       NG T LH+     +S
Subjt:  AFYCLYYFCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS

Q9C7A2 Ankyrin repeat-containing protein ITN16.4e-0827.89Show/hide
Query:  EGVVEKYAEDPEAQRLR------LTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNA
        EG++     D E   +R      +   G+T L  A      ++V+ L+K   R     + +A +N+S   PLH+AA  G   +  V+      L      
Subjt:  EGVVEKYAEDPEAQRLR------LTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNA

Query:  SGETPLFLAALYGHKDAFYCLYYFCRHNLAQATN--NCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS
        S  TPL  AA+ GH +    L       L++A N    SRSN    LH A +  H ++   L+      A  +D  G T LH+     SS
Subjt:  SGETPLFLAALYGHKDAFYCLYYFCRHNLAQATN--NCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSS

Q9C7A2 Ankyrin repeat-containing protein ITN11.1e-0233Show/hide
Query:  SQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISF
        + S +VVA L ATVAFA   TVPGG+++ G+AV+     F IF I + +AL  S   +++ ++++     A+     +  K L+ L+S+  S+  +  S+
Subjt:  SQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISF

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein5.1e-4526.43Show/hide
Query:  NTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNN
        N + G  E   +    +PEA    LTSNGDT +H AV+    +IVE +++   R  D +Q+L  +N +  + L  AA  G  R+   + +    LV+  N
Subjt:  NTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNN

Query:  ASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCSRS------NRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKSGA
        A    P+ +A+LYGHK   + + Y   H      + C  S      N   ++   + +  + +A  LI  +   A+  D +  T +  LA  P +F S  
Subjt:  ASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCSRS------NRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKSGA

Query:  HICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSS
         I                                                             I +V  LK                   H         
Subjt:  HICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSS

Query:  INLVLPMPRQRMELLEAIDFETPEYGQA------LMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFR
                 Q  E+L+ I  E P++  A      L   +  A + G++E + E+ + +   +      G NI   A   RQ  ++  +     +   L  
Subjt:  INLVLPMPRQRMELLEAIDFETPEYGQA------LMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFR

Query:  AVDKYGNSALHLAATARTSKHLG-ITGAALQMQWEVRWYKYVEKSVPHQIFALYN-KKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAF
          D + N+ LH AA    +  L  I GAALQMQ E++W+K VEK V  +   + N K+ KT   +F + H  +V++GE W+ +T+ SC+VVA LI T+ F
Subjt:  AVDKYGNSALHLAATARTSKHLG-ITGAALQMQWEVRWYKYVEKSVPHQIFALYN-KKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAF

Query:  ATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVA
        ++A TVPGG    G  +   +  F IF IS  I+L  S  S++MFL IL SR+  +DF  +LPTK+++GL +L+ S+  M+++F      L+  ++  V+
Subjt:  ATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVA

Query:  TLLYTLTFLPIIFIFGILQLPLFFDLLK
             L  +P + +F +LQ P+  ++ +
Subjt:  TLLYTLTFLPIIFIFGILQLPLFFDLLK

AT3G54070.1 Ankyrin repeat family protein1.3e-3524.92Show/hide
Query:  RLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLY
        ++T N +  LH+AV    ++ V  L+    R  D    L+ +NK  N+PL  AAALG      ++ +   DL + +N    TP+ +AALYGH +     Y
Subjt:  RLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLY

Query:  YFCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIH----MHKDAAHWVDWNGFTPLHVLASKPSSFKSGAHICGWWKIVYNWIYVDRLKPQSIET
         F + ++    +    +   T++   +     D+   ++       K+ A + + N    LH+LA K S+    + +  + ++  +W+  D  +  ++E 
Subjt:  YFCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIH----MHKDAAHWVDWNGFTPLHVLASKPSSFKSGAHICGWWKIVYNWIYVDRLKPQSIET

Query:  LSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEAIDFETPEY
                                                                                     +++ + R  ++LL  +D      
Subjt:  LSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEAIDFETPEY

Query:  GQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEK---KDEIESLFRAVDKYGNSALHLAATARTSKHLGI-TGA
                                   RT  H           +AA YR  +++  + E    KD I S      K  ++ LHL A         + +GA
Subjt:  GQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEK---KDEIESLFRAVDKYGNSALHLAATARTSKHLGI-TGA

Query:  ALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAV-------LEKEQ
        AL MQ E+ W+K V++ VP       N KG+ A  IF E H  + ++GE W+ +T+ +C + ATLIATV FA A T+PGGNDD G            K  
Subjt:  ALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAV-------LEKEQ

Query:  GFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPII--FIFGILQL
         F IF++S  +AL  S  SI++FLSI TSR+  +DF  +LPTK++ GLS+L+ SI++M+++F    + +I+ R++  +  L  ++ L  +    F  L  
Subjt:  GFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPII--FIFGILQL

Query:  PLFFDLLKPI
         L+F+ L+ +
Subjt:  PLFFDLLKPI

AT5G04680.1 Ankyrin repeat family protein9.7e-3626.83Show/hide
Query:  DTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHN
        +T L  A    + EIV+ L+  +   +   ++  ++N S ++ L + A  G+  +   + +    L+     +G+ P+ +A      +    LY      
Subjt:  DTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHN

Query:  LAQATNNCSRSNRDTVLHF-ALKNEHFDLAFQLIHMHKDAAHWVDWN-GFTPLHVLASKPSSFKSGAHICGWWKIVYNWIYV-----------DRLKPQS
        +  A +        T+L   A+     D+A  L +M +  A          P+ VLASKP  F    ++    + +Y+WI V           ++     
Subjt:  LAQATNNCSRSNRDTVLHF-ALKNEHFDLAFQLIHMHKDAAHWVDWN-GFTPLHVLASKPSSFKSGAHICGWWKIVYNWIYV-----------DRLKPQS

Query:  IETLSEAWKKSIH--PKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEAIDF
           + + +KKSI+   KK   +   FP      +    S W GI +V  LK +                 H+  +       L+L +        E +  
Subjt:  IETLSEAWKKSIH--PKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEAIDF

Query:  ETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDT-TEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAA-TARTSKHLGI
           E  + +   +L A +YG ++ ++E+ +     +  T T     + LLA E+RQ  V+  L    D    L    D  GN  LHLA   +  SK   +
Subjt:  ETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDT-TEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAA-TARTSKHLGI

Query:  TGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA---------TVAFATAATVPGGNDDH--GA
          A L+MQ E++W+K VE+  P       N + +T   IF + H G+ Q+ E W+  T+ SCS+VA LI          TV FA   TV GG+DD+  G 
Subjt:  TGAALQMQWEVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIA---------TVAFATAATVPGGNDDH--GA

Query:  AVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPI---I
             EQ F IF +S LI+   + T++ +FL ILT+R+   DF   LPTKM+ GLS L+ SI AMLI+F      L++  + N    +   T L      
Subjt:  AVLEKEQGFFIFSISSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPI---I

Query:  FIFGILQLPLFFDLL
         +F +LQ PL  +++
Subjt:  FIFGILQLPLFFDLL

AT5G04700.1 Ankyrin repeat family protein6.9e-4227.95Show/hide
Query:  DTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHN
        +T L  A    + EIV+ L++ +   +   ++  ++N S ++PL + A  G+  +   + +    L+     +G+ P+ +A      +    LY      
Subjt:  DTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASAYEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHN

Query:  LAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWN-GFTPLHVLASKPSSFKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSI
          Q   +    +   +   A+  +  D+A  L +M +  A          P+ VLASKP  F  G ++    + +Y+WI V       + TL +      
Subjt:  LAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWN-GFTPLHVLASKPSSFKSGAHICGWWKIVYNWIYVDRLKPQSIETLSEAWKKSI

Query:  HPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEAIDFET-----PEYGQAL
         P ++N                        L    LK L K    DE  + K                    M  Q  +LL  I  ET      E  + +
Subjt:  HPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEAIDFET-----PEYGQAL

Query:  MPVMLLATKYGVMELVMELYKRFRTTIHDT-TEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAA-TARTSKHLGITGAALQMQW
           +L A +YG ++ ++E+ +     +  T T     + LLA E+RQ  V+  L    D    L    D  GN  LHLA   +  SK   + GA LQ+Q 
Subjt:  MPVMLLATKYGVMELVMELYKRFRTTIHDT-TEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAA-TARTSKHLGITGAALQMQW

Query:  EVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH--GAAVLEKEQGFFIFSISSLI
        E++W+K VE+  P       N + +T   IF + H G+ Q+ E W+  T+ SCS+VA LI TV FA   TVPGG DD+  G     +++ F IF +S LI
Subjt:  EVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH--GAAVLEKEQGFFIFSISSLI

Query:  ALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPIIFIFGILQLPLFFDLL
        +   S TS+++FL ILT+R+   DF   LPTKM+ GLS L+ SI AMLI+F S  + ++    + +         LP + +F +LQ PL  +++
Subjt:  ALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPIIFIFGILQLPLFFDLL

AT5G35810.1 Ankyrin repeat family protein2.6e-3334.65Show/hide
Query:  PVMLL-ATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGN-SALHLAATARTSKHLG-ITGAALQMQW
        P++L  A + G +EL++ L + +   I       +++  +AA  R   ++  + E     + +    +K  N + LHL A       L  ++GAALQMQ 
Subjt:  PVMLL-ATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGN-SALHLAATARTSKHLG-ITGAALQMQW

Query:  EVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH------GAAVLEKEQGFFIFSI
        E+ WYK V++ VP       NKK + A  +F + H  + ++GE W+ +T+ +C +V+TLIATV FA A T+PGGND        G     KE  F +F I
Subjt:  EVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH------GAAVLEKEQGFFIFSI

Query:  SSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQ----NVATLLYTLTFLPIIFIFGILQLPLFFD
        S  +AL  S TSI++FLSILTSR+    F + LPTK+++GL +L+ SI++M+++F +    LI+ R Q    ++  L+Y  +   + F+  +L   L+FD
Subjt:  SSLIALCLSTTSIIMFLSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQ----NVATLLYTLTFLPIIFIFGILQLPLFFD

Query:  LLK
         L+
Subjt:  LLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGAGAGAGAAAAGTGAAGACAATGGCTAATGCAGATTTACTAGAGTTTCTACGTGCAAACACAGAGAGAGGGAATTGGGAAGGAGTGGTTGAAAAGTATGCGGAAGATCC
TGAAGCTCAGCGACTGAGGCTGACCTCGAATGGAGACACGACACTGCATTTGGCTGTTATTGACAATCAAGAAGAAATAGTCGAAAGGCTTGTCAAACTCATTTGCAGAT
CCAGAGATTACAAGCAACTTCTTGCGGCTAGAAATAAAAGTATTAATAGCCCTCTCCACCTCGCTGCGGCGCTGGGGAGCGCCAGAATGTGTCATGTCATTGCTTCAGCC
TATGAGGACTTAGTGAATGGGAACAACGCCTCCGGTGAAACACCTCTCTTCTTGGCAGCTCTGTATGGCCACAAGGACGCCTTTTATTGCCTTTACTACTTTTGCAGACA
CAACCTCGCTCAAGCCACCAACAATTGCTCACGTTCTAATAGAGACACCGTACTACATTTTGCCCTAAAAAACGAGCATTTTGATTTGGCATTTCAATTAATTCACATGC
ACAAGGACGCTGCTCACTGGGTGGATTGGAATGGCTTTACCCCTCTCCATGTTCTAGCAAGTAAGCCAAGTTCCTTCAAAAGTGGAGCCCACATCTGTGGATGGTGGAAA
ATCGTCTATAACTGGATATATGTGGATCGACTAAAGCCTCAATCGATAGAAACTCTCAGTGAAGCGTGGAAGAAAAGTATCCACCCAAAAAAAACAAACACAACTACTTC
CTATTTTCCAGCTAACTACCACACATGCATCCACTTCTTTACCAGTTTGTGGGATGGAATTTTAAAAGTCAGCACTTTGAAACCACTAGTCAAAAAGAAGAATAGTGATG
AAGCCAACAAGGATAAAGATTTGGAAGAACATGTTGATGTGGAAACTGATCCATCGAGTATAAATTTAGTTTTGCCAATGCCAAGGCAACGCATGGAACTGCTAGAAGCA
ATAGACTTTGAGACACCGGAATATGGCCAAGCCTTGATGCCAGTGATGTTACTAGCAACAAAGTACGGTGTGATGGAGCTGGTGATGGAACTTTACAAACGCTTCCGAAC
AACGATCCATGATACTACGGAAGATGGGAAAAATATAGTGCTTTTGGCTGCAGAGTATAGGCAGCTACACGTGTACAGGTTTCTACTGGAGAAAAAAGATGAGATAGAAA
GCCTGTTTCGAGCTGTGGATAAATATGGCAATAGCGCCTTGCATCTCGCAGCAACTGCCCGAACTTCAAAGCATTTGGGTATCACTGGAGCCGCACTACAGATGCAATGG
GAAGTTAGGTGGTATAAGTACGTGGAGAAGTCTGTGCCGCACCAAATTTTTGCCCTCTATAATAAGAAAGGAAAAACTGCAAGTACAATCTTTGAAGAAACCCACATGGG
TGTAGTGCAAAAGGGCGAGGATTGGCTTTACAAAACCTCACAGTCATGCTCTGTGGTAGCTACTCTGATTGCAACAGTGGCTTTTGCAACCGCAGCCACTGTCCCAGGCG
GCAATGACGACCATGGCGCCGCAGTACTCGAAAAAGAGCAAGGCTTTTTTATCTTTTCCATCTCTTCCCTCATTGCTCTATGCCTCTCTACAACCTCAATCATCATGTTT
CTTTCCATCTTGACCTCCAGGTTCGGTGCCAAAGATTTCGGATCAAACTTGCCTACCAAAATGCTCATTGGTTTATCCTCTCTTTACTTTTCCATCGTCGCCATGTTAAT
TTCCTTCTGTAGTGGCCATTACTTTCTCATCGTTCACCGCCTTCAAAATGTGGCTACTCTACTCTACACACTTACTTTTCTTCCCATCATATTCATCTTTGGAATATTGC
AACTTCCTCTCTTCTTCGATTTGCTGAAGCCTATTCGCGAAATAGTGCCTGGAGGGGGCGCCAAGGACGTCCTATTCAATTAG
mRNA sequenceShow/hide mRNA sequence
AGAGAGAGAAAAGTGAAGACAATGGCTAATGCAGATTTACTAGAGTTTCTACGTGCAAACACAGAGAGAGGGAATTGGGAAGGAGTGGTTGAAAAGTATGCGGAAGATCC
TGAAGCTCAGCGACTGAGGCTGACCTCGAATGGAGACACGACACTGCATTTGGCTGTTATTGACAATCAAGAAGAAATAGTCGAAAGGCTTGTCAAACTCATTTGCAGAT
CCAGAGATTACAAGCAACTTCTTGCGGCTAGAAATAAAAGTATTAATAGCCCTCTCCACCTCGCTGCGGCGCTGGGGAGCGCCAGAATGTGTCATGTCATTGCTTCAGCC
TATGAGGACTTAGTGAATGGGAACAACGCCTCCGGTGAAACACCTCTCTTCTTGGCAGCTCTGTATGGCCACAAGGACGCCTTTTATTGCCTTTACTACTTTTGCAGACA
CAACCTCGCTCAAGCCACCAACAATTGCTCACGTTCTAATAGAGACACCGTACTACATTTTGCCCTAAAAAACGAGCATTTTGATTTGGCATTTCAATTAATTCACATGC
ACAAGGACGCTGCTCACTGGGTGGATTGGAATGGCTTTACCCCTCTCCATGTTCTAGCAAGTAAGCCAAGTTCCTTCAAAAGTGGAGCCCACATCTGTGGATGGTGGAAA
ATCGTCTATAACTGGATATATGTGGATCGACTAAAGCCTCAATCGATAGAAACTCTCAGTGAAGCGTGGAAGAAAAGTATCCACCCAAAAAAAACAAACACAACTACTTC
CTATTTTCCAGCTAACTACCACACATGCATCCACTTCTTTACCAGTTTGTGGGATGGAATTTTAAAAGTCAGCACTTTGAAACCACTAGTCAAAAAGAAGAATAGTGATG
AAGCCAACAAGGATAAAGATTTGGAAGAACATGTTGATGTGGAAACTGATCCATCGAGTATAAATTTAGTTTTGCCAATGCCAAGGCAACGCATGGAACTGCTAGAAGCA
ATAGACTTTGAGACACCGGAATATGGCCAAGCCTTGATGCCAGTGATGTTACTAGCAACAAAGTACGGTGTGATGGAGCTGGTGATGGAACTTTACAAACGCTTCCGAAC
AACGATCCATGATACTACGGAAGATGGGAAAAATATAGTGCTTTTGGCTGCAGAGTATAGGCAGCTACACGTGTACAGGTTTCTACTGGAGAAAAAAGATGAGATAGAAA
GCCTGTTTCGAGCTGTGGATAAATATGGCAATAGCGCCTTGCATCTCGCAGCAACTGCCCGAACTTCAAAGCATTTGGGTATCACTGGAGCCGCACTACAGATGCAATGG
GAAGTTAGGTGGTATAAGTACGTGGAGAAGTCTGTGCCGCACCAAATTTTTGCCCTCTATAATAAGAAAGGAAAAACTGCAAGTACAATCTTTGAAGAAACCCACATGGG
TGTAGTGCAAAAGGGCGAGGATTGGCTTTACAAAACCTCACAGTCATGCTCTGTGGTAGCTACTCTGATTGCAACAGTGGCTTTTGCAACCGCAGCCACTGTCCCAGGCG
GCAATGACGACCATGGCGCCGCAGTACTCGAAAAAGAGCAAGGCTTTTTTATCTTTTCCATCTCTTCCCTCATTGCTCTATGCCTCTCTACAACCTCAATCATCATGTTT
CTTTCCATCTTGACCTCCAGGTTCGGTGCCAAAGATTTCGGATCAAACTTGCCTACCAAAATGCTCATTGGTTTATCCTCTCTTTACTTTTCCATCGTCGCCATGTTAAT
TTCCTTCTGTAGTGGCCATTACTTTCTCATCGTTCACCGCCTTCAAAATGTGGCTACTCTACTCTACACACTTACTTTTCTTCCCATCATATTCATCTTTGGAATATTGC
AACTTCCTCTCTTCTTCGATTTGCTGAAGCCTATTCGCGAAATAGTGCCTGGAGGGGGCGCCAAGGACGTCCTATTCAATTAG
Protein sequenceShow/hide protein sequence
RERKVKTMANADLLEFLRANTERGNWEGVVEKYAEDPEAQRLRLTSNGDTTLHLAVIDNQEEIVERLVKLICRSRDYKQLLAARNKSINSPLHLAAALGSARMCHVIASA
YEDLVNGNNASGETPLFLAALYGHKDAFYCLYYFCRHNLAQATNNCSRSNRDTVLHFALKNEHFDLAFQLIHMHKDAAHWVDWNGFTPLHVLASKPSSFKSGAHICGWWK
IVYNWIYVDRLKPQSIETLSEAWKKSIHPKKTNTTTSYFPANYHTCIHFFTSLWDGILKVSTLKPLVKKKNSDEANKDKDLEEHVDVETDPSSINLVLPMPRQRMELLEA
IDFETPEYGQALMPVMLLATKYGVMELVMELYKRFRTTIHDTTEDGKNIVLLAAEYRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAATARTSKHLGITGAALQMQW
EVRWYKYVEKSVPHQIFALYNKKGKTASTIFEETHMGVVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSTTSIIMF
LSILTSRFGAKDFGSNLPTKMLIGLSSLYFSIVAMLISFCSGHYFLIVHRLQNVATLLYTLTFLPIIFIFGILQLPLFFDLLKPIREIVPGGGAKDVLFN