; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028720 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028720
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAnkyrin repeat family protein
Genome locationscaffold7:15808966..15817803
RNA-Seq ExpressionSpg028720
SyntenySpg028720
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022931013.1 uncharacterized protein LOC111437338 isoform X2 [Cucurbita moschata]2.1e-14141.51Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT R  WE V++KY + PEAQ L LT  GDTALHLA++DN+EE+V++LV  I  S+   + LE  N R  NPLHLAA +GSA MC+ IASAH 
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV K+N + ETPL+LAA  G++DAF+ LY+FCR   +  T NC   SNGDTVLH  L+N+HFDLAF ++H+H +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------
                  A+ C  VDQL PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  K+DE+KKD D             
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------

Query:  -------------------LEEDVNVET------------------------------------------------------------------------
                           L ++ ++                                                                          
Subjt:  -------------------LEEDVNVET------------------------------------------------------------------------

Query:  --DRSGITI-------------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEI
          D++G+T+                                  MLLAAK GV+E+V  +F RF  +I+D  +D+KN+VLLAAE  Q  VYRFLL  K   
Subjt:  --DRSGITI-------------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEI

Query:  ESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFA
        E+LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA
Subjt:  ESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFA

Query:  AYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSR
         YNK+GK +++IF ETH+DLVQK  DWL KTS+SCSVV  LI TVAF + A++PGG N   G+  L+  + FF F++ SLIALCLS  S+ +FL+ILT R
Subjt:  AYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSR

Query:  FGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP
        F A DF +NLP K+ IG SSL+ SI++MLISFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP
Subjt:  FGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP

XP_022995620.1 uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima]1.4e-14541.63Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT RG WE V++KY E PEAQ+L LT  GDTALHLA++DN+EE+V++LV  I  +  Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV  +N V ETPL+LAA  G++DAF+ LY+FCR+N ++ T NC   SNGDTVLH  L+N+HFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV
                  A+ C  VDQL+PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  ++DE+KKD  +  ++      ++
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV

Query:  ETDRS------------------GITIMP-----------------------------------------------------------------------
        ETD S                   IT +P                                                                       
Subjt:  ETDRS------------------GITIMP-----------------------------------------------------------------------

Query:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE
                                                  MLLAAK GV+E+V  +F RF  +I D  +D+KN+VLLAAE  Q  VYRFLL  K   E
Subjt:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE

Query:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA
        +LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA 
Subjt:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA

Query:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF
        YNK+GK +++IF ETH+DLV+K  +WL KTS+SCSVV TLI TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF
Subjt:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF

Query:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS
         A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP+
Subjt:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS

XP_022995621.1 uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima]8.1e-14641.74Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT RG WE V++KY E PEAQ+L LT  GDTALHLA++DN+EE+V++LV  I  +  Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV  +N V ETPL+LAA  G++DAF+ LY+FCR+N ++ T NC   SNGDTVLH  L+N+HFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV
                  A+ C  VDQL+PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  ++DE+KKD  +  ++      ++
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV

Query:  ETDRS------------------GITIMP-----------------------------------------------------------------------
        ETD S                   IT +P                                                                       
Subjt:  ETDRS------------------GITIMP-----------------------------------------------------------------------

Query:  ---------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESL
                                                MLLAAK GV+E+V  +F RF  +I D  +D+KN+VLLAAE  Q  VYRFLL  K   E+L
Subjt:  ---------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESL

Query:  FRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYN
        FRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA YN
Subjt:  FRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYN

Query:  KDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGA
        K+GK +++IF ETH+DLV+K  +WL KTS+SCSVV TLI TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF A
Subjt:  KDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGA

Query:  KDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS
         DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP+
Subjt:  KDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS

XP_022995622.1 uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima]1.4e-14541.63Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT RG WE V++KY E PEAQ+L LT  GDTALHLA++DN+EE+V++LV  I  +  Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV  +N V ETPL+LAA  G++DAF+ LY+FCR+N ++ T NC   SNGDTVLH  L+N+HFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV
                  A+ C  VDQL+PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  ++DE+KKD  +  ++      ++
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV

Query:  ETDRS------------------GITIMP-----------------------------------------------------------------------
        ETD S                   IT +P                                                                       
Subjt:  ETDRS------------------GITIMP-----------------------------------------------------------------------

Query:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE
                                                  MLLAAK GV+E+V  +F RF  +I D  +D+KN+VLLAAE  Q  VYRFLL  K   E
Subjt:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE

Query:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA
        +LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA 
Subjt:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA

Query:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF
        YNK+GK +++IF ETH+DLV+K  +WL KTS+SCSVV TLI TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF
Subjt:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF

Query:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS
         A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP+
Subjt:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS

XP_023534306.1 uncharacterized protein LOC111795904 isoform X2 [Cucurbita pepo subsp. pepo]2.1e-14141.87Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT R  WE V++KY + PEA +L LT  GDTALHLA++DN+E++V++LV  I  S+  K  LE  N R  NPLHLAA +GSA MC+ IASA +
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LVH++N + ETPL+LAA +G++DAF+ LY+FCR+  ++ T NC   SNGDTVLH  L+NEHFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------
                  A+ C  VDQL+PQ I++L   W   +   N +T  FPANY TCI FFT +WDGFLK S LK + +  K+DE KKD D             
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------

Query:  LEED----------------------VNV-----------------------------------------------------------------------
        LE D                       NV                                                                       
Subjt:  LEED----------------------VNV-----------------------------------------------------------------------

Query:  ETDRSGITI-----------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIES
          D++G T+                                MLLAAK GV+E+V  +F+RF  +I+D  +D+KN+VLLAAE  Q  VYRFLL      E+
Subjt:  ETDRSGITI-----------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIES

Query:  LFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAY
        LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA Y
Subjt:  LFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAY

Query:  NKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFG
        NK+GK +++IF ETH+DLVQK  DWL KTS+SCSVV  LI TVAF + A++PGG N   G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF 
Subjt:  NKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFG

Query:  AKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP
        A DF +NLP K+ IG SSL+ SI++MLISFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP
Subjt:  AKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP

TrEMBL top hitse value%identityAlignment
A0A6J1ESA5 uncharacterized protein LOC111437338 isoform X11.7e-14141.4Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT R  WE V++KY + PEAQ L LT  GDTALHLA++DN+EE+V++LV  I  S+   + LE  N R  NPLHLAA +GSA MC+ IASAH 
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV K+N + ETPL+LAA  G++DAF+ LY+FCR   +  T NC   SNGDTVLH  L+N+HFDLAF ++H+H +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------
                  A+ C  VDQL PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  K+DE+KKD D             
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------

Query:  -------------------LEEDVNVET------------------------------------------------------------------------
                           L ++ ++                                                                          
Subjt:  -------------------LEEDVNVET------------------------------------------------------------------------

Query:  --DRSGITI---------------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKD
          D++G+T+                                    MLLAAK GV+E+V  +F RF  +I+D  +D+KN+VLLAAE  Q  VYRFLL  K 
Subjt:  --DRSGITI---------------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKD

Query:  EIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHF
          E+LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +F
Subjt:  EIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHF

Query:  FAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILT
        FA YNK+GK +++IF ETH+DLVQK  DWL KTS+SCSVV  LI TVAF + A++PGG N   G+  L+  + FF F++ SLIALCLS  S+ +FL+ILT
Subjt:  FAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILT

Query:  SRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP
         RF A DF +NLP K+ IG SSL+ SI++MLISFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP
Subjt:  SRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP

A0A6J1EX58 uncharacterized protein LOC111437338 isoform X21.0e-14141.51Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT R  WE V++KY + PEAQ L LT  GDTALHLA++DN+EE+V++LV  I  S+   + LE  N R  NPLHLAA +GSA MC+ IASAH 
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV K+N + ETPL+LAA  G++DAF+ LY+FCR   +  T NC   SNGDTVLH  L+N+HFDLAF ++H+H +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------
                  A+ C  VDQL PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  K+DE+KKD D             
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKD-------------

Query:  -------------------LEEDVNVET------------------------------------------------------------------------
                           L ++ ++                                                                          
Subjt:  -------------------LEEDVNVET------------------------------------------------------------------------

Query:  --DRSGITI-------------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEI
          D++G+T+                                  MLLAAK GV+E+V  +F RF  +I+D  +D+KN+VLLAAE  Q  VYRFLL  K   
Subjt:  --DRSGITI-------------------------------MPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEI

Query:  ESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFA
        E+LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA
Subjt:  ESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFA

Query:  AYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSR
         YNK+GK +++IF ETH+DLVQK  DWL KTS+SCSVV  LI TVAF + A++PGG N   G+  L+  + FF F++ SLIALCLS  S+ +FL+ILT R
Subjt:  AYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGG-NDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSR

Query:  FGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP
        F A DF +NLP K+ IG SSL+ SI++MLISFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP
Subjt:  FGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVP

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X36.7e-14641.63Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT RG WE V++KY E PEAQ+L LT  GDTALHLA++DN+EE+V++LV  I  +  Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV  +N V ETPL+LAA  G++DAF+ LY+FCR+N ++ T NC   SNGDTVLH  L+N+HFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV
                  A+ C  VDQL+PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  ++DE+KKD  +  ++      ++
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV

Query:  ETDRS------------------GITIMP-----------------------------------------------------------------------
        ETD S                   IT +P                                                                       
Subjt:  ETDRS------------------GITIMP-----------------------------------------------------------------------

Query:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE
                                                  MLLAAK GV+E+V  +F RF  +I D  +D+KN+VLLAAE  Q  VYRFLL  K   E
Subjt:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE

Query:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA
        +LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA 
Subjt:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA

Query:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF
        YNK+GK +++IF ETH+DLV+K  +WL KTS+SCSVV TLI TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF
Subjt:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF

Query:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS
         A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP+
Subjt:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X23.9e-14641.74Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT RG WE V++KY E PEAQ+L LT  GDTALHLA++DN+EE+V++LV  I  +  Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV  +N V ETPL+LAA  G++DAF+ LY+FCR+N ++ T NC   SNGDTVLH  L+N+HFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV
                  A+ C  VDQL+PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  ++DE+KKD  +  ++      ++
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV

Query:  ETDRS------------------GITIMP-----------------------------------------------------------------------
        ETD S                   IT +P                                                                       
Subjt:  ETDRS------------------GITIMP-----------------------------------------------------------------------

Query:  ---------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESL
                                                MLLAAK GV+E+V  +F RF  +I D  +D+KN+VLLAAE  Q  VYRFLL  K   E+L
Subjt:  ---------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESL

Query:  FRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYN
        FRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA YN
Subjt:  FRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYN

Query:  KDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGA
        K+GK +++IF ETH+DLV+K  +WL KTS+SCSVV TLI TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF A
Subjt:  KDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGA

Query:  KDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS
         DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP+
Subjt:  KDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X16.7e-14641.63Show/hide
Query:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE
        L +FL  NT RG WE V++KY E PEAQ+L LT  GDTALHLA++DN+EE+V++LV  I  +  Y + L+  N R   PLHLAA +GSA MC+ IASAH+
Subjt:  LLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHE

Query:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------
         LV  +N V ETPL+LAA  G++DAF+ LY+FCR+N ++ T NC   SNGDTVLH  L+N+HFDLAFQ++H++ +A HWV   G TPLH+L         
Subjt:  NLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCS-RSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHIL---------

Query:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV
                  A+ C  VDQL+PQ I++L   W   +   NT+T  FPANY TCI FFT +WDGFLK S LK + +  ++DE+KKD  +  ++      ++
Subjt:  ----------AWIC--VDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDV------NV

Query:  ETDRS------------------GITIMP-----------------------------------------------------------------------
        ETD S                   IT +P                                                                       
Subjt:  ETDRS------------------GITIMP-----------------------------------------------------------------------

Query:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE
                                                  MLLAAK GV+E+V  +F RF  +I D  +D+KN+VLLAAE  Q  VYRFLL  K   E
Subjt:  -----------------------------------------VMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIE

Query:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA
        +LFRAVD  GNSALHLAAA   S    ITGAALQMQWE++WYK                                            +VE SVP +FFA 
Subjt:  SLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAA

Query:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF
        YNK+GK +++IF ETH+DLV+K  +WL KTS+SCSVV TLI TVAF + A++PGG + H G+  L+  + FF F++ SLIALCLS  S+ MFL+ILT RF
Subjt:  YNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH-GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRF

Query:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS
         A DF +NLP K+ IG SSL+ SI++ML+SFC+GHYFL+   + H   LLYT++ +P+ LIF I +LPL+ D+++ I +IVP+
Subjt:  GAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPS

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR42.6e-0622.98Show/hide
Query:  AAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKES
        AA+ G +E+V  L ++         +  +  + +A +     V R L++    I  L    DK GN+ALH+A      K   I    L++          
Subjt:  AAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKES

Query:  PYASPLSVSFQDFKMEY-VGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGE-----------------DWL
           + ++   +D K  Y +  +LP+  + S + +     G     E + P       ++  KT + I ++ H  L Q  +                 + +
Subjt:  PYASPLSVSFQDFKMEY-VGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGE-----------------DWL

Query:  YKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAML
           + S +VVA L ATVAFA   TVPGGN ++G AV+ +   F IF I + IAL  S+  +++ ++++     ++     + +K+      ++ + V   
Subjt:  YKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAML

Query:  ISFCSGHYFLIAHRLQHVPILL
        ISF +  Y ++    Q   +L+
Subjt:  ISFCSGHYFLIAHRLQHVPILL

G5E8K5 Ankyrin-31.2e-0632.7Show/hide
Query:  KGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCR
        +G+TALH+A    Q E+V  LV      +D  Q +EAK K    PLH++A LG A +   +     +  +     G TPL LAA  GH+D      +   
Subjt:  KGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCR

Query:  HNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIH--MHKDAAHWVNTQGFTPLHILA
        H    A+ + +   G T LH   K    ++A  L+      DAA      G TPLH+ A
Subjt:  HNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIH--MHKDAAHWVNTQGFTPLHILA

O70511 Ankyrin-32.6e-0632.08Show/hide
Query:  KGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCR
        +G+TALH+A    Q E+V  LV      +D  Q +EAK K    PLH++A LG A +   +     +  +     G TPL L+A  GH+D      +   
Subjt:  KGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCR

Query:  HNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIH--MHKDAAHWVNTQGFTPLHILA
        H    A+ + +   G T LH   K    ++A  L+      DAA      G TPLH+ A
Subjt:  HNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIH--MHKDAAHWVNTQGFTPLHILA

Q12955 Ankyrin-33.5e-0632.08Show/hide
Query:  KGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCR
        +G+TALH+A    Q E+V  LV      +D  Q +EAK K    PLH++A LG A +   +     +  +     G TPL L+A  GH+D      +   
Subjt:  KGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCR

Query:  HNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIH--MHKDAAHWVNTQGFTPLHILA
        H    A+ + +   G T LH   K    ++A  L+      DAA      G TPLH+ A
Subjt:  HNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIH--MHKDAAHWVNTQGFTPLHILA

Q9C7A2 Ankyrin repeat-containing protein ITN13.5e-0622.6Show/hide
Query:  MLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWY
        ++ AA  G  E+V +L  +    +  +  + KN + LAA    + V + LL K  +   L R +DK G +ALH+A    +S+   +    L     +   
Subjt:  MLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWY

Query:  KESPYASPLSVSFQDFKMEYVGRSLPI-DMQPSSVLNNRK----------ILGNTPYVENSVPHHFFAAYNKDGK-------TSSSIFEETHVDLVQK--
         +    + L V+ +  + E V   L + D   +++  + K          +   + Y++  +        N+  +       T + I  + H+ L Q   
Subjt:  KESPYASPLSVSFQDFKMEYVGRSLPI-DMQPSSVLNNRK----------ILGNTPYVENSVPHHFFAAYNKDGK-------TSSSIFEETHVDLVQK--

Query:  ---------------GEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSN
                         + +   + S +VVA L ATVAFA   TVPGG+++ G+AV+     F IF I + +AL  S+  +++ ++++     A+     
Subjt:  ---------------GEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSN

Query:  LPSKMLIGLSSLYFSIVAMLISF
        + +K L+ L+S+  S+  +  S+
Subjt:  LPSKMLIGLSSLYFSIVAMLISF

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein5.1e-3725.55Show/hide
Query:  SADLDLLEFLQENTM---------RGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAAL
        +ADL   E  QEN+           G  EA  +     PEA    LT  GDT +H A++    +IVE +++   R  D +Q L+ KN      L  AA  
Subjt:  SADLDLLEFLQENTM---------RGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAAL

Query:  GSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRS------NGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVN
        G  ++   + +    LV  +N     P+ +A+LYGHK   + + Y   H      + C  S      NG  ++   + +  + +A  LI  +   A+  +
Subjt:  GSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRS------NGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVN

Query:  TQGFTPLHILAWICVDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGIT
        +   T +  LA                             T Y           F S+     +V  LKL   + K       +++ +    +   +G+ 
Subjt:  TQGFTPLHILAWICVDQLRPQSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGIT

Query:  IMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLG-ITGAALQMQW
            +  A + G++E + E+ + +   +        NI   A   RQ  ++  +     +   L    D + N+ LH AA    +  L  I GAALQMQ 
Subjt:  IMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLG-ITGAALQMQW

Query:  EVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSV-PHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSV
        E++W+KE                                            VE  V P H      K  KT  ++F + H DLV++GE W+ +T+ SC+V
Subjt:  EVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSV-PHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSV

Query:  VATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYF
        VA LI T+ F++A TVPGG    G  +   +  F IF IS  I+L  S +S++MFL IL SR+  +DF  +LP+K+++GL +L+ S+  M+++F      
Subjt:  VATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYF

Query:  LIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLK
        L+  ++  V      L  +P+ + F +LQ P+  ++ +
Subjt:  LIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLK

AT3G54070.1 Ankyrin repeat family protein1.1e-2824.44Show/hide
Query:  MRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIV
        + G+W+      + K       +T   + ALH+A+    ++ V  L+    R  D    L  KNK  N PL  AAALG  +   ++ +   +L    N  
Subjt:  MRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIV

Query:  GETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHILAWICVDQLRPQSIETLSEA
          TP+ +AALYGH +    L+         +  +    N    L+      H  ++  +  +  D   W+  +            VD  R +        
Subjt:  GETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHILAWICVDQLRPQSIETLSEA

Query:  WKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGITIMPVMLLAAKYGVMEMVMELFKRFRTTIH
                    + +P N +  +H                LL  K  +   K   +L + V             ++  AA+ G +E+++ L +     + 
Subjt:  WKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGITIMPVMLLAAKYGVMEMVMELFKRFRTTIH

Query:  DTTEDEKNIVLLAAECRQLHVYRFLLEK---KDEIESLFRAVDKYGNSALHLAAAVGTSKHLGI-TGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYV
            + + +  +AA  R  +++  + E    KD I S      K  ++ LHL A +       + +GAAL MQ E+ W+K                    
Subjt:  DTTEDEKNIVLLAAECRQLHVYRFLLEK---KDEIESLFRAVDKYGNSALHLAAAVGTSKHLGI-TGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYV

Query:  GRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAV
                                 V+  VP  +    N  G+ +  IF E H +L ++GE W+ +T+ +C + ATLIATV FA A T+PGGNDD G   
Subjt:  GRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAV

Query:  -------LEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLP
                 K   F IF++S  +AL  S++SI++FLSI TSR+  +DF  +LP+K++ GLS+L+ SI++M+++F    + +I  R++   + L  +  L 
Subjt:  -------LEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLP

Query:  II--LIFGILQLPLFFDLLKPI
         +  L F  L   L+F+ L+ +
Subjt:  II--LIFGILQLPLFFDLLKPI

AT5G04690.1 Ankyrin repeat family protein9.6e-2825.6Show/hide
Query:  LQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLE--AKNKRYNNPLHLAAALGSAKMCHIIASAHENL
        L +   +G  EAV +    +P+A    + P     L      N E     +VKL+ R    +Q L   ++N  YN PL + A  G+ ++   + + +  L
Subjt:  LQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLE--AKNKRYNNPLHLAAALGSAKMCHIIASAHENL

Query:  VHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRSNGDTVLHFV--LKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHILAWICVDQLRP
        +      GE P+ +A      +    LY     N T       +     +L F+  +  +  D+A  L +  +  A                     LR 
Subjt:  VHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRSNGDTVLHFV--LKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHILAWICVDQLRP

Query:  QSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGI-----TIMPVMLLAAKYGVME
        +S+  +  A K  +         FP      +    S   G  +V  LK++       +AKK   L + ++ ET   G+     ++   +L A +YG ++
Subjt:  QSIETLSEAWKKSIPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGI-----TIMPVMLLAAKYGVME

Query:  MVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLG-ITGAALQMQWEVRWYKESPYASPLS
         ++E+ K     +  T      +   A + RQ  V+  L    D         D  GNS LHLA     +  L  +  A LQMQ E++W+KE        
Subjt:  MVMELFKRFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLG-ITGAALQMQWEVRWYKESPYASPLS

Query:  VSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATV
               ME +                       P +EN          N +  T   IF + H  +  + E W+  T+ SCS+VA LI TV FA   TV
Subjt:  VSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATV

Query:  PGGNDDH--GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLY
        PGG DD+  G      E+ F IF +S LI+   +  S+++FL ILT+R+   DF  +LP+ M+ GLS+L+ SI AML++F S  + +      + P ++ 
Subjt:  PGGNDDH--GAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLY

Query:  TLIFLPII--LIFGILQLPLFFDLL
          IF      L+F ++Q PL  +L+
Subjt:  TLIFLPII--LIFGILQLPLFFDLL

AT5G04700.1 Ankyrin repeat family protein9.3e-3125.9Show/hide
Query:  DTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHN
        +T L  A    + EIV+ L++ +   +   +   ++N  Y+ PL + A  G+ ++   + + +  L+    I G+ P+ +A      +    LY      
Subjt:  DTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLVHKKNIVGETPLFLAALYGHKDAFYSLYYFCRHN

Query:  LTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQ----------------------GFTPLHILAWICVDQLRPQSIETLSEAWKKSI
          Q   +    +G  +    +  +  D+A  L +M +  A   ++Q                      G     I +WI V       + TL +  + + 
Subjt:  LTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQ----------------------GFTPLHILAWICVDQLRPQSIETLSEAWKKSI

Query:  PKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGI-----TIMPVMLLAAKYGVMEMVMELFKRFRTTIH
         + NT                 S W G  +V  LK++       +AKK   L   ++ ET   G+     T+   +L A +YG ++ ++E+ +     + 
Subjt:  PKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGI-----TIMPVMLLAAKYGVMEMVMELFKRFRTTIH

Query:  DT-TEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVG-TSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGR
         T T     + LLA E RQ  V+  L    D    L    D  GN  LHLA      SK   + GA LQ+Q E++W+KE                     
Subjt:  DT-TEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVG-TSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGR

Query:  SLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH--GAAV
                               VE   P       N + +T   IF + H  L Q+ E W+  T+ SCS+VA LI TV FA   TVPGG DD+  G   
Subjt:  SLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDH--GAAV

Query:  LEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGI
          +++ F IF +S LI+   S  S+++FL ILT+R+   DF   LP+KM+ GLS L+ SI AMLI+F S  + ++    + +         LP  L+F +
Subjt:  LEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGI

Query:  LQLPLFFDLL
        LQ PL  +++
Subjt:  LQLPLFFDLL

AT5G04730.1 Ankyrin-repeat containing protein2.1e-3031.78Show/hide
Query:  TIMPVMLLAAKYGVMEMVMELFK---RFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLG-ITGAAL
        T+   +L AAK G  +  +E+ K   +    ++ T+   +N+  LA E ++  ++  +    D   +L R+ DK  N+ LH+A  + T   L  I+GAAL
Subjt:  TIMPVMLLAAKYGVMEMVMELFK---RFRTTIHDTTEDEKNIVLLAAECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLG-ITGAAL

Query:  QMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQS
        +MQ E +W+KE                                            VE+ V        NKD KT   IFE  H  L ++GE+W+  T+ +
Subjt:  QMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVENSVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQS

Query:  CSVVATLIATVAFATAATVPGGND-DHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCS
        CS VA LIATV F    TVPGG D   G+ ++  +  F  F  +  +A   S IS+++FLSILTSR+   DF  +LP KM++G S L+ SI +ML++F +
Subjt:  CSVVATLIATVAFATAATVPGGND-DHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGAKDFGSNLPSKMLIGLSSLYFSIVAMLISFCS

Query:  GHYFLIAHRLQHVPILLYTLIFLPII--LIFGILQLPLFFDLL
             ++  ++H P L+Y L  L     L+F +LQ PL  +++
Subjt:  GHYFLIAHRLQHVPILLYTLIFLPII--LIFGILQLPLFFDLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGCGCTGACTTAGATTTATTAGAGTTTCTACAGGAAAACACGATGAGAGGGAACTGGGAAGCAGTGGTTGAGAAGTATGCGGAAAAGCCTGAAGCTCAGAAGCT
GCCGCTGACCCCAAAGGGAGACACAGCACTACATTTGGCTATTATTGACAATCAAGAAGAAATAGTCGAAAGGCTTGTCAAACTCATTTGCAGATCAAGAGATTACAAGC
AACATCTTGAGGCTAAAAACAAGAGATATAACAATCCTCTCCACCTCGCTGCGGCACTAGGGAGTGCCAAAATGTGCCACATCATTGCTTCAGCCCATGAGAACTTAGTG
CATAAGAAAAACATCGTCGGCGAAACGCCTCTCTTCTTGGCGGCTCTGTATGGCCACAAGGACGCCTTTTATTCCCTTTACTACTTTTGCAGACACAACCTCACTCAAGC
CACCAACAATTGCTCACGTTCTAATGGAGACACTGTACTACATTTTGTCCTGAAAAACGAGCATTTTGATTTAGCATTTCAATTAATTCACATGCACAAGGACGCTGCTC
ATTGGGTGAATACGCAAGGCTTTACCCCTCTCCATATTCTAGCATGGATATGTGTGGATCAACTAAGGCCTCAATCAATAGAAACCCTCAGTGAAGCGTGGAAGAAAAGT
ATCCCCAAAACAAACACAACTACTTCCTATTTTCCAGCTAACTACCACACATGCATCCACTTCTTTACCAGCTTGTGGGATGGATTTTTAAAAGTCAGCACTTTGAAACT
ACTGGTCAACAAGAAGAAAAGTGATGAAGCGAAGAAGGATAAAGATTTGGAAGAAGATGTTAATGTGGAAACTGATCGATCGGGTATAACCATAATGCCAGTGATGTTAC
TAGCAGCAAAGTACGGTGTGATGGAGATGGTGATGGAACTGTTCAAACGTTTTCGAACGACGATCCATGATACTACGGAAGATGAGAAAAATATAGTGCTTCTGGCTGCA
GAGTGTAGGCAGCTACACGTGTACAGGTTTTTACTCGAGAAAAAAGATGAGATAGAAAGCCTGTTTCGAGCCGTGGATAAATATGGCAATAGCGCTTTGCATCTTGCAGC
AGCCGTCGGAACTTCAAAGCATTTGGGCATCACTGGAGCCGCACTACAGATGCAATGGGAAGTTAGGTGGTACAAGGAATCTCCCTATGCATCGCCTCTCTCAGTTAGTT
TTCAAGATTTTAAAATGGAGTATGTAGGGAGATCCCTACCCATTGATATGCAGCCAAGTAGTGTTCTTAACAATCGGAAAATACTTGGCAATACACCGTACGTGGAGAAT
TCTGTGCCGCACCATTTTTTTGCCGCCTATAATAAGGATGGAAAAACTTCAAGTTCAATCTTTGAAGAAACCCACGTGGATCTAGTGCAAAAGGGCGAGGATTGGCTTTA
CAAAACGTCACAGTCATGCTCTGTTGTGGCTACTCTGATTGCAACAGTGGCTTTTGCAACCGCAGCCACTGTCCCGGGCGGCAATGATGACCATGGCGCCGCAGTACTCG
AAAAAGAGCAAGGCTTTTTTATCTTCTCCATCTCTTCCCTCATTGCTCTATGCCTCTCTATAATCTCAATCATCATGTTTCTTTCCATCTTAACCTCCAGGTTCGGTGCC
AAAGACTTCGGATCAAACTTGCCTTCCAAAATGCTCATTGGCTTATCCTCTCTTTACTTTTCCATCGTCGCCATGTTAATTTCCTTTTGTAGCGGCCATTACTTTCTCAT
CGCTCACCGCCTTCAACATGTGCCTATTCTACTCTACACACTTATTTTTCTTCCCATCATATTGATCTTCGGAATATTGCAGCTTCCTCTCTTTTTCGATTTGCTGAAGC
CTATTCGTGAAATAGTGCCTAGTGGGGGCACCAAGGACGTCGTATTCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGCGCTGACTTAGATTTATTAGAGTTTCTACAGGAAAACACGATGAGAGGGAACTGGGAAGCAGTGGTTGAGAAGTATGCGGAAAAGCCTGAAGCTCAGAAGCT
GCCGCTGACCCCAAAGGGAGACACAGCACTACATTTGGCTATTATTGACAATCAAGAAGAAATAGTCGAAAGGCTTGTCAAACTCATTTGCAGATCAAGAGATTACAAGC
AACATCTTGAGGCTAAAAACAAGAGATATAACAATCCTCTCCACCTCGCTGCGGCACTAGGGAGTGCCAAAATGTGCCACATCATTGCTTCAGCCCATGAGAACTTAGTG
CATAAGAAAAACATCGTCGGCGAAACGCCTCTCTTCTTGGCGGCTCTGTATGGCCACAAGGACGCCTTTTATTCCCTTTACTACTTTTGCAGACACAACCTCACTCAAGC
CACCAACAATTGCTCACGTTCTAATGGAGACACTGTACTACATTTTGTCCTGAAAAACGAGCATTTTGATTTAGCATTTCAATTAATTCACATGCACAAGGACGCTGCTC
ATTGGGTGAATACGCAAGGCTTTACCCCTCTCCATATTCTAGCATGGATATGTGTGGATCAACTAAGGCCTCAATCAATAGAAACCCTCAGTGAAGCGTGGAAGAAAAGT
ATCCCCAAAACAAACACAACTACTTCCTATTTTCCAGCTAACTACCACACATGCATCCACTTCTTTACCAGCTTGTGGGATGGATTTTTAAAAGTCAGCACTTTGAAACT
ACTGGTCAACAAGAAGAAAAGTGATGAAGCGAAGAAGGATAAAGATTTGGAAGAAGATGTTAATGTGGAAACTGATCGATCGGGTATAACCATAATGCCAGTGATGTTAC
TAGCAGCAAAGTACGGTGTGATGGAGATGGTGATGGAACTGTTCAAACGTTTTCGAACGACGATCCATGATACTACGGAAGATGAGAAAAATATAGTGCTTCTGGCTGCA
GAGTGTAGGCAGCTACACGTGTACAGGTTTTTACTCGAGAAAAAAGATGAGATAGAAAGCCTGTTTCGAGCCGTGGATAAATATGGCAATAGCGCTTTGCATCTTGCAGC
AGCCGTCGGAACTTCAAAGCATTTGGGCATCACTGGAGCCGCACTACAGATGCAATGGGAAGTTAGGTGGTACAAGGAATCTCCCTATGCATCGCCTCTCTCAGTTAGTT
TTCAAGATTTTAAAATGGAGTATGTAGGGAGATCCCTACCCATTGATATGCAGCCAAGTAGTGTTCTTAACAATCGGAAAATACTTGGCAATACACCGTACGTGGAGAAT
TCTGTGCCGCACCATTTTTTTGCCGCCTATAATAAGGATGGAAAAACTTCAAGTTCAATCTTTGAAGAAACCCACGTGGATCTAGTGCAAAAGGGCGAGGATTGGCTTTA
CAAAACGTCACAGTCATGCTCTGTTGTGGCTACTCTGATTGCAACAGTGGCTTTTGCAACCGCAGCCACTGTCCCGGGCGGCAATGATGACCATGGCGCCGCAGTACTCG
AAAAAGAGCAAGGCTTTTTTATCTTCTCCATCTCTTCCCTCATTGCTCTATGCCTCTCTATAATCTCAATCATCATGTTTCTTTCCATCTTAACCTCCAGGTTCGGTGCC
AAAGACTTCGGATCAAACTTGCCTTCCAAAATGCTCATTGGCTTATCCTCTCTTTACTTTTCCATCGTCGCCATGTTAATTTCCTTTTGTAGCGGCCATTACTTTCTCAT
CGCTCACCGCCTTCAACATGTGCCTATTCTACTCTACACACTTATTTTTCTTCCCATCATATTGATCTTCGGAATATTGCAGCTTCCTCTCTTTTTCGATTTGCTGAAGC
CTATTCGTGAAATAGTGCCTAGTGGGGGCACCAAGGACGTCGTATTCAACTAG
Protein sequenceShow/hide protein sequence
MSSADLDLLEFLQENTMRGNWEAVVEKYAEKPEAQKLPLTPKGDTALHLAIIDNQEEIVERLVKLICRSRDYKQHLEAKNKRYNNPLHLAAALGSAKMCHIIASAHENLV
HKKNIVGETPLFLAALYGHKDAFYSLYYFCRHNLTQATNNCSRSNGDTVLHFVLKNEHFDLAFQLIHMHKDAAHWVNTQGFTPLHILAWICVDQLRPQSIETLSEAWKKS
IPKTNTTTSYFPANYHTCIHFFTSLWDGFLKVSTLKLLVNKKKSDEAKKDKDLEEDVNVETDRSGITIMPVMLLAAKYGVMEMVMELFKRFRTTIHDTTEDEKNIVLLAA
ECRQLHVYRFLLEKKDEIESLFRAVDKYGNSALHLAAAVGTSKHLGITGAALQMQWEVRWYKESPYASPLSVSFQDFKMEYVGRSLPIDMQPSSVLNNRKILGNTPYVEN
SVPHHFFAAYNKDGKTSSSIFEETHVDLVQKGEDWLYKTSQSCSVVATLIATVAFATAATVPGGNDDHGAAVLEKEQGFFIFSISSLIALCLSIISIIMFLSILTSRFGA
KDFGSNLPSKMLIGLSSLYFSIVAMLISFCSGHYFLIAHRLQHVPILLYTLIFLPIILIFGILQLPLFFDLLKPIREIVPSGGTKDVVFN