; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009673 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009673
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationchr09:21641203..21643507
RNA-Seq ExpressionPI0009673
SyntenyPI0009673
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592382.1 Ankyrin repeat-containing protein BDA1, partial [Cucurbita argyrosperma subsp. sororia]1.0e-11663.61Show/hide
Query:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI
        EVDA     LHLAS NG +EIVQAL+EKNTS+CMVRDLNGLIPLHHAVING+I +MQ LI+ RPQS+WMK  NG+T+LHLCVK+NHLEALK LI I+I  
Subjt:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVAT
              LN  + +ENTILD SMMLR IE VRYLLS+PGIK G N      + + EE+C             I +LW+ N  +Y+G+W++EVQ T+MLVAT
Subjt:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVAT

Query:  VIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLT
        VIATV FQGAINPPGG+WQQD PFNSN T I+RSF STNT K F+AGTA+MAYP S    Y YTAYLVTNSISFFAS+CVIMFII R PLKNRIC+ LLT
Subjt:  VIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLT

Query:  TAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR
          MCVA+A+LS+SY++GVW++N S+RNS  L++TF  A+ I L M+GVV   CI+IPFL  VVKLL W  +
Subjt:  TAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR

KAG6592383.1 Ankyrin repeat-containing protein BDA1, partial [Cucurbita argyrosperma subsp. sororia]2.4e-12165.61Show/hide
Query:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI
        EVD L    LHLAS NG +EIVQAL+EKNTS+CMVRDLNGLIPLHHAVING+I +MQ LI  R +S W+KL NGQT+LHLCVK+NHLEALK  IT +I  
Subjt:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTN-------NHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQN
          D   LN  D +ENTILDLSMMLRRIE+V+YLLS+PGIKTGTN       ++ + S +R +E+C K  S S T ++ I +LW  N L+YKG+W Q+VQ 
Subjt:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTN-------NHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQN

Query:  TMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNR
        TMMLVATVIATV FQGAINPPGG+WQ+D PFNSN+T I+R F S+NT+K F+AGTA+MAYPTSD Q   YT YLVTNSISFFASICVIMFII R PLKNR
Subjt:  TMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNR

Query:  ICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR
        ICA +LT  MC+A+ASLS SYLMGVW++NVSYR S  LK+    A+ I L  +GVV   CIIIPFL  VVKLL W  +
Subjt:  ICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR

XP_022925435.1 uncharacterized protein LOC111432732 [Cucurbita moschata]1.0e-11965.5Show/hide
Query:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI
        EVDAL    LHLAS NG I+IVQAL+EKNTSSCMVRDLNGLIPLHHAVING+I +M  LI+ RPQS+WMKL NGQT+LHLCV +NHLEALK    +LIT+
Subjt:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQ---LLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMML
          D+Q    LN  D +ENTILDLSMMLRRIE+V+YLLS+ GIKT TN                  STS  PK L+T+  K+  L+YKG+W QEV  TMML
Subjt:  FNDEQ---LLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMML

Query:  VATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICAS
        VATVIATV FQGAINPPGG+WQ+D P+NSN+T I+RSF S+NT+K F+AGTAIMAYP S    Y YTAYL+TNS+SFFAS+CVIMFII R PLKNRICA 
Subjt:  VATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICAS

Query:  LLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW
        LLT  MC+A+ASLS+SYL+GVW++N S+RNSY  +VTFV A+ I L M+GVV   CI+IP ++ VVKLL W
Subjt:  LLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW

XP_022973933.1 uncharacterized protein LOC111472564 [Cucurbita maxima]3.0e-12463.94Show/hide
Query:  MDATQEQQLGGD----EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKN
        M+ T +Q +       EVDAL    LHLAS  G +EIVQAL+EKNTS+CMVRDL+GLIPLHHAVING+I +M  LI+ RPQS+WMKL NGQT+LHLCV +
Subjt:  MDATQEQQLGGD----EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKN

Query:  NHLEALKFLITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHN-------ETSSTRIEEECYKGSSTSSTPKRLITTLWK
        NHLEALK LIT +I    D   LN  D +ENTILDLSMMLR+IE+V+YLLS+ GIKTGTN  N       + S  RIEE CY G STS  PK L+T+  K
Subjt:  NHLEALKFLITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHN-------ETSSTRIEEECYKGSSTSSTPKRLITTLWK

Query:  NNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFAS
        +  L+YKG+W QEVQ TMMLVATVIATV FQGAINPPGG+WQ+D P+NSN+T I+RSF S+NT+K F+AGTA+M YP S    Y YTAYL+TNS+SFFAS
Subjt:  NNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFAS

Query:  ICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW
        + VIMFII R PLKNRICA LLT  MC+A+ASLS+SYL+GVW++N S+RNSY  +VTF+ A+ I L M+GVV   CI+IPF++ VVKLL W
Subjt:  ICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW

XP_023535332.1 alpha-latrocrustotoxin-Lt1a-like [Cucurbita pepo subsp. pepo]5.0e-11964.29Show/hide
Query:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI
        EVD L    LHLAS NG +EIVQAL+EKNTS+CMVRDLNG IPLHHAVING+I +MQ LI  R +S W+KL NGQT+LHLCVK+NHLEALKF IT +I  
Subjt:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTN-------NHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQN
          D   LN  D +ENTILD SM+LRRIE+V+YLLS+PGIKTGTN       ++ + S +R +E+C K  S S T ++ I +LW  N L+YKG+W Q+VQ 
Subjt:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTN-------NHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQN

Query:  TMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNR
        TMMLVATVIATV FQGAINPPGG+WQ++ PFNSN+T I+R F S+NT+K F+AGTA+MAYPTSD Q   YT YLVTNSISFFASICVIMFII R PLKNR
Subjt:  TMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNR

Query:  ICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR
        ICA +LT  MC+A+ASLS SYLMGVW++NVSYR S  LK+    A+ I L  +G V   CI+IPFL  VVKLL W  +
Subjt:  ICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR

TrEMBL top hitse value%identityAlignment
A0A6J1D887 ankyrin repeat-containing protein NPR4-like1.3e-9962.11Show/hide
Query:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI
        EVDAL    LHLASANGE+EIVQAL+EKNTS+C+VRDLNGLIPLHHAVI+GQI++MQ LI  RPQS+W KL NGQT+LHLC K+NHLEA++ LI  L  I
Subjt:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGT--------NNHNETSS-----------TRIEEECYKGSSTSSTPKRLITTLWKNNGL
         NDE+ LN  DN +NT+LDLS+MLR+IE+VRYLLS+PGIK GT        +N N +S            +R+ E+C + SST +T K+L+ + WK + L
Subjt:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGT--------NNHNETSS-----------TRIEEECYKGSSTSSTPKRLITTLWKNNGL

Query:  QYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVI
        +YKG+W+Q+VQ TMMLVATVIATV FQGAINPPGG+WQ+D  FN N TA +RS+   + +  F+AGTA+MAYPTS +Q YGY  YL+ NSISF ASICVI
Subjt:  QYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVI

Query:  MFIISRFPLKNRICASLLTTAM
        +FII+R PL+NRICA LLT  M
Subjt:  MFIISRFPLKNRICASLLTTAM

A0A6J1EC78 uncharacterized protein LOC1114327351.5e-10061.61Show/hide
Query:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV
        LHLAS NG IEIVQAL+EKNTS+CMVRDLNGLIPLHHAVING+I ++Q LI  RP+S W+KL NGQ+ LHLC+K+NHLE+LK LI I+I    +   LN 
Subjt:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV

Query:  PDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQG
         + +ENTILDLS+                         E S +RIEE+C KGSSTS+T K+L+ +LW  N L+YKG+W+QEVQ TMMLVATVIATV FQG
Subjt:  PDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQG

Query:  AINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTAMCVAVAS
        AINPPGG+WQ+D  +NSN ++I+RS    NT+K F+AGTAIMAYP    Q Y Y AYL+TNSISFFASICVIMFII R PLKNRIC+ LLT  MC+A+AS
Subjt:  AINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTAMCVAVAS

Query:  LSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGM
        LSYSYL+ V ++ VS+ +S +L+VTFV A+ IWL M
Subjt:  LSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGM

A0A6J1EHY8 uncharacterized protein LOC1114327324.9e-12065.5Show/hide
Query:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI
        EVDAL    LHLAS NG I+IVQAL+EKNTSSCMVRDLNGLIPLHHAVING+I +M  LI+ RPQS+WMKL NGQT+LHLCV +NHLEALK    +LIT+
Subjt:  EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQ---LLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMML
          D+Q    LN  D +ENTILDLSMMLRRIE+V+YLLS+ GIKT TN                  STS  PK L+T+  K+  L+YKG+W QEV  TMML
Subjt:  FNDEQ---LLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMML

Query:  VATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICAS
        VATVIATV FQGAINPPGG+WQ+D P+NSN+T I+RSF S+NT+K F+AGTAIMAYP S    Y YTAYL+TNS+SFFAS+CVIMFII R PLKNRICA 
Subjt:  VATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICAS

Query:  LLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW
        LLT  MC+A+ASLS+SYL+GVW++N S+RNSY  +VTFV A+ I L M+GVV   CI+IP ++ VVKLL W
Subjt:  LLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW

A0A6J1EHZ5 ankyrin repeat domain-containing protein 16-like1.4e-10666.67Show/hide
Query:  EQQLGGDEVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFL
        E +L G EVD L    LHLAS NG +EIVQAL+EKNTS+CMVRDLNGLIPLHHAVINGQI ++  LI+ RP+S W+KL NGQ++LHLCVK+NHLEALK L
Subjt:  EQQLGGDEVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFL

Query:  ITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHN-------ETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGE
        IT +I    D   LN  D +ENTILD SMMLRRIE+V+YLLS+PGIKTGTN  N       + S +RIEE+C K +S S T K  I +LW  N L+YKG+
Subjt:  ITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHN-------ETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGE

Query:  WVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIIS
        W QEVQ TMMLVATVIATV FQGAINPP G+WQ+D PFNSN T  +R F S+NT+K  +AGTA+MAYPTSD Q   YT YL+TNSISFFASICVIMFII 
Subjt:  WVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIIS

Query:  RFPLKNRICASLLTTAMCVAVASLSYSYLM
        R PLKNRICA +LT  MCVA+A LS SYL+
Subjt:  RFPLKNRICASLLTTAMCVAVASLSYSYLM

A0A6J1IEM9 uncharacterized protein LOC1114725641.5e-12463.94Show/hide
Query:  MDATQEQQLGGD----EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKN
        M+ T +Q +       EVDAL    LHLAS  G +EIVQAL+EKNTS+CMVRDL+GLIPLHHAVING+I +M  LI+ RPQS+WMKL NGQT+LHLCV +
Subjt:  MDATQEQQLGGD----EVDAL----LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKN

Query:  NHLEALKFLITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHN-------ETSSTRIEEECYKGSSTSSTPKRLITTLWK
        NHLEALK LIT +I    D   LN  D +ENTILDLSMMLR+IE+V+YLLS+ GIKTGTN  N       + S  RIEE CY G STS  PK L+T+  K
Subjt:  NHLEALKFLITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHN-------ETSSTRIEEECYKGSSTSSTPKRLITTLWK

Query:  NNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFAS
        +  L+YKG+W QEVQ TMMLVATVIATV FQGAINPPGG+WQ+D P+NSN+T I+RSF S+NT+K F+AGTA+M YP S    Y YTAYL+TNS+SFFAS
Subjt:  NNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFAS

Query:  ICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW
        + VIMFII R PLKNRICA LLT  MC+A+ASLS+SYL+GVW++N S+RNSY  +VTF+ A+ I L M+GVV   CI+IPF++ VVKLL W
Subjt:  ICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTW

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR43.7e-0824.32Show/hide
Query:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV
        LH A+  G +EIV+AL+EK+       D  G   LH AV      V++ L+D  P  + +   NG T LH+  +    E +  L+ +       +  +N 
Subjt:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV

Query:  PDNDENTILDLSMMLRRIE---MVRYLLSVPG-IKTGTNNHNETSSTRIEEECYKG-----SSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVAT
           D  T  D++  L   E    ++ +LS  G +++   N       +   E  K        T  T K +     +   L  +G  +    N++ +VA 
Subjt:  PDNDENTILDLSMMLRRIE---MVRYLLSVPG-IKTGTNNHNETSSTRIEEECYKG-----SSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVAT

Query:  VIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPT--SDHQRYGYTAYLVTNSISFFASIC-VIMFIISRFPLKNR
        + ATV F      PGG           A A +R F   N + +F +   ++   T      +       V N + + AS+C  I FI S + +  R
Subjt:  VIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPT--SDHQRYGYTAYLVTNSISFFASIC-VIMFIISRFPLKNR

Q6AWW5 Ankyrin repeat-containing protein At5g026201.1e-0729.58Show/hide
Query:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV
        LH A++ G  EIV  L++K      +   NG   LH A  NG   +++ LI+ +   +      GQT LH+ VK  + E +  L+        D  L+N 
Subjt:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV

Query:  PDNDENTILDLSMMLRRIEMVRYLLSVPGI-KTGTNNHNETS
         DN  NT L +++   R E+V+ +L    + +   N   ET+
Subjt:  PDNDENTILDLSMMLRRIEMVRYLLSVPGI-KTGTNNHNETS

Q8GYH5 Ankyrin repeat-containing protein BDA12.4e-0723.71Show/hide
Query:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDE----Q
        LHLA  N ++E+   LV+ + S   +R   G+ PLH     G + ++   +   P+S+     NG+TILH+ + N+  E LK L   +  + + +     
Subjt:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDE----Q

Query:  LLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKG--EWV---------------
        +LN  D   NT+L L+      ++V+ L  V  +    N  N++  T ++    +GS  +   + +I       G    G  EW                
Subjt:  LLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKG--EWV---------------

Query:  ----------QEVQNTMMLVATVIATVTFQGA
                     +N ++++A +I + TFQ A
Subjt:  ----------QEVQNTMMLVATVIATVTFQGA

Q9C7A2 Ankyrin repeat-containing protein ITN13.2e-0723Show/hide
Query:  DATQEQQLGGDEVDALLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFL
        DAT  Q  G      L+  A+  G  E+V  L+ K  +   +   N    LH A   G ++V++ L+   PQ        GQT LH+ VK    E +K L
Subjt:  DATQEQQLGGDEVDALLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFL

Query:  ITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQN
        +        D  ++  PD   NT L ++   +R E+V  LLS+P     T   +  ++  I E    G   S     +   L ++  L+   E  Q    
Subjt:  ITILITIFNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQN

Query:  TMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQ---------STNTMKIF------VAGTAIMAYPTSDHQ--------RYGYTAYLVTN
            V  +   V  Q          +Q    N N   I +  +         +TN++ +       VA  AI   P  D+         R  +  + + N
Subjt:  TMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQ---------STNTMKIF------VAGTAIMAYPTSDHQ--------RYGYTAYLVTN

Query:  SISFFASICVIMFIISRFPLKNR--------ICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVV
        +++ F S+ V++  I+    + +        I   +   +MC +VA L+ SY++      V  +N +  ++  V   VI  G++G +
Subjt:  SISFFASICVIMFIISRFPLKNR--------ICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVV

Q9ZU96 Ankyrin repeat-containing protein At2g016802.4e-0727.74Show/hide
Query:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV
        L+ A+    +EIV A+++ + S  M+   NG   LH A   G +++++ LI+     + +K   GQT LH+ VK   LE ++ ++    TI N+      
Subjt:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV

Query:  PDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGT-NNHNETSSTRIEEECYKGSS
         D   NT L ++    R ++   LL+   I+    NN  ET+    ++  Y  S+
Subjt:  PDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGT-NNHNETSSTRIEEECYKGSS

Arabidopsis top hitse value%identityAlignment
AT1G10340.1 Ankyrin repeat family protein2.3e-1322.98Show/hide
Query:  LLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLN
        LLH A   G+ E+   L+  +       + NGL PLH AV+ G + +++  +D  P S      + +T+ HL  +N +++A  F+   L    N + LL 
Subjt:  LLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLN

Query:  VPDNDENTILDLSMMLR-RIEMVRYL------------------------------LSVPGIKTGTNNHNETSSTRIEEECYKGS--------------S
          D   NT+L ++  +     ++RY+                              L    ++ GT    E  S    E+ ++GS              +
Subjt:  VPDNDENTILDLSMMLR-RIEMVRYL------------------------------LSVPGIKTGTNNHNETSSTRIEEECYKGS--------------S

Query:  TSSTPKRL------ITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSD
        TS   +R       +    +N   Q   E +Q  +NT+ +VA +IA+V + G INPPGG++ QD P+   +         T   K+F             
Subjt:  TSSTPKRL------ITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSD

Query:  HQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTA---MCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVA
                  + N+I+ F S+ +++ ++S  P K +    LL      M V+V  ++ +Y+   W+    Y  +  L    VA
Subjt:  HQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTA---MCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVA

AT1G10340.2 Ankyrin repeat family protein2.3e-1322.98Show/hide
Query:  LLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLN
        LLH A   G+ E+   L+  +       + NGL PLH AV+ G + +++  +D  P S      + +T+ HL  +N +++A  F+   L    N + LL 
Subjt:  LLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLN

Query:  VPDNDENTILDLSMMLR-RIEMVRYL------------------------------LSVPGIKTGTNNHNETSSTRIEEECYKGS--------------S
          D   NT+L ++  +     ++RY+                              L    ++ GT    E  S    E+ ++GS              +
Subjt:  VPDNDENTILDLSMMLR-RIEMVRYL------------------------------LSVPGIKTGTNNHNETSSTRIEEECYKGS--------------S

Query:  TSSTPKRL------ITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSD
        TS   +R       +    +N   Q   E +Q  +NT+ +VA +IA+V + G INPPGG++ QD P+   +         T   K+F             
Subjt:  TSSTPKRL------ITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSD

Query:  HQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTA---MCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVA
                  + N+I+ F S+ +++ ++S  P K +    LL      M V+V  ++ +Y+   W+    Y  +  L    VA
Subjt:  HQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTA---MCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVA

AT3G13950.1 unknown protein1.2e-1428.74Show/hide
Query:  WKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFF
        W    L+ +G+W+++ +  +M+ ATVIA ++FQ  +NPPGG+WQ D     N T      +         AGTA++ Y +S  +R  Y   ++++++SF 
Subjt:  WKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFF

Query:  ASICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIW
         S+ +I+ +IS   L+NR+  ++L T M VAV  +S ++   + ++    +    + + +V  +V++
Subjt:  ASICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIW

AT5G51160.1 Ankyrin repeat family protein9.1e-1825.15Show/hide
Query:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV
        LH A+A G++E V+A +      C ++D +G  PLH A + G+I V++ ++      +  +   GQT LHL V +  +EA+   I  LIT  N   +LN 
Subjt:  LHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFNDEQLLNV

Query:  PDNDENTILDLSMMLRRIEMVRYLL-SVP---------------------------------------------GIKTGTNNHNETSSTRIEEECYKGSS
         D   NT L L+   +  +++  L+ ++P                                             G   GT N   T+ST     C + + 
Subjt:  PDNDENTILDLSMMLRRIEMVRYLL-SVP---------------------------------------------GIKTGTNNHNETSSTRIEEECYKGSS

Query:  TSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNT--MKIFVAGTAIMAYPTSDHQRY
         S + K L+    K    +   +   E ++ +++VA+++AT TFQ ++ PPGG WQ     +S+  A+ ++  + NT   +   AG +IM          
Subjt:  TSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNT--MKIFVAGTAIMAYPTSDHQRY

Query:  GYTAYLVTNSISFFASICVIMFIISRFPLK
         +T ++  N+I F  S+ ++  +   FPL+
Subjt:  GYTAYLVTNSISFFASICVIMFIISRFPLK

AT5G54710.1 Ankyrin repeat family protein2.5e-1525Show/hide
Query:  DEVDALLHLASANGEIEIVQAL--VEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNG-QTILHLCVKNNHLEALKFLITILITI
        +E   LLH A  +G +E+ + L  V+ N       D +GL PLH AVING +++++  +   P S  +      +T+ HL  K    +A  F+       
Subjt:  DEVDALLHLASANGEIEIVQAL--VEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNG-QTILHLCVKNNHLEALKFLITILITI

Query:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRI---------------EEECYK----------------------GSS
         N  QLL   D ++NT+L ++  +    +VR++LS   I     N    ++  +                +E  K                         
Subjt:  FNDEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRI---------------EEECYK----------------------GSS

Query:  TSSTPKRLITTLW-----KNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDH
          S+  R +  L      +N   +   E +Q  +NT+ +VA +IA+V F   INPPGG+  QD PF   ATA       T   KIF              
Subjt:  TSSTPKRLITTLW-----KNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAINPPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDH

Query:  QRYGYTAYLVTNSISFFASICVIMFIISRFPLKN---RICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCII
                 V N+I+ F S+ ++  ++S    +    ++C  +    M +AVAS++ +Y    WI       S  L  T  A   + LG + V VS  ++
Subjt:  QRYGYTAYLVTNSISFFASICVIMFIISRFPLKN---RICASLLTTAMCVAVASLSYSYLMGVWIINVSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCAACTCAAGAACAACAACTTGGTGGTGATGAAGTTGATGCCTTGCTCCACTTAGCTTCGGCTAATGGAGAGATTGAGATTGTTCAAGCTTTGGTAGAGAAGAA
CACGAGCAGTTGCATGGTTCGTGATTTAAATGGCTTGATTCCTCTTCACCATGCAGTGATTAATGGCCAAATTAAGGTTATGCAACATTTGATTGATGTAAGACCACAAT
CTATGTGGATGAAGCTTTCCAATGGCCAAACTATTCTTCATTTGTGTGTTAAGAACAATCATTTGGAGGCTCTTAAGTTTCTCATCACAATACTCATCACGATCTTCAAC
GATGAACAACTTTTGAATGTACCTGATAATGATGAAAATACTATTCTGGATTTGTCTATGATGTTAAGGAGAATTGAGATGGTACGTTATTTATTGTCTGTCCCAGGAAT
AAAAACTGGAACAAACAATCATAATGAAACATCATCAACTAGAATTGAAGAAGAATGTTACAAGGGGTCATCAACATCATCAACACCAAAAAGGTTGATAACAACTTTAT
GGAAGAATAATGGTCTTCAATACAAAGGTGAATGGGTTCAAGAAGTGCAAAATACAATGATGTTAGTAGCAACCGTGATAGCAACTGTGACTTTTCAAGGTGCAATCAAC
CCTCCCGGTGGCCTTTGGCAACAAGATTTTCCTTTCAACTCCAATGCCACCGCTATTTATCGTTCATTTCAAAGTACCAATACTATGAAAATTTTCGTAGCTGGAACTGC
AATAATGGCCTACCCAACTTCAGATCATCAAAGATATGGTTACACAGCTTATTTGGTTACAAATTCCATCTCATTCTTTGCATCAATTTGTGTGATTATGTTCATTATAA
GTCGTTTTCCTTTGAAGAATAGGATTTGTGCATCTTTGTTAACAACTGCTATGTGTGTTGCAGTTGCATCCTTGTCATATAGTTACTTAATGGGGGTTTGGATCATAAAT
GTCTCATATAGAAATTCCTATGTTTTGAAAGTAACATTTGTAGCTGCATTTGTCATCTGGTTAGGAATGATTGGAGTAGTTGTTTCTTCTTGTATTATAATTCCCTTCCT
TAGTTATGTGGTGAAGCTTCTCACTTGGGTAGCTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCAACTCAAGAACAACAACTTGGTGGTGATGAAGTTGATGCCTTGCTCCACTTAGCTTCGGCTAATGGAGAGATTGAGATTGTTCAAGCTTTGGTAGAGAAGAA
CACGAGCAGTTGCATGGTTCGTGATTTAAATGGCTTGATTCCTCTTCACCATGCAGTGATTAATGGCCAAATTAAGGTTATGCAACATTTGATTGATGTAAGACCACAAT
CTATGTGGATGAAGCTTTCCAATGGCCAAACTATTCTTCATTTGTGTGTTAAGAACAATCATTTGGAGGCTCTTAAGTTTCTCATCACAATACTCATCACGATCTTCAAC
GATGAACAACTTTTGAATGTACCTGATAATGATGAAAATACTATTCTGGATTTGTCTATGATGTTAAGGAGAATTGAGATGGTACGTTATTTATTGTCTGTCCCAGGAAT
AAAAACTGGAACAAACAATCATAATGAAACATCATCAACTAGAATTGAAGAAGAATGTTACAAGGGGTCATCAACATCATCAACACCAAAAAGGTTGATAACAACTTTAT
GGAAGAATAATGGTCTTCAATACAAAGGTGAATGGGTTCAAGAAGTGCAAAATACAATGATGTTAGTAGCAACCGTGATAGCAACTGTGACTTTTCAAGGTGCAATCAAC
CCTCCCGGTGGCCTTTGGCAACAAGATTTTCCTTTCAACTCCAATGCCACCGCTATTTATCGTTCATTTCAAAGTACCAATACTATGAAAATTTTCGTAGCTGGAACTGC
AATAATGGCCTACCCAACTTCAGATCATCAAAGATATGGTTACACAGCTTATTTGGTTACAAATTCCATCTCATTCTTTGCATCAATTTGTGTGATTATGTTCATTATAA
GTCGTTTTCCTTTGAAGAATAGGATTTGTGCATCTTTGTTAACAACTGCTATGTGTGTTGCAGTTGCATCCTTGTCATATAGTTACTTAATGGGGGTTTGGATCATAAAT
GTCTCATATAGAAATTCCTATGTTTTGAAAGTAACATTTGTAGCTGCATTTGTCATCTGGTTAGGAATGATTGGAGTAGTTGTTTCTTCTTGTATTATAATTCCCTTCCT
TAGTTATGTGGTGAAGCTTCTCACTTGGGTAGCTAGGTAA
Protein sequenceShow/hide protein sequence
MDATQEQQLGGDEVDALLHLASANGEIEIVQALVEKNTSSCMVRDLNGLIPLHHAVINGQIKVMQHLIDVRPQSMWMKLSNGQTILHLCVKNNHLEALKFLITILITIFN
DEQLLNVPDNDENTILDLSMMLRRIEMVRYLLSVPGIKTGTNNHNETSSTRIEEECYKGSSTSSTPKRLITTLWKNNGLQYKGEWVQEVQNTMMLVATVIATVTFQGAIN
PPGGLWQQDFPFNSNATAIYRSFQSTNTMKIFVAGTAIMAYPTSDHQRYGYTAYLVTNSISFFASICVIMFIISRFPLKNRICASLLTTAMCVAVASLSYSYLMGVWIIN
VSYRNSYVLKVTFVAAFVIWLGMIGVVVSSCIIIPFLSYVVKLLTWVAR