; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0110 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0110
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionp-hydroxybenzoic acid efflux pump subunit aaeB
Genome locationscaffold60:1204583..1207959
RNA-Seq ExpressionMC00g0110
SyntenyMC00g0110
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607229.1 hypothetical protein SDJN03_00571, partial [Cucurbita argyrosperma subsp. sororia]1.22e-31263.23Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        +   RALW  RLASAFRTALAC++VA  TLYGP  LR QVAFPAFSYLTA+LIVTNA+LGD I G  LALFAT+QTVCPAM +FWFIGPTKFS +T A+T
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT
        VALAS+VVVL  ST +LAK+IALGQIV+IYVVGFIGG  TDPLMHP+HVAATTA+GA ASVCATLLPFPRLASLQV++K K VVEN  ERL L+VKA LT
Subjt:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT

Query:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV
         DD +AA SISKA+LLSSSA KLLHSI+H+QES  WERLP KI  + W  N E LQ+L   L GMELALS+   P  P  + QN+    D  +NAL+  +
Subjt:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV

Query:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD
        +LAL QAN  SL      P DD  N   ++QIMPT+     H FF FC+KLL +K Q++     + +   +    + +M ALK A+SLG AVFLGLMY  
Subjt:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD

Query:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF
        ENGFWASLAVA+S TS+RE TF+ ANVKVHGTMLGSVYG+LSFV+F   L GRLLCLLPWFVFT+FL+H+ MYGSAGGV+A+VGALVVLGRTNYG+P EF
Subjt:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF

Query:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS
        AFVR +ETFIG+SIS+  D+I QPTRASK+AKIQL  +L++LQ CI  L+  S +E+  RAL IQV ELKKLI+EAE EPN WFLPF S+ Y KL  SLS
Subjt:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS

Query:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC
        K VDFLA S +AM  LK+           ++VE L+G +E+FKE++ + V+CY D+SSLK LRV+E EGEK + C DVEMGE  +I +DEI EKE LI C
Subjt:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC

Query:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI
         L+HS E IVD+ G G+D + + ILSL A+AFCL++L+R I+EIGEA REL+QWENP S  +DF+ I
Subjt:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI

KAG7036914.1 hypothetical protein SDJN02_00534, partial [Cucurbita argyrosperma subsp. argyrosperma]7.01e-31263.1Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        +   RALW  RLASAFRTALAC++VA  TLYGP  LR  VAFPAFSYLTA+LIVTNA+LGD I G  LALFAT+QTVCPAM +FWFIGPTKFS +T A+T
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT
        VALAS+VVVL  ST +LAK+IALGQIV+IYVVGFIGG  TDPLMHP+HVAATTA+GA ASVCATLLPFPRLASLQV++K K VVEN  ERL L+VKA LT
Subjt:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT

Query:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV
         DD +AA SISKA+LLSSSA KLLHSI+H+QES  WERLP KI  + W  N E LQ+L   L GMELALS+   P  P  + QN+    D  +NAL+  +
Subjt:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV

Query:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD
        +LAL QAN  SL      P DD  N   ++QIMPT+     H FF FC+KLL +K Q++     + +   +    + +M ALK A+SLG AVFLGLMY  
Subjt:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD

Query:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF
        ENGFWASLAVA+S TS+RE TF+ ANVKVHGTMLGSVYG+LSFV+F   L GRLLCLLPWFVFT+FL+H+ MYGSAGGV+A+VGALVVLGRTNYG+P EF
Subjt:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF

Query:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS
        AFVR +ETFIG+SIS+  D+I QPTRASK+AKIQL  +L++LQ CI  L+  S +E+  RAL IQV ELKKLI+EAE EPN WFLPF S+ Y KL  SLS
Subjt:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS

Query:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC
        K VDFLA S +AM  LK+           ++VE L+G +E+FKE++ + V+CY D+SSLK LRV+E EGEK + C DVEMGE  +I +DEI EKE LI C
Subjt:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC

Query:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI
         L+HS E IVD+ G G+D + + ILSL A+AFCL++L+R I+EIGEA REL+QWENP S  +DF+ I
Subjt:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI

XP_022141933.1 uncharacterized protein LOC111012189 [Momordica charantia]0.0100Show/hide
Query:  QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTAL
        QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTAL
Subjt:  QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTAL

Query:  TVALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL
        TVALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL
Subjt:  TVALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL

Query:  TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKH
        TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKH
Subjt:  TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKH

Query:  VALALNQANNASLPSDDPINAVQIMPTHHNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAIS
        VALALNQANNASLPSDDPINAVQIMPTHHNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAIS
Subjt:  VALALNQANNASLPSDDPINAVQIMPTHHNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAIS

Query:  FTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLS
        FTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLS
Subjt:  FTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLS

Query:  ISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMD
        ISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMD
Subjt:  ISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMD

Query:  GLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRG
        GLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRG
Subjt:  GLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRG

Query:  AGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI
        AGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI
Subjt:  AGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI

XP_022948311.1 uncharacterized protein LOC111452025 [Cucurbita moschata]1.06e-31363.49Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        +   RALW  RLASAFRTALAC++VA  TLYGP  LR QVAFPAFSYLTA+LIVTNA+LGD I G  LALFAT+QTVCPAM +FWFIGPTKFS +T A+T
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT
        VALAS+VVVL  ST +LAK+IALGQIV+IYVVGFIGG  TDPLMHP+HVAATTA+GA ASVCATLLPFPRLASLQV++K K VVEN  ERL L+VKA LT
Subjt:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT

Query:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV
         DD +AA SISKA+LLSSSA KLLHSI+H+QES  WERLP KI  + W  N E LQ+L   L GMELALS+   P  P  + QN+    D  +NAL+  +
Subjt:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV

Query:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD
        +LAL QAN  SL      P DD  N   ++QIMPT+     H FF FC+KLL +K Q++     + +   +    + +M ALK A+SLG AVFLGLMY  
Subjt:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD

Query:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF
        ENGFWASLAVA+S TS+RE TF+ ANVKVHGTMLGSVYG+LSFV+F   L GRLLCLLPWFVFT+FL+H+ MYGSAGGV+A+VGALVVLGRTNYG+P EF
Subjt:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF

Query:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS
        AFVR +ETFIG+SIS+  D+I QPTRASK+AKIQL  +L++LQ CI  L+  S +E+  RAL IQV ELKKLI+EAE EPN WFLPF S+ Y KL  SLS
Subjt:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS

Query:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC
        K VDFLA S +AM  LK+           ++VE L+G +E+FKE++ + V+CY D+SSLK LRV+E EGEK + C DVEMGE  +I +DEI EKE LI C
Subjt:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC

Query:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI
         L+HS E IVD+ G G+D + + ILSL A+AFCLS+LMR I+EIGEA REL+QWENP S  +DF+ I
Subjt:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI

XP_022998765.1 uncharacterized protein LOC111493334 [Cucurbita maxima]1.50e-31363.49Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        +   RALW  RLASAFRTALAC++VA  TLYGP  LR QVAFPAFSYLTA+LIVTNA+LGD I G  LALFAT+QTVCPAM +FWFIGPTKFS +T A+T
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT
        VALAS+VVVL  ST +LAK+IALGQIV+IYVVGFIGG  TDPLMHP+HVAATTA+GA ASVCATLLPFPRLASLQV++K K VVEN  ERL L+VKA LT
Subjt:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT

Query:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV
         DD VAA SISKA+LLSSSA KLLHSI+H+QES  WE LP KI  + W  N E L++L   L+GMELALS+   P  P  + QN+    D  +NAL+  +
Subjt:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV

Query:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD
        +LAL QAN  SL      P DD  N   ++QIMPT+     H FF FC+KLL +K Q++     + +   +    + +M ALK A+SLG AVFLGLMY  
Subjt:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD

Query:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF
        ENGFWASLAVA+S TS+RE TF+ ANVKVHGTMLGSVYG+LSFV+F   L GRLLCLLPWFVFT+FL+H+ MYGSAGGV+A+VGALVVLGRTNYG+P EF
Subjt:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF

Query:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS
        AFVR +ETFIG+SIS+  D+I QPTRASK+AKIQL  +L++LQ CI  L+  S +E+  RAL IQV ELKKLI+EAEAEPN WFLPF S+ Y KL  SLS
Subjt:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS

Query:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC
        K VDFLA S +AM  LK+           ++VE L+G +E+FKE++   V+CY D+SSLK L+V+E EGEK + C DVEMGE  +I +DEI EKE LI C
Subjt:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC

Query:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI
         L+HS E IVD+ G G+D K + ILSL A+AFCLS+LMR I+EIGEA REL+QWENP S  +DF+ I
Subjt:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI

TrEMBL top hitse value%identityAlignment
A0A0A0LUT1 Uncharacterized protein7.14e-28658.42Show/hide
Query:  RALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALTVALA
        RA+W TRLASA R ALACSIVA  TLYGPA+LR  VAFPAFSYLTA LIVTNA LGDA+ G CL +FAT+QTVCPAM +FWFIGP KFS +TTA+TVALA
Subjt:  RALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALTVALA

Query:  SVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDDDR
        SVVVVL  ST LLAK+IALGQIVIIYVVGFIGG HTDPLMHP+HVAATTA+GA AS+ ATLLPFPRLASLQV+ K K+VVEN  ERL L+VKA L +D  
Subjt:  SVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDDDR

Query:  VAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNE---LAMRDDDINALEKHVA
        +AAASIS+AQ LSSSA KLLHSIK YQES QWE+ PL+IC M W  N E L+DL M L GMELALS       P Y  QN          D+N LE  + 
Subjt:  VAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNE---LAMRDDDINALEKHVA

Query:  LALNQANNASLPSD-----------------DPINAVQIMPTHHN-----FFTFCLKLLHSKCQLKRPTKLEQQV-------------HANYW----RER
        L+L QAN    PSD                 + + ++QI PT H      FF FC+KLL+ K Q+K P K +++              + + W      +
Subjt:  LALNQANNASLPSD-----------------DPINAVQIMPTHHN-----FFTFCLKLLHSKCQLKRPTKLEQQV-------------HANYW----RER

Query:  LIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSA
         ++ ALK A+SLG +V LGL+Y+ ENGFW SLAVA+S  S+REPTF+ AN+KVHGTMLGS++G+LSFV+F   L GRLLCLLPWFVFT+FL+HSTMYGSA
Subjt:  LIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSA

Query:  GGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRP-LSFGSSV---EDLRALGIQVCELKKLI
        GG+SA+VGALVVLGRTNYGSP EFAF RM+ETFIG+SIS+  D+IFQP RASKL KIQL  +L+ LQ CI     + SS    +DL+ L  QV E+KKLI
Subjt:  GGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRP-LSFGSSV---EDLRALGIQVCELKKLI

Query:  DEAEAEPNLWFL-PFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVV-NVSVTCYADVSSLKSLRVLEKEGEK
        DEAE EPN  FL PF    + K+F+SLSK+V  LAL+ EAM+ LK+ L       W K+ E L+G  EKFKE++ N  VT Y ++ S  SL+ L+ +  K
Subjt:  DEAEAEPNLWFL-PFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVV-NVSVTCYADVSSLKSLRVLEKEGEK

Query:  NDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWE
         D C D+EMGE Q+I   + +EKE LI  FLQH  E++  + G  E    E ILSL A+AFCL++LM+E++E+GEA REL++WE
Subjt:  NDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWE

A0A1S3C6B7 uncharacterized protein LOC1034971742.89e-28858.62Show/hide
Query:  LWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALTVALASV
        LW T  A+  RTA+ACSI+A AT+YGP  LR QV FPAFSY+TAILIVTNATLGD + G  LAL+ATLQTVCPAM VFWFIGPTKFS  T ALTVALASV
Subjt:  LWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALTVALASV

Query:  VVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDDDRVA
        VVVL  S+ +LAKRIALGQIVIIYVVGFIGG  T+PLMHP+HVA+TTAMG  AS  ATLLPFPRLASL+V++K KA+VE  GERL++LVKAFL D+D VA
Subjt:  VVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDDDRVA

Query:  AASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHVALALNQ
          S+SKA LLS+SA KLL  IK YQESM+WE +PLK+C + W  N + LQDL   +RGMELALS+ A+ P         L    + IN+LE  +  +LNQ
Subjt:  AASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHVALALNQ

Query:  ------ANNASLPSDDP----------INAVQIM-PTHHN-----FFTFCLKLLHSKCQ-------LKRPTKLEQQVHANYWR-------ERLIMAALKL
              +++ + P  +P          IN +Q+  PT+H      FF FCLKLL  K Q        K+  + +Q  +   W         + +M ALK 
Subjt:  ------ANNASLPSDDP----------INAVQIM-PTHHN-----FFTFCLKLLHSKCQ-------LKRPTKLEQQVHANYWR-------ERLIMAALKL

Query:  AVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVG
        A+SLG AV+LGL+YS ENGFWASL VA+S    RE TF+ ANVK+ GT++GSVYGVL FV+F   L GRLLCLLP FVFT+FL+ S MYG+AGGVSA++G
Subjt:  AVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVG

Query:  ALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRA----LGIQVCELKKLIDEAEAEPN
        A+++LGRTNYGSP E AF R+VET IG+S SI  D+I  PTRASKLAK QLT+TLR L  CI   SF    EDL+     LG  V ELKKLIDEA  EPN
Subjt:  ALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRA----LGIQVCELKKLIDEAEAEPN

Query:  LWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDS-WGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDI----CG
         WFLPFQS CYGKL  SLSK VD  A  + +++G+ QNL +  + S W K+ E L+  +E FKE+++  V C ADVSSLKSL+VLEKE EK +      G
Subjt:  LWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDS-WGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDI----CG

Query:  DVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI
        DVEMGE + + + E ME+E L+  F++H  E IV+Q    E+GKREA+LS  ALAFCLS+LM+EI+EIG+ATRELIQWENP SHV DFNEISSKI
Subjt:  DVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI

A0A6J1CLZ8 uncharacterized protein LOC1110121890.0100Show/hide
Query:  QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTAL
        QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTAL
Subjt:  QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTAL

Query:  TVALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL
        TVALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL
Subjt:  TVALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL

Query:  TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKH
        TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKH
Subjt:  TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKH

Query:  VALALNQANNASLPSDDPINAVQIMPTHHNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAIS
        VALALNQANNASLPSDDPINAVQIMPTHHNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAIS
Subjt:  VALALNQANNASLPSDDPINAVQIMPTHHNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAIS

Query:  FTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLS
        FTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLS
Subjt:  FTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLS

Query:  ISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMD
        ISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMD
Subjt:  ISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMD

Query:  GLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRG
        GLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRG
Subjt:  GLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRG

Query:  AGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI
        AGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI
Subjt:  AGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI

A0A6J1G8X2 uncharacterized protein LOC1114520255.15e-31463.49Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        +   RALW  RLASAFRTALAC++VA  TLYGP  LR QVAFPAFSYLTA+LIVTNA+LGD I G  LALFAT+QTVCPAM +FWFIGPTKFS +T A+T
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT
        VALAS+VVVL  ST +LAK+IALGQIV+IYVVGFIGG  TDPLMHP+HVAATTA+GA ASVCATLLPFPRLASLQV++K K VVEN  ERL L+VKA LT
Subjt:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT

Query:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV
         DD +AA SISKA+LLSSSA KLLHSI+H+QES  WERLP KI  + W  N E LQ+L   L GMELALS+   P  P  + QN+    D  +NAL+  +
Subjt:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV

Query:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD
        +LAL QAN  SL      P DD  N   ++QIMPT+     H FF FC+KLL +K Q++     + +   +    + +M ALK A+SLG AVFLGLMY  
Subjt:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD

Query:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF
        ENGFWASLAVA+S TS+RE TF+ ANVKVHGTMLGSVYG+LSFV+F   L GRLLCLLPWFVFT+FL+H+ MYGSAGGV+A+VGALVVLGRTNYG+P EF
Subjt:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF

Query:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS
        AFVR +ETFIG+SIS+  D+I QPTRASK+AKIQL  +L++LQ CI  L+  S +E+  RAL IQV ELKKLI+EAE EPN WFLPF S+ Y KL  SLS
Subjt:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS

Query:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC
        K VDFLA S +AM  LK+           ++VE L+G +E+FKE++ + V+CY D+SSLK LRV+E EGEK + C DVEMGE  +I +DEI EKE LI C
Subjt:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC

Query:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI
         L+HS E IVD+ G G+D + + ILSL A+AFCLS+LMR I+EIGEA REL+QWENP S  +DF+ I
Subjt:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI

A0A6J1KDE1 uncharacterized protein LOC1114933347.28e-31463.49Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        +   RALW  RLASAFRTALAC++VA  TLYGP  LR QVAFPAFSYLTA+LIVTNA+LGD I G  LALFAT+QTVCPAM +FWFIGPTKFS +T A+T
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT
        VALAS+VVVL  ST +LAK+IALGQIV+IYVVGFIGG  TDPLMHP+HVAATTA+GA ASVCATLLPFPRLASLQV++K K VVEN  ERL L+VKA LT
Subjt:  VALASVVVVLQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLT

Query:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV
         DD VAA SISKA+LLSSSA KLLHSI+H+QES  WE LP KI  + W  N E L++L   L+GMELALS+   P  P  + QN+    D  +NAL+  +
Subjt:  DDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHV

Query:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD
        +LAL QAN  SL      P DD  N   ++QIMPT+     H FF FC+KLL +K Q++     + +   +    + +M ALK A+SLG AVFLGLMY  
Subjt:  ALALNQANNASL------PSDDPIN---AVQIMPTH-----HNFFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSD

Query:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF
        ENGFWASLAVA+S TS+RE TF+ ANVKVHGTMLGSVYG+LSFV+F   L GRLLCLLPWFVFT+FL+H+ MYGSAGGV+A+VGALVVLGRTNYG+P EF
Subjt:  ENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEF

Query:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS
        AFVR +ETFIG+SIS+  D+I QPTRASK+AKIQL  +L++LQ CI  L+  S +E+  RAL IQV ELKKLI+EAEAEPN WFLPF S+ Y KL  SLS
Subjt:  AFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL-RALGIQVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLS

Query:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC
        K VDFLA S +AM  LK+           ++VE L+G +E+FKE++   V+CY D+SSLK L+V+E EGEK + C DVEMGE  +I +DEI EKE LI C
Subjt:  KIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDICGDVEMGEPQKIGKDEIMEKENLIVC

Query:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI
         L+HS E IVD+ G G+D K + ILSL A+AFCLS+LMR I+EIGEA REL+QWENP S  +DF+ I
Subjt:  FLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28780.1 unknown protein1.1e-16343.73Show/hide
Query:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT
        + +GRA+W T LASAFRTALAC+IV  ATLYGP  +   VAFPAFSY+T ILI+T+ATLGD + G  LAL+AT Q+V PA++    I P + +  TTAL 
Subjt:  SRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALT

Query:  VALASVVVVL-QRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL
         ALA+ VVVL   ST L+AKRIALGQIV+IYV+G+I G  TDP+MHP+ VAA+TA+G  A V A L+P PRLA+ +V+   K + +N   R+KL +KAF 
Subjt:  VALASVVVVL-QRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFL

Query:  TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSR-ND---ESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINA
        +DD   A AS+S+A++L+ S+ KL  ++K YQ SM WERLP KI   RW   ND   E LQ + + LRGME+ ++S +  PS     + +     +D+  
Subjt:  TDDDRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSR-ND---ESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINA

Query:  LEKHVALALNQANNASLPS----DDPINAVQIMPTHHN-----------FFTFCLKLLHSKCQLKRPTKLEQQVHANY---------WRERLIMAALKLA
        +++ V L++ + NN+S PS     DP N  + + T              FF FC++LL +   + +P + + +V  N          W  + IM ALKL+
Subjt:  LEKHVALALNQANNASLPS----DDPINAVQIMPTHHN-----------FFTFCLKLLHSKCQLKRPTKLEQQVHANY---------WRERLIMAALKLA

Query:  VSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGA
        +SLG A+ LG M+S  NG+WA L VA+SF + RE TF+  NVK  GT++G+VYGV+   VF   L  R L LLPWF+F++FL  S MYG AGG+SA +GA
Subjt:  VSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGA

Query:  LVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL----RALGIQVCELKKLIDEAEAEPNL
        +++LGR N+G P+EFA  R++ETFIGLS SI  +++FQPTRA+ +AK++L+ +  AL  C       +S  D+    + L   + ELKK   EA AEP+ 
Subjt:  LVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDL----RALGIQVCELKKLIDEAEAEPNL

Query:  WFLPFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKND-ICGDVEMG
        WF PF  SCY KLF SLSK+ D L  S  A+ G       T      +++  +D  L+   E + +    + +++ LKSL  LEK   K+D    D+E+G
Subjt:  WFLPFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKND-ICGDVEMG

Query:  E-PQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDG-------KREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI
        + P       + E E ++  +LQH   V        EDG       K E +LSL AL FC+  + +E  EI E  +E++Q ENP SH V+ +EIS KI
Subjt:  E-PQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDG-------KREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI

AT3G09450.1 CONTAINS InterPro DOMAIN/s: Fusaric acid resistance protein, conserved region (InterPro:IPR006726)3.4e-10135.08Show/hide
Query:  WVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILI---VTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKF-SPVTTALTVAL
        W+ RL  A RTA+AC IV+  TLYGP  LRH   FPAFSYLT ILI       T G+ +       +AT QT+  A++    +GP    + +   + VAL
Subjt:  WVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILI---VTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKF-SPVTTALTVAL

Query:  ASVVVVLQRSTQLLAKRIALGQIVIIYVVGFI-GGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDD
        AS +V    ST LL KRIA GQIV++YV   +  G      M P+HVA +TA+GA AS+ A LLPFPRLA  Q+    K   ENA ERL + V+  +  D
Subjt:  ASVVVVLQRSTQLLAKRIALGQIVIIYVVGFI-GGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDD

Query:  DRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRND-ESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHVA
        +  A   I++A  LS++A   L +IK + E + WER   +    +   +  E L      LRG+ELAL S ++ P           M  D++  L +   
Subjt:  DRVAAASISKAQLLSSSAPKLLHSIKHYQESMQWERLPLKICGMRWSRND-ESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHVA

Query:  LALNQANNASLPSDDPINAVQIMPTHHN------------FFTFCLKLLHSK-CQLKRPTK------LEQQVH---------ANYW--------RERLIM
          +   + ++L S D +        HH             FF +C++L       L++ +K       E+++H           +W        RER + 
Subjt:  LALNQANNASLPSDDPINAVQIMPTHHN------------FFTFCLKLLHSK-CQLKRPTK------LEQQVH---------ANYW--------RERLIM

Query:  AALKLAVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGV
         A K ++SLG AV  G++Y+  NG+W+ L VAIS  S R+ T   AN ++ GT +GSVYG++   VF      R L LLPW +   F+RHS +YG  GGV
Subjt:  AALKLAVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGRLLCLLPWFVFTTFLRHSTMYGSAGGV

Query:  SAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSS------VEDLR----ALGIQVCELK
        +A + AL++LGR NYG+PTEFA  R+VE  IGL   +  +++  P RA+ LA+ +++  L AL +CI+ L   S       V DLR     L   V  L+
Subjt:  SAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSS------VEDLR----ALGIQVCELK

Query:  KLIDEAEAEPNLWFL-PFQSSCYGKLFDSLSKIVDFLALSTEAMDGLK--QNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEK
        +   EA  EP + FL    +  Y +L  S SKI D   L     DGLK    +  T    W  +       L  F+E ++ SV C  ++S  KS   L+K
Subjt:  KLIDEAEAEPNLWFL-PFQSSCYGKLFDSLSKIVDFLALSTEAMDGLK--QNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEK

Query:  EGEKNDICGDVEMGEP-------QKIGKDEI-MEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMRE
        E +K  IC DVE G          ++G  +  +E+ ++    L   A   +    A +  K E  L L +L FC+S LM+E
Subjt:  EGEKNDICGDVEMGEP-------QKIGKDEI-MEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAATCTAGGCAAGGCCGAGCCTTGTGGGTCACGCGCCTGGCCTCTGCCTTTCGGACTGCACTGGCCTGCTCCATAGTGGCCTGTGCCACTTTGTACGGCCCGGCTTCTCT
CCGCCACCAAGTGGCCTTTCCGGCCTTCTCCTATCTCACCGCTATCCTCATCGTCACCAACGCCACTCTCGGCGACGCCATCTGTGGCTCCTGCCTCGCACTCTTCGCCA
CGCTCCAGACCGTGTGTCCAGCCATGCTCGTGTTTTGGTTCATCGGGCCAACCAAATTCTCACCCGTCACGACGGCACTCACGGTGGCGTTGGCCTCAGTGGTGGTGGTG
CTGCAGAGGTCCACCCAATTGCTGGCCAAAAGGATTGCTCTGGGCCAGATTGTGATCATTTACGTCGTGGGCTTCATCGGAGGACCCCACACGGACCCTCTAATGCACCC
AATCCACGTGGCAGCCACGACGGCCATGGGCGCCACCGCCAGTGTCTGTGCCACGCTGCTTCCCTTCCCACGCCTTGCTTCTCTTCAGGTGAGAGATAAGAGGAAGGCAG
TGGTGGAGAACGCGGGGGAGAGATTAAAGCTATTGGTGAAAGCATTCCTCACCGACGACGACAGGGTGGCTGCTGCCTCCATATCCAAAGCTCAGTTATTGTCTTCTTCA
GCACCAAAACTCCTCCACTCCATAAAACATTACCAAGAAAGCATGCAATGGGAGAGGCTTCCATTAAAAATTTGTGGAATGAGATGGTCGAGGAATGACGAGAGTTTGCA
AGATTTAGGAATGGGCTTAAGGGGAATGGAACTAGCATTGTCGTCCACTGCCACTCCTCCTTCTCCCTGGTATCAATTTCAAAACGAACTAGCGATGAGAGATGACGATA
TAAATGCATTGGAGAAGCACGTCGCCCTTGCCTTAAACCAAGCCAACAATGCTTCTTTGCCGTCTGATGATCCAATCAACGCCGTTCAGATCATGCCAACGCATCATAAT
TTCTTCACATTCTGCCTCAAACTCCTCCACAGTAAATGCCAACTCAAACGCCCCACCAAATTAGAACAACAAGTTCACGCAAATTATTGGAGGGAGAGGCTAATAATGGC
AGCGCTGAAACTCGCAGTTTCATTGGGCGCTGCGGTGTTTTTGGGGTTAATGTACAGTGACGAAAACGGGTTCTGGGCGAGCCTTGCAGTGGCCATCAGCTTCACTTCCG
ACAGAGAGCCCACATTCAGAGCCGCCAACGTTAAGGTTCATGGGACAATGTTAGGATCTGTGTATGGAGTTTTGAGTTTTGTCGTTTTCCCAGGTTGTTTGGCCGGAAGG
CTTCTTTGTCTTCTTCCTTGGTTCGTTTTCACCACGTTTCTCCGACACAGTACCATGTACGGTTCCGCCGGCGGAGTTTCCGCGGTTGTCGGAGCTTTGGTTGTTTTGGG
AAGAACAAATTATGGTTCTCCAACAGAATTTGCTTTTGTGAGGATGGTGGAAACTTTTATTGGGCTATCAATTTCCATCGCTGCGGATGTCATTTTTCAGCCAACTAGAG
CTTCTAAACTGGCCAAAATTCAGCTCACTGCCACTCTACGAGCGCTGCAAAACTGCATCCGCCCACTCAGTTTTGGATCAAGTGTTGAAGATTTGAGAGCTTTGGGGATT
CAAGTTTGTGAGCTGAAGAAGCTCATCGACGAGGCTGAGGCGGAGCCCAACTTGTGGTTTTTGCCGTTTCAGAGCAGTTGCTATGGGAAGCTGTTCGATTCATTGTCGAA
AATCGTCGATTTCTTGGCTCTGAGTACAGAGGCAATGGATGGTCTTAAGCAGAATCTATCAATGACAACGGAGGATTCGTGGGGAAAACTTGTGGAAACTTTAGACGGAG
GGCTCGAGAAATTTAAGGAAGTAGTGAATGTGTCGGTGACATGTTATGCGGATGTGAGTTCGTTGAAATCTCTGAGAGTGCTTGAAAAGGAGGGCGAGAAGAATGATATT
TGTGGCGATGTTGAGATGGGAGAGCCACAGAAGATTGGAAAAGATGAAATAATGGAGAAGGAAAATTTGATTGTTTGTTTTCTGCAACACTCGGCAGAGGTTATTGTTGA
TCAGAGAGGTGCTGGTGAAGATGGTAAAAGGGAAGCAATTCTTAGTCTGAGGGCTTTGGCATTTTGCTTAAGCAATTTGATGAGAGAGATAGACGAGATTGGAGAGGCAA
CTAGAGAACTGATTCAATGGGAAAATCCTTGCTCTCATGTTGTCGACTTCAATGAAATATCATCTAAGATT
mRNA sequenceShow/hide mRNA sequence
CAATCTAGGCAAGGCCGAGCCTTGTGGGTCACGCGCCTGGCCTCTGCCTTTCGGACTGCACTGGCCTGCTCCATAGTGGCCTGTGCCACTTTGTACGGCCCGGCTTCTCT
CCGCCACCAAGTGGCCTTTCCGGCCTTCTCCTATCTCACCGCTATCCTCATCGTCACCAACGCCACTCTCGGCGACGCCATCTGTGGCTCCTGCCTCGCACTCTTCGCCA
CGCTCCAGACCGTGTGTCCAGCCATGCTCGTGTTTTGGTTCATCGGGCCAACCAAATTCTCACCCGTCACGACGGCACTCACGGTGGCGTTGGCCTCAGTGGTGGTGGTG
CTGCAGAGGTCCACCCAATTGCTGGCCAAAAGGATTGCTCTGGGCCAGATTGTGATCATTTACGTCGTGGGCTTCATCGGAGGACCCCACACGGACCCTCTAATGCACCC
AATCCACGTGGCAGCCACGACGGCCATGGGCGCCACCGCCAGTGTCTGTGCCACGCTGCTTCCCTTCCCACGCCTTGCTTCTCTTCAGGTGAGAGATAAGAGGAAGGCAG
TGGTGGAGAACGCGGGGGAGAGATTAAAGCTATTGGTGAAAGCATTCCTCACCGACGACGACAGGGTGGCTGCTGCCTCCATATCCAAAGCTCAGTTATTGTCTTCTTCA
GCACCAAAACTCCTCCACTCCATAAAACATTACCAAGAAAGCATGCAATGGGAGAGGCTTCCATTAAAAATTTGTGGAATGAGATGGTCGAGGAATGACGAGAGTTTGCA
AGATTTAGGAATGGGCTTAAGGGGAATGGAACTAGCATTGTCGTCCACTGCCACTCCTCCTTCTCCCTGGTATCAATTTCAAAACGAACTAGCGATGAGAGATGACGATA
TAAATGCATTGGAGAAGCACGTCGCCCTTGCCTTAAACCAAGCCAACAATGCTTCTTTGCCGTCTGATGATCCAATCAACGCCGTTCAGATCATGCCAACGCATCATAAT
TTCTTCACATTCTGCCTCAAACTCCTCCACAGTAAATGCCAACTCAAACGCCCCACCAAATTAGAACAACAAGTTCACGCAAATTATTGGAGGGAGAGGCTAATAATGGC
AGCGCTGAAACTCGCAGTTTCATTGGGCGCTGCGGTGTTTTTGGGGTTAATGTACAGTGACGAAAACGGGTTCTGGGCGAGCCTTGCAGTGGCCATCAGCTTCACTTCCG
ACAGAGAGCCCACATTCAGAGCCGCCAACGTTAAGGTTCATGGGACAATGTTAGGATCTGTGTATGGAGTTTTGAGTTTTGTCGTTTTCCCAGGTTGTTTGGCCGGAAGG
CTTCTTTGTCTTCTTCCTTGGTTCGTTTTCACCACGTTTCTCCGACACAGTACCATGTACGGTTCCGCCGGCGGAGTTTCCGCGGTTGTCGGAGCTTTGGTTGTTTTGGG
AAGAACAAATTATGGTTCTCCAACAGAATTTGCTTTTGTGAGGATGGTGGAAACTTTTATTGGGCTATCAATTTCCATCGCTGCGGATGTCATTTTTCAGCCAACTAGAG
CTTCTAAACTGGCCAAAATTCAGCTCACTGCCACTCTACGAGCGCTGCAAAACTGCATCCGCCCACTCAGTTTTGGATCAAGTGTTGAAGATTTGAGAGCTTTGGGGATT
CAAGTTTGTGAGCTGAAGAAGCTCATCGACGAGGCTGAGGCGGAGCCCAACTTGTGGTTTTTGCCGTTTCAGAGCAGTTGCTATGGGAAGCTGTTCGATTCATTGTCGAA
AATCGTCGATTTCTTGGCTCTGAGTACAGAGGCAATGGATGGTCTTAAGCAGAATCTATCAATGACAACGGAGGATTCGTGGGGAAAACTTGTGGAAACTTTAGACGGAG
GGCTCGAGAAATTTAAGGAAGTAGTGAATGTGTCGGTGACATGTTATGCGGATGTGAGTTCGTTGAAATCTCTGAGAGTGCTTGAAAAGGAGGGCGAGAAGAATGATATT
TGTGGCGATGTTGAGATGGGAGAGCCACAGAAGATTGGAAAAGATGAAATAATGGAGAAGGAAAATTTGATTGTTTGTTTTCTGCAACACTCGGCAGAGGTTATTGTTGA
TCAGAGAGGTGCTGGTGAAGATGGTAAAAGGGAAGCAATTCTTAGTCTGAGGGCTTTGGCATTTTGCTTAAGCAATTTGATGAGAGAGATAGACGAGATTGGAGAGGCAA
CTAGAGAACTGATTCAATGGGAAAATCCTTGCTCTCATGTTGTCGACTTCAATGAAATATCATCTAAGATT
Protein sequenceShow/hide protein sequence
QSRQGRALWVTRLASAFRTALACSIVACATLYGPASLRHQVAFPAFSYLTAILIVTNATLGDAICGSCLALFATLQTVCPAMLVFWFIGPTKFSPVTTALTVALASVVVV
LQRSTQLLAKRIALGQIVIIYVVGFIGGPHTDPLMHPIHVAATTAMGATASVCATLLPFPRLASLQVRDKRKAVVENAGERLKLLVKAFLTDDDRVAAASISKAQLLSSS
APKLLHSIKHYQESMQWERLPLKICGMRWSRNDESLQDLGMGLRGMELALSSTATPPSPWYQFQNELAMRDDDINALEKHVALALNQANNASLPSDDPINAVQIMPTHHN
FFTFCLKLLHSKCQLKRPTKLEQQVHANYWRERLIMAALKLAVSLGAAVFLGLMYSDENGFWASLAVAISFTSDREPTFRAANVKVHGTMLGSVYGVLSFVVFPGCLAGR
LLCLLPWFVFTTFLRHSTMYGSAGGVSAVVGALVVLGRTNYGSPTEFAFVRMVETFIGLSISIAADVIFQPTRASKLAKIQLTATLRALQNCIRPLSFGSSVEDLRALGI
QVCELKKLIDEAEAEPNLWFLPFQSSCYGKLFDSLSKIVDFLALSTEAMDGLKQNLSMTTEDSWGKLVETLDGGLEKFKEVVNVSVTCYADVSSLKSLRVLEKEGEKNDI
CGDVEMGEPQKIGKDEIMEKENLIVCFLQHSAEVIVDQRGAGEDGKREAILSLRALAFCLSNLMREIDEIGEATRELIQWENPCSHVVDFNEISSKI