; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002748 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002748
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionglutamic acid-rich protein isoform X1
Genome locationscaffold66:245737..247998
RNA-Seq ExpressionMS002748
SyntenyMS002748
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031049.1 hypothetical protein SDJN02_05088, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-25068.67Show/hide
Query:  MDSDRHFRTTSNST----ASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MDSDRHFR T++ST    ASSELFICFTSR   SSSS+MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPS
Subjt:  MDSDRHFRTTSNST----ASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPF
        SPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE VQ Q+      FV + SH + HL RQNSN  GGNGF      QQECLSHRNQRWVHLPF
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPF

Query:  TICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSE
        TICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++ E
Subjt:  TICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSE

Query:  VVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-------MDAPLTVILSNDEDEEETKVE
        VV  +EEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+E    EEDE+E+ EE S Q EA K       M   +T+I   +E+EEETKVE
Subjt:  VVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-------MDAPLTVILSNDEDEEETKVE

Query:  LNVKLKNDEEMSEESVSDGEEEN---------YLVLQQEEEEHNE--EETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNN-DEDNQP
        LN KLKN+EEM EESVSD EEE           +VLQ+EEEE  +  EE++E+AT+NEIDVQKLDITV NH +QEE AE+++E+E E E  +  D+DNQ 
Subjt:  LNVKLKNDEEMSEESVSDGEEEN---------YLVLQQEEEEHNE--EETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNN-DEDNQP

Query:  EELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQ-----TEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEE
        ++L EET A    I + C+PE+ QDAEK+ESA E DE K  HGNE   E E+ EEQ      EE+ ENG NP SP LSV          ETE D NWEEE
Subjt:  EELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQ-----TEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEE

Query:  EEEEEEEETIHWEDREKATE----EEGMRPDIGDGGAM------EESKERETPAAEAKREAET------GLLPDCLLLMMYEPKLSMEVSKETWVCSTDF
        EEEEE       E+  K T     +EG+ P I +   M      ++SKERETP  E +RE +T       +LPDCLLLMMYEPKLSMEVSKETWVCSTDF
Subjt:  EEEEEEEETIHWEDREKATE----EEGMRPDIGDGGAM------EESKERETPAAEAKREAET------GLLPDCLLLMMYEPKLSMEVSKETWVCSTDF

Query:  IRCVPTREKKAA---AAKKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAK--GYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPH
        IRCVPTREKKAA     KKRE K AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAK  GYEPFVLTRCKSEPMRSSAKLAPD  F KDR LEPH
Subjt:  IRCVPTREKKAA---AAKKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAK--GYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPH

Query:  RPATFGVGAAGVGF
        RPATFG+GAAG+GF
Subjt:  RPATFGVGAAGVGF

XP_004146243.2 glutamic acid-rich protein [Cucumis sativus]1.2e-24968.34Show/hide
Query:  MDSDRHFRTT------SNSTASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE
        MDSD HFRTT      S +T SSELFICFTSR   SSSS+MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPE
Subjt:  MDSDRHFRTT------SNSTASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE

Query:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV
        PSSPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE +    Q NG DQQF    SHH+ HL RQNSNS+ GNGF      QQECLSHRNQRWV
Subjt:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV

Query:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK
        HLPFTICEALRAFGAELNCFLPCHSSC+ +R+  KESKPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFK
Subjt:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK

Query:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--
        DK+E V  EEEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV +EEDEE E ED E K RQ E  + +  P++ I++ +++EEE K E  
Subjt:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--

Query:  ------LNVKLKNDEEMSEESVSDGE---EENYLVLQQ---EEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPE
              L VKL+N+EEM+EE VSD +   EE  LVLQ+   EEEE NEEET+E+AT+NEID QK    VN  NQE+  E+++E++        D+ NQ  
Subjt:  ------LNVKLKNDEEMSEESVSDGE---EENYLVLQQ---EEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPE

Query:  ELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD-------EEQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD
           +ET AIP      +HC+PE+AQD EK+ES E+E E K  H +E   + E+D       EE+ EEEGENG N     SPSLSVETE    D  ETE D
Subjt:  ELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD-------EEQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD

Query:  VNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERET--------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCS
        VN EEEEEEEEE+           T +EG+ PD  +    G  EE  SKE ET        P  E K + ET +LPDCLLLMMYEPKLSMEVSKETWVCS
Subjt:  VNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERET--------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCS

Query:  TDFIRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRK
         DFIRCVPTREKKA           KKRE K  + TQ AVVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDAC WKDRK
Subjt:  TDFIRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRK

Query:  LEPHRPATFGVGAAGVGF
        LEPHRPATFGVGAA VGF
Subjt:  LEPHRPATFGVGAAGVGF

XP_008456014.1 PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo]5.3e-25368.1Show/hide
Query:  MDSDRHFRTT------SNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MD DRHFRTT      S +T SSELFICFTSR SSSS+MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPS
Subjt:  MDSDRHFRTT------SNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE V    Q N  DQQF    SHH+ HL RQNSNS+ GNGF      QQECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+ +R+  KE KPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFKDK
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE
        +E V  EEEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE++EE E ED E K R+ E  +     + + +TV    +E+EEE K E
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE

Query:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE--EEEHNEEETLEIATDNEIDVQKLDIT-VNHHNQEEPAEDEQEEEEEHENNNNDEDNQP
                 VKL+N+EE++EESVSD +   EE  LVLQ+E  EE+ NEEET+E+AT+N+ D QK DIT VN  NQE+  E+++E++        D+ NQ 
Subjt:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE--EEEHNEEETLEIATDNEIDVQKLDIT-VNHHNQEEPAEDEQEEEEEHENNNNDEDNQP

Query:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD---------EEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN
            +ET AIP    +HC+PE+AQDAEK+ES E+E ESK  H +E   + E+D         EE+ EEEGENG NP SPSLSVET+     ++ETE +V+
Subjt:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD---------EEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN

Query:  WEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERET------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDF
         + EEEEEEEEE       EKAT +EG+ PD  + GA+      ++SKERET      P  E K + ET +LPDCLLLMMYEPKLSMEVSKETWVCS DF
Subjt:  WEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERET------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDF

Query:  IRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEP
        IRCVPTREKK            KKRE K  +  Q  VVQPARWSCSFPAAAAAAA IEQKL RAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEP
Subjt:  IRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEP

Query:  HRPATFGVGAAGVGF
        HRPATFGVGAA VGF
Subjt:  HRPATFGVGAAGVGF

XP_022142926.1 uncharacterized protein LOC111012918, partial [Momordica charantia]0.0e+0099.1Show/hide
Query:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
        MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
Subjt:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC

Query:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
        IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
Subjt:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL

Query:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
        RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
Subjt:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE

Query:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
        SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ+DKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
Subjt:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES

Query:  VSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESA
        VSDGEEENYLVLQQ EEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESA
Subjt:  VSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESA

Query:  EEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEES
        EEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNW  EEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEES
Subjt:  EEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEES

Query:  KERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA
        KERETPAAEAKREAETG+LPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAK A
Subjt:  KERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA

XP_038894264.1 glutamic acid-rich protein [Benincasa hispida]3.9e-26471.04Show/hide
Query:  MDSDRHFRTTS-NSTA-----SSELFICFTSRL---SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNP
        MD+DRHFRTTS NST+     SSELFICFTSR    SSSS+MKISSKSILSPGRAREPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNP
Subjt:  MDSDRHFRTTS-NSTA-----SSELFICFTSRL---SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNP

Query:  EPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRW
        EPSSPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFR+SE V    Q NG +QQF    SHH+ HL RQNSN++GGNGF      QQECLSHRNQRW
Subjt:  EPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRW

Query:  VHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDF
        VHLPFTICEALRAFGAELNCFLPCHSSC+SDR+  KESKPA +SSE+ESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DF
Subjt:  VHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDF

Query:  KDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-MDAPL------TVILSNDEDEE
        KD++E+V  E+EESRISICIPPKNALLLMRCRSDPVK+AELAKRFC+SPAPKV+EE+ EE+E+ED E K++Q E  + +  PL      TV +S +++EE
Subjt:  KDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-MDAPL------TVILSNDEDEE

Query:  ETKVELNVKLKNDEEMSEESVSDGEEEN-----YLVLQQEEEEHNEEETLEIATDNEIDVQKLD-ITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEE
        E KVEL VKL+NDEE +EESV D E+EN     +L  ++EEEE N E T+E+AT NEID QKLD I +N  NQE+  E+++EE++       D+DNQ   
Subjt:  ETKVELNVKLKNDEEMSEESVSDGEEEN-----YLVLQQEEEEHNEEETLEIATDNEIDVQKLD-ITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEE

Query:  LAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEE
          +ET AIP      +H +PE AQDAEK+ES  EEDESK  H +E   + ED+  + EEE  NG NP SPS SVETE    D  ETE D NWEEEEEEEE
Subjt:  LAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEE

Query:  EEETIHWEDREKATEEEGMRPDIGD-----GGAMEESKERETPAAEAKR--EAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAA-
        EEE    E+ EKAT +EG+ PD  +     G   ++SKERETP  E +R  + ET +LPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKK A  
Subjt:  EEETIHWEDREKATEEEGMRPDIGD-----GGAMEESKERETPAAEAKR--EAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAA-

Query:  -------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVG
                KKRE K A+ TQ AVVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAA VG
Subjt:  -------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVG

Query:  F
        F
Subjt:  F

TrEMBL top hitse value%identityAlignment
A0A0A0L789 Uncharacterized protein5.9e-25068.34Show/hide
Query:  MDSDRHFRTT------SNSTASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE
        MDSD HFRTT      S +T SSELFICFTSR   SSSS+MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPE
Subjt:  MDSDRHFRTT------SNSTASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE

Query:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV
        PSSPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE +    Q NG DQQF    SHH+ HL RQNSNS+ GNGF      QQECLSHRNQRWV
Subjt:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV

Query:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK
        HLPFTICEALRAFGAELNCFLPCHSSC+ +R+  KESKPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFK
Subjt:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK

Query:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--
        DK+E V  EEEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV +EEDEE E ED E K RQ E  + +  P++ I++ +++EEE K E  
Subjt:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--

Query:  ------LNVKLKNDEEMSEESVSDGE---EENYLVLQQ---EEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPE
              L VKL+N+EEM+EE VSD +   EE  LVLQ+   EEEE NEEET+E+AT+NEID QK    VN  NQE+  E+++E++        D+ NQ  
Subjt:  ------LNVKLKNDEEMSEESVSDGE---EENYLVLQQ---EEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPE

Query:  ELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD-------EEQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD
           +ET AIP      +HC+PE+AQD EK+ES E+E E K  H +E   + E+D       EE+ EEEGENG N     SPSLSVETE    D  ETE D
Subjt:  ELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD-------EEQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD

Query:  VNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERET--------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCS
        VN EEEEEEEEE+           T +EG+ PD  +    G  EE  SKE ET        P  E K + ET +LPDCLLLMMYEPKLSMEVSKETWVCS
Subjt:  VNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERET--------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCS

Query:  TDFIRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRK
         DFIRCVPTREKKA           KKRE K  + TQ AVVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDAC WKDRK
Subjt:  TDFIRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRK

Query:  LEPHRPATFGVGAAGVGF
        LEPHRPATFGVGAA VGF
Subjt:  LEPHRPATFGVGAAGVGF

A0A1S3C2C2 glutamic acid-rich protein isoform X12.5e-25368.1Show/hide
Query:  MDSDRHFRTT------SNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MD DRHFRTT      S +T SSELFICFTSR SSSS+MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPS
Subjt:  MDSDRHFRTT------SNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE V    Q N  DQQF    SHH+ HL RQNSNS+ GNGF      QQECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+ +R+  KE KPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFKDK
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE
        +E V  EEEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE++EE E ED E K R+ E  +     + + +TV    +E+EEE K E
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE

Query:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE--EEEHNEEETLEIATDNEIDVQKLDIT-VNHHNQEEPAEDEQEEEEEHENNNNDEDNQP
                 VKL+N+EE++EESVSD +   EE  LVLQ+E  EE+ NEEET+E+AT+N+ D QK DIT VN  NQE+  E+++E++        D+ NQ 
Subjt:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE--EEEHNEEETLEIATDNEIDVQKLDIT-VNHHNQEEPAEDEQEEEEEHENNNNDEDNQP

Query:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD---------EEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN
            +ET AIP    +HC+PE+AQDAEK+ES E+E ESK  H +E   + E+D         EE+ EEEGENG NP SPSLSVET+     ++ETE +V+
Subjt:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDD---------EEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN

Query:  WEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERET------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDF
         + EEEEEEEEE       EKAT +EG+ PD  + GA+      ++SKERET      P  E K + ET +LPDCLLLMMYEPKLSMEVSKETWVCS DF
Subjt:  WEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERET------PAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDF

Query:  IRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEP
        IRCVPTREKK            KKRE K  +  Q  VVQPARWSCSFPAAAAAAA IEQKL RAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEP
Subjt:  IRCVPTREKKAAA--------AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEP

Query:  HRPATFGVGAAGVGF
        HRPATFGVGAA VGF
Subjt:  HRPATFGVGAAGVGF

A0A6J1CNN3 uncharacterized protein LOC1110129180.0e+0099.1Show/hide
Query:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
        MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
Subjt:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC

Query:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
        IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
Subjt:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL

Query:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
        RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
Subjt:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE

Query:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
        SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ+DKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
Subjt:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES

Query:  VSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESA
        VSDGEEENYLVLQQ EEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESA
Subjt:  VSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESA

Query:  EEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEES
        EEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNW  EEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEES
Subjt:  EEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEES

Query:  KERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA
        KERETPAAEAKREAETG+LPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAK A
Subjt:  KERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA

A0A6J1FMZ8 glutamic acid-rich protein-like2.9e-24968.85Show/hide
Query:  MDSDRHFRTTSNST----ASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MDSDRHFR T++ST    ASSELFICFTSR   SSSS+MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPS
Subjt:  MDSDRHFRTTSNST----ASSELFICFTSRL--SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPF
        SPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE VQ Q+      FV + SH + HL RQNSN  GGNGF      QQECLSHRNQRWVHLPF
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPF

Query:  TICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSE
        TICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++ E
Subjt:  TICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSE

Query:  VVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQK-EAAKMDAPLTVILSNDEDEEETKVELNVKLK
        VV  +EEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE+++E+  +   QK     +  PLTV L  +E+EEETKVELN KLK
Subjt:  VVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQK-EAAKMDAPLTVILSNDEDEEETKVELNVKLK

Query:  NDEEMSEESVSDGEEEN---YLVLQQEEEEH--NEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA----I
        N+EEM EESVSD EEE     +VLQ+EEEE   N EE++E+AT+NEIDVQKLDITV +H  +E AE+++E+E E E +  D+DNQ ++L EET A    I
Subjt:  NDEEMSEESVSDGEEEN---YLVLQQEEEEH--NEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA----I

Query:  PSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREK
         + C+PE+ QDAEK+ESA E DE K  HGNE   E E+  ++ EEE   ENG NP SP LSV          ETE D NWEEEEE          +    
Subjt:  PSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREK

Query:  ATEEEGMRPDIGDGGAM------EESKERETPAAEAKREAET------GLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA---AAKKRE
           +EG+ P I +   M      ++SKERETP  E +RE +T       +LPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA     KKRE
Subjt:  ATEEEGMRPDIGDGGAM------EESKERETPAAEAKREAET------GLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA---AAKKRE

Query:  AKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAK--GYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
         K AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAK  GYEPFVLTRCKSEPMRSSAKLAPD  F KDR LEPHRPATFG+GAAG+GF
Subjt:  AKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAK--GYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF

A0A6J1IN35 calponin homology domain-containing protein DDB_G0272472-like isoform X22.9e-24969.4Show/hide
Query:  MDSDRHFRTTSNST----ASSELFICFTSRL----SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE
        MDSDRHFR T++ST    ASSELFICFTSR+    SSSS+MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGC F+NPE
Subjt:  MDSDRHFRTTSNST----ASSELFICFTSRL----SSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE

Query:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        PSSPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE VQ Q+      F+ + SHH+ HL RQNSN  GGNGF      QQECLSHRNQRWVHL
Subjt:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EKDNGSQRRHVFEG+DFK++
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKL
         EVV  +EEESRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEEE+E+DE+   + ++++     +  PL+V L   E+EEETKVELN KL
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKL

Query:  KNDEEMSEESVSDGEEEN--YLVLQQE-EEEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA----I
        KN+EEM EESVSD EEE    LVLQ+E EEE N EE++E+AT+NEIDVQKLDITV NH +QEE AE+++E+E     +  D+DNQ ++L EET A    I
Subjt:  KNDEEMSEESVSDGEEEN--YLVLQQE-EEEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA----I

Query:  PSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREK
         + C+PE+ QDAEK+ESA E DE K  HGNE   E E+  ++ EEE   ENG NP SP LSV          ET  D NWEEEEEEEEE      E+ + 
Subjt:  PSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREK

Query:  ATE---EEGMRPDIGDGGAM-----EESKERETPAAEAKREAET------GLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA---AAKK
         T    +EG+ P I +   M     ++SKERETP  E +RE +T       +LPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA   A KK
Subjt:  ATE---EEGMRPDIGDGGAM-----EESKERETPAAEAKREAET------GLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA---AAKK

Query:  REAK-AAENTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAK-GYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
        RE K A ++TQ  V +QP RWSCSFPAAAAAAA IEQKL RAK GYEPFVLTRCKSEPMRSSAKLAPD  F KDR LEPHRPATFG+GAAGVGF
Subjt:  REAK-AAENTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAK-GYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78110.1 unknown protein6.0e-0526.02Show/hide
Query:  RTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASP-MFPTGAKKRGCAFDN-PEPSSPKVTCIGQVR
        R +S+S  S++L +CF SR    + + ++ K I SP R  + S         +  K SG   GG  SP ++   A  +    D   EP+SPKVTC GQ++
Subjt:  RTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASP-MFPTGAKKRGCAFDN-PEPSSPKVTCIGQVR

Query:  VKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGA
        V+  K G +                           G + Q V ++      + R + N S    F ++                 + F  C  LR    
Subjt:  VKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGA

Query:  ELNCFLPC-HSSCTSDRDKE------SKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEE
        +  CF    H+  TSD D+E       +      E E +  TVF++W + +Q              +E+   + D  + +          D+   +E+ E
Subjt:  ELNCFLPC-HSSCTSDRDKE------SKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEE

Query:  EESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEE--KSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEM
         E      +PP NALLLMRCRS P K + L +R       +  EE+ EE E ED+E   K+++K+   +              EE K+EL V ++ D E 
Subjt:  EESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEE--KSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEM

Query:  SEESVSDGEEENYLV
           S SD  +E ++V
Subjt:  SEESVSDGEEENYLV

AT3G15095.1 unknown protein5.3e-9439.2Show/hide
Query:  TTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQA----SPMFPT--GAKKRGCAFDNP--------EPS
        + +NS +S++LFICFTSR SSSS+M++SSKSI SP R+       L+TSLSRRL+TSGSLK   A    SPMF    G K+ G  ++N         EPS
Subjt:  TTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQA----SPMFPT--GAKKRGCAFDNP--------EPS

Query:  SPKVTCIGQVRVKTKKQ-GKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLP
        SPKVTCIGQVRVKT+K   KKMRARS  RR   E SFR+S  V     GG  +F A ++                                   R VHLP
Subjt:  SPKVTCIGQVRVKTKKQ-GKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLP

Query:  FTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGV
         TICE+LR+FG+ELNCF PC SSCT +   + + A  +++         +SCG VF RW VAV++   G+ REIELVVG E+   E    S+RRHVFEG+
Subjt:  FTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGV

Query:  DFKD---KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEET
        D  +   K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R                          R+++ +  D    V    +EDE   
Subjt:  DFKD---KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEET

Query:  KVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAI
        + EL ++ K   ++ E+ +S            E     EE ++ +A                      AE E E E    +N   E+ +  ++ E++   
Subjt:  KVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAI

Query:  PSHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKA
              E  Q+A K+ +S EEE E+  +             ++ E+E  N         ++E E    +MEE       E EE EE +E       + + 
Subjt:  PSHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKA

Query:  TEEEGMR-PDIGDGGAMEESKERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAA
          E+G R PD      M  S + ET   E        +LPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R                    +K+   
Subjt:  TEEEGMR-PDIGDGGAMEESKERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAA

Query:  AKKREAKAAENT---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVG
        A    A +   +    P  +QP R SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT GVG AGVG
Subjt:  AKKREAKAAENT---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVG

Query:  F
        F
Subjt:  F

AT3G15095.2 unknown protein4.2e-6736.32Show/hide
Query:  MRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCH
        MRARS  RR   E SFR+S  V     GG  +F A ++                                   R VHLP TICE+LR+FG+ELNCF PC 
Subjt:  MRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCH

Query:  SSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKD---KSEVVEEEEEESRI
        SSCT +   + + A  +++         +SCG VF RW VAV++   G+ REIELVVG E+   E    S+RRHVFEG+D  +   K+E  E  EE  R+
Subjt:  SSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKD---KSEVVEEEEEESRI

Query:  SICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSD
        SIC PPKNALLLMRCRSDPVKVA LA R                          R+++ +  D    V    +EDE   + EL ++ K   ++ E+ +S 
Subjt:  SICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSD

Query:  GEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKV-ESAEE
                   E     EE ++ +A                      AE E E E    +N   E+ +  ++ E++         E  Q+A K+ +S EE
Subjt:  GEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKV-ESAEE

Query:  EDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMR-PDIGDGGAMEESK
        E E+  +             ++ E+E  N         ++E E    +MEE       E EE EE +E       + +   E+G R PD      M  S 
Subjt:  EDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMR-PDIGDGGAMEESK

Query:  ERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAKAAENT---QPAVVQ
        + ET   E        +LPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R                    +K+   A    A +   +    P  +Q
Subjt:  ERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAKAAENT---QPAVVQ

Query:  PARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
        P R SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT GVG AGVGF
Subjt:  PARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF

AT3G15095.3 unknown protein3.7e-3935.43Show/hide
Query:  KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVK
        K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R                          R+++ +  D    V    +EDE   + EL ++
Subjt:  KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVK

Query:  LKNDEEMSEESVSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPE
         K   ++ E+ +S            E     EE ++ +A                      AE E E E    +N   E+ +  ++ E++         E
Subjt:  LKNDEEMSEESVSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPE

Query:  LAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMR
          Q+A K+ +S EEE E+  +             ++ E+E  N         ++E E    +MEE       E EE EE +E       + +   E+G R
Subjt:  LAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMR

Query:  -PDIGDGGAMEESKERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAK
         PD      M  S + ET   E        +LPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R                    +K+   A    A 
Subjt:  -PDIGDGGAMEESKERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAK

Query:  AAENT---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
        +   +    P  +QP R SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT GVG AGVGF
Subjt:  AAENT---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTC
CAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCT
CGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAG
CAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATT
CGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGC
ATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGAC
AGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAG
GGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTG
TGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAA
CGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAGAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGC
GCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATG
GCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATT
ACTGTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGA
AGAAACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAAT
CCATCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGAC
ATGGAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAAT
GAGACCCGACATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCCTGCTCCCAGATTGCTTGC
TTCTGATGATGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCG
GCAAAGAAGCGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGA
GCAGAAGCTAGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGG
ACAGGAAGCTCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTT
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTC
CAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCT
CGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAG
CAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATT
CGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGC
ATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGAC
AGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAG
GGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTG
TGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAA
CGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAGAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGC
GCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATG
GCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATT
ACTGTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGA
AGAAACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAAT
CCATCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGAC
ATGGAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAAT
GAGACCCGACATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCCTGCTCCCAGATTGCTTGC
TTCTGATGATGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCG
GCAAAGAAGCGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGA
GCAGAAGCTAGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGG
ACAGGAAGCTCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTT
Protein sequenceShow/hide protein sequence
MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKK
QGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSD
RDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAK
RFCESPAPKVEEEEDEEDEQEDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEEHNEEETLEIATDNEIDVQKLDI
TVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAED
MEETEADVNWEEEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKERETPAAEAKREAETGLLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAA
AKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF