; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0916 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0916
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionglutamic acid-rich protein isoform X1
Genome locationMC10:7886492..7889079
RNA-Seq ExpressionMC10g0916
SyntenyMC10g0916
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031049.1 hypothetical protein SDJN02_05088, partial [Cucurbita argyrosperma subsp. argyrosperma]3.22e-31468.7Show/hide
Query:  MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MDSDRHFR T++STA    SSELFICFTSR SSSS+  MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPS
Subjt:  MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE  QVQ Q NG D  FV + SH + HL RQNSN  GGNGFQ      QECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-------MDAPLTVILSNDEDEEETK
         EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE++    EE S Q EA K       M   +T+I   +E+EEETK
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-------MDAPLTVILSNDEDEEETK

Query:  VELNVKLKNDEEMSEESVSDGEEENY---------LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNN-DEDN
        VELN KLKN+EEM EESVSD EEE           +VLQ+EEE  E   EE++E+AT+NEIDVQKLDITV NH +QEE AE+++E+E E E  +  D+DN
Subjt:  VELNVKLKNDEEMSEESVSDGEEENY---------LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNN-DEDN

Query:  QPEELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQT-----EEEGENGGNPASPSLSVETERAAEDMEETEADVNWE
        Q ++L EET A    I + C+PE+ QDAEK+ESAE  DE K  HGNE   E E+ EEQ      EE+ ENG NP SP LSVETE           D NWE
Subjt:  QPEELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQT-----EEEGENGGNPASPSLSVETERAAEDMEETEADVNWE

Query:  EEEEEEEETIHWEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKET
        EEEEEEE      +R ++TEEE           G+ P I +   M      ++SKERETP  E +RE +T       VLPDCLLLMMYEPKLSMEVSKET
Subjt:  EEEEEEEETIHWEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKET

Query:  WVCSTDFIRCVPTREKKAAAA---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAKLAPDACFWK
        WVCSTDFIRCVPTREKKAA     KKRE K AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG  YEPFVLTRCKSEPMRSSAKLAPD  F K
Subjt:  WVCSTDFIRCVPTREKKAAAA---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAKLAPDACFWK

Query:  DRKLEPHRPATFGVGAAGVGF
        DR LEPHRPATFG+GAAG+GF
Subjt:  DRKLEPHRPATFGVGAAGVGF

XP_008456014.1 PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo]1.97e-31568.27Show/hide
Query:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MD DRHFRTTS NST+S     SELFICFTSR SSSS+MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPS
Subjt:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE V    Q N  DQQF    SHH+ HL RQNSNS+ GNGFQ      QECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+ +R+  KE KPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFKDK
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE
        +E VEEEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE++EE E +D E K R+ E  +     + + +TV    +E+EEE K E
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE

Query:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQP
                 VKL+N+EE++EESVSD +   EE  LVLQ+E   E+ NEEET+E+AT+N+ D QK DITV N  NQE+  E+++E++        D+ NQ 
Subjt:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQP

Query:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDE---------EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN
            +ET AIP    +HC+PE+AQDAEK+ES E+E ESK  H +E   + E+DE         E+ EEEGENG NP SPSLSVET+     ++ETE +V+
Subjt:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDE---------EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN

Query:  WEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERETPAAE------AKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIR
         + EEEEEEE    E+ EKAT+E G+ PD  + GA+      ++SKERETP  E       K + ET VLPDCLLLMMYEPKLSMEVSKETWVCS DFIR
Subjt:  WEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERETPAAE------AKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIR

Query:  CVPTREKKAAAA--------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHR
        CVPTREKK            KKRE K  +  Q  VVQPARWSCSFPAAAAAAA IEQKL RAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHR
Subjt:  CVPTREKKAAAA--------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHR

Query:  PATFGVGAAGVGF
        PATFGVGAA VGF
Subjt:  PATFGVGAAGVGF

XP_022142926.1 uncharacterized protein LOC111012918, partial [Momordica charantia]0.099.85Show/hide
Query:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
        MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
Subjt:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC

Query:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
        IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
Subjt:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL

Query:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
        RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
Subjt:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE

Query:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
        SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
Subjt:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES

Query:  VSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAE
        VSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAE
Subjt:  VSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAE

Query:  EEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER
        EEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER
Subjt:  EEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER

Query:  ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA
        ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAK A
Subjt:  ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA

XP_022941617.1 glutamic acid-rich protein-like [Cucurbita moschata]4.09e-31369.51Show/hide
Query:  MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MDSDRHFR T++STA    SSELFICFTSR SSSS+  MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPS
Subjt:  MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE  QVQ Q NG D  FV + SH + HL RQNSN  GGNGFQ      QECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE
         EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE++    EE S Q EA K     +  PLTV L  +E+EEETKVE
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE

Query:  LNVKLKNDEEMSEESVSDGEEENY---LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAED-EQEEEEEHENNNNDEDNQPEELAEE
        LN KLKN+EEM EESVSD EEE     +VLQ+EEE  E   EE++E+AT+NEIDVQKLDITV NH +QEE  ED EQE E+EH     D+DNQ ++L EE
Subjt:  LNVKLKNDEEMSEESVSDGEEENY---LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAED-EQEEEEEHENNNNDEDNQPEELAEE

Query:  TRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIH
        T A    I + C+PE+ QDAEK+ESAE  DE K  HGNE   E E+  ++ EEE   ENG NP SP LSVETE           D NWEEEEE       
Subjt:  TRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIH

Query:  WEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV
          +R ++TEEE           G+ P I +   M      ++SKERETP  E +RE +T       VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV
Subjt:  WEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV

Query:  PTREKKAAAA---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPAT
        PTREKKAA     KKRE K AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG  YEPFVLTRCKSEPMRSSAKLAPD  F KDR LEPHRPAT
Subjt:  PTREKKAAAA---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPAT

Query:  FGVGAAGVGF
        FG+GAAG+GF
Subjt:  FGVGAAGVGF

XP_038894264.1 glutamic acid-rich protein [Benincasa hispida]0.070.71Show/hide
Query:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSA---MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNP
        MD+DRHFRTTS NST+S     SELFICFTSR SSSS+   MKISSKSILSPGRAREPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNP
Subjt:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSA---MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNP

Query:  EPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRW
        EPSSPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFR+SE V    Q NG +QQF    SHH+ HL RQNSN++GGNGFQ      QECLSHRNQRW
Subjt:  EPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRW

Query:  VHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDF
        VHLPFTICEALRAFGAELNCFLPCHSSC+SDR+  KESKPA +SSE+ESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DF
Subjt:  VHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDF

Query:  KDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-MDAPL------TVILSNDEDEE
        KD++E+VE+EE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFC+SPAPKV+EE+ EE+E++D E K++Q E  + +  PL      TV +S +++EE
Subjt:  KDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-MDAPL------TVILSNDEDEE

Query:  ETKVELNVKLKNDEEMSEESVSDGEEEN---YLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQPEE
        E KVEL VKL+NDEE +EESV D E+EN    L LQ+E   EE N E T+E+AT NEID QKLDI V N  NQE+  E+++EE++       D+DNQ   
Subjt:  ETKVELNVKLKNDEEMSEESVSDGEEEN---YLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQPEE

Query:  LAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEE
          +ET AIP      +H +PE AQDAEK+ES EE DESK  H +E   + ED+  + EEE  NG NP SPS SVETE   ++ E TE D NWEEEEEEEE
Subjt:  LAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEE

Query:  ETIHWEDREKATEEEGMRPDIGD-----GGAMEESKERETPAAEAKR--EAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA--
        E    E+ E+   +EG+ PD  +     G   ++SKERETP  E +R  + ET VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKK A    
Subjt:  ETIHWEDREKATEEEGMRPDIGD-----GGAMEESKERETPAAEAKR--EAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA--

Query:  ------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
              KKRE K A+ TQ AVVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAA VGF
Subjt:  ------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF

TrEMBL top hitse value%identityAlignment
A0A0A0L789 Uncharacterized protein4.64e-31268.5Show/hide
Query:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE
        MDSD HFRTTS NST+S     SELFICFTSR SSSS+  MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPE
Subjt:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE

Query:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV
        PSSPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE +    Q NG DQQF    SHH+ HL RQNSNS+ GNGFQ      QECLSHRNQRWV
Subjt:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV

Query:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK
        HLPFTICEALRAFGAELNCFLPCHSSC+ +R+  KESKPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFK
Subjt:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK

Query:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--
        DK+E VEEEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE DEE E +D E K RQ E  + +  P++ I++ +++EEE K E  
Subjt:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--

Query:  ------LNVKLKNDEEMSEESVSDGE---EENYLVLQQEE----EHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPE
              L VKL+N+EEM+EE VSD +   EE  LVLQ+EE    E NEEET+E+AT+NEID QK    VN  NQE+  E+++E++        D+ NQ  
Subjt:  ------LNVKLKNDEEMSEESVSDGE---EENYLVLQQEE----EHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPE

Query:  ELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDE-------EQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD
           +ET AIP      +HC+PE+AQD EK+ES E+E E K  H +E   + E+DE       E+ EEEGENG N     SPSLSVETE  +++ E TE D
Subjt:  ELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDE-------EQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD

Query:  VNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERETPAAEA--------KREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTD
        VN EEEEEEEEE          T +EG+ PD  +    G  EE  SKE ETP  E         K + ET VLPDCLLLMMYEPKLSMEVSKETWVCS D
Subjt:  VNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERETPAAEA--------KREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTD

Query:  FIRCVPTREKKAAAA--------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLE
        FIRCVPTREKKA           KKRE K  + TQ AVVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDAC WKDRKLE
Subjt:  FIRCVPTREKKAAAA--------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLE

Query:  PHRPATFGVGAAGVGF
        PHRPATFGVGAA VGF
Subjt:  PHRPATFGVGAAGVGF

A0A1S3C2C2 glutamic acid-rich protein isoform X19.54e-31668.27Show/hide
Query:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MD DRHFRTTS NST+S     SELFICFTSR SSSS+MKISSKSILSPGR REPSQISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPS
Subjt:  MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARSQKRR+NSEASFR+SE V    Q N  DQQF    SHH+ HL RQNSNS+ GNGFQ      QECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+ +R+  KE KPA +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFKDK
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE
        +E VEEEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE++EE E +D E K R+ E  +     + + +TV    +E+EEE K E
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE

Query:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQP
                 VKL+N+EE++EESVSD +   EE  LVLQ+E   E+ NEEET+E+AT+N+ D QK DITV N  NQE+  E+++E++        D+ NQ 
Subjt:  -------LNVKLKNDEEMSEESVSDGE---EENYLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQP

Query:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDE---------EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN
            +ET AIP    +HC+PE+AQDAEK+ES E+E ESK  H +E   + E+DE         E+ EEEGENG NP SPSLSVET+     ++ETE +V+
Subjt:  EELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDE---------EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN

Query:  WEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERETPAAE------AKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIR
         + EEEEEEE    E+ EKAT+E G+ PD  + GA+      ++SKERETP  E       K + ET VLPDCLLLMMYEPKLSMEVSKETWVCS DFIR
Subjt:  WEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERETPAAE------AKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIR

Query:  CVPTREKKAAAA--------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHR
        CVPTREKK            KKRE K  +  Q  VVQPARWSCSFPAAAAAAA IEQKL RAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHR
Subjt:  CVPTREKKAAAA--------KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHR

Query:  PATFGVGAAGVGF
        PATFGVGAA VGF
Subjt:  PATFGVGAAGVGF

A0A6J1CNN3 uncharacterized protein LOC1110129180.099.85Show/hide
Query:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
        MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC
Subjt:  MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTC

Query:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
        IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL
Subjt:  IGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEAL

Query:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
        RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
Subjt:  RAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE

Query:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
        SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES
Subjt:  SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEES

Query:  VSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAE
        VSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAE
Subjt:  VSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAE

Query:  EEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER
        EEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER
Subjt:  EEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER

Query:  ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA
        ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAK A
Subjt:  ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAA

A0A6J1FMZ8 glutamic acid-rich protein-like1.98e-31369.51Show/hide
Query:  MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS
        MDSDRHFR T++STA    SSELFICFTSR SSSS+  MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPS
Subjt:  MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPS

Query:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL
        SPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE  QVQ Q NG D  FV + SH + HL RQNSN  GGNGFQ      QECLSHRNQRWVHL
Subjt:  SPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHL

Query:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK
        PFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++
Subjt:  PFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK

Query:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE
         EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE++    EE S Q EA K     +  PLTV L  +E+EEETKVE
Subjt:  SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE

Query:  LNVKLKNDEEMSEESVSDGEEENY---LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAED-EQEEEEEHENNNNDEDNQPEELAEE
        LN KLKN+EEM EESVSD EEE     +VLQ+EEE  E   EE++E+AT+NEIDVQKLDITV NH +QEE  ED EQE E+EH     D+DNQ ++L EE
Subjt:  LNVKLKNDEEMSEESVSDGEEENY---LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAED-EQEEEEEHENNNNDEDNQPEELAEE

Query:  TRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIH
        T A    I + C+PE+ QDAEK+ESAE  DE K  HGNE   E E+  ++ EEE   ENG NP SP LSVETE           D NWEEEEE       
Subjt:  TRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIH

Query:  WEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV
          +R ++TEEE           G+ P I +   M      ++SKERETP  E +RE +T       VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV
Subjt:  WEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV

Query:  PTREKKAAAA---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPAT
        PTREKKAA     KKRE K AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG  YEPFVLTRCKSEPMRSSAKLAPD  F KDR LEPHRPAT
Subjt:  PTREKKAAAA---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPAT

Query:  FGVGAAGVGF
        FG+GAAG+GF
Subjt:  FGVGAAGVGF

A0A6J1IN35 calponin homology domain-containing protein DDB_G0272472-like isoform X23.85e-31269.54Show/hide
Query:  MDSDRHFRTTSNSTAS----SELFICFTSRLSSSSA----MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE
        MDSDRHFR T++STAS    SELFICFTSR+SSSS+    MKISSKSILSPGR REP+QISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGC F+NPE
Subjt:  MDSDRHFRTTSNSTAS----SELFICFTSRLSSSSA----MKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPE

Query:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV
        PSSPKVTCIGQVRVKTKKQGKKMRARS KRRSNSEASFRKSE  QVQ Q NG D  F+ + SHH+ HL RQNSN  GGNGFQ      QECLSHRNQRWV
Subjt:  PSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWV

Query:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK
        HLPFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EKDNGSQRRHVFEG+DFK
Subjt:  HLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK

Query:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNV
        ++ EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEEE+E+DE+   + ++++     +  PL+V L  +E EEETKVELN 
Subjt:  DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNV

Query:  KLKNDEEMSEESVSDGEEENY--LVLQQE--EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA---
        KLKN+EEM EESVSD EEE    LVLQ+E  EE N EE++E+AT+NEIDVQKLDITV NH +QEE AE+++E+E     +  D+DNQ ++L EET A   
Subjt:  KLKNDEEMSEESVSDGEEENY--LVLQQE--EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA---

Query:  -IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREK
         I + C+PE+ QDAEK+ESAE  DE K  HGNE   E E+  ++ EEE   ENG NP SP LSVET            D NWEEEEEEEEE      R +
Subjt:  -IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREK

Query:  ATEE----------EGMRPDIGDGGAM-----EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA
        +TEE          EG+ P I +   M     ++SKERETP  E +RE +T       VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA
Subjt:  ATEE----------EGMRPDIGDGGAM-----EESKERETPAAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA

Query:  ---AAKKREAKAA-ENTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG-YEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVG
           A KKRE K A ++TQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG YEPFVLTRCKSEPMRSSAKLAPD  F KDR LEPHRPATFG+GAAGVG
Subjt:  ---AAKKREAKAA-ENTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG-YEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVG

Query:  F
        F
Subjt:  F

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78110.1 unknown protein1.2e-0526.15Show/hide
Query:  RTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASP-MFPTGAKKRGCAFDN-PEPSSPKVTCIGQVR
        R +S+S  S++L +CF SR    + + ++ K I SP R  + S         +  K SG   GG  SP ++   A  +    D   EP+SPKVTC GQ++
Subjt:  RTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASP-MFPTGAKKRGCAFDN-PEPSSPKVTCIGQVR

Query:  VKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGA
        V+  K G +                           G + Q V ++      + R + N S    F ++                 + F  C  LR    
Subjt:  VKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGA

Query:  ELNCFLPC-HSSCTSDRDKE------SKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEE
        +  CF    H+  TSD D+E       +      E E +  TVF++W + +Q              +E+   + D  + +          D+   +E+ E
Subjt:  ELNCFLPC-HSSCTSDRDKE------SKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEE

Query:  EESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSE
         E      +PP NALLLMRCRS P K + L +R       KV+ E+++ +EQK+++E   Q+ + K        L      EE K+EL V ++ D E   
Subjt:  EESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSE

Query:  ESVSDGEEENYLV
         S SD  +E ++V
Subjt:  ESVSDGEEENYLV

AT3G15095.1 unknown protein2.8e-9539.27Show/hide
Query:  TTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQA----SPMFPT--GAKKRGCAFDNP--------EPS
        + +NS +S++LFICFTSR SSSS+M++SSKSI SP R+       L+TSLSRRL+TSGSLK   A    SPMF    G K+ G  ++N         EPS
Subjt:  TTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQA----SPMFPT--GAKKRGCAFDNP--------EPS

Query:  SPKVTCIGQVRVKTKKQ-GKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLP
        SPKVTCIGQVRVKT+K   KKMRARS  RR   E SFR+S  V     GG  +F A ++                                   R VHLP
Subjt:  SPKVTCIGQVRVKTKKQ-GKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLP

Query:  FTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGV
         TICE+LR+FG+ELNCF PC SSCT +   + + A  +++         +SCG VF RW VAV++   G+ REIELVVG E+   E    S+RRHVFEG+
Subjt:  FTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGV

Query:  DFKD---KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEET
        D  +   K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R                          R+++ +  D    V    +EDE   
Subjt:  DFKD---KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEET

Query:  KVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIP
        + EL ++ K   ++ E+ +S                          +  ++ +++ + V        AE E E E    +N   E+ +  ++ E++    
Subjt:  KVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIP

Query:  SHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEE
             E  Q+A K+ +S EEE E+  +       +IED+     EE E         L+   E A   + ETE     E EE +E         E+ +E+
Subjt:  SHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEE

Query:  EGMRPDIGDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKR
            PD      M  S + ET   E        VLPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R                    +K+   A   
Subjt:  EGMRPDIGDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKR

Query:  EAKAAENT---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
         A +   +    P  +QP R SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT GVG AGVGF
Subjt:  EAKAAENT---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF

AT3G15095.2 unknown protein2.2e-6836.39Show/hide
Query:  MRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCH
        MRARS  RR   E SFR+S  V     GG  +F A ++                                   R VHLP TICE+LR+FG+ELNCF PC 
Subjt:  MRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCH

Query:  SSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKD---KSEVVEEEEEESRI
        SSCT +   + + A  +++         +SCG VF RW VAV++   G+ REIELVVG E+   E    S+RRHVFEG+D  +   K+E  E  EE  R+
Subjt:  SSCTSDRDKESKPAAKSSE-------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKD---KSEVVEEEEEESRI

Query:  SICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSD
        SIC PPKNALLLMRCRSDPVKVA LA R                          R+++ +  D    V    +EDE   + EL ++ K   ++ E+ +S 
Subjt:  SICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSD

Query:  GEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKV-ESAEEE
                                 +  ++ +++ + V        AE E E E    +N   E+ +  ++ E++         E  Q+A K+ +S EEE
Subjt:  GEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKV-ESAEEE

Query:  DESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKERET
         E+  +       +IED+     EE E         L+   E A   + ETE     E EE +E         E+ +E+    PD      M  S + ET
Subjt:  DESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKERET

Query:  PAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAKAAENT---QPAVVQPARW
           E        VLPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R                    +K+   A    A +   +    P  +QP R 
Subjt:  PAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAKAAENT---QPAVVQPARW

Query:  SCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
        SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT GVG AGVGF
Subjt:  SCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF

AT3G15095.3 unknown protein2.0e-4035.51Show/hide
Query:  KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVK
        K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R                          R+++ +  D    V    +EDE   + EL ++
Subjt:  KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVK

Query:  LKNDEEMSEESVSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPEL
         K   ++ E+ +S                          +  ++ +++ + V        AE E E E    +N   E+ +  ++ E++         E 
Subjt:  LKNDEEMSEESVSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPEL

Query:  AQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDI
         Q+A K+ +S EEE E+  +       +IED+     EE E         L+   E A   + ETE     E EE +E         E+ +E+    PD 
Subjt:  AQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDI

Query:  GDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAKAAEN
             M  S + ET   E        VLPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+P R                    +K+   A    A +   
Subjt:  GDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTR--------------------EKKAAAAKKREAKAAEN

Query:  T---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
        +    P  +QP R SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT GVG AGVGF
Subjt:  T---QPAVVQPARWSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTC
CAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCT
CGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAG
CAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATT
CGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGC
ATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGAC
AGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAG
GGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTG
TGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAA
CGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAAAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGC
GCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATG
GCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATTACT
GTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGAAGA
AACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAATCCA
TCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGACATG
GAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAATGAGACCCGA
CATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCGTGCTCCCAGATTGCTTGCTTCTGATGA
TGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCGGCAAAGAAG
CGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGAGCAGAAGCT
AGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGGACAGGAAGC
TCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTC
CAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCT
CGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAG
CAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATT
CGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGC
ATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGAC
AGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAG
GGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTG
TGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAA
CGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAAAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGC
GCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATG
GCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATTACT
GTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGAAGA
AACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAATCCA
TCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGACATG
GAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAATGAGACCCGA
CATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCGTGCTCCCAGATTGCTTGCTTCTGATGA
TGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCGGCAAAGAAG
CGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGAGCAGAAGCT
AGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGGACAGGAAGC
TCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTTTGACAATTTAACCCCCCAGGTAAAAAAAAAAAAATAGCGAAATGTAAAAATAGCGTG
GTAGTAATTTGTATTTTGGTTAGAAAATTTGTAAGTGTAGCTACAATTGTGTTTTTCCATCTCTATCTCTTTTTCTTTTTGGTCTCAAATTTATTATGATAGAAAATTGA
ATGAAACAAATTTATATGATCGATAATATTTTGTCAATTTATTAAAGACTGAATTAAGAAAAATTTACGACGTGAAAATAGGTCAAGCATCTATTTTCAACCAAATAATT
CTGGAATCCCTAAAACCAAAAAAATTGTGAAATCCCTAAATGTTATCGATTTTATTAG
Protein sequenceShow/hide protein sequence
MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKK
QGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSD
RDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAK
RFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDIT
VNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDM
EETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK
REAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF