; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0067 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0067
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationMC03:979956..984188
RNA-Seq ExpressionMC03g0067
SyntenyMC03g0067
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.51e-20676.17Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDR     LE
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG
        QSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VG
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG

Query:  LS-SQRK
        L  +QRK
Subjt:  LS-SQRK

XP_022150972.1 protein CHUP1, chloroplastic [Momordica charantia]1.28e-27598.75Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
        MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ

Query:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
        RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
Subjt:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE

Query:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVE
        GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDR     LEQSVSNVE
Subjt:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVE

Query:  KTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
        KTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
Subjt:  KTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]1.75e-20576.41Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDR     LE
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG
        QSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VG
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG

Query:  LS-SQRK
        L  +QRK
Subjt:  LS-SQRK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]4.00e-20776.28Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDR     
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI

Query:  LEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKK
        LEQSVSNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+
Subjt:  LEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKK

Query:  VGLS-SQRK
        VGL  SQRK
Subjt:  VGLS-SQRK

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]2.43e-20876.53Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS IQSLKAHNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI
        VN LI+EVE AAPRDI EVERFV WLD ELGSLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDR     
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI

Query:  LEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKK
        LEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+
Subjt:  LEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKK

Query:  VGLS-SQRK
        VGL  +QRK
Subjt:  VGLS-SQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein2.22e-20476.1Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK
        MP+EEDE LAMEI  L+KEL+I++ KS FLEKENQELRQEL RL+SQIQS KA NN+RKS+LWKKF++S+D     A+SPP      A DKRE+TKS PK
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK

Query:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG
        Q  W  VKES RM  G PA  P PPPPPLPTKLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIGEIENRSAYL+AIKSEVETHG
Subjt:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG

Query:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC
        +FVNWLIKEVE  APRDI+EVERFV WLD +L SLVDERAVLK+FPRWPE KADALREAAFSYRDLK LES+VC FRDNPKEEM VVLKRAQALQDR   
Subjt:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC

Query:  TILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGL
          +EQSVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+K+S+LRLAKEYM RITRELQS + T Q +NL LQG RFAYRVHQYAGGFDS+ I AFEGL
Subjt:  TILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGL

Query:  KKVGLSSQRK
        KK GLSSQRK
Subjt:  KKVGLSSQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X16.34e-20476.34Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK
        MP+E+DEELAMEI  L+K+L+I++ KS FLE+ENQELR EL RLKSQIQSLKA NN+RKS+LWKKF++SMD     A+SPP      A DKRE TK  PK
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK

Query:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG
        Q  W  VKESQRM    PA AP PPPPPLP KLL GSKAVRRVPEVL+LYR+LTKRDAQKENK AHGG P VAFTKNMIGEIENRSAYL+AIKSEVETHG
Subjt:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG

Query:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC
        EFVNWLIKEVE  APRDI+E E+FV WLD +L SLVDERAVLKHFPRWPE KADALREAAFSYRDLKSLES+VC FRDNPKEEM VVLKRAQALQDR   
Subjt:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC

Query:  TILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGL
          +EQSVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+KLS+LRLAKEYM RITREL+S + T QA+NL LQGVRFAYRVHQYAGGFDS+AI AFEGL
Subjt:  TILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGL

Query:  KKVGLSSQRK
        KK GLSSQRK
Subjt:  KKVGLSSQRK

A0A6J1DC83 protein CHUP1, chloroplastic6.19e-27698.75Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
        MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ

Query:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
        RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
Subjt:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE

Query:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVE
        GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDR     LEQSVSNVE
Subjt:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVE

Query:  KTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
        KTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
Subjt:  KTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK

A0A6J1G8X0 protein CHUP1, chloroplastic8.49e-20676.41Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDR     LE
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG
        QSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VG
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG

Query:  LS-SQRK
        L  +QRK
Subjt:  LS-SQRK

A0A6J1K8G4 protein CHUP1, chloroplastic1.94e-20776.28Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDR     
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI

Query:  LEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKK
        LEQSVSNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+
Subjt:  LEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKK

Query:  VGLS-SQRK
        VGL  SQRK
Subjt:  VGLS-SQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic4.4e-6848.67Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L ++     +E
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        QSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown3.4e-10049Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKE
        +P  ED+    ++  L KELQ  + ++D LEKEN ELRQE+ RL++Q+ +LK+H N+RKS+LWKK  +S D  +   ++ +  E+ KS+ K      V+ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKE

Query:  SQRMP--EGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLI
            P  +G       PPPPPLP+K   G ++VRR PEV+E YR+LTKR++   NK    G  + AF +NMIGEIENRS YL+ IKS+ + H + ++ LI
Subjt:  SQRMP--EGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLI

Query:  KEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSV
         +VE A   DI+EVE FV W+D EL SLVDERAVLKHFP+WPE K D+LREAA +Y+  K+L +E+ SF+DNPK+ +   L+R Q+LQDR     LE+SV
Subjt:  KEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSV

Query:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSS
        +N EK R+ +  +Y++F+IP EWM ++GL+GQ+K SSLRLA+EYM+RI +EL+S + + +  NL+LQGVRFAY +HQ+AGGFD + ++ F  LKK+    
Subjt:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSS

Query:  QR
         R
Subjt:  QR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.1e-6948.67Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L ++     +E
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        QSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.1e-6948.67Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L ++     +E
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        QSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein3.1e-6948.67Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L ++     +E
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILE

Query:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        QSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  QSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-7247.09Show/hide
Query:  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FT
        A+ PP    +++    P  P    +++    P  + AP P PPPPP P  L   S  VRRVPEV+E Y SL +RD+    + + GG  A A         
Subjt:  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FT

Query:  KNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCS
        ++MIGEIENRS YL AIK++VET G+F+ +LIKEV  AA  DI +V  FV WLD EL  LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   
Subjt:  KNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCS

Query:  FRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID-NTQQADNLLLQ
        FR++P++     LK+ QAL ++     LE  V ++ + RE +  K+++F+IP +WM E+G+  Q+KL+S++LA +YM+R++ EL++I+    + + L++Q
Subjt:  FRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID-NTQQADNLLLQ

Query:  GVRFAYRVHQYAGGFDSDAIAAFEGLK
        GVRFA+RVHQ+AGGFD++ + AFE L+
Subjt:  GVRFAYRVHQYAGGFDSDAIAAFEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGT
CGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCT
CCGGCGCCCCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGC
GCAAAAGGAAAACAAGGCCGCTCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAG
AGGTGGAAACACACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAG
CTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAG
CCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCCTGGAGC
AGAGTGTGAGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAG
TTAAGCTCATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATT
TGCTTACAGGGTTCACCAGTATGCAGGCGGTTTCGATTCAGACGCTATAGCAGCATTTGAAGGACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTTCTCGACAAAGTGATGAGTCAACTTCTGATTCGAGGTGAATGTCTAGTCAGTTGAGATGATGACTTTGCAAATAAAATCTTTGCTTGTTCGAGCTCGTGTT
GAGCCAAAGTTTAGTTAGGTTTCACAGAAACTGCGTATCTTTATGAGGAATGGATCCCGCCTTTATTAGGAGAGTTTTTAGAAGTGTTGTGTTACCCCATATTCAGGCAC
TTCGTACTTGATGTTCAACAAAATTTACGTTTGTGATGTCACTGTTTTCAAGTTAGTGCGAGCTAGGTTTTCTGAGGGTCTGTCGGGGTGACCCCGCTTGGCCCAATGGG
AATAAAATCTTTATGGCACTTTTCTTTTTAGGTCCTTTCAGCGGGACAGAGGGTTCGAAATGTCAAAAATACCCTTCTTCGTGAAAATCTTCTCTCCCATTTTCTAGAGT
CAACATTGCGATAAAAATTTTAGTGAAGGGCATTCCATGGAATATTTTTTCACAGAAACTTTGGCGCATTGTATACCATCCTTTCTCTGCAAGCAAGAGATTGAAAATTG
AAATAAATAAATGAGACCAAAAAATCAAGAAAAAAAACTGTATGATGTTCCTGTCTCATTTATTAGAGAAGGTCGGTGAAAATCAAGAGAAAATATAATATAATGCCGCA
GGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACTCAGACAAG
AATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGTCGCCGCCG
GCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCTCCGGCGCC
CCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGCGCAAAAGG
AAAACAAGGCCGCTCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAGAGGTGGAA
ACACACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAGCTGGGGTC
GCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAGCCTGGAGA
GTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCCTGGAGCAGAGTGTG
AGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTC
ATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACA
GGGTTCACCAGTATGCAGGCGGTTTCGATTCAGACGCTATAGCAGCATTTGAAGGACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAATAGACTTCTGGGTTATAGATT
ATAGGCTTTTACTAGAAATCTGTCAGCTGTTGGCAACTTTACTAGAAAACAGCTCTTGTAAGAATCAGCATTGTAGAGCACATTGTAGAAAATGATGTGTACAAAATGCT
TGATCAAGACAATTGAGGGTTCTTCTATACCTAAATCAATCCTATCCTATTGCAACTCTTGCACAATCGCAAGACAAATAGCATAACAGAATAACTATCTTAGCTATAAA
AAAATAAAGTCATTAAATTGATATAACCCTCCAAGAGCAACAATCCAACTTTCATAACAAAAAATATGAAGAACCTAAATTATCAATTGGACAACTATTTTTTAAAAAAA
TGGTACAAGTATCCCCACCCTCAGAAATCAAAAAACCCTGTATTTTAGTTTCCTAAGGGCTCAAGGGTACAAAACTCAATGCAAATGAAAAATCAAGAAATCAAATTCCA
TATCCATGATGACATGATTATGATTCAATCAAATGGTTTGTTCTGAACCAAATGAATCTATGAGAAAAGATCTAGAAGACCTATTCATGGCACCTTCATCTGCTACATTT
CCAAATTGAGCACTTCCAGAAATTTCCCAAGGGTTCATGTTTAAAGAGAATAGGATAGATATAACGCCGCAAGCTTAATAATAACACTACTGCTGCAGAAAATAAGGAAT
ATTGAACAGTATATGAAAATCACTTGATAAAAACGATTAAGCTGATGAAAATCCAGTTCTCAATTGAAGACAACTAAATGTTTCTCGATCACATAGCAATAAATAAAAAA
GCATTTCAAAATGGTAACGCGACAAAAACGAAGATAACATCAAGAAATCTAAACAGTAACAACAAAAACCCTAGGTCGGAGCAATCCATCAGATAATGGCCCCTACTTGG
ATCTCTGCCTCATCTTTCGGCGCTTACGCTTCAACCTCCTCATTCTCTTCTTCTTCCACTGCAAAACAAAACCAAAAATGATCAGAAGCCTAGAATTCACGCATCTGTAG
GTCAAGCATCACTAAAAAGAAGTAAAAGTTGAAAACGAGTTCAATCTATTGATGAATCTTCAAATGAATTACCTTAGCTCTCATGGCGACGGTGTGGAAAAAGTGCGACT
GCTCTCTGTCAAGTGCTGCGGAGGAACGGCTGGAGGAAGAAGGTAGGGCAACGTAGAATCC
Protein sequenceShow/hide protein sequence
MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPA
PAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRE
LGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMK
LSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK