; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein CHUP1, chloroplastic
Genome locationchr3:995858..998259
RNA-Seq ExpressionMoc03g01290
SyntenyMoc03g01290
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.7e-16577.11Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDRLEQSVSN
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQ
        VE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL  +Q
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQ

Query:  RK
        RK
Subjt:  RK

XP_022150972.1 protein CHUP1, chloroplastic [Momordica charantia]1.4e-217100Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
        MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ

Query:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
        RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
Subjt:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE

Query:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREF
        GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREF
Subjt:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREF

Query:  SCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
        SCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
Subjt:  SCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]2.4e-16477.36Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDRLEQSVSN
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQ
        VE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL  +Q
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQ

Query:  RK
        RK
Subjt:  RK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]1.3e-16577.23Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSV
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDRLEQSV
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSV

Query:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG-LS
        SNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VG L 
Subjt:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG-LS

Query:  SQRK
        SQRK
Subjt:  SQRK

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]1.5e-16677.48Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS IQSLKAHNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSV
        VN LI+EVE AAPRDI EVERFV WLD ELGSLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDRLEQSV
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSV

Query:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-
        SNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL  
Subjt:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-

Query:  SQRK
        +QRK
Subjt:  SQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein1.3e-16377.04Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK
        MP+EEDE LAMEI  L+KEL+I++ KS FLEKENQELRQEL RL+SQIQS KA NN+RKS+LWKKF++S+D     A+SPP      A DKRE+TK SPK
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK

Query:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG
        Q  W  VKES RM  G PA  P PPPPPLPTKLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIGEIENRSAYL+AIKSEVETHG
Subjt:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG

Query:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQ
        +FVNWLIKEVE  APRDI+EVERFV WLD +L SLVDERAVLK+FPRWPE KADALREAAFSYRDLK LES+VC FRDNPKEEM VVLKRAQALQDR+EQ
Subjt:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQ

Query:  SVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGL
        SVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+K+S+LRLAKEYM RITRELQS + T Q +NL LQG RFAYRVHQYAGGFDS+ I AFEGLKK GL
Subjt:  SVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGL

Query:  SSQRK
        SSQRK
Subjt:  SSQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X12.8e-16377.28Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK
        MP+E+DEELAMEI  L+K+L+I++ KS FLE+ENQELR EL RLKSQIQSLKA NN+RKS+LWKKF++SMD     A+SPP      A DKRE TK  PK
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK

Query:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG
        Q  W  VKESQRM    PA AP PPPPPLP KLL GSKAVRRVPEVL+LYR+LTKRDAQKENK AHGG P VAFTKNMIGEIENRSAYL+AIKSEVETHG
Subjt:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG

Query:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQ
        EFVNWLIKEVE  APRDI+E E+FV WLD +L SLVDERAVLKHFPRWPE KADALREAAFSYRDLKSLES+VC FRDNPKEEM VVLKRAQALQDR+EQ
Subjt:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQ

Query:  SVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGL
        SVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+KLS+LRLAKEYM RITREL+S + T QA+NL LQGVRFAYRVHQYAGGFDS+AI AFEGLKK GL
Subjt:  SVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGL

Query:  SSQRK
        SSQRK
Subjt:  SSQRK

A0A6J1DC83 protein CHUP1, chloroplastic6.9e-218100Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
        MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ

Query:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
        RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
Subjt:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE

Query:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREF
        GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREF
Subjt:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREF

Query:  SCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
        SCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK
Subjt:  SCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK

A0A6J1G8X0 protein CHUP1, chloroplastic1.2e-16477.36Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDRLEQSVSN
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQ
        VE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL  +Q
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQ

Query:  RK
        RK
Subjt:  RK

A0A6J1K8G4 protein CHUP1, chloroplastic6.1e-16677.23Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSV
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDRLEQSV
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSV

Query:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG-LS
        SNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VG L 
Subjt:  SNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVG-LS

Query:  SQRK
        SQRK
Subjt:  SQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic7.9e-7049.49Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +++EQSV  
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown6.2e-10249.62Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKE
        +P  ED+    ++  L KELQ  + ++D LEKEN ELRQE+ RL++Q+ +LK+H N+RKS+LWKK  +S D  +   ++ +  E+ KS+ K      V+ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKE

Query:  SQRMP--EGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLI
            P  +G       PPPPPLP+K   G ++VRR PEV+E YR+LTKR++   NK    G  + AF +NMIGEIENRS YL+ IKS+ + H + ++ LI
Subjt:  SQRMP--EGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLI

Query:  KEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEK
         +VE A   DI+EVE FV W+D EL SLVDERAVLKHFP+WPE K D+LREAA +Y+  K+L +E+ SF+DNPK+ +   L+R Q+LQDRLE+SV+N EK
Subjt:  KEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEK

Query:  TREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQR
         R+ +  +Y++F+IP EWM ++GL+GQ+K SSLRLA+EYM+RI +EL+S + + +  NL+LQGVRFAY +HQ+AGGFD + ++ F  LKK+     R
Subjt:  TREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein5.6e-7149.49Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +++EQSV  
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein5.6e-7149.49Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +++EQSV  
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein5.6e-7149.49Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +++EQSV  
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN

Query:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK
        + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGFDSDAIAAFEGLK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-7447.83Show/hide
Query:  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FT
        A+ PP    +++    P  P    +++    P  + AP P PPPPP P  L   S  VRRVPEV+E Y SL +RD+    + + GG  A A         
Subjt:  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FT

Query:  KNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCS
        ++MIGEIENRS YL AIK++VET G+F+ +LIKEV  AA  DI +V  FV WLD EL  LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   
Subjt:  KNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCS

Query:  FRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID-NTQQADNLLLQGVRFA
        FR++P++     LK+ QAL ++LE  V ++ + RE +  K+++F+IP +WM E+G+  Q+KL+S++LA +YM+R++ EL++I+    + + L++QGVRFA
Subjt:  FRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID-NTQQADNLLLQGVRFA

Query:  YRVHQYAGGFDSDAIAAFEGLK
        +RVHQ+AGGFD++ + AFE L+
Subjt:  YRVHQYAGGFDSDAIAAFEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGT
CGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCT
CCGGCGCCCCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGC
GCAAAAGGAAAACAAGGCCGCTCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAG
AGGTGGAAACACACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAG
CTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAG
CCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCTGGAGCAGAGTGTGAGCAATG
TGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTCATTGAGG
CTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCA
CCAGTATGCAGGCGGTTTCGATTCAGACGCTATAGCAGCATTTGAAGGACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGT
CGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCT
CCGGCGCCCCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGC
GCAAAAGGAAAACAAGGCCGCTCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAG
AGGTGGAAACACACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAG
CTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAG
CCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCTGGAGCAGAGTGTGAGCAATG
TGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTCATTGAGG
CTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCA
CCAGTATGCAGGCGGTTTCGATTCAGACGCTATAGCAGCATTTGAAGGACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAATAG
Protein sequenceShow/hide protein sequence
MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPA
PAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRE
LGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLR
LAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK