; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023372 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023372
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationchr7:47685574..47688365
RNA-Seq ExpressionLag0023372
SyntenyLag0023372
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.8e-18384.41Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MP EEDEELAMEI + LK++L+I+LQKSNFLEKENQEL+QEL R KS + SLK HNNDRKSILWKKFHNS+D  VAG DS PQSP   A+DKW+ TR+QK
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W VVKENQRM AAAP P PPPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVN LIR+VEA APRDI EVERFVKWLDGELA LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE EVC FR+NPKEE N +LKRAQALQDRLEQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-
        VSNVERTREFNCKKYN FQIPCQWM DSGLP QMKLSSLRL KE MRRIT+E+Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK+VGL  
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-

Query:  SQRK
        +QRK
Subjt:  SQRK

XP_011658693.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]1.1e-18383.62Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MPKEEDE LAMEI N LKK+L+I+LQKS FLEKENQELRQEL RL+SQI S KA NN+RKSILWKKFH+SID+SVAGADSPP SPA  A DK + T+S K
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRMAAAPA-PVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QSSW  VKE+ RM   PA P PPPPPPLPTKLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+
Subjt:  QSSWGVVKENQRMAAAPA-PVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVNWLI++VE IAPRDI+EVERFVKWLDG+LA LVDERAVLK+FPRWPE KADALREAAFSYRDLKGLES+VC FRDNPKEEMNVVLKRAQALQDR+EQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSS
        VSN+ERTREFNC+KY  FQIPCQWMFDS LP Q+K+S+LRLAKEYM RITRELQS ETPQ ENLFLQG RFAYRVHQYAGGFDSE I AFEG+KK GLSS
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSS

Query:  QRK
        QRK
Subjt:  QRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]1.4e-18384.9Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MP EEDEELAMEI + LK++L+I+LQKS FLEKENQEL+QEL R KS IHSLKAHNNDRKSILWKKFHNS+D  VAG DS PQSP   A+DKW+ TR+QK
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W VVKENQRM AAAP P PPPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVN LIR+VEA APRDI EVERFVKWLDGELA LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE EVC FR+NPKEE N +LKRAQALQDRLEQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-
        VSNVERTREFNCKKYN FQIPCQWM DSGLP QMKLSSLRL KE MRRIT+E Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK+VGL  
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-

Query:  SQRK
        +QRK
Subjt:  SQRK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]4.3e-18584.65Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MP EEDEELAMEI + LK++L+I+LQKSNFLEKENQEL+QEL R KS + SLK HNNDRKSILWKKFHNS+DV+VAG DS PQSP   A+DKW+ TR+QK
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W VVKENQRM AAAP P PPPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVN LIR+VEA APRDI EVERFVKWLDGELA LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EVC FR+NPKEE N +LKRAQALQDRLEQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVG-LS
        VSNVERTREFNC KYN FQIPCQWM DSGLP QMKLSSLRL KE MRRIT+ELQ NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK+VG L 
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVG-LS

Query:  SQRK
        SQRK
Subjt:  SQRK

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]3.9e-18684.9Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MP EEDEELAMEI + LK++L+I+LQKSNFLEKENQEL+QEL R KS I SLKAHNNDRKSILWKKFHNS+DV+VAG DS PQSP   A+DKW+ TR+QK
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W VVKENQRM AAAP P PPPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVN LIR+VEA APRDI EVERFVKWLDGEL  LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EVC FR+NPKEE N +LKRAQALQDRLEQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-
        VSNVERTREFNCKKYN FQIPCQWM DSGLP QMKLSSLRL KE MRRIT+E+Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK+VGL  
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-

Query:  SQRK
        +QRK
Subjt:  SQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein5.1e-18483.62Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MPKEEDE LAMEI N LKK+L+I+LQKS FLEKENQELRQEL RL+SQI S KA NN+RKSILWKKFH+SID+SVAGADSPP SPA  A DK + T+S K
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRMAAAPA-PVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QSSW  VKE+ RM   PA P PPPPPPLPTKLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+
Subjt:  QSSWGVVKENQRMAAAPA-PVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVNWLI++VE IAPRDI+EVERFVKWLDG+LA LVDERAVLK+FPRWPE KADALREAAFSYRDLKGLES+VC FRDNPKEEMNVVLKRAQALQDR+EQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSS
        VSN+ERTREFNC+KY  FQIPCQWMFDS LP Q+K+S+LRLAKEYM RITRELQS ETPQ ENLFLQG RFAYRVHQYAGGFDSE I AFEG+KK GLSS
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSS

Query:  QRK
        QRK
Subjt:  QRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X13.8e-17982.38Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MPKE+DEELAMEI + LKK L+I+LQKS FLE+ENQELR EL RLKSQI SLKA NN+RKSILWKKFH+S+D++VAGADSPP +PA AA DK + T+  K
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRMAAAPAPV-PPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QSSW  VKE+QRM A PA   PPPPPPLP KLL GSKAVRRVPEVL+LYR+LTKRDAQKENK AHGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRMAAAPAPV-PPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVNWLI++VE IAPRDI+E E+FVKWLD +LA LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+VC FRDNPKEEMNVVLKRAQALQDR+EQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSS
        VSN+ERTREFNCKKY  FQIPCQWMFDS LP Q+KLS+LRLAKEYM RITREL+S ET QAENLFLQGVRFAYRVHQYAGGFDSEAI AFEG+KK GLSS
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSS

Query:  QRK
        QRK
Subjt:  QRK

A0A6J1DC83 protein CHUP1, chloroplastic6.2e-17481.77Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQ-
        MP+EEDEELAMEI  +L+K+L IA+ KS+FLEKENQELRQELGRLKSQI SLKAHNNDRKS+LWKKF+NS+D     A+SPP      A+DK + T+S  
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQ-

Query:  KQSSWGVVKENQRM-AAAPAPVP-PPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETH
        KQ  W  VKE+QRM   APAP P PPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGG PAVAFTKNMIGEIENRSAYL+AIKSEVETH
Subjt:  KQSSWGVVKENQRM-AAAPAPVP-PPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETH

Query:  GEFVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLE
        GEFVNWLI++VE  APRDITEVERFV WLD EL  LVDERAVLKHFPRWPEGKADALREAAFSYRDLK LESEVC FRDNPKEEM VVLKRAQALQDRLE
Subjt:  GEFVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLE

Query:  QSVSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQS-NETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVG
        QSVSNVE+TREF+C KY NF+IPC+WMF+SGL  QMKLSSLRLAKEYMRRITRELQS + T QA+NL LQGVRFAYRVHQYAGGFDS+AI AFEG+KKVG
Subjt:  QSVSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQS-NETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVG

Query:  LSSQRK
        LSSQRK
Subjt:  LSSQRK

A0A6J1G8X0 protein CHUP1, chloroplastic6.6e-18484.9Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MP EEDEELAMEI + LK++L+I+LQKS FLEKENQEL+QEL R KS IHSLKAHNNDRKSILWKKFHNS+D  VAG DS PQSP   A+DKW+ TR+QK
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W VVKENQRM AAAP P PPPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVN LIR+VEA APRDI EVERFVKWLDGELA LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE EVC FR+NPKEE N +LKRAQALQDRLEQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-
        VSNVERTREFNCKKYN FQIPCQWM DSGLP QMKLSSLRL KE MRRIT+E Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK+VGL  
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLS-

Query:  SQRK
        +QRK
Subjt:  SQRK

A0A6J1K8G4 protein CHUP1, chloroplastic2.1e-18584.65Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        MP EEDEELAMEI + LK++L+I+LQKSNFLEKENQEL+QEL R KS + SLK HNNDRKSILWKKFHNS+DV+VAG DS PQSP   A+DKW+ TR+QK
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W VVKENQRM AAAP P PPPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWGVVKENQRM-AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS
        FVN LIR+VEA APRDI EVERFVKWLDGELA LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EVC FR+NPKEE N +LKRAQALQDRLEQS
Subjt:  FVNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVG-LS
        VSNVERTREFNC KYN FQIPCQWM DSGLP QMKLSSLRL KE MRRIT+ELQ NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK+VG L 
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVG-LS

Query:  SQRK
        SQRK
Subjt:  SQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.4e-6649.48Show/hide
Query:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V A
Subjt:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA

Query:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN
         +  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V  F D+P       LK+   L +++EQSV  + RTR+  
Subjt:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y  F IP  W+ D+G+  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown1.0e-9948.38Show/hide
Query:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK
        +P  ED+   +     L K+L   L +++ LEKEN ELRQE+ RL++Q+ +LK+H N+RKS+LWKK  +S D S     +     +  ++ K    R+  
Subjt:  MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQK

Query:  QSSWGVVKENQRMAAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
                +      + A  PPPPPPLP+K   G ++VRR PEV+E YR+LTKR++   NK    G+ + AF +NMIGEIENRS YLS IKS+ + H + 
Subjt:  QSSWGVVKENQRMAAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSV
        ++ LI +VEA    DI+EVE FVKW+D EL+ LVDERAVLKHFP+WPE K D+LREAA +Y+  K L +E+  F+DNPK+ +   L+R Q+LQDRLE+SV
Subjt:  VNWLIRQVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSV

Query:  SNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSSQ
        +N E+ R+   K+Y +FQIP +WM D+GL  Q+K SSLRLA+EYM+RI +EL+SN + +  NL LQGVRFAY +HQ+AGGFD E +  F  +KK+     
Subjt:  SNVERTREFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSSQ

Query:  R
        R
Subjt:  R

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.7e-6749.48Show/hide
Query:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V A
Subjt:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA

Query:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN
         +  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V  F D+P       LK+   L +++EQSV  + RTR+  
Subjt:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y  F IP  W+ D+G+  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.7e-6749.48Show/hide
Query:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V A
Subjt:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA

Query:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN
         +  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V  F D+P       LK+   L +++EQSV  + RTR+  
Subjt:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y  F IP  W+ D+G+  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.7e-6749.48Show/hide
Query:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V A
Subjt:  PAPVPPPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEA

Query:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN
         +  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V  F D+P       LK+   L +++EQSV  + RTR+  
Subjt:  IAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y  F IP  W+ D+G+  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-7451.56Show/hide
Query:  AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIR
        + + AP PPPPPP P  L   S  VRRVPEV+E Y SL +RD+    + + GG  A A         ++MIGEIENRS YL AIK++VET G+F+ +LI+
Subjt:  AAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIR

Query:  QVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERT
        +V   A  DI +V  FVKWLD EL+ LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   FR++P++  +  LK+ QAL ++LE  V ++ R 
Subjt:  QVEAIAPRDITEVERFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERT

Query:  REFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNE--TPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RE    K+ +FQIP  WM ++G+  Q+KL+S++LA +YM+R++ EL++ E   P+ E L +QGVRFA+RVHQ+AGGFD+E + AFE ++
Subjt:  REFNCKKYNNFQIPCQWMFDSGLPIQMKLSSLRLAKEYMRRITRELQSNE--TPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAGGAAGAAGATGAAGAACTGGCCATGGAGATCAACAACACCCTCAAAAAACAACTCGATATCGCTCTGCAGAAATCTAATTTTCTCGAGAAAGAAAATCAAGA
ACTCCGCCAAGAATTGGGTCGACTGAAATCCCAGATTCACTCTCTCAAGGCTCACAACAATGACAGAAAATCCATTCTTTGGAAGAAATTTCATAACTCCATCGATGTCT
CCGTCGCCGGAGCCGACTCGCCGCCGCAGAGTCCGGCGGCGGCGGCATCGGATAAATGGGACCCGACCAGATCGCAGAAACAGAGCAGCTGGGGCGTGGTGAAGGAGAAT
CAGAGAATGGCGGCGGCACCGGCTCCGGTACCGCCGCCGCCTCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCGAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCT
GTACCGTTCACTGACGAAAAGGGATGCGCAGAAGGAAAATAAGGCCGCACACGGCGGACTTCCGGCGGTGGCGTTCACCAAAAATATGATCGGAGAAATTGAAAACCGAT
CAGCCTATCTGTCTGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTTGTGAACTGGTTGATCAGACAAGTAGAGGCCATAGCGCCGAGAGACATTACGGAGGTGGAG
AGGTTTGTGAAGTGGCTGGACGGGGAGCTGGCGTTGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCGGACGCGCTGAGGGAGGC
GGCGTTCAGCTATAGGGACCTGAAGGGCTTAGAAAGTGAAGTGTGTGGGTTCAGAGACAATCCAAAGGAGGAGATGAATGTGGTACTGAAGAGGGCTCAGGCCTTGCAAG
ACAGGTTGGAGCAGAGTGTGAGCAATGTGGAGAGGACGAGGGAGTTCAATTGTAAAAAGTACAACAATTTTCAAATCCCCTGCCAATGGATGTTCGACTCTGGCCTGCCC
ATTCAGATGAAGTTGAGCTCCTTGAGACTGGCGAAGGAATACATGCGAAGGATAACAAGAGAACTACAATCAAACGAGACCCCACAAGCAGAAAACCTTTTTCTTCAAGG
GGTTCGCTTTGCTTACAGGGTTCACCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCTTTTGAAGGAATGAAGAAAGTTGGGCTTAGTAGTCAGAGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAGGAAGAAGATGAAGAACTGGCCATGGAGATCAACAACACCCTCAAAAAACAACTCGATATCGCTCTGCAGAAATCTAATTTTCTCGAGAAAGAAAATCAAGA
ACTCCGCCAAGAATTGGGTCGACTGAAATCCCAGATTCACTCTCTCAAGGCTCACAACAATGACAGAAAATCCATTCTTTGGAAGAAATTTCATAACTCCATCGATGTCT
CCGTCGCCGGAGCCGACTCGCCGCCGCAGAGTCCGGCGGCGGCGGCATCGGATAAATGGGACCCGACCAGATCGCAGAAACAGAGCAGCTGGGGCGTGGTGAAGGAGAAT
CAGAGAATGGCGGCGGCACCGGCTCCGGTACCGCCGCCGCCTCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCGAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCT
GTACCGTTCACTGACGAAAAGGGATGCGCAGAAGGAAAATAAGGCCGCACACGGCGGACTTCCGGCGGTGGCGTTCACCAAAAATATGATCGGAGAAATTGAAAACCGAT
CAGCCTATCTGTCTGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTTGTGAACTGGTTGATCAGACAAGTAGAGGCCATAGCGCCGAGAGACATTACGGAGGTGGAG
AGGTTTGTGAAGTGGCTGGACGGGGAGCTGGCGTTGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCGGACGCGCTGAGGGAGGC
GGCGTTCAGCTATAGGGACCTGAAGGGCTTAGAAAGTGAAGTGTGTGGGTTCAGAGACAATCCAAAGGAGGAGATGAATGTGGTACTGAAGAGGGCTCAGGCCTTGCAAG
ACAGGTTGGAGCAGAGTGTGAGCAATGTGGAGAGGACGAGGGAGTTCAATTGTAAAAAGTACAACAATTTTCAAATCCCCTGCCAATGGATGTTCGACTCTGGCCTGCCC
ATTCAGATGAAGTTGAGCTCCTTGAGACTGGCGAAGGAATACATGCGAAGGATAACAAGAGAACTACAATCAAACGAGACCCCACAAGCAGAAAACCTTTTTCTTCAAGG
GGTTCGCTTTGCTTACAGGGTTCACCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCTTTTGAAGGAATGAAGAAAGTTGGGCTTAGTAGTCAGAGAAAATAG
Protein sequenceShow/hide protein sequence
MPKEEDEELAMEINNTLKKQLDIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAASDKWDPTRSQKQSSWGVVKEN
QRMAAAPAPVPPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEAIAPRDITEVE
RFVKWLDGELALLVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVCGFRDNPKEEMNVVLKRAQALQDRLEQSVSNVERTREFNCKKYNNFQIPCQWMFDSGLP
IQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSSQRK