; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021806 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021806
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationChr05:16995356..16998732
RNA-Seq ExpressionHG10021806
SyntenyHG10021806
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457349.1 PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis melo]9.7e-16379.6Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKE+DE LAMEI+CLK++LEISLQK  FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSSMD+AVAGADSPP +PA  A DKRE+TK  KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+QRM A PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQKENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VC+F+DN KEEMNVVLK+AQALQDR E ++
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ
        +                   C  MF       +KLS+LRLAKEYM RITREL+S ET QAENL LQGVRFAYRVHQYAGGFDSEAI AFEGLKKAGLSSQ
Subjt:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ

Query:  RK
        RK
Subjt:  RK

XP_011658693.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]4.2e-16680.1Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE+LAMEINCLK+ELEISLQK  FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS+D++VAGADSPP SPA  A DKRE TKS KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RM   PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI EVERFVKWLDGKLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VC+F+DN KEEMNVVLK+AQALQDR E ++
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ
        +                   C  MF       +K+S+LRLAKEYM RITRELQS ETPQ ENL LQG RFAYRVHQYAGGFDSE I AFEGLKKAGLSSQ
Subjt:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ

Query:  RK
        RK
Subjt:  RK

XP_011658695.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]1.2e-15279.26Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE+LAMEINCLK+ELEISLQK  FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS+D++VAGADSPP SPA  A DKRE TKS KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RM   PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI EVERFVKWLDGKLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VC+F+DN KEEMNVVLK+AQALQDR E ++
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ
        +                   C  MF       +K+S+LRLAKEYM RITRELQS ETPQ ENL LQG RFAYRVHQ
Subjt:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]6.5e-15175.19Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MP EEDE LAMEI+ LKRELEISLQK NFLEKENQEL+QEL R KS +QSLK  NN+RKSILWKKFH+SMDVAVAG DS P SP   A+DK E T++QKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKENQRM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK T+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRE---
        VN LI+EVEA APRDI EVERFVKWLDG+LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE+EVC F++N KEE N +LK+AQALQDR E   
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRE---

Query:  --------------------CTINLCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAG-LSS
                            C   L   +   MKLSSLRL KE M RIT+ELQ NETPQ ENL LQGVRFAYRVHQYAGGFDSEAI+AFEG+K+ G L S
Subjt:  --------------------CTINLCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAG-LSS

Query:  QRK
        QRK
Subjt:  QRK

XP_038896069.1 protein CHUP1, chloroplastic [Benincasa hispida]2.9e-16780.8Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE LAMEIN LK+ELEISLQK NFLE ENQELRQELGRLKSQIQSLKA NNERKSILWKKFHSSMDVAVAGADS PPSPA  A +KRE TKSQKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
        SSWGDVKENQRMM APA APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENK THGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
Subjt:  SSWGDVKENQRMMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV

Query:  NWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTIN
        NWLIKEVEA APRDI EVERFVKW+D KL SLVDERAVLKHFPRWPEAKADALREAAFSYRDLK LE+EVC+F+DN KEE+NVVLK+AQALQDR E +++
Subjt:  NWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTIN

Query:  -----------------------LCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQR
                                   +   MKLSSLRL KE M RITRE++S ETPQAENL LQGVRFAYRVHQ+AGGFDSEA + FE LKKAGLSSQR
Subjt:  -----------------------LCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQR

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein2.0e-16680.1Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE+LAMEINCLK+ELEISLQK  FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS+D++VAGADSPP SPA  A DKRE TKS KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RM   PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI EVERFVKWLDGKLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VC+F+DN KEEMNVVLK+AQALQDR E ++
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ
        +                   C  MF       +K+S+LRLAKEYM RITRELQS ETPQ ENL LQG RFAYRVHQYAGGFDSE I AFEGLKKAGLSSQ
Subjt:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ

Query:  RK
        RK
Subjt:  RK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X14.7e-16379.6Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKE+DE LAMEI+CLK++LEISLQK  FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSSMD+AVAGADSPP +PA  A DKRE+TK  KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+QRM A PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQKENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VC+F+DN KEEMNVVLK+AQALQDR E ++
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ
        +                   C  MF       +KLS+LRLAKEYM RITREL+S ET QAENL LQGVRFAYRVHQYAGGFDSEAI AFEGLKKAGLSSQ
Subjt:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQ

Query:  RK
        RK
Subjt:  RK

A0A1S3C5E9 protein CHUP1, chloroplastic isoform X23.9e-14978.46Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKE+DE LAMEI+CLK++LEISLQK  FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSSMD+AVAGADSPP +PA  A DKRE+TK  KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+QRM A PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQKENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VC+F+DN KEEMNVVLK+AQALQDR E ++
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ
        +                   C  MF       +KLS+LRLAKEYM RITREL+S ET QAENL LQGVRFAYRVHQ
Subjt:  N------------------LCCVMF-----VLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ

A0A6J1G8X0 protein CHUP1, chloroplastic3.1e-14673.95Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MP EEDE LAMEI+ LKRELEISLQK  FLEKENQEL+QEL R KS I SLKA NN+RKSILWKKFH+SMD  VAG DS P SP   A+DK E T++QKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKENQRM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK  +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRE---
        VN LI+EVEA APRDI EVERFVKWLDG+LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE EVC F++N KEE N +LK+AQALQDR E   
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRE---

Query:  --------------------CTINLCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLS-S
                            C   L   +   MKLSSLRL KE M RIT+E Q NETPQ ENL LQGVRFAYRVHQYAGGFDSEAI+AFEG+K+ GL  +
Subjt:  --------------------CTINLCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLS-S

Query:  QRK
        QRK
Subjt:  QRK

A0A6J1K8G4 protein CHUP1, chloroplastic3.2e-15175.19Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MP EEDE LAMEI+ LKRELEISLQK NFLEKENQEL+QEL R KS +QSLK  NN+RKSILWKKFH+SMDVAVAG DS P SP   A+DK E T++QKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKENQRM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK T+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRE---
        VN LI+EVEA APRDI EVERFVKWLDG+LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE+EVC F++N KEE N +LK+AQALQDR E   
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRE---

Query:  --------------------CTINLCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAG-LSS
                            C   L   +   MKLSSLRL KE M RIT+ELQ NETPQ ENL LQGVRFAYRVHQYAGGFDSEAI+AFEG+K+ G L S
Subjt:  --------------------CTINLCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAG-LSS

Query:  QRK
        QRK
Subjt:  QRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.3e-5345.64Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++ E ++          
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------

Query:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
                          +   +KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown5.2e-8245.48Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        +P  ED+    ++  L +EL+  L + + LEKEN ELRQE+ RL++Q+ +LK+  NERKS+LWKK  SS D             +NT     +  +S K 
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQR-----MMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVET
        ++ G    N          + A+ PPPPPPLP+K   G ++VRR PEV+E YR LTKR++   NK+   G  + AF +NMIGEIENRS YLS IKS+ + 
Subjt:  SSWGDVKENQR-----MMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVET

Query:  HGEFVNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRR
        H + ++ LI +VEA    DI EVE FVKW+D +L+SLVDERAVLKHFP+WPE K D+LREAA +Y+  K+L +E+  FKDN K+ +   L++ Q+LQDR 
Subjt:  HGEFVNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRR

Query:  ECTIN-----------------------LCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKK
        E ++N                       L   +   +K SSLRLA+EYM RI +EL+SN + +  NL+LQGVRFAY +HQ+AGGFD E +  F  LKK
Subjt:  ECTIN-----------------------LCCVMFVLMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKK

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein9.3e-5545.64Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++ E ++          
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------

Query:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
                          +   +KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein9.3e-5545.64Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++ E ++          
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------

Query:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
                          +   +KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein9.3e-5545.64Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++ E ++          
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLC-------

Query:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
                          +   +KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  ----------------CVMFVLMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-5848.07Show/hide
Query:  ASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        A  PPPPPP P  L   S  VRRVPEV+E Y +L +RD+    + + GGG A A         ++MIGEIENRS YL AIK++VET G+F+ +LIKEV  
Subjt:  ASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVM----
         A  DI +V  FVKWLD +L+ LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   F+++ ++  +  LKK QAL ++ E  +     M    
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVM----

Query:  ------------FVL-------MKLSSLRLAKEYMTRITRELQSNE--TPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
                    ++L       +KL+S++LA +YM R++ EL++ E   P+ E L++QGVRFA+RVHQ+AGGFD+E + AFE L+
Subjt:  ------------FVL-------MKLSSLRLAKEYMTRITRELQSNE--TPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAGGAAGAAGATGAAATTTTGGCTATGGAGATCAATTGCTTGAAAAGAGAATTGGAAATTTCTCTACAAAAATTAAATTTTCTCGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGATTGAAATCCCAGATTCAGTCTTTGAAAGCTCAAAACAATGAGAGAAAATCAATTCTCTGGAAGAAATTCCATAGCTCCATGGATGTCGCCG
TCGCCGGAGCTGACTCGCCGCCGCCAAGTCCGGCTAATACGGCGAGTGATAAACGAGAGCTGACCAAATCGCAGAAACAGAGTAGTTGGGGTGATGTGAAAGAGAATCAG
AGAATGATGGCAGCACCGGCATCGGCGCCGCCGCCTCCGCCGCCACTTCCGACGAAGCTGCTCGGAGGATCGAAGGCAGTGCGGCGAGTTCCGGAAGTGTTGGAGTTGTA
CCGTACGCTGACGAAAAGGGATGCACAGAAGGAAAACAAGGTCACACACGGCGGAGGTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATTGAAAACCGATCAG
CCTATCTCTCGGCAATAAAATCAGAGGTGGAAACACATGGGGAGTTCGTGAATTGGTTGATCAAAGAAGTGGAAGCGACGGCGCCAAGAGACATAGTAGAGGTAGAGAGG
TTTGTGAAATGGCTGGATGGGAAACTAGCCTCGTTGGTGGACGAGAGAGCAGTATTGAAGCACTTCCCGCGGTGGCCAGAGGCGAAAGCGGATGCACTGCGGGAGGCAGC
ATTTAGCTATAGAGACCTAAAGAGCTTGGAGAGTGAAGTGTGTCTATTTAAGGACAATCTAAAAGAGGAGATGAATGTAGTGTTAAAGAAGGCTCAAGCATTGCAAGACA
GGCGAGAATGTACTATCAATCTTTGTTGTGTAATGTTTGTTTTGATGAAGTTGAGCTCATTGAGGCTAGCAAAGGAATACATGACAAGGATAACAAGAGAACTGCAATCA
AACGAAACCCCACAAGCAGAAAACCTTCTCCTTCAAGGGGTTCGCTTTGCTTACAGGGTTCATCAGTATGCAGGTGGTTTCGATTCAGAGGCTATACTGGCTTTTGAAGG
ACTGAAGAAAGCTGGGCTGAGTAGCCAAAGAAAATACGCTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAGGAAGAAGATGAAATTTTGGCTATGGAGATCAATTGCTTGAAAAGAGAATTGGAAATTTCTCTACAAAAATTAAATTTTCTCGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGATTGAAATCCCAGATTCAGTCTTTGAAAGCTCAAAACAATGAGAGAAAATCAATTCTCTGGAAGAAATTCCATAGCTCCATGGATGTCGCCG
TCGCCGGAGCTGACTCGCCGCCGCCAAGTCCGGCTAATACGGCGAGTGATAAACGAGAGCTGACCAAATCGCAGAAACAGAGTAGTTGGGGTGATGTGAAAGAGAATCAG
AGAATGATGGCAGCACCGGCATCGGCGCCGCCGCCTCCGCCGCCACTTCCGACGAAGCTGCTCGGAGGATCGAAGGCAGTGCGGCGAGTTCCGGAAGTGTTGGAGTTGTA
CCGTACGCTGACGAAAAGGGATGCACAGAAGGAAAACAAGGTCACACACGGCGGAGGTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATTGAAAACCGATCAG
CCTATCTCTCGGCAATAAAATCAGAGGTGGAAACACATGGGGAGTTCGTGAATTGGTTGATCAAAGAAGTGGAAGCGACGGCGCCAAGAGACATAGTAGAGGTAGAGAGG
TTTGTGAAATGGCTGGATGGGAAACTAGCCTCGTTGGTGGACGAGAGAGCAGTATTGAAGCACTTCCCGCGGTGGCCAGAGGCGAAAGCGGATGCACTGCGGGAGGCAGC
ATTTAGCTATAGAGACCTAAAGAGCTTGGAGAGTGAAGTGTGTCTATTTAAGGACAATCTAAAAGAGGAGATGAATGTAGTGTTAAAGAAGGCTCAAGCATTGCAAGACA
GGCGAGAATGTACTATCAATCTTTGTTGTGTAATGTTTGTTTTGATGAAGTTGAGCTCATTGAGGCTAGCAAAGGAATACATGACAAGGATAACAAGAGAACTGCAATCA
AACGAAACCCCACAAGCAGAAAACCTTCTCCTTCAAGGGGTTCGCTTTGCTTACAGGGTTCATCAGTATGCAGGTGGTTTCGATTCAGAGGCTATACTGGCTTTTGAAGG
ACTGAAGAAAGCTGGGCTGAGTAGCCAAAGAAAATACGCTTCTTGA
Protein sequenceShow/hide protein sequence
MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQSSWGDVKENQ
RMMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEATAPRDIVEVER
FVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMFVLMKLSSLRLAKEYMTRITRELQS
NETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQRKYAS