; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G012620 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G012620
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationchr02:16758515..16762290
RNA-Seq ExpressionLsi02G012620
SyntenyLsi02G012620
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457349.1 PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis melo]3.7e-18484.02Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKE+DE LAMEI+CLK++LEISLQK  FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSSMD+AVAGADSPP +PA  A DKRE+TK  KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+QRM A PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQKENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VC+F+DN KEEMNVVLK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              +EQSV+NMERTREFNCKKY +FQIP QWMFDSALP Q+KLS+LRLAKEYM RITREL+S ET QAENL LQGVRFAYRVHQYAGGFDSEAI AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAGLSSQRK
        EGLKKAGLSSQRK
Subjt:  EGLKKAGLSSQRK

XP_011658693.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]3.6e-18784.26Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE+LAMEINCLK+ELEISLQK  FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS+D++VAGADSPP SPA  A DKRE TKS KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RM   PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI EVERFVKWLDGKLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VC+F+DN KEEMNVVLK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              +EQSV+NMERTREFNC+KY +FQIP QWMFDSALP Q+K+S+LRLAKEYM RITRELQS ETPQ ENL LQG RFAYRVHQYAGGFDSE I AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAGLSSQRK
        EGLKKAGLSSQRK
Subjt:  EGLKKAGLSSQRK

XP_011658695.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]1.0e-17383.72Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE+LAMEINCLK+ELEISLQK  FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS+D++VAGADSPP SPA  A DKRE TKS KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RM   PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI EVERFVKWLDGKLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VC+F+DN KEEMNVVLK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ
              +EQSV+NMERTREFNC+KY +FQIP QWMFDSALP Q+K+S+LRLAKEYM RITRELQS ETPQ ENL LQG RFAYRVHQ
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]2.1e-17179.23Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MP EEDE LAMEI+ LKRELEISLQK NFLEKENQEL+QEL R KS IQSLKA NN+RKSILWKKFH+SMDVAVAG DS P SP   A+DK E T++QKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKENQRM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK  +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VN LI+EVEA APRDI EVERFVKWLDG+L SLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE+EVC F++N KEE N +LK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              LEQSV+N+ERTREFNCKKY+ FQIP QWM DS LPAQMKLSSLRL KE M RIT+E+Q NETPQ ENL LQGVRFAYRVHQYAGGFDSEAI+AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAGLS-SQRK
        EG+K+ GL  +QRK
Subjt:  EGLKKAGLS-SQRK

XP_038896069.1 protein CHUP1, chloroplastic [Benincasa hispida]9.5e-18885.44Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE LAMEIN LK+ELEISLQK NFLE ENQELRQELGRLKSQIQSLKA NNERKSILWKKFHSSMDVAVAGADS PPSPA  A +KRE TKSQKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
        SSWGDVKENQRMM APA APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENK THGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
Subjt:  SSWGDVKENQRMMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV

Query:  NWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTIN
        NWLIKEVEA APRDI EVERFVKW+D KL SLVDERAVLKHFPRWPEAKADALREAAFSYRDLK LE+EVC+F+DN KEE+NVVLK+AQALQDR      
Subjt:  NWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTIN

Query:  LCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFE
             +EQSV+N+E+TREFN KKY  FQIP QWMFDSALPAQMKLSSLRL KE M RITRE++S ETPQAENL LQGVRFAYRVHQ+AGGFDSEA + FE
Subjt:  LCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFE

Query:  GLKKAGLSSQRK
         LKKAGLSSQRK
Subjt:  GLKKAGLSSQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein1.7e-18784.26Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKEEDE+LAMEINCLK+ELEISLQK  FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS+D++VAGADSPP SPA  A DKRE TKS KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RM   PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWGDVKENQRMMAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI EVERFVKWLDGKLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VC+F+DN KEEMNVVLK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              +EQSV+NMERTREFNC+KY +FQIP QWMFDSALP Q+K+S+LRLAKEYM RITRELQS ETPQ ENL LQG RFAYRVHQYAGGFDSE I AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAGLSSQRK
        EGLKKAGLSSQRK
Subjt:  EGLKKAGLSSQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X11.8e-18484.02Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKE+DE LAMEI+CLK++LEISLQK  FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSSMD+AVAGADSPP +PA  A DKRE+TK  KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+QRM A PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQKENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VC+F+DN KEEMNVVLK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              +EQSV+NMERTREFNCKKY +FQIP QWMFDSALP Q+KLS+LRLAKEYM RITREL+S ET QAENL LQGVRFAYRVHQYAGGFDSEAI AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAGLSSQRK
        EGLKKAGLSSQRK
Subjt:  EGLKKAGLSSQRK

A0A1S3C5E9 protein CHUP1, chloroplastic isoform X21.5e-17083.2Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MPKE+DE LAMEI+CLK++LEISLQK  FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSSMD+AVAGADSPP +PA  A DKRE+TK  KQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+QRM A PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQKENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRMMAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VNWLIKEVE  APRDI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VC+F+DN KEEMNVVLK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ
              +EQSV+NMERTREFNCKKY +FQIP QWMFDSALP Q+KLS+LRLAKEYM RITREL+S ET QAENL LQGVRFAYRVHQ
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQ

A0A6J1G8X0 protein CHUP1, chloroplastic3.4e-16778.5Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MP EEDE LAMEI+ LKRELEISLQK  FLEKENQEL+QEL R KS I SLKA NN+RKSILWKKFH+SMD  VAG DS P SP   A+DK E T++QKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKENQRM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK  +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VN LI+EVEA APRDI EVERFVKWLDG+LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE EVC F++N KEE N +LK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              LEQSV+N+ERTREFNCKKY+ FQIP QWM DS LPAQMKLSSLRL KE M RIT+E Q NETPQ ENL LQGVRFAYRVHQYAGGFDSEAI+AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAGLS-SQRK
        EG+K+ GL  +QRK
Subjt:  EGLKKAGLS-SQRK

A0A6J1K8G4 protein CHUP1, chloroplastic1.0e-17179.47Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        MP EEDE LAMEI+ LKRELEISLQK NFLEKENQEL+QEL R KS +QSLK  NN+RKSILWKKFH+SMDVAVAG DS P SP   A+DK E T++QKQ
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKENQRM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK T+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWGDVKENQRM-MAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI
        VN LI+EVEA APRDI EVERFVKWLDG+LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE+EVC F++N KEE N +LK+AQALQDR     
Subjt:  VNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTI

Query:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF
              LEQSV+N+ERTREFNC KY+ FQIP QWM DS LPAQMKLSSLRL KE M RIT+ELQ NETPQ ENL LQGVRFAYRVHQYAGGFDSEAI+AF
Subjt:  NLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAF

Query:  EGLKKAG-LSSQRK
        EG+K+ G L SQRK
Subjt:  EGLKKAG-LSSQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.8e-6447.99Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++           +EQS
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS

Query:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown9.1e-9648.17Show/hide
Query:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ
        +P  ED+    ++  L +EL+  L + + LEKEN ELRQE+ RL++Q+ +LK+  NERKS+LWKK  SS D             +NT     +  +S K 
Subjt:  MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQ

Query:  SSWGDVKENQR-----MMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVET
        ++ G    N          + A+ PPPPPPLP+K   G ++VRR PEV+E YR LTKR++   NK+   G  + AF +NMIGEIENRS YLS IKS+ + 
Subjt:  SSWGDVKENQR-----MMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVET

Query:  HGEFVNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRR
        H + ++ LI +VEA    DI EVE FVKW+D +L+SLVDERAVLKHFP+WPE K D+LREAA +Y+  K+L +E+  FKDN K+ +   L++ Q+LQDR 
Subjt:  HGEFVNWLIKEVEATAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRR

Query:  ECTINLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEA
                  LE+SVNN E+ R+   K+Y  FQIP++WM D+ L  Q+K SSLRLA+EYM RI +EL+SN + +  NL+LQGVRFAY +HQ+AGGFD E 
Subjt:  ECTINLCCVMLEQSVNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEA

Query:  ILAFEGLKK
        +  F  LKK
Subjt:  ILAFEGLKK

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.3e-6547.99Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++           +EQS
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS

Query:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.3e-6547.99Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++           +EQS
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS

Query:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.3e-6547.99Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++KE   + ++ G G + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQKE---NKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+        LKK   L ++           +EQS
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS

Query:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL     S++ P  E LLLQGVRFA+RVHQ+AGGFD+E++ AFE L+
Subjt:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQ----SNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-7050.34Show/hide
Query:  ASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA
        A  PPPPPP P  L   S  VRRVPEV+E Y +L +RD+    + + GGG A A         ++MIGEIENRS YL AIK++VET G+F+ +LIKEV  
Subjt:  ASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEA

Query:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS
         A  DI +V  FVKWLD +L+ LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   F+++ ++  +  LKK QAL ++           LE  
Subjt:  TAPRDIVEVERFVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQS

Query:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNE--TPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK
        V ++ R RE    K+ SFQIP  WM ++ + +Q+KL+S++LA +YM R++ EL++ E   P+ E L++QGVRFA+RVHQ+AGGFD+E + AFE L+
Subjt:  VNNMERTREFNCKKYHSFQIPYQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNE--TPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAGGAAGAAGATGAAATTTTGGCTATGGAGATCAATTGCTTGAAAAGAGAATTGGAAATTTCTCTACAAAAATTAAATTTTCTCGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGATTGAAATCCCAGATTCAGTCTTTGAAAGCTCAAAACAATGAGAGAAAATCAATTCTCTGGAAGAAATTCCATAGCTCCATGGATGTCGCCG
TCGCCGGAGCTGACTCGCCGCCGCCAAGTCCGGCTAATACGGCGAGTGATAAACGAGAGCTGACCAAATCGCAGAAACAGAGTAGTTGGGGTGATGTGAAAGAGAATCAG
AGAATGATGGCAGCACCGGCATCGGCGCCGCCGCCTCCGCCGCCACTTCCGACGAAGCTGCTCGGAGGATCGAAGGCAGTGCGGCGAGTTCCGGAAGTGTTGGAGTTGTA
CCGTACGCTGACGAAAAGGGATGCACAGAAGGAAAACAAGGTCACACACGGCGGAGGTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATTGAAAACCGATCAG
CCTATCTCTCGGCAATAAAATCAGAGGTGGAAACACATGGGGAGTTCGTGAATTGGTTGATCAAAGAAGTGGAAGCGACAGCGCCAAGAGACATAGTAGAGGTAGAGAGG
TTTGTGAAATGGCTGGATGGGAAACTAGCCTCGTTGGTGGACGAGAGAGCAGTATTGAAGCACTTCCCGCGGTGGCCAGAGGCGAAAGCAGATGCACTGCGGGAGGCAGC
ATTTAGCTATAGAGACCTAAAGAGCTTGGAGAGTGAAGTGTGTCTATTTAAGGACAATCTAAAAGAGGAGATGAATGTAGTGTTAAAGAAGGCTCAAGCATTGCAAGACA
GGCGAGAATGTACTATCAATCTTTGTTGTGTAATGCTGGAGCAAAGTGTCAACAACATGGAGAGAACAAGGGAGTTTAATTGTAAGAAGTACCATAGTTTTCAAATCCCC
TATCAGTGGATGTTCGATTCCGCATTGCCCGCTCAGATGAAGTTGAGCTCATTGAGGCTAGCAAAGGAATACATGACAAGGATAACAAGAGAACTGCAATCAAACGAAAC
CCCACAAGCAGAAAACCTTCTCCTTCAAGGGGTTCGCTTTGCTTACAGGGTTCATCAGTATGCAGGTGGTTTCGATTCAGAGGCTATACTGGCTTTTGAAGGACTGAAGA
AAGCTGGGCTGAGTAGCCAAAGAAAATACGCTTCTTGA
mRNA sequenceShow/hide mRNA sequence
GCAAGAAGCAAAAAATCAAGCAAACTTTAATAGCTTGGCTTCTCCTTTAGCCCATTTATTAAAAAAAATCACAAAATCAAAGGAGAATGCCAAAGGAAGAAGATGAAATT
TTGGCTATGGAGATCAATTGCTTGAAAAGAGAATTGGAAATTTCTCTACAAAAATTAAATTTTCTCGAGAAAGAAAATCAAGAACTCAGACAAGAATTGGGTCGATTGAA
ATCCCAGATTCAGTCTTTGAAAGCTCAAAACAATGAGAGAAAATCAATTCTCTGGAAGAAATTCCATAGCTCCATGGATGTCGCCGTCGCCGGAGCTGACTCGCCGCCGC
CAAGTCCGGCTAATACGGCGAGTGATAAACGAGAGCTGACCAAATCGCAGAAACAGAGTAGTTGGGGTGATGTGAAAGAGAATCAGAGAATGATGGCAGCACCGGCATCG
GCGCCGCCGCCTCCGCCGCCACTTCCGACGAAGCTGCTCGGAGGATCGAAGGCAGTGCGGCGAGTTCCGGAAGTGTTGGAGTTGTACCGTACGCTGACGAAAAGGGATGC
ACAGAAGGAAAACAAGGTCACACACGGCGGAGGTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATTGAAAACCGATCAGCCTATCTCTCGGCAATAAAATCAG
AGGTGGAAACACATGGGGAGTTCGTGAATTGGTTGATCAAAGAAGTGGAAGCGACAGCGCCAAGAGACATAGTAGAGGTAGAGAGGTTTGTGAAATGGCTGGATGGGAAA
CTAGCCTCGTTGGTGGACGAGAGAGCAGTATTGAAGCACTTCCCGCGGTGGCCAGAGGCGAAAGCAGATGCACTGCGGGAGGCAGCATTTAGCTATAGAGACCTAAAGAG
CTTGGAGAGTGAAGTGTGTCTATTTAAGGACAATCTAAAAGAGGAGATGAATGTAGTGTTAAAGAAGGCTCAAGCATTGCAAGACAGGCGAGAATGTACTATCAATCTTT
GTTGTGTAATGCTGGAGCAAAGTGTCAACAACATGGAGAGAACAAGGGAGTTTAATTGTAAGAAGTACCATAGTTTTCAAATCCCCTATCAGTGGATGTTCGATTCCGCA
TTGCCCGCTCAGATGAAGTTGAGCTCATTGAGGCTAGCAAAGGAATACATGACAAGGATAACAAGAGAACTGCAATCAAACGAAACCCCACAAGCAGAAAACCTTCTCCT
TCAAGGGGTTCGCTTTGCTTACAGGGTTCATCAGTATGCAGGTGGTTTCGATTCAGAGGCTATACTGGCTTTTGAAGGACTGAAGAAAGCTGGGCTGAGTAGCCAAAGAA
AATACGCTTCTTGATTAGAAACTTATAGGGAATGAATCTTCTTGTTGGCTGTTGCCAACTCTAATGCAGAGCAAATTCAACTAGATGTAATACAAATGCTTGAATGGGTA
TTCTATACATAATCAATCCTATTGCAACTTGTACAAACCATATCAAGGAAAATGGCATATCAGCATAACTAGACTTAGCTCTATCAAAAAGAGAACTAATTAAATAGATA
AAACTCCAAGAGAAACCATCCAATGGGTCCTAACAAAAATATGAAGAGCTAAGTTCTCACTAGAACAAAATTTCTTTCCTTCTATAATGTGGCATTGGGATCCTACC
Protein sequenceShow/hide protein sequence
MPKEEDEILAMEINCLKRELEISLQKLNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSMDVAVAGADSPPPSPANTASDKRELTKSQKQSSWGDVKENQ
RMMAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVTHGGGPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEATAPRDIVEVER
FVKWLDGKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCLFKDNLKEEMNVVLKKAQALQDRRECTINLCCVMLEQSVNNMERTREFNCKKYHSFQIP
YQWMFDSALPAQMKLSSLRLAKEYMTRITRELQSNETPQAENLLLQGVRFAYRVHQYAGGFDSEAILAFEGLKKAGLSSQRKYAS