; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G00610 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G00610
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein CHUP1, chloroplastic
Genome locationClcChr03:524477..528576
RNA-Seq ExpressionClc03G00610
SyntenyClc03G00610
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457349.1 PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis melo]5.7e-18584.91Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKE+DEELAMEI+CLKK+LEISLQKS FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSS D+AV   DSP   PA A GDKRE T+  KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMTA PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQ ENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE  AP+DI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG
            +EQSVSNMERTREFNCKKYQ F IP QWMFDSALP Q+KLS+LRLAKEYMIRITREL S ET QAENLFLQGVRFAYRVHQYAGGFDS+AI AFEG
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG

Query:  LKKAGLSSQRK
        LKKAGLSSQRK
Subjt:  LKKAGLSSQRK

XP_008457350.1 PREDICTED: protein CHUP1, chloroplastic isoform X2 [Cucumis melo]2.1e-17184.42Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKE+DEELAMEI+CLKK+LEISLQKS FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSS D+AV   DSP   PA A GDKRE T+  KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMTA PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQ ENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE  AP+DI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQ
            +EQSVSNMERTREFNCKKYQ F IP QWMFDSALP Q+KLS+LRLAKEYMIRITREL S ET QAENLFLQGVRFAYRVHQ
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQ

XP_011658693.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]6.7e-18684.43Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKEEDE LAMEINCLKKELEISLQKS FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS D++V   DSP   PA   GDKRE+T+S KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMT  PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQ ENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWVDVKENLRMTAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE+ AP+DI EVERFVKWLD KLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG
            +EQSVSNMERTREFNC+KYQ F IP QWMFDSALP Q+K+S+LRLAKEYMIRITREL S ETPQ ENLFLQG RFAYRVHQYAGGFDS+ I AFEG
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG

Query:  LKKAGLSSQRK
        LKKAGLSSQRK
Subjt:  LKKAGLSSQRK

XP_011658695.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]8.5e-17384.16Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKEEDE LAMEINCLKKELEISLQKS FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS D++V   DSP   PA   GDKRE+T+S KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMT  PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQ ENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWVDVKENLRMTAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE+ AP+DI EVERFVKWLD KLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQ
            +EQSVSNMERTREFNC+KYQ F IP QWMFDSALP Q+K+S+LRLAKEYMIRITREL S ETPQ ENLFLQG RFAYRVHQ
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQ

XP_038896069.1 protein CHUP1, chloroplastic [Benincasa hispida]3.9e-18685.12Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKEEDEELAMEIN LKKELEISLQKSNFLE ENQELRQELGRLKSQIQSLKA NNERKSILWKKFHSS DVAV   DS  P PAAA G+KRETT+SQKQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
        SSW DVKEN RM  APA APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQ ENK  HGGVP VAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
Subjt:  SSWVDVKENLRMTAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV

Query:  NWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTIN
        NWLIKEVE+ AP+DI EVERFVKW+D KL SLVDERAVLKHFPRWPEAKADALREAAFSYRDLK LE+EVCMFRDN KEE+NVVLKRAQALQDR      
Subjt:  NWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTIN

Query:  LLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGL
           +EQSVSN+E+TREFN KKYQ F IP QWMFDSALPAQMKLSSLRL KE M+RITRE+ S ETPQAENLFLQGVRFAYRVHQ+AGGFDS+A + FE L
Subjt:  LLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGL

Query:  KKAGLSSQRK
        KKAGLSSQRK
Subjt:  KKAGLSSQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein3.3e-18684.43Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKEEDE LAMEINCLKKELEISLQKS FLEKENQELRQEL RL+SQIQS KAQNNERKSILWKKFHSS D++V   DSP   PA   GDKRE+T+S KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMT  PAS  PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQ ENKV HGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SSWVDVKENLRMTAAPAS-APPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE+ AP+DI EVERFVKWLD KLASLVDERAVLK+FPRWPEAKADALREAAFSYRDLK LES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG
            +EQSVSNMERTREFNC+KYQ F IP QWMFDSALP Q+K+S+LRLAKEYMIRITREL S ETPQ ENLFLQG RFAYRVHQYAGGFDS+ I AFEG
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG

Query:  LKKAGLSSQRK
        LKKAGLSSQRK
Subjt:  LKKAGLSSQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X12.8e-18584.91Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKE+DEELAMEI+CLKK+LEISLQKS FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSS D+AV   DSP   PA A GDKRE T+  KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMTA PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQ ENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE  AP+DI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG
            +EQSVSNMERTREFNCKKYQ F IP QWMFDSALP Q+KLS+LRLAKEYMIRITREL S ET QAENLFLQGVRFAYRVHQYAGGFDS+AI AFEG
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG

Query:  LKKAGLSSQRK
        LKKAGLSSQRK
Subjt:  LKKAGLSSQRK

A0A1S3C5E9 protein CHUP1, chloroplastic isoform X21.0e-17184.42Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MPKE+DEELAMEI+CLKK+LEISLQKS FLE+ENQELR EL RLKSQIQSLKA NNERKSILWKKFHSS D+AV   DSP   PA A GDKRE T+  KQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        SSW DVKE+ RMTA PASA PPPPPPLP KLLGGSKAVRRVPEVL+LYRTLTKRDAQ ENKV HGG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWVDVKENLRMTAAPASA-PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VNWLIKEVE  AP+DI E E+FVKWLD KLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLES+VCMFRDNPKEEMNVVLKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQ
            +EQSVSNMERTREFNCKKYQ F IP QWMFDSALP Q+KLS+LRLAKEYMIRITREL S ET QAENLFLQGVRFAYRVHQ
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQ

A0A6J1G8X0 protein CHUP1, chloroplastic1.9e-16577.91Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        MP EEDEELAMEI+ LK+ELEISLQKS FLEKENQEL+QEL R KS I SLKA NN+RKSILWKKFH+S DVA     +PQ  PA    DK ETTR+QKQ
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRM-TAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKEN RM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQ ENK  +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SSWVDVKENLRM-TAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI
        VN LI+EVE+ AP+DI EVERFVKWLD +LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE EVC FR+NPKEE N +LKRAQALQDR     
Subjt:  VNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTI

Query:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG
            LEQSVSN+ERTREFNCKKY  F IP QWM DS LPAQMKLSSLRL KE M RIT+E   NETPQ ENLFLQGVRFAYRVHQYAGGFDS+AI+AFEG
Subjt:  NLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEG

Query:  LKKAGLS-SQRK
        +K+ GL  +QRK
Subjt:  LKKAGLS-SQRK

A0A6J1K8G4 protein CHUP1, chloroplastic6.2e-16978.93Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTD-SPQPIPAAAVGDKRETTRSQK
        MP EEDEELAMEI+ LK+ELEISLQKSNFLEKENQEL+QEL R KS +QSLK  NN+RKSILWKKFH+S DVAV  TD SPQ  PA    DK ETTR+QK
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTD-SPQPIPAAAVGDKRETTRSQK

Query:  QSSWVDVKENLRM-TAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        QS+W  VKEN RM  AAP  APPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQ ENK  +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  QSSWVDVKENLRM-TAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACT
        FVN LI+EVE+ AP+DI EVERFVKWLD +LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE+EVC FR+NPKEE N +LKRAQALQDR    
Subjt:  FVNWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACT

Query:  INLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFE
             LEQSVSN+ERTREFNC KY  F IP QWM DS LPAQMKLSSLRL KE M RIT+EL  NETPQ ENLFLQGVRFAYRVHQYAGGFDS+AI+AFE
Subjt:  INLLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFE

Query:  GLKKAG-LSSQRK
        G+K+ G L SQRK
Subjt:  GLKKAG-LSSQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.5e-6346.96Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++ E   + +  G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV +
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES

Query:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+P       LK+   L ++         +EQSV 
Subjt:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS

Query:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK
         + RTR+    +Y+ F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL    GS++ P  E L LQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown6.2e-9748.01Show/hide
Query:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ
        +P  ED+    ++  L KEL+  L +++ LEKEN ELRQE+ RL++Q+ +LK+  NERKS+LWKK  SS D +     + +   +     K +  R+   
Subjt:  MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQ

Query:  SSWVDVKENLRMTAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV
           +  +       + A+ PPPPPPLP+K   G ++VRR PEV+E YR LTKR++ M NK+   GV + AF +NMIGEIENRS YLS IKS+ + H + +
Subjt:  SSWVDVKENLRMTAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFV

Query:  NWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTIN
        + LI +VE+    DI EVE FVKW+D +L+SLVDERAVLKHFP+WPE K D+LREAA +Y+  K+L +E+  F+DNPK+ +   L+R Q+LQDR      
Subjt:  NWLIKEVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTIN

Query:  LLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGL
           LE+SV+N E+ R+   K+Y+ F IP++WM D+ L  Q+K SSLRLA+EYM RI +EL SN + +  NL LQGVRFAY +HQ+AGGFD + +  F  L
Subjt:  LLLLEQSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGL

Query:  KK
        KK
Subjt:  KK

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.1e-6446.96Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++ E   + +  G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV +
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES

Query:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+P       LK+   L ++         +EQSV 
Subjt:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS

Query:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK
         + RTR+    +Y+ F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL    GS++ P  E L LQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.1e-6446.96Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++ E   + +  G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV +
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES

Query:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+P       LK+   L ++         +EQSV 
Subjt:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS

Query:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK
         + RTR+    +Y+ F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL    GS++ P  E L LQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.1e-6446.96Show/hide
Query:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES
        P   PPPPPP P  L    GG   V R PE++E Y++L KR+++ E   + +  G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV +
Subjt:  PASAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRTLTKRDAQME---NKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVES

Query:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS
        ++  DI ++  FV WLD +L+ LVDERAVLKHF  WPE KADALREAAF Y+DL  LE +V  F D+P       LK+   L ++         +EQSV 
Subjt:  TAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVS

Query:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK
         + RTR+    +Y+ F IP  W+ D+ +  ++KLSS++LAK+YM R+  EL    GS++ P  E L LQGVRFA+RVHQ+AGGFD++++ AFE L+
Subjt:  NMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITREL----GSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-6948.99Show/hide
Query:  TAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIK
        + + A  PPPPPP P  L   S  VRRVPEV+E Y +L +RD+    +   GG  A A         ++MIGEIENRS YL AIK++VET G+F+ +LIK
Subjt:  TAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIK

Query:  EVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLE
        EV + A  DI +V  FVKWLD +L+ LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   FR++P++  +  LK+ QAL ++         LE
Subjt:  EVESTAPKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLE

Query:  QSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNE--TPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK
          V ++ R RE    K++ F IP  WM ++ + +Q+KL+S++LA +YM R++ EL + E   P+ E L +QGVRFA+RVHQ+AGGFD++ + AFE L+
Subjt:  QSVSNMERTREFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNE--TPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAGGAAGAAGATGAAGAATTGGCTATGGAGATCAATTGCTTGAAAAAAGAATTGGAAATTTCTCTACAAAAATCAAATTTTCTCGAGAAAGAAAATCAA
GAATTGAGACAAGAATTGGGTCGATTGAAATCCCAGATTCAGTCTTTAAAAGCTCAAAACAATGAGAGAAAATCTATTCTCTGGAAGAAATTCCACAGCTCCACG
GACGTCGCCGTCGTCAGAACTGACTCGCCGCAGCCCATTCCGGCAGCGGCGGTGGGTGATAAACGCGAGACGACCAGATCGCAGAAACAGAGTAGTTGGGTTGAT
GTGAAAGAGAATCTGAGAATGACGGCGGCACCGGCATCGGCGCCGCCACCTCCGCCGCCACTTCCGACGAAGCTGCTGGGAGGATCGAAGGCAGTGCGGCGAGTT
CCGGAAGTGTTGGAGTTGTACCGTACGCTGACGAAAAGGGATGCACAGATGGAAAATAAGGTCAAACACGGCGGAGTTCCGGCAGTGGCGTTCACCAAAAACATG
ATCGGCGAAATTGAAAATCGATCAGCCTATCTCTCGGCGATAAAATCAGAGGTGGAAACACATGGAGAGTTCGTGAACTGGTTGATCAAAGAAGTGGAATCGACA
GCACCAAAAGACATATTAGAGGTAGAGAGGTTTGTGAAGTGGCTGGACAGGAAGTTGGCATCGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCACGGTGG
CCAGAAGCGAAAGCAGATGCATTGCGAGAGGCGGCATTTAGCTATAGAGACTTAAAGAGCTTGGAGAGTGAAGTGTGTATGTTCAGGGACAATCCAAAGGAGGAG
ATGAATGTGGTGTTAAAGAGGGCTCAAGCGTTGCAAGACAGGCGAGCATGTACAATCAATCTGTTGTTGCTAGAGCAAAGCGTCAGTAACATGGAGAGGACAAGG
GAGTTTAATTGTAAGAAGTACCAAGGTTTTCATATCCCCTACCAATGGATGTTCGATTCCGCTTTGCCCGCTCAGATGAAGTTGAGCTCATTGAGGCTGGCGAAG
GAATACATGATAAGGATAACAAGAGAACTAGGATCAAATGAAACCCCACAAGCAGAAAACCTTTTCCTTCAAGGAGTTCGCTTCGCTTACAGGGTTCATCAGTAC
GCAGGTGGTTTCGATTCAGACGCTATACTGGCTTTTGAAGGACTCAAGAAAGCTGGGCTGAGTAGTCAAAGAAAGTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCATTGTAAGCAAGAAGAAGAAATTGAAATAAATGAGCCCAAAAAATCAAGCAAACTGTAATAGCTTGGCTTCTCCTTTAGCCATTTATTAAAAAAGAAGAA
CAAAAGCACAAAATCAAAGGAGAATGCCAAAGGAAGAAGATGAAGAATTGGCTATGGAGATCAATTGCTTGAAAAAAGAATTGGAAATTTCTCTACAAAAATCAA
ATTTTCTCGAGAAAGAAAATCAAGAATTGAGACAAGAATTGGGTCGATTGAAATCCCAGATTCAGTCTTTAAAAGCTCAAAACAATGAGAGAAAATCTATTCTCT
GGAAGAAATTCCACAGCTCCACGGACGTCGCCGTCGTCAGAACTGACTCGCCGCAGCCCATTCCGGCAGCGGCGGTGGGTGATAAACGCGAGACGACCAGATCGC
AGAAACAGAGTAGTTGGGTTGATGTGAAAGAGAATCTGAGAATGACGGCGGCACCGGCATCGGCGCCGCCACCTCCGCCGCCACTTCCGACGAAGCTGCTGGGAG
GATCGAAGGCAGTGCGGCGAGTTCCGGAAGTGTTGGAGTTGTACCGTACGCTGACGAAAAGGGATGCACAGATGGAAAATAAGGTCAAACACGGCGGAGTTCCGG
CAGTGGCGTTCACCAAAAACATGATCGGCGAAATTGAAAATCGATCAGCCTATCTCTCGGCGATAAAATCAGAGGTGGAAACACATGGAGAGTTCGTGAACTGGT
TGATCAAAGAAGTGGAATCGACAGCACCAAAAGACATATTAGAGGTAGAGAGGTTTGTGAAGTGGCTGGACAGGAAGTTGGCATCGCTGGTGGACGAGAGGGCGG
TGCTGAAGCACTTCCCACGGTGGCCAGAAGCGAAAGCAGATGCATTGCGAGAGGCGGCATTTAGCTATAGAGACTTAAAGAGCTTGGAGAGTGAAGTGTGTATGT
TCAGGGACAATCCAAAGGAGGAGATGAATGTGGTGTTAAAGAGGGCTCAAGCGTTGCAAGACAGGCGAGCATGTACAATCAATCTGTTGTTGCTAGAGCAAAGCG
TCAGTAACATGGAGAGGACAAGGGAGTTTAATTGTAAGAAGTACCAAGGTTTTCATATCCCCTACCAATGGATGTTCGATTCCGCTTTGCCCGCTCAGATGAAGT
TGAGCTCATTGAGGCTGGCGAAGGAATACATGATAAGGATAACAAGAGAACTAGGATCAAATGAAACCCCACAAGCAGAAAACCTTTTCCTTCAAGGAGTTCGCT
TCGCTTACAGGGTTCATCAGTACGCAGGTGGTTTCGATTCAGACGCTATACTGGCTTTTGAAGGACTCAAGAAAGCTGGGCTGAGTAGTCAAAGAAAGTAGGCTT
CTTCTTGATTATAAATTATAGGGAATGAATCTTGTTGGTGGCTGTAGCCAACTCTAATGCAGAGCACATTCAACTAGATTGATGTAATACAAATGCTTGAATGGA
TGTTGTATACATAATCAATCCTATTGCAACTTATACAAACAATATCAAGGAGAATGGCATATCAGCTTAACTACACCTAGCTCTATAAAAAAGAACTAATGAAAA
AGATAAAACTCCAAGAGAAACCATCCAATGGGTCCTAACTAAAATATGGAGAGATCGGTTTTCACTAGAACAAAGTTTCTCTTCTTCTATAATGTGGTATCGGGA
TTCAACCGCCACATATCCAGGCATCCTAATGCCTACATTTTCTTTAATTGAGCTGTAGAAAACTTCCCCCAAAAACTCCATATTCCACCACTTACAGAATGGGTC
TCTAGAATTAATAGCTGGCCAAGGATTTCTTTCTCGAAAACAAGCAAAGCTAGATAAGATGAACTTGATGTTTCGGAAAAATCACCAAACAAAATATTACAACCA
TGACCAACAAGAGTATTTTAATCAAAATGCAGACCAATGCAATAATGAATCATTGAGAAAAGGAGACCCATCTAAAAGGCCTACTCATCGCAAGTTTATCTGTCA
TACAGTCTCCAATCACCAACTACACTCCTAAATTTAAGTACTTCCAGAAGATGAACAATGGCTCATGTTTAAAGAGAATAGAGAAAGCTAAGTAATGGTATTCAT
TCAGGAAGGCAAAAGCTAATTAATTAGACTATTGCATCTTTGAAATGGCAATCATTAGAACAAATATATAGCATAGAAATCACTAGATAAAAACTGTAAAGCTGG
AAAACCAGTTCTCGATTAAAGACAACAAAATGTGTCGATCACAAAGCAATTTAAAAACAATGCAAAACGGTTGAAAACTAAGATAACATAAAGAAATCTAAACAG
TAACCAAAACCCTAGGTTGGAAACCATCCCTAAGAAAACGGTCGCTACTTCGATCTCTGCCTCATCTTTCGGCGCTTTCTCTTCAACCTCCTCATTCTCTTCTTC
TTCCACTTAGCTCTCATGGCGACGCTGCGAAAAAAGTTTGAGTGCTGTCTGTAAAGTGAGCGTCCTCAAATACAGACGAACGGCGGAGGGAAGAAGGCTGGAGAC
AGTAGAATC
Protein sequenceShow/hide protein sequence
MPKEEDEELAMEINCLKKELEISLQKSNFLEKENQELRQELGRLKSQIQSLKAQNNERKSILWKKFHSSTDVAVVRTDSPQPIPAAAVGDKRETTRSQKQSSWVD
VKENLRMTAAPASAPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQMENKVKHGGVPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEST
APKDILEVERFVKWLDRKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESEVCMFRDNPKEEMNVVLKRAQALQDRRACTINLLLLEQSVSNMERTR
EFNCKKYQGFHIPYQWMFDSALPAQMKLSSLRLAKEYMIRITRELGSNETPQAENLFLQGVRFAYRVHQYAGGFDSDAILAFEGLKKAGLSSQRK