; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1626 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1626
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationMC01:20467201..20472922
RNA-Seq ExpressionMC01g1626
SyntenyMC01g1626
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134549.1 protein CHUP1, chloroplastic [Cucumis sativus]0.090.28Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NP E RGK SRFADQNQNPKC NQN AKG+TGNG SKLRAASSWGS+IVKGFSTDK+TKAQ  SNLQPKKAP L +SDL NQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLFVELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKK E+D L +KVS+LEEDRR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA
        V LPS+SEKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKNSESEA+AKIKAE SLLRHTNEDLCKQVEGLQMSRLNEVEELA
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA

Query:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR
        YLRWVNSCLR+ELRNS PSA+S + SSPQ  ERSSE++GSLSSQKEYM+Y+SAKRINLIKKLKKWPITDEDLSNLDCSD  +L DKNW D EEGRSPRRR
Subjt:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR

Query:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP
        HSISGAK WPEELEPNKRRQSDGF+CAKEMEK+ DPLS+QKYDLGV QRPH+L NCHE +R+ ASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP
Subjt:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP

Query:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
        PLPPPPPPP LPKF+VRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNG ICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
Subjt:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY

Query:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC
        L IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRNC
Subjt:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC

Query:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKA+SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

XP_008439508.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]0.089.53Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NP E RGK SRFADQNQNPKC NQN AKG++GNG SKLRAASSWGS+IVKGFSTDK+  A+TQSNLQPKKAP L +SDL NQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLFVELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKK E+D L +KVS+LEEDRR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEEL
        V L S+SEKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKN SESEA+AK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEEL
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEEL

Query:  AYLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRR
        AYLRWVNSCLR+ELRNSCPSA+S + SSPQ  ERSSE V SLSSQKEYM+Y+SAKRINLIKKLKKWPITDEDLSNLDCSD   L+DK W D EEGRSPRR
Subjt:  AYLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRR

Query:  RHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP
        RHSISGAK WPEELEPNKRRQSDGF+CAKEMEK+ DPLS+QKYDLGV QRPH+L N HE +R+ ASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP
Subjt:  RHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP

Query:  PPLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV
        PPLPPPPPPP LPKFAVRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV
Subjt:  PPLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV

Query:  YLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRN
        YL IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRN
Subjt:  YLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRN

Query:  CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        CKEFQIPTDWMLD+GIISKIKLGSVKLAKMYMKRVA ELQSKA+SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

XP_022146557.1 protein CHUP1, chloroplastic-like [Momordica charantia]0.0100Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI
        MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI

Query:  GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA
        GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA
Subjt:  GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA

Query:  LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL
        LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL
Subjt:  LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL

Query:  RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS
        RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS
Subjt:  RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS

Query:  ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL
        ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL
Subjt:  ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL

Query:  PPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDI
        PPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDI
Subjt:  PPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDI

Query:  EDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEF
        EDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEF
Subjt:  EDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEF

Query:  QIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        QIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  QIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

XP_038881874.1 protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida]0.087.38Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NPSE+RGK SRFADQNQNPKC NQN AKGN+GNG SKLRAASSWGS+IVKGFSTDK+TKAQ  SNLQPKK P L +SDLANQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLF+ELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKKAE+D LTQKVS+LEE+RR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA
        V L SISEKQEEPQTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKNSESEA+AKIKAEASLLRH NEDLCKQVEGLQMSRLNEVEELA
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA

Query:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR
        YLRWVNSCLR+ELRNSC SA+S + SSPQ  ERS ES+GSLSSQKEYM+YNSAKRINL+KKLKKWPITDEDLSNLDCSD  +L DKNW D EEGRSPRRR
Subjt:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR

Query:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP
        HSISGAK WPEELEPNKRRQSDGFICAKEMEKEADPLS+QKYDLGV QRPH+  NCHE +RS  SL+VEKRALRIPNPPPRPSCSISSEPKEEN AQVPP
Subjt:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP

Query:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
        PLPPPPPPP LPKFAVRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
Subjt:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY

Query:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC
        L IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRNC
Subjt:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC

Query:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQ--------------------------------FA
        KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKA SEKDPAMDYMLLQGVRFAFRIHQ                                FA
Subjt:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQ--------------------------------FA

Query:  GGFDAETMHAFEDLRNLANLLNKK
        GGFDAETMHAFEDLRNLANLLNKK
Subjt:  GGFDAETMHAFEDLRNLANLLNKK

XP_038881875.1 protein CHUP1, chloroplastic-like isoform X2 [Benincasa hispida]0.090.91Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NPSE+RGK SRFADQNQNPKC NQN AKGN+GNG SKLRAASSWGS+IVKGFSTDK+TKAQ  SNLQPKK P L +SDLANQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLF+ELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKKAE+D LTQKVS+LEE+RR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA
        V L SISEKQEEPQTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKNSESEA+AKIKAEASLLRH NEDLCKQVEGLQMSRLNEVEELA
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA

Query:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR
        YLRWVNSCLR+ELRNSC SA+S + SSPQ  ERS ES+GSLSSQKEYM+YNSAKRINL+KKLKKWPITDEDLSNLDCSD  +L DKNW D EEGRSPRRR
Subjt:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR

Query:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP
        HSISGAK WPEELEPNKRRQSDGFICAKEMEKEADPLS+QKYDLGV QRPH+  NCHE +RS  SL+VEKRALRIPNPPPRPSCSISSEPKEEN AQVPP
Subjt:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP

Query:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
        PLPPPPPPP LPKFAVRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
Subjt:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY

Query:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC
        L IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRNC
Subjt:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC

Query:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKA SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

TrEMBL top hitse value%identityAlignment
A0A0A0KMA9 Uncharacterized protein0.090.28Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NP E RGK SRFADQNQNPKC NQN AKG+TGNG SKLRAASSWGS+IVKGFSTDK+TKAQ  SNLQPKKAP L +SDL NQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLFVELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKK E+D L +KVS+LEEDRR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA
        V LPS+SEKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKNSESEA+AKIKAE SLLRHTNEDLCKQVEGLQMSRLNEVEELA
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELA

Query:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR
        YLRWVNSCLR+ELRNS PSA+S + SSPQ  ERSSE++GSLSSQKEYM+Y+SAKRINLIKKLKKWPITDEDLSNLDCSD  +L DKNW D EEGRSPRRR
Subjt:  YLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRR

Query:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP
        HSISGAK WPEELEPNKRRQSDGF+CAKEMEK+ DPLS+QKYDLGV QRPH+L NCHE +R+ ASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP
Subjt:  HSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPP

Query:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
        PLPPPPPPP LPKF+VRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNG ICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY
Subjt:  PLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVY

Query:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC
        L IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRNC
Subjt:  LDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNC

Query:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKA+SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  KEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

A0A1S3AZK1 protein CHUP1, chloroplastic-like0.089.53Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NP E RGK SRFADQNQNPKC NQN AKG++GNG SKLRAASSWGS+IVKGFSTDK+  A+TQSNLQPKKAP L +SDL NQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLFVELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKK E+D L +KVS+LEEDRR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEEL
        V L S+SEKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKN SESEA+AK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEEL
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEEL

Query:  AYLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRR
        AYLRWVNSCLR+ELRNSCPSA+S + SSPQ  ERSSE V SLSSQKEYM+Y+SAKRINLIKKLKKWPITDEDLSNLDCSD   L+DK W D EEGRSPRR
Subjt:  AYLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRR

Query:  RHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP
        RHSISGAK WPEELEPNKRRQSDGF+CAKEMEK+ DPLS+QKYDLGV QRPH+L N HE +R+ ASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP
Subjt:  RHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP

Query:  PPLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV
        PPLPPPPPPP LPKFAVRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV
Subjt:  PPLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV

Query:  YLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRN
        YL IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRN
Subjt:  YLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRN

Query:  CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        CKEFQIPTDWMLD+GIISKIKLGSVKLAKMYMKRVA ELQSKA+SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

A0A5A7UD87 Protein CHUP10.089.53Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS
        MKE+NP E RGK SRFADQNQNPKC NQN AKG++GNG SKLRAASSWGS+IVKGFSTDK+  A+TQSNLQPKKAP L +SDL NQK+KFVPSHSRIKRS
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQN-AKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAP-LTSSDLANQKDKFVPSHSRIKRS

Query:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL
        +IGDL+CSANPAQVHP SYQTH RRQSSRDLFVELDQLRSLL+ESKQREFELQNELAELKRN RNYELERELEEKK E+D L +KVS+LEEDRR LSEQL
Subjt:  LIGDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL

Query:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEEL
        V L S+SEKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKN SESEA+AK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEEL
Subjt:  VALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKN-SESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEEL

Query:  AYLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRR
        AYLRWVNSCLR+ELRNSCPSA+S + SSPQ  ERSSE V SLSSQKEYM+Y+SAKRINLIKKLKKWPITDEDLSNLDCSD   L+DK W D EEGRSPRR
Subjt:  AYLRWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRR

Query:  RHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP
        RHSISGAK WPEELEPNKRRQSDGF+CAKEMEK+ DPLS+QKYDLGV QRPH+L N HE +R+ ASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP
Subjt:  RHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVP

Query:  PPLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV
        PPLPPPPPPP LPKFAVRSA GMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV
Subjt:  PPLPPPPPPP-LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAV

Query:  YLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRN
        YL IEDIV FVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERS YNLLR+RESLMRN
Subjt:  YLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRN

Query:  CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        CKEFQIPTDWMLD+GIISKIKLGSVKLAKMYMKRVA ELQSKA+SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

A0A6J1CZW8 protein CHUP1, chloroplastic-like0.0100Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI
        MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI

Query:  GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA
        GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA
Subjt:  GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA

Query:  LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL
        LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL
Subjt:  LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL

Query:  RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS
        RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS
Subjt:  RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS

Query:  ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL
        ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL
Subjt:  ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL

Query:  PPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDI
        PPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDI
Subjt:  PPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDI

Query:  EDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEF
        EDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEF
Subjt:  EDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEF

Query:  QIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        QIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  QIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

A0A6J1KYE4 protein CHUP1, chloroplastic-like0.088.24Show/hide
Query:  MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI
        MKE+N SE+RGK SRFADQN       Q  KG +GNG SKLRAASSWGS+IVKGFSTDKKTKAQT  NLQ KKAPLT+S+L NQK+K VPSH+RIKRSLI
Subjt:  MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLI

Query:  GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA
        GDL+CS NPAQVHP SYQTH RRQSSRDLFVELDQLRSLL+ESKQREFELQNELAEL+RN RN+ELERELEEKKAE++ LTQK SLLEEDRR LSEQLVA
Subjt:  GDLSCSANPAQVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVA

Query:  LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL
          SI+EK EEPQTAP NVEVEVVELRRLNKELQLQKRNLACRLSSVESELA LAKNSESEA+AKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL
Subjt:  LPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYL

Query:  RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS
        RWVNSCLR+ELRNSCPSA+SD+ SSPQA ERSSE VGSLSSQKE+MDYN+AKRIN IKKLKKWPITDEDLSNLDCSD   L+ KNW D EE  SPRRRHS
Subjt:  RWVNSCLRNELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHS

Query:  ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL
        ISGAK WPEELEPNKRRQSDGF+CAKE+EKEADPLS+QKYDLGV QRPHILEN HE +R+ ASLDVEKRALRIPNPPPRPSCSISSEPKEEN  +VPPPL
Subjt:  ISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPL

Query:  PPPPPPP--LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYL
        PPPPPPP  LPKFA RS+ GMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYL
Subjt:  PPPPPPP--LPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYL

Query:  DIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCK
         IED+VAFVKWLDDELCFLVDERAVLKHFDWPERKAD LREAAFGYRDLKKLECEISAYKDDPRLPC+IALKKMV LSEKMERS YNLLR+RESLMRNCK
Subjt:  DIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCK

Query:  EFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
        EFQIP DWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSK++SEKDPAMDYMLLQGVR+AFRIHQFAGGFDAETMHAFEDLRNLANLLNKK
Subjt:  EFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic4.9e-11937.82Show/hide
Query:  ELDQLRSLLHESKQREFELQNELAEL----KRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL-----------VALPSISEKQEEPQ----
        EL++L+ L+ E ++RE +L+ EL E     ++ +   EL+R+L+ K  EID L   ++ L+ +R+ L E+L           VA   I E Q + Q    
Subjt:  ELDQLRSLLHESKQREFELQNELAEL----KRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL-----------VALPSISEKQEEPQ----

Query:  ----------------------------------TAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNED
                                           A Q++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A+L+  +ES+ +AK++ E + L+H NED
Subjt:  ----------------------------------TAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNED

Query:  LCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER--------SSESVGSLSSQKEYM--------DYNSA---
        L KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN    A        S NLS   QA  +        S    G    +  Y         D+++A   
Subjt:  LCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER--------SSESVGSLSSQKEYM--------DYNSA---

Query:  ----------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGA----------------KGWPE-----ELEPNKRR
                  K+  LI+KLKKW  + +D    S+   S YG    +  +   + R P     I  A                 G PE      +   ++ 
Subjt:  ----------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGA----------------KGWPE-----ELEPNKRR

Query:  QSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI------------------------------------------------------
         S G           + +K ++   D   P    ++ L V +  HI                                                      
Subjt:  QSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI------------------------------------------------------

Query:  LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP------------PPPPPPLPKFAVRSAMG--MVQRAPQVVEFY
         EN   +++ +  +D+EKR  R+P PPPR +         S+ P        PPP P            PPPPPP P    R A G   V RAP++VEFY
Subjt:  LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP------------PPPPPPLPKFAVRSAMG--MVQRAPQVVEFY

Query:  HSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDW
         SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV SL  EV  + + DIED++AFV WLD+EL FLVDERAVLKHFDW
Subjt:  HSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDW

Query:  PERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMK
        PE KADALREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+S+Y LLR R+  +   KEF IP DW+ D G++ KIKL SV+LAK YMK
Subjt:  PERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMK

Query:  RVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA
        RVA EL S + S+KDP  +++LLQGVRFAFR+HQFAGGFDAE+M AFE+LR+ A
Subjt:  RVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein3.5e-8043.96Show/hide
Query:  ELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRAL--------------RIPNPPPRPSCSISSE----PKEE
        ELE N R+ S   + A   E +   LS+        Q     +    I+  L    V+K                 R+P  PP P   +S       ++E
Subjt:  ELEPNKRRQSDGFICAKEMEKEADPLSTQKYDLGVSQRPHILENCHEISRSLASLDVEKRAL--------------RIPNPPPRPSCSISSE----PKEE

Query:  NRAQVPPPLPPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIRE
        N +   PP PPPPPPP P   +  A    Q++P V + +  L K+D+ ++ S     N   V++  +S++GEI+NRS+HL+AIKADIET+GEF+N LI++
Subjt:  NRAQVPPPLPPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIRE

Query:  VNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRE
        V    + D+ED++ FV WLD EL  L DERAVLKHF WPE+KAD L+EAA  YR+LKKLE E+S+Y DDP +   +ALKKM  L +K E+ I  L+R+R 
Subjt:  VNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRE

Query:  SLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLR
        S MR+ ++F+IP +WMLD+G+I KIK  S+KLAK YM RVA ELQS    +++   + +LLQGVRFA+R HQFAGG D ET+ A E+++
Subjt:  SLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.5e-12037.82Show/hide
Query:  ELDQLRSLLHESKQREFELQNELAEL----KRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL-----------VALPSISEKQEEPQ----
        EL++L+ L+ E ++RE +L+ EL E     ++ +   EL+R+L+ K  EID L   ++ L+ +R+ L E+L           VA   I E Q + Q    
Subjt:  ELDQLRSLLHESKQREFELQNELAEL----KRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL-----------VALPSISEKQEEPQ----

Query:  ----------------------------------TAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNED
                                           A Q++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A+L+  +ES+ +AK++ E + L+H NED
Subjt:  ----------------------------------TAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNED

Query:  LCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER--------SSESVGSLSSQKEYM--------DYNSA---
        L KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN    A        S NLS   QA  +        S    G    +  Y         D+++A   
Subjt:  LCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER--------SSESVGSLSSQKEYM--------DYNSA---

Query:  ----------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGA----------------KGWPE-----ELEPNKRR
                  K+  LI+KLKKW  + +D    S+   S YG    +  +   + R P     I  A                 G PE      +   ++ 
Subjt:  ----------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGA----------------KGWPE-----ELEPNKRR

Query:  QSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI------------------------------------------------------
         S G           + +K ++   D   P    ++ L V +  HI                                                      
Subjt:  QSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI------------------------------------------------------

Query:  LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP------------PPPPPPLPKFAVRSAMG--MVQRAPQVVEFY
         EN   +++ +  +D+EKR  R+P PPPR +         S+ P        PPP P            PPPPPP P    R A G   V RAP++VEFY
Subjt:  LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP------------PPPPPPLPKFAVRSAMG--MVQRAPQVVEFY

Query:  HSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDW
         SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV SL  EV  + + DIED++AFV WLD+EL FLVDERAVLKHFDW
Subjt:  HSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDW

Query:  PERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMK
        PE KADALREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+S+Y LLR R+  +   KEF IP DW+ D G++ KIKL SV+LAK YMK
Subjt:  PERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMK

Query:  RVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA
        RVA EL S + S+KDP  +++LLQGVRFAFR+HQFAGGFDAE+M AFE+LR+ A
Subjt:  RVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.5e-12037.82Show/hide
Query:  ELDQLRSLLHESKQREFELQNELAEL----KRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL-----------VALPSISEKQEEPQ----
        EL++L+ L+ E ++RE +L+ EL E     ++ +   EL+R+L+ K  EID L   ++ L+ +R+ L E+L           VA   I E Q + Q    
Subjt:  ELDQLRSLLHESKQREFELQNELAEL----KRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQL-----------VALPSISEKQEEPQ----

Query:  ----------------------------------TAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNED
                                           A Q++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A+L+  +ES+ +AK++ E + L+H NED
Subjt:  ----------------------------------TAPQNVEVEVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNED

Query:  LCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER--------SSESVGSLSSQKEYM--------DYNSA---
        L KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN    A        S NLS   QA  +        S    G    +  Y         D+++A   
Subjt:  LCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER--------SSESVGSLSSQKEYM--------DYNSA---

Query:  ----------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGA----------------KGWPE-----ELEPNKRR
                  K+  LI+KLKKW  + +D    S+   S YG    +  +   + R P     I  A                 G PE      +   ++ 
Subjt:  ----------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGA----------------KGWPE-----ELEPNKRR

Query:  QSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI------------------------------------------------------
         S G           + +K ++   D   P    ++ L V +  HI                                                      
Subjt:  QSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI------------------------------------------------------

Query:  LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP------------PPPPPPLPKFAVRSAMG--MVQRAPQVVEFY
         EN   +++ +  +D+EKR  R+P PPPR +         S+ P        PPP P            PPPPPP P    R A G   V RAP++VEFY
Subjt:  LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP------------PPPPPPLPKFAVRSAMG--MVQRAPQVVEFY

Query:  HSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDW
         SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV SL  EV  + + DIED++AFV WLD+EL FLVDERAVLKHFDW
Subjt:  HSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDW

Query:  PERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMK
        PE KADALREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+S+Y LLR R+  +   KEF IP DW+ D G++ KIKL SV+LAK YMK
Subjt:  PERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMK

Query:  RVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA
        RVA EL S + S+KDP  +++LLQGVRFAFR+HQFAGGFDAE+M AFE+LR+ A
Subjt:  RVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein4.0e-11638.66Show/hide
Query:  QREFEL-QNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRL
        ++E E+ +N++ EL+R     +++ +  + K ++  L Q VS L+        +         + E    A Q++EV+V+EL+R N+ELQ +KR L+ +L
Subjt:  QREFEL-QNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVALPSISEKQEEPQTAPQNVEVEVVELRRLNKELQLQKRNLACRL

Query:  SSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER----
         S E+ +A+L+  +ES+ +AK++ E + L+H NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN    A        S NLS   QA  +    
Subjt:  SSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSAS-------SDNLS-SPQATER----

Query:  ----SSESVGSLSSQKEYM--------DYNSA-------------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISG
            S    G    +  Y         D+++A             K+  LI+KLKKW  + +D    S+   S YG    +  +   + R P     I  
Subjt:  ----SSESVGSLSSQKEYM--------DYNSA-------------KRINLIKKLKKWPITDED---LSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISG

Query:  A----------------KGWPE-----ELEPNKRRQSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI-------------------
        A                 G PE      +   ++  S G           + +K ++   D   P    ++ L V +  HI                   
Subjt:  A----------------KGWPE-----ELEPNKRRQSDG----------FICAKEMEKEAD---PLSTQKYDLGVSQRPHI-------------------

Query:  -----------------------------------LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP--------
                                            EN   +++ +  +D+EKR  R+P PPPR +         S+ P        PPP P        
Subjt:  -----------------------------------LENCHEISRSLASLDVEKRALRIPNPPPRPSCS------ISSEPKEENRAQVPPPLP--------

Query:  ----PPPPPPLPKFAVRSAMG--MVQRAPQVVEFYHSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNN
            PPPPPP P    R A G   V RAP++VEFY SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV SL  EV  
Subjt:  ----PPPPPPLPKFAVRSAMG--MVQRAPQVVEFYHSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNN

Query:  AVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLM
        + + DIED++AFV WLD+EL FLVDERAVLKHFDWPE KADALREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+S+Y LLR R+  +
Subjt:  AVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLM

Query:  RNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA
           KEF IP DW+ D G++ KIKL SV+LAK YMKRVA EL S + S+KDP  +++LLQGVRFAFR+HQFAGGFDAE+M AFE+LR+ A
Subjt:  RNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-9042.16Show/hide
Query:  ASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRN----ELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYM
        A+ A  S +  +++++ +   LR   E L K  E L++  L E   +  L       +N    ELR      + DN    +  +RS E      ++++ M
Subjt:  ASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRN----ELRNSCPSASSDNLSSPQATERSSESVGSLSSQKEYM

Query:  DYNSAKRINLIKKLKKWPITDEDLSNLDCSD-YGVLVDKNWADAEEGRSPRRRHSISGAKGWPEEL----EPNKRRQSDGFICAKEMEKEADPLSTQKYD
        +         I +L+K   ++ D   L  S  +  L+D + A +   RS +R   +   +  PE +      NK   S G         +AD    +K +
Subjt:  DYNSAKRINLIKKLKKWPITDEDLSNLDCSD-YGVLVDKNWADAEEGRSPRRRHSISGAKGWPEEL----EPNKRRQSDGFICAKEMEKEADPLSTQKYD

Query:  LGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRA---------------------QVPPP-----LPPPPPPPLPKFAVRS
        +    R     N  E++ S +   V  R  R+P PPP+ S S+      ENRA                     Q PPP      PPPPPPP P  ++  
Subjt:  LGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRA---------------------QVPPP-----LPPPPPPPLPKFAVRS

Query:  AMGMVQRAPQVVEFYHSLMKRD---SRKDSSNGAICNVPDV---SNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKW
        A   V+R P+VVEFYHSLM+RD   SR+DS+ G       +   SN R  MIGEIENRS +LLAIK D+ETQG+F+  LI+EV NA + DIED+V FVKW
Subjt:  AMGMVQRAPQVVEFYHSLMKRD---SRKDSSNGAICNVPDV---SNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKW

Query:  LDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLD
        LDDEL +LVDERAVLKHF+WPE+KADALREAAF Y DLKKL  E S +++DPR     ALKKM AL EK+E  +Y+L R+RES     K FQIP DWML+
Subjt:  LDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLD

Query:  NGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA
         GI S+IKL SVKLA  YMKRV+ EL+  A     P  + +++QGVRFAFR+HQFAGGFDAETM AFE+LR+ A
Subjt:  NGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGGAGAACCCATCAGAAAGCAGAGGCAAATCGTCGAGGTTTGCCGATCAAAATCAGAATCCCAAGTGTCAAAATCAGAATGCGAAAGGAAATACTGGGAATGG
GGGCTCGAAATTGAGGGCGGCTTCTTCCTGGGGTTCTAATATTGTGAAAGGTTTCTCCACAGACAAGAAAACCAAAGCTCAGACTCAGAGCAATCTCCAGCCCAAGAAGG
CGCCACTTACGAGTTCGGACTTGGCAAATCAGAAGGACAAATTTGTTCCTTCACATTCTCGCATCAAGCGATCACTGATTGGGGATTTGTCATGCTCTGCAAATCCTGCT
CAAGTTCATCCACACTCGTATCAGACCCATCACCGCAGACAATCTTCTCGGGATTTGTTCGTAGAGCTCGATCAACTCAGAAGTCTGCTCCACGAATCCAAGCAGAGAGA
ATTCGAACTTCAGAACGAACTAGCAGAGCTTAAGCGAAATGCTAGAAATTATGAACTTGAAAGGGAACTTGAGGAGAAGAAAGCAGAAATAGACTGTCTTACTCAGAAAG
TTAGTCTATTGGAAGAAGATAGAAGAGTTCTGTCCGAACAATTGGTGGCTTTACCATCAATTTCTGAGAAGCAAGAGGAGCCGCAGACTGCACCTCAAAACGTAGAAGTG
GAAGTTGTTGAGTTGAGACGTTTGAATAAGGAACTCCAGCTTCAGAAGAGGAACCTCGCTTGCAGGCTTTCTTCGGTGGAATCTGAGCTAGCTTCTCTTGCAAAGAATTC
CGAGAGTGAAGCTATTGCTAAGATCAAAGCAGAAGCATCTTTGCTAAGACACACTAATGAAGATTTGTGCAAGCAAGTTGAAGGTCTACAGATGAGCAGATTGAATGAGG
TTGAAGAACTTGCATATCTTAGGTGGGTCAATTCATGTTTAAGGAATGAGCTCCGTAATTCGTGCCCCTCGGCAAGTTCTGATAACCTATCCAGCCCTCAGGCAACCGAG
AGGAGCAGTGAATCCGTTGGTTCATTATCTAGCCAGAAGGAGTACATGGATTATAACAGTGCTAAGAGGATAAATCTTATTAAAAAGTTGAAGAAATGGCCCATTACTGA
TGAGGATCTGTCTAATTTAGATTGCTCGGATTATGGTGTTTTAGTAGACAAAAATTGGGCTGATGCGGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGCGGAG
CAAAAGGCTGGCCCGAAGAATTGGAACCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAGGAAGCAGACCCTCTTTCCACTCAGAAATAT
GATTTGGGCGTGAGTCAAAGACCACATATTTTGGAAAATTGTCATGAAATTAGCAGGAGTCTTGCCTCTTTAGACGTGGAGAAACGAGCATTGCGCATACCAAATCCCCC
GCCGAGGCCTTCTTGCTCCATTTCAAGTGAACCTAAAGAGGAAAACAGAGCTCAAGTCCCACCACCTCTGCCCCCACCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGA
GGAGCGCCATGGGAATGGTACAGAGAGCTCCACAAGTGGTTGAATTCTACCATTCACTGATGAAGAGAGATTCTAGGAAGGATTCCTCCAATGGTGCCATATGCAATGTT
CCAGATGTTTCAAATGTCCGGAGTAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTATAAAGGCAGATATTGAAACCCAGGGAGAGTTTGTAAATTC
ACTGATAAGAGAGGTCAACAATGCAGTTTATCTGGATATTGAAGATATAGTGGCTTTTGTCAAGTGGCTTGACGATGAACTTTGCTTTCTGGTGGACGAAAGGGCAGTTC
TAAAGCACTTTGATTGGCCAGAGAGAAAGGCAGACGCATTGCGAGAAGCAGCCTTTGGGTACAGGGATTTAAAAAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGAT
CCCAGATTGCCTTGTGACATAGCTCTGAAAAAAATGGTTGCTTTATCAGAGAAGATGGAACGTAGCATTTATAACCTTCTCCGGATAAGAGAATCATTGATGCGGAACTG
CAAAGAATTTCAAATTCCCACTGATTGGATGCTTGACAATGGGATCATAAGCAAGATAAAGTTGGGTTCAGTGAAGTTGGCAAAGATGTACATGAAGAGAGTAGCAATGG
AACTTCAGTCAAAGGCTGCATCAGAGAAAGATCCTGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGGATTCATCAGTTTGCAGGGGGTTTCGATGCA
GAAACAATGCATGCCTTTGAGGATCTGCGTAACTTGGCCAACCTTCTGAACAAAAAGTGA
mRNA sequenceShow/hide mRNA sequence
GAAAAAAGAACAAAGTAGAGTAGTTGCTAAGTTATACAAAATTTCATTAAATACGTCCAAAACAAGGCGTAAGAACTAACTAACTGTTGATGGTCAACTATAAAATTGGA
AATTAATGTGGAGTTTAAATAAAGAAAGTCCAAAACCAGAGAGAGAGAGAGAAATAATATGACCGTTGCCCTCCCTCCCTCGTACTACCCAACGGACGCACAAAATTCAA
ATTTCAAAAATTCAAAACTAAAGCCATAAAGAGAAAGCTGCCACCTTTTCTCCCAATCCCAGTCCCCTATGTTTCTTGGATTTGGAGCTCTCAATTAGTTGCTCCCACAT
ACAACACCTTCTCCGGCCGGCATAATCTGCGCTCATTGGCTCCACCTCACTGTTCGAAGCTTCTTTTTTTCAAAAAACAAACCCCGCTCACCCTCCCTTCATTTGAGCTG
AGCTTTGTTTCGTACCCATGTTTCATGTTTGTATCTTGGACATGGAACTGGAGAGGTTGTCGTGAAATTCTACTTGCATTGGCAGTTTCTCCAACGGGACGATTTTGGTT
TCCTTTTAAGGTGGTTCATACAGAATGAAGGAGGAGAACCCATCAGAAAGCAGAGGCAAATCGTCGAGGTTTGCCGATCAAAATCAGAATCCCAAGTGTCAAAATCAGAA
TGCGAAAGGAAATACTGGGAATGGGGGCTCGAAATTGAGGGCGGCTTCTTCCTGGGGTTCTAATATTGTGAAAGGTTTCTCCACAGACAAGAAAACCAAAGCTCAGACTC
AGAGCAATCTCCAGCCCAAGAAGGCGCCACTTACGAGTTCGGACTTGGCAAATCAGAAGGACAAATTTGTTCCTTCACATTCTCGCATCAAGCGATCACTGATTGGGGAT
TTGTCATGCTCTGCAAATCCTGCTCAAGTTCATCCACACTCGTATCAGACCCATCACCGCAGACAATCTTCTCGGGATTTGTTCGTAGAGCTCGATCAACTCAGAAGTCT
GCTCCACGAATCCAAGCAGAGAGAATTCGAACTTCAGAACGAACTAGCAGAGCTTAAGCGAAATGCTAGAAATTATGAACTTGAAAGGGAACTTGAGGAGAAGAAAGCAG
AAATAGACTGTCTTACTCAGAAAGTTAGTCTATTGGAAGAAGATAGAAGAGTTCTGTCCGAACAATTGGTGGCTTTACCATCAATTTCTGAGAAGCAAGAGGAGCCGCAG
ACTGCACCTCAAAACGTAGAAGTGGAAGTTGTTGAGTTGAGACGTTTGAATAAGGAACTCCAGCTTCAGAAGAGGAACCTCGCTTGCAGGCTTTCTTCGGTGGAATCTGA
GCTAGCTTCTCTTGCAAAGAATTCCGAGAGTGAAGCTATTGCTAAGATCAAAGCAGAAGCATCTTTGCTAAGACACACTAATGAAGATTTGTGCAAGCAAGTTGAAGGTC
TACAGATGAGCAGATTGAATGAGGTTGAAGAACTTGCATATCTTAGGTGGGTCAATTCATGTTTAAGGAATGAGCTCCGTAATTCGTGCCCCTCGGCAAGTTCTGATAAC
CTATCCAGCCCTCAGGCAACCGAGAGGAGCAGTGAATCCGTTGGTTCATTATCTAGCCAGAAGGAGTACATGGATTATAACAGTGCTAAGAGGATAAATCTTATTAAAAA
GTTGAAGAAATGGCCCATTACTGATGAGGATCTGTCTAATTTAGATTGCTCGGATTATGGTGTTTTAGTAGACAAAAATTGGGCTGATGCGGAGGAAGGAAGAAGCCCCA
GAAGAAGACACTCCATTAGCGGAGCAAAAGGCTGGCCCGAAGAATTGGAACCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAGGAAGCA
GACCCTCTTTCCACTCAGAAATATGATTTGGGCGTGAGTCAAAGACCACATATTTTGGAAAATTGTCATGAAATTAGCAGGAGTCTTGCCTCTTTAGACGTGGAGAAACG
AGCATTGCGCATACCAAATCCCCCGCCGAGGCCTTCTTGCTCCATTTCAAGTGAACCTAAAGAGGAAAACAGAGCTCAAGTCCCACCACCTCTGCCCCCACCTCCGCCCC
CTCCTCTTCCAAAGTTCGCTGTGAGGAGCGCCATGGGAATGGTACAGAGAGCTCCACAAGTGGTTGAATTCTACCATTCACTGATGAAGAGAGATTCTAGGAAGGATTCC
TCCAATGGTGCCATATGCAATGTTCCAGATGTTTCAAATGTCCGGAGTAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTATAAAGGCAGATATTGA
AACCCAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGGATATTGAAGATATAGTGGCTTTTGTCAAGTGGCTTGACGATGAACTTTGCT
TTCTGGTGGACGAAAGGGCAGTTCTAAAGCACTTTGATTGGCCAGAGAGAAAGGCAGACGCATTGCGAGAAGCAGCCTTTGGGTACAGGGATTTAAAAAAATTGGAGTGT
GAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATAGCTCTGAAAAAAATGGTTGCTTTATCAGAGAAGATGGAACGTAGCATTTATAACCTTCTCCGGAT
AAGAGAATCATTGATGCGGAACTGCAAAGAATTTCAAATTCCCACTGATTGGATGCTTGACAATGGGATCATAAGCAAGATAAAGTTGGGTTCAGTGAAGTTGGCAAAGA
TGTACATGAAGAGAGTAGCAATGGAACTTCAGTCAAAGGCTGCATCAGAGAAAGATCCTGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGGATTCAT
CAGTTTGCAGGGGGTTTCGATGCAGAAACAATGCATGCCTTTGAGGATCTGCGTAACTTGGCCAACCTTCTGAACAAAAAGTGAAAGTCATAGGACAGGGCAAATGGAAA
TTCAGTTGGTTTTGGTTTACTATCTCGGTAAGCAGCAAGCAAGCATTCTAGCAGTACATTCATCACACCAAGTGTGAGGGTGGTGTGCCTTCTTTCACATGTGAGGGAGG
TTTTCGATTTTGAAACTTAGAGCTGCTGGGGAAGCCAGGGAAGATTTGGTATGACCCATTAGACAGGCCTTAAGTCAACTATATTTTTTGTCCTCTGCCATCCTTCCTCC
TCCCTTCATGAAGCAAGCTCCTCCAAAACAGGCAGGGTCCTACAGAAAGAGAACGTGGCAGACTGCCAGAGGGAGCAAGATAATTTTATTATGTTTATGTAAATTGTGTG
TTTGACTAAGATCTCAAACGTGGTCTTTGTTGGAAACAGAGTTTCTTAGTTATGAGATTTTTTCTCAGGTTAACCTTGATTTCTAAGTTTTAGAAATGGTTCAATAAGAA
GAAAACAGGTTTTTATTTTCTGTTTAGGAAACGATTTTGAGGGCATGTTCAAGATATTACAGTAAGATTAAATTGCTGGTGGCATGTTTGGGAAGAGCGTTTGTTTCAAA
ACTGGTAGCTATGAAAACTGGTAGCTATGATTGTGATAATTTGAAATCACAATCCACAAACAAATGGTTCAAATTACATCACCTTAAATTATTCTAAAAATAGTCACTCG
TTAAGTTATCTGTCTACAGTCTACACATACTTTTCCTAATTTATGTTAATCTATAAAGCAAGAAAGAAGTTCAAAAAACATCTCTCAAACAACCTTTATGTATTTTTCTT
GAAAAGATCAACACTCTCTTACAAAAAAGATCATAAAACAAAGCAAAGTAAAAAGAAAAAAGAAAAGAAGCGAATTCAGCGATCCAGAAAATGATGAGAGGGGGGTGGAT
TTGGATGATTGATAAAATCTAAAGCAGTGGAGAAAAGGGAAATAAGAATGGACTGGAAGATTTGACATACAGAAAAGACCTAACATTGCCTGTATTTACCCCATCTTGGG
CGCCAAACTACAAAAGAGGCAGACAAACCAGCAACTCTGCCAACCCCATAAACAATTCAAAAAAAGTCTCCACCGCACACCACCCTATTCTATTTCTATCTTACCTTACT
CCCTCATTTTCTTTTAACCTCTTAGCCCTAGCCCACCATCCCCCACCTCTTTACATCCAACGTTTTCTAAAAATTTCATCTTGGGCGCACATGATTTGGTAATTTCTGCT
GCTGATTCAGACGAGAAAAGACCAATGTCTGGTACCATATAGTACCACTGTATGTCAAGATATTAAATGTCTGGTGTACTACATTGATAGTATGAATTCTAAGCCTAACT
ACTCTCTTGAAAATTCAATTCACCTTCTTTCTTTGCTCTTACAAACACCATCTTTGATCCTCAAGGATTATGAGAGATTACTTCTTCCTCACATCACAGAAATACGGCCA
CATGTATTGTAAAAACAGTAATGTAAACATTCATGATTCATGTTTTGTTCATGGCCTATAACTAGAAATCACATGATTTGAGAAACAGGGTTGGAAAAGAAAGAAAAATT
GGGGGACAAGAGAGGGAAAATTGTTAGGGTTGGTAATTAATTGTGTGGAGAAGGAGATGATACGAAATGAACAGTAAAGCGAGTTAGAAATTCATGTGATGTGACCTGTA
ATTGTGGAAATTTTGGAGCAATTTCCATCCCATCGATCTGCCACGGAAAGCTTAGTAGTCACATCCACCACGATTCCACGGCAGAGGAAAAATTGCTTTGTTTCTTCTTT
CTCAGCTCTCACCGGCGAAAAATGGTGCTTTTTATGCCTTGTTATGTCCCAATTTAAGAACCCCCCTTCCATTTTTCTTCTCCAACGGGTCTTTTCCGTCTGCTTCTGTT
TATCTTTTTCGGGAATTCCTCTTAGAATATCTCTCTCGAGTCTCAGCGACCATGATACGACTTC
Protein sequenceShow/hide protein sequence
MKEENPSESRGKSSRFADQNQNPKCQNQNAKGNTGNGGSKLRAASSWGSNIVKGFSTDKKTKAQTQSNLQPKKAPLTSSDLANQKDKFVPSHSRIKRSLIGDLSCSANPA
QVHPHSYQTHHRRQSSRDLFVELDQLRSLLHESKQREFELQNELAELKRNARNYELERELEEKKAEIDCLTQKVSLLEEDRRVLSEQLVALPSISEKQEEPQTAPQNVEV
EVVELRRLNKELQLQKRNLACRLSSVESELASLAKNSESEAIAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSASSDNLSSPQATE
RSSESVGSLSSQKEYMDYNSAKRINLIKKLKKWPITDEDLSNLDCSDYGVLVDKNWADAEEGRSPRRRHSISGAKGWPEELEPNKRRQSDGFICAKEMEKEADPLSTQKY
DLGVSQRPHILENCHEISRSLASLDVEKRALRIPNPPPRPSCSISSEPKEENRAQVPPPLPPPPPPPLPKFAVRSAMGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNV
PDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLDIEDIVAFVKWLDDELCFLVDERAVLKHFDWPERKADALREAAFGYRDLKKLECEISAYKDD
PRLPCDIALKKMVALSEKMERSIYNLLRIRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKAASEKDPAMDYMLLQGVRFAFRIHQFAGGFDA
ETMHAFEDLRNLANLLNKK