; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028888 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028888
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionpentatricopeptide repeat-containing protein At5g04810, chloroplastic
Genome locationtig00153210:1226993..1244862
RNA-Seq ExpressionSgr028888
SyntenySgr028888
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000209 - Peptidase S8/S53 domain
IPR000504 - RNA recognition motif domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR015500 - Peptidase S8, subtilisin-related
IPR033443 - Pentacotripeptide-repeat region of PRORP
IPR034197 - Cucumisin-like catalytic domain
IPR035979 - RNA-binding domain superfamily
IPR036852 - Peptidase S8/S53 domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034647.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0089.2Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLK
        MD RSLSN TTTTSS  FS+    HRRR H    HP  A+I+FSLKPP P   P+P  SDSDDSS+S+P LSGRIRRPQTLKTTSSPKRT+S+ P NPL+
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLK

Query:  NLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQF
        NLVGSAYVP+LP  PPPPPPPPVSHSLS KLWLSSKLSPPPPPISE  +ED+NE+EEIETEN+ SKGR EVQFRQEGK+FVGNLPNWIKKHEVQEFFRQF
Subjt:  NLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQF

Query:  GPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIE
        GPVKNVILIKGH+ATERNAGYGF+IYDG TAAKSA+KAVEFDGVEFHGRVLTVKLDDGRRLKEKT  RARWMEGDDSVEYRS WHEERDKAR GFRKVIE
Subjt:  GPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIE

Query:  TEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGG
        TEPENWQAVV AF+ IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEP+SHVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGG
Subjt:  TEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGG

Query:  FAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYG
        FAKMGNAESADHWFQEAKEKHSS+NAIIYGNIIYAYCQ CNMDRAEALVR+MEE+GIDAPIDIYHTMMDGYTM+GD +KCLLVFERFKECGLNPSVITYG
Subjt:  FAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYG

Query:  CLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPT
        CLINLY KLGKVSKALEVSKEMEHAGIKHNMKT+SMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA+CTVKEMQKQRH+PT
Subjt:  CLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPT

Query:  TRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNE
        TRTF+PIIHGFARKGEM+KALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKA +ILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLR+E
Subjt:  TRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNE

Query:  GMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIE
        G+ LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGDVWEAADLMQQMKREGVQ DIHTYTSFINACSKAGDMQRATKTIE
Subjt:  GMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIE

Query:  EMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKI
        EMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVA+ CIYPG+LSVCREM+D  LTVDMGTAVHWSKCL KI
Subjt:  EMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKI

Query:  ERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF
        ERTGGEITEALQKTFPPNWN YN  L SS++DSDDESDIS D DDDIC+GG  SN   D ESD D +GRSWF
Subjt:  ERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF

XP_004135367.2 pentatricopeptide repeat-containing protein At5g04810, chloroplastic [Cucumis sativus]0.0e+0088.67Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHF--HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKN
        MD RSLSN TTTTSS  FS+    HRRR+H   HPS AII+FSLKPP P   P+P+ SDSDDSS+S P LSGRIRRPQTLKTTSSPKRTTS+ P NPL+N
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHF--HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKN

Query:  LVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFG
        LVGSAYVP+LP  PPPPPPPPVSHSLS KLWLSSKLSPPPPPISE  +ED NE+EEIE++N+ SKGR EVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFG
Subjt:  LVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFG

Query:  PVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIET
        PVKNVILIKGH+A ERNAGYGF+IYDG TA KSA+KAVEFDGVEFHGRVLTVKLDDGRRLKEKT+ RARWMEGDDSVEYRS WHEERDKAR G RKVIET
Subjt:  PVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIET

Query:  EPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGF
        EPENWQAVV AFE IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEP+SHVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILV GF
Subjt:  EPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGF

Query:  AKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGC
        AK GNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQ CNMD+AEALVR+MEE+GIDAPIDIYHTMMDGYTM+GD DKCLLVFERFKECGLNPSVITYGC
Subjt:  AKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGC

Query:  LINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTT
        LINLY KLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA+CTVKEMQKQRH+PTT
Subjt:  LINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTT

Query:  RTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEG
        RTF+PIIHGFARKGEM+KALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKA +ILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAF YFTKLR+EG
Subjt:  RTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEG

Query:  MELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEE
        ++LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGD+WEAADLMQQMKREGVQ DIHTYTSFINACSKAGDMQRATKTIEE
Subjt:  MELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEE

Query:  MKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIE
        MKSVGVKPNVKTYTTLI+GWARASLPEKALSCFEEMKLSGLKPD+AVYHCLMTSLLSRATVA  CIYPG+LSVCREM+D  LTVDMGTAVHWSKCL KIE
Subjt:  MKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIE

Query:  RTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICR-GGVSNTDKDDESDDDAIGRSWF
        RTGGEITEALQKTFPPNWN YN  L +S++DSDDESDIS D DDDIC+ G  SN   D ESD D +GRSWF
Subjt:  RTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICR-GGVSNTDKDDESDDDAIGRSWF

XP_008446749.1 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g04810, chloroplastic [Cucumis melo]0.0e+0088.35Show/hide
Query:  RQRQRLKKIMDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTT
        RQR+  KKIMD RSLSN TTTTSS  FS+    HRRR H    HP  A+I+FSLKPP P   P+P  SDSDDSS+S+P LSGRIRRPQTLKTTSSPKRT+
Subjt:  RQRQRLKKIMDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTT

Query:  SKAPFNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKH
        S+ P NPL+NLVGSAYVP+LP  PPPPPPPPVSHSLS KLWLSSKLSPPPPPISE  +ED+NE+EEIETEN+ SKGR EVQFRQEGK+FVGNLPNWIKKH
Subjt:  SKAPFNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKH

Query:  EVQEFFRQFGPVKNVILIKGHDATERNAG------YGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWH
        EVQEFFRQFGPVKNVILIKGH+ATERNAG       GF+IYDG TAAKSA+KAVEFDGVEFHGRVLTVKLDDGRRLKEKT  RARWMEGDDSVEYRS WH
Subjt:  EVQEFFRQFGPVKNVILIKGHDATERNAG------YGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWH

Query:  EERDKARKGFRKVIETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEE
        EERDKAR GFRKVIETEPENWQAVV AF+ IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEP+SHVYT+LIHAYAVGRDMEEALSCVRKMKEE
Subjt:  EERDKARKGFRKVIETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEE

Query:  GIEMSLVTYSILVGGFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFE
        GIEMSLVTYSILVGGFAKMGNAESADHWFQEAKEKHSS+NAIIYGNIIYAYCQ CNMDRAEALVR+MEE+GIDAPIDIYHTMMDGYTM+GD +KCLLVFE
Subjt:  GIEMSLVTYSILVGGFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFE

Query:  RFKECGLNPSVITYGCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRA
        RFKECGLNPSVITYGCLINLY KLGKVSKALEVSKEMEHAGIKHNMKT+SMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA
Subjt:  RFKECGLNPSVITYGCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRA

Query:  ICTVKEMQKQRHRPTTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLG
        +CTVKEMQKQRH+PTTRTF+PIIHGFARKGEM+KALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKA +ILDEMTLAGVSPNEHTYTTIMHGYASLG
Subjt:  ICTVKEMQKQRHRPTTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLG

Query:  DTGKAFAYFTKLRNEGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINA
        DTGKAFAYFTKLR+EG+ LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGDVWEAADLMQQMKREGVQ DIHTYTSFINA
Subjt:  DTGKAFAYFTKLRNEGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINA

Query:  CSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTV
        CSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVA+ CIYPG+LSVCREM+D  LTV
Subjt:  CSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTV

Query:  DMGTAVHWSKCLRKIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF
        DMGTAVHWSKCL KIERTGGEITEALQKTFPPNWN YN  L SS++DSDDESDIS D DDDIC+GG  SN   D ESD D +GRSWF
Subjt:  DMGTAVHWSKCLRKIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF

XP_022150548.1 pentatricopeptide repeat-containing protein At5g04810, chloroplastic isoform X1 [Momordica charantia]0.0e+0092.49Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKP--PPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKN
        MDARSLSNTTTTTSSACFSAVL PHRRR H HPSSA+I+FSLKP  PPPHHSPSP  SDSDDSS+STP LSGRIRRPQTLKTTSSPKRTTSK P NPLKN
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKP--PPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKN

Query:  LVGSAYVPLLPPLPPPPPPPP--VSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQ
        LVGSAYVP+LPP PPPPPPPP  VS+SLSNKLWLSSKLSPPPPP SE   EDENEVEEI TEN+ SKGR E++ RQEGKIFVGNLP+WIKKHE+QEFFRQ
Subjt:  LVGSAYVPLLPPLPPPPPPPP--VSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQ

Query:  FGPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVI
        FGPVKNVILIKGHDATERNAGYGFVIYDG TAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTE RARWMEGDDSVEYRS+WHEERDKAR GFRKVI
Subjt:  FGPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVI

Query:  ETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVG
        ETEPENWQAVVWAFE IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVG
Subjt:  ETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVG

Query:  GFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITY
        GFAKM NAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNM+RAEALVRQMEE+GIDAPIDIYHTMMDGYTMIGD DKCLLVFERFKECGLNPSVITY
Subjt:  GFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITY

Query:  GCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRP
        GCLINLYTKLGKV+KALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLI+DGIKPD+VLYNNIITAFCGMGKMDRAICTVKEMQKQRHRP
Subjt:  GCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRP

Query:  TTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRN
        TTRTF+PIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTL+GVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLR+
Subjt:  TTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRN

Query:  EGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTI
        EG+ELDVYTYEALLKACCKSGRMQSALAVTKEMSAQ IPRNTFIYNIL+DGWARRGDVWEAADLMQQMKREGVQ DIHTYTSFINACSKAGDMQRATKTI
Subjt:  EGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTI

Query:  EEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATV-AEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLR
        EEM+SVGVKPNVKTYTTLIHGWARASLPE ALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATV AE  IYPG+LSVCREM+DSGLTVDMGTAVHWSKCLR
Subjt:  EEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATV-AEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLR

Query:  KIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAIGRSWF
        KIERTGGEITEALQKTFPPNWNSY+  L SSS+D++DESD+ SDDDDIC GGVSN D+DDE+DDD +GRSWF
Subjt:  KIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAIGRSWF

XP_022945086.1 pentatricopeptide repeat-containing protein At5g04810, chloroplastic [Cucurbita moschata]0.0e+0087.84Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLV
        MD RSLSN T+TTSSA F     P RR  H HPSSA+IV SLK PPP   P P+ SDSDDSS ST  +SGRIRRPQ LKT+SSPKRTTSK P NPLKNLV
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLV

Query:  GSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPV
        GSA VP+LP  PPPPPP PVSHSL +KLWLSSKLSPPPPPI+E  +EDE+E EEIETE++ S+GR EVQFRQEGKIFVGNLPNWIKKHEVQ+FFRQFGPV
Subjt:  GSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPV

Query:  KNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIETEP
         NVILIKGHD T+RNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVL+VKLDDGRRLKEK   RA+WMEGDDSVE+RS+WHEERDKARKGFR VIETEP
Subjt:  KNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIETEP

Query:  ENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK
        E+WQAVV AFE IKKPSRKEY LMVNYYARRGDMHRARETFEKMRARGIEPT+HVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK
Subjt:  ENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK

Query:  MGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLI
        MGNAE+ADHWFQEAKEKH SLNAIIYGNIIYAYCQ CNMDRAEALVRQMEE+GIDAPIDIYHTMMDGYTM+GD  KCLLVFERFKECGLNPSVITYGCLI
Subjt:  MGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLI

Query:  NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRT
        NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA+CTVKEMQKQRHRPTTRT
Subjt:  NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRT

Query:  FLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGME
        F+PIIHGFAR+G+MRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGV+PNEHTYTTIMHGYASLGDTGKAF YFTKLR+EG++
Subjt:  FLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGME

Query:  LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMK
        LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGDVWEAADL+QQMK+EGVQ DIHTYTSFINACSKAGDMQRATKTI EMK
Subjt:  LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMK

Query:  SVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERT
        S GVKPN+KTYTTLIHGWARASLPEKALSCF EMKLSGLKPDKAVYHCLMTSLLSRATVAE  IYPG+LSVC+EM+DSGLTVDMGTAVHWSKCLRKIERT
Subjt:  SVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERT

Query:  GGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD---DDDICRGGVSNTDKDDESDDDAIGRSWF
        GGEITEALQKTFPPNWNSYN V  SSS+DSDDE  IS D   DDDIC+  VS     D  DDD +GRSWF
Subjt:  GGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD---DDDICRGGVSNTDKDDESDDDAIGRSWF

TrEMBL top hitse value%identityAlignment
A0A1S3BFB6 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g04810, chloroplastic0.0e+0088.35Show/hide
Query:  RQRQRLKKIMDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTT
        RQR+  KKIMD RSLSN TTTTSS  FS+    HRRR H    HP  A+I+FSLKPP P   P+P  SDSDDSS+S+P LSGRIRRPQTLKTTSSPKRT+
Subjt:  RQRQRLKKIMDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTT

Query:  SKAPFNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKH
        S+ P NPL+NLVGSAYVP+LP  PPPPPPPPVSHSLS KLWLSSKLSPPPPPISE  +ED+NE+EEIETEN+ SKGR EVQFRQEGK+FVGNLPNWIKKH
Subjt:  SKAPFNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKH

Query:  EVQEFFRQFGPVKNVILIKGHDATERNAG------YGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWH
        EVQEFFRQFGPVKNVILIKGH+ATERNAG       GF+IYDG TAAKSA+KAVEFDGVEFHGRVLTVKLDDGRRLKEKT  RARWMEGDDSVEYRS WH
Subjt:  EVQEFFRQFGPVKNVILIKGHDATERNAG------YGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWH

Query:  EERDKARKGFRKVIETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEE
        EERDKAR GFRKVIETEPENWQAVV AF+ IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEP+SHVYT+LIHAYAVGRDMEEALSCVRKMKEE
Subjt:  EERDKARKGFRKVIETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEE

Query:  GIEMSLVTYSILVGGFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFE
        GIEMSLVTYSILVGGFAKMGNAESADHWFQEAKEKHSS+NAIIYGNIIYAYCQ CNMDRAEALVR+MEE+GIDAPIDIYHTMMDGYTM+GD +KCLLVFE
Subjt:  GIEMSLVTYSILVGGFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFE

Query:  RFKECGLNPSVITYGCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRA
        RFKECGLNPSVITYGCLINLY KLGKVSKALEVSKEMEHAGIKHNMKT+SMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA
Subjt:  RFKECGLNPSVITYGCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRA

Query:  ICTVKEMQKQRHRPTTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLG
        +CTVKEMQKQRH+PTTRTF+PIIHGFARKGEM+KALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKA +ILDEMTLAGVSPNEHTYTTIMHGYASLG
Subjt:  ICTVKEMQKQRHRPTTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLG

Query:  DTGKAFAYFTKLRNEGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINA
        DTGKAFAYFTKLR+EG+ LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGDVWEAADLMQQMKREGVQ DIHTYTSFINA
Subjt:  DTGKAFAYFTKLRNEGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINA

Query:  CSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTV
        CSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVA+ CIYPG+LSVCREM+D  LTV
Subjt:  CSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTV

Query:  DMGTAVHWSKCLRKIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF
        DMGTAVHWSKCL KIERTGGEITEALQKTFPPNWN YN  L SS++DSDDESDIS D DDDIC+GG  SN   D ESD D +GRSWF
Subjt:  DMGTAVHWSKCLRKIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF

A0A5D3CFW5 Pentatricopeptide repeat-containing protein0.0e+0089.2Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLK
        MD RSLSN TTTTSS  FS+    HRRR H    HP  A+I+FSLKPP P   P+P  SDSDDSS+S+P LSGRIRRPQTLKTTSSPKRT+S+ P NPL+
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHF---HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLK

Query:  NLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQF
        NLVGSAYVP+LP  PPPPPPPPVSHSLS KLWLSSKLSPPPPPISE  +ED+NE+EEIETEN+ SKGR EVQFRQEGK+FVGNLPNWIKKHEVQEFFRQF
Subjt:  NLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQF

Query:  GPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIE
        GPVKNVILIKGH+ATERNAGYGF+IYDG TAAKSA+KAVEFDGVEFHGRVLTVKLDDGRRLKEKT  RARWMEGDDSVEYRS WHEERDKAR GFRKVIE
Subjt:  GPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIE

Query:  TEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGG
        TEPENWQAVV AF+ IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEP+SHVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGG
Subjt:  TEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGG

Query:  FAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYG
        FAKMGNAESADHWFQEAKEKHSS+NAIIYGNIIYAYCQ CNMDRAEALVR+MEE+GIDAPIDIYHTMMDGYTM+GD +KCLLVFERFKECGLNPSVITYG
Subjt:  FAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYG

Query:  CLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPT
        CLINLY KLGKVSKALEVSKEMEHAGIKHNMKT+SMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA+CTVKEMQKQRH+PT
Subjt:  CLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPT

Query:  TRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNE
        TRTF+PIIHGFARKGEM+KALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKA +ILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLR+E
Subjt:  TRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNE

Query:  GMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIE
        G+ LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGDVWEAADLMQQMKREGVQ DIHTYTSFINACSKAGDMQRATKTIE
Subjt:  GMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIE

Query:  EMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKI
        EMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVA+ CIYPG+LSVCREM+D  LTVDMGTAVHWSKCL KI
Subjt:  EMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKI

Query:  ERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF
        ERTGGEITEALQKTFPPNWN YN  L SS++DSDDESDIS D DDDIC+GG  SN   D ESD D +GRSWF
Subjt:  ERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD-DDDICRGGV-SNTDKDDESDDDAIGRSWF

A0A6J1D9Q6 pentatricopeptide repeat-containing protein At5g04810, chloroplastic isoform X10.0e+0092.49Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKP--PPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKN
        MDARSLSNTTTTTSSACFSAVL PHRRR H HPSSA+I+FSLKP  PPPHHSPSP  SDSDDSS+STP LSGRIRRPQTLKTTSSPKRTTSK P NPLKN
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKP--PPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKN

Query:  LVGSAYVPLLPPLPPPPPPPP--VSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQ
        LVGSAYVP+LPP PPPPPPPP  VS+SLSNKLWLSSKLSPPPPP SE   EDENEVEEI TEN+ SKGR E++ RQEGKIFVGNLP+WIKKHE+QEFFRQ
Subjt:  LVGSAYVPLLPPLPPPPPPPP--VSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQ

Query:  FGPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVI
        FGPVKNVILIKGHDATERNAGYGFVIYDG TAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTE RARWMEGDDSVEYRS+WHEERDKAR GFRKVI
Subjt:  FGPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVI

Query:  ETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVG
        ETEPENWQAVVWAFE IKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVG
Subjt:  ETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVG

Query:  GFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITY
        GFAKM NAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNM+RAEALVRQMEE+GIDAPIDIYHTMMDGYTMIGD DKCLLVFERFKECGLNPSVITY
Subjt:  GFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITY

Query:  GCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRP
        GCLINLYTKLGKV+KALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLI+DGIKPD+VLYNNIITAFCGMGKMDRAICTVKEMQKQRHRP
Subjt:  GCLINLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRP

Query:  TTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRN
        TTRTF+PIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTL+GVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLR+
Subjt:  TTRTFLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRN

Query:  EGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTI
        EG+ELDVYTYEALLKACCKSGRMQSALAVTKEMSAQ IPRNTFIYNIL+DGWARRGDVWEAADLMQQMKREGVQ DIHTYTSFINACSKAGDMQRATKTI
Subjt:  EGMELDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTI

Query:  EEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATV-AEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLR
        EEM+SVGVKPNVKTYTTLIHGWARASLPE ALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATV AE  IYPG+LSVCREM+DSGLTVDMGTAVHWSKCLR
Subjt:  EEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATV-AEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLR

Query:  KIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAIGRSWF
        KIERTGGEITEALQKTFPPNWNSY+  L SSS+D++DESD+ SDDDDIC GGVSN D+DDE+DDD +GRSWF
Subjt:  KIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAIGRSWF

A0A6J1FZV7 pentatricopeptide repeat-containing protein At5g04810, chloroplastic0.0e+0087.84Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLV
        MD RSLSN T+TTSSA F     P RR  H HPSSA+IV SLK PPP   P P+ SDSDDSS ST  +SGRIRRPQ LKT+SSPKRTTSK P NPLKNLV
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLV

Query:  GSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPV
        GSA VP+LP  PPPPPP PVSHSL +KLWLSSKLSPPPPPI+E  +EDE+E EEIETE++ S+GR EVQFRQEGKIFVGNLPNWIKKHEVQ+FFRQFGPV
Subjt:  GSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPV

Query:  KNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIETEP
         NVILIKGHD T+RNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVL+VKLDDGRRLKEK   RA+WMEGDDSVE+RS+WHEERDKARKGFR VIETEP
Subjt:  KNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIETEP

Query:  ENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK
        E+WQAVV AFE IKKPSRKEY LMVNYYARRGDMHRARETFEKMRARGIEPT+HVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK
Subjt:  ENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK

Query:  MGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLI
        MGNAE+ADHWFQEAKEKH SLNAIIYGNIIYAYCQ CNMDRAEALVRQMEE+GIDAPIDIYHTMMDGYTM+GD  KCLLVFERFKECGLNPSVITYGCLI
Subjt:  MGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLI

Query:  NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRT
        NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA+CTVKEMQKQRHRPTTRT
Subjt:  NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRT

Query:  FLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGME
        F+PIIHGFAR+G+MRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGV+PNEHTYTTIMHGYASLGDTGKAF YFTKLR+EG++
Subjt:  FLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGME

Query:  LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMK
        LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNIL+DGWARRGDVWEAADL+QQMK+EGVQ DIHTYTSFINACSKAGDMQRATKTI EMK
Subjt:  LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMK

Query:  SVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERT
        S GVKPN+KTYTTLIHGWARASLPEKALSCF EMKLSGLKPDKAVYHCLMTSLLSRATVAE  IYPG+LSVC+EM+DSGLTVDMGTAVHWSKCLRKIERT
Subjt:  SVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERT

Query:  GGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD---DDDICRGGVSNTDKDDESDDDAIGRSWF
        GGEITEALQKTFPPNWNSYN V  SSS+DSDDE  IS D   DDDIC+  VS     D  DDD +GRSWF
Subjt:  GGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD---DDDICRGGVSNTDKDDESDDDAIGRSWF

A0A6J1HUK8 pentatricopeptide repeat-containing protein At5g04810, chloroplastic0.0e+0086.78Show/hide
Query:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLV
        MD RSLSN T+TTSSA F+    P RR  H HPSS +IVFSLK PPP   P P  SDSDDSS ST  +S RIRRPQ LKT+SSPKRTTSK P NPLKNLV
Subjt:  MDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLV

Query:  GSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPV
        GSA VP+  PLPPPPPPPPVSHS+++KLWLSSKLSP PPPI+E  +EDE+E EEIETE++ S+GR EVQFRQEGKIFVGNLPNWIKKHEVQ+FFRQFGPV
Subjt:  GSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPV

Query:  KNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIETEP
         NVILIKGHD T+RNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEK   RA+WMEGDDSVE+RS+WHEERDKARKG R VIETEP
Subjt:  KNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEERDKARKGFRKVIETEP

Query:  ENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK
         +WQAVV AFE IKKPSRKEY LMVNYYARRGDMHRARETFEKMRARGIEPTSHVYT+LIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSI+VGGFAK
Subjt:  ENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAK

Query:  MGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLI
        MGNAE+ADHWFQEAKEKH SLNAIIYGNIIYAYCQ CNMDRAEALVRQMEE+GIDAPIDIYHTMMDGYTM+GD +KCLLVFERFKECGLNPSV+TYGCLI
Subjt:  MGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLI

Query:  NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRT
        NLYTKLGKVSKALEV KEME+AGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPD+VLYNNIITAFCGMGKMDRA+CTVKEMQKQRHRPTTRT
Subjt:  NLYTKLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRT

Query:  FLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGME
        F+PIIHGFAR+G+MRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLR+EG++
Subjt:  FLPIIHGFARKGEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGME

Query:  LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMK
        LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRN FIYNIL+DGWARRGDVWEAADL+QQMK+EGVQ DIHTYTSFINACSKAGDMQRATKTI EMK
Subjt:  LDVYTYEALLKACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMK

Query:  SVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERT
        S GVKPNVKTYTTLIHGWARASLPEKALSCF EMKLSGLKPDKAVYHCLMTSLLSRATVAE  IYPG+LSVC+EM+DSGLTVDMGTAVHWSKCLRKIERT
Subjt:  SVGVKPNVKTYTTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERT

Query:  GGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD---------DDDICRGGVSNTDKDDESDDDAIGRSWF
        GGEITEALQKTFPPNWNSYN V  SSS+DSDDES IS D         DDD C+ GVS      ++ DD +GRSWF
Subjt:  GGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSD---------DDDICRGGVSNTDKDDESDDDAIGRSWF

SwissProt top hitse value%identityAlignment
Q0WMY5 Pentatricopeptide repeat-containing protein At5g04810, chloroplastic0.0e+0066.14Show/hide
Query:  HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAP-------FNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSL
        H   A I FSLK PPP   P P  S  D            +RRP+    +SS   + S  P        NPLK L   + V  L              SL
Subjt:  HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAP-------FNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSL

Query:  SNKLWLSSKLS-----PPPPPISETCQ-----EDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPVKNVILIKGHDATER
        ++KL LSSKLS     PPPPP+ ET Q       + +  E ET N       + +FRQEGKIFVGNLP WIKK E +EFFRQFGP++NVILIKGH   E+
Subjt:  SNKLWLSSKLS-----PPPPPISETCQ-----EDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPVKNVILIKGHDATER

Query:  NAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWM---EGDDSVEYRSKWHEERDKARKGFRKVIETEPENWQAVVWAFE
        NAG+GF+IY    A KSAMKAVEFDGVEFHGR+LTVKLDDG+RLK K E R RW+   E D  +  +S WH+ER+ +RK  +++++T  +NWQAV+ AFE
Subjt:  NAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWM---EGDDSVEYRSKWHEERDKARKGFRKVIETEPENWQAVVWAFE

Query:  GIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAKMGNAESADHWF
         I KPSR E+GLMV +Y RRGDMHRARETFE+MRARGI PTS +YTSLIHAYAVGRDM+EALSCVRKMKEEGIEMSLVTYS++VGGF+K G+AE+AD+WF
Subjt:  GIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAKMGNAESADHWF

Query:  QEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLINLYTKLGKVSK
         EAK  H +LNA IYG IIYA+CQTCNM+RAEALVR+MEE+GIDAPI IYHTMMDGYTM+ D  K L+VF+R KECG  P+V+TYGCLINLYTK+GK+SK
Subjt:  QEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLINLYTKLGKVSK

Query:  ALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRTFLPIIHGFARK
        ALEVS+ M+  G+KHN+KTYSM+INGF+KLKDWANAFA+FED++K+G+KPD++LYNNII+AFCGMG MDRAI TVKEMQK RHRPTTRTF+PIIHG+A+ 
Subjt:  ALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRTFLPIIHGFARK

Query:  GEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGMELDVYTYEALLK
        G+MR++L+VFDMMR  GC+PTVHT+N LI GLVEKR+MEKAVEILDEMTLAGVS NEHTYT IM GYAS+GDTGKAF YFT+L+NEG+++D++TYEALLK
Subjt:  GEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGMELDVYTYEALLK

Query:  ACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMKSVGVKPNVKTY
        ACCKSGRMQSALAVTKEMSA+NIPRN+F+YNIL+DGWARRGDVWEAADL+QQMK+EGV+ DIHTYTSFI+ACSKAGDM RAT+TIEEM+++GVKPN+KTY
Subjt:  ACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMKSVGVKPNVKTY

Query:  TTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERTGGEITEALQKT
        TTLI GWARASLPEKALSC+EEMK  G+KPDKAVYHCL+TSLLSRA++AE  IY GV+++C+EM+++GL VDMGTAVHWSKCL KIE +GGE+TE LQKT
Subjt:  TTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERTGGEITEALQKT

Query:  FPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAI
        FPP+W+S++          D  SD+ SD+DD+        D +D  DD+ +
Subjt:  FPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAI

Q39547 Cucumisin5.5e-10466.31Show/hide
Query:  VFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRD
        VF N   ++HTTRSWDF+GF   V R   VES+I+VGVLDTGIWPESPSFDD+G+ PPP KWKG+CE S NF CN KIIGARSY     +   DV GPRD
Subjt:  VFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRD

Query:  SNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGIL
        +NGHGTHTAST AGGLV QA+++GLGLGTARGGVP ARIAAYKVCW+D C+D DILAA+DDAIADGVDI+S+S+GG   ++YF D+IAIG+FHA+++GIL
Subjt:  SNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGIL

Query:  TSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISR
        TS SAGN GP F T  + SPW LSVAAST DRKFVT+V++G+G+ F G S+NTFD     +PLV   DIPN   GF+ S SR
Subjt:  TSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISR

Q9FIF8 Subtilisin-like protease SBT4.31.1e-8857.24Show/hide
Query:  VFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRD
        VFP+   ++ TTRSWDF+GF ++  R    ESD+IVGV+D+GIWPES SFDD+G+GPPP KWKGSC+    F+CNNK+IGAR Y              RD
Subjt:  VFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRD

Query:  SNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGIL
          GHGTHTAST AG  V+ AS +GL  GTARGGVPSARIAAYKVC+ + CND DILAAFDDAIADGVD++S+S+  D   N  N S+AIG+FHAM +GI+
Subjt:  SNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGIL

Query:  TSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDI----PNAAAGFNGS
        T+ SAGN+GP   ++ N SPW ++VAAS TDR+F+ RV LG+G+   G SVNTF++NGT+FP+VY  ++      A AG+  S
Subjt:  TSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDI----PNAAAGFNGS

Q9FIM6 Subtilisin-like protease SBT4.82.0e-8255.63Show/hide
Query:  KGREEVQFHQEVFPNGKKKIHTTRSWDFMGFSQ--QVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSD
        +G  E++    VF +   K+ TT SWDFMG  +     R  +VESD I+G +D+GIWPES SF DKG+GPPP KWKG C+   NF+CNNK+IGAR Y S+
Subjt:  KGREEVQFHQEVFPNGKKKIHTTRSWDFMGFSQ--QVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSD

Query:  GQLPANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSI
                 G RD  GHGTHT ST AG  V   S FG+G GTARGGVP++R+AAYKVC    C+D ++L+AFDDAIADGVD++SVSLGGD    Y  D+I
Subjt:  GQLPANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSI

Query:  AIGAFHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPN
        AIGAFHAM KGILT  SAGN GP  +T+ + +PW L+VAA+TT+R+F+T+V LG+G+   G SVN FD+ G ++PL Y GD  N
Subjt:  AIGAFHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPN

Q9LZS6 Subtilisin-like protease SBT4.152.4e-8356.25Show/hide
Query:  VFPNGKKKIHTTRSWDFMGF-SQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFS-CNNKIIGARSY--RSDGQLPANDVP
        VF N ++++HTTRSWDF+G    +  R   +ES+IIVGVLDTGI  ESPSF+DKG GPPPAKWKG C    NF+ CNNK+IGA+ +  +S+G LP  +  
Subjt:  VFPNGKKKIHTTRSWDFMGF-SQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFS-CNNKIIGARSY--RSDGQLPANDVP

Query:  GPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMK
           D +GHGTHT+ST+AG  V  AS+FG+  GTARGGVPSARIAAYKVCW   C D D+LAAFD+AI+DGVDI+S+S+GG  +  +F D IAIGAFHAMK
Subjt:  GPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMK

Query:  KGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISRCE
        +GILT+ SAGN+GPG  T+ N +PW ++VAA++ DRKF T V+LG+G    G S+N F+     +PL       N +AG  G  S CE
Subjt:  KGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISRCE

Arabidopsis top hitse value%identityAlignment
AT5G03620.1 Subtilisin-like serine endopeptidase family protein1.7e-8456.25Show/hide
Query:  VFPNGKKKIHTTRSWDFMGF-SQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFS-CNNKIIGARSY--RSDGQLPANDVP
        VF N ++++HTTRSWDF+G    +  R   +ES+IIVGVLDTGI  ESPSF+DKG GPPPAKWKG C    NF+ CNNK+IGA+ +  +S+G LP  +  
Subjt:  VFPNGKKKIHTTRSWDFMGF-SQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFS-CNNKIIGARSY--RSDGQLPANDVP

Query:  GPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMK
           D +GHGTHT+ST+AG  V  AS+FG+  GTARGGVPSARIAAYKVCW   C D D+LAAFD+AI+DGVDI+S+S+GG  +  +F D IAIGAFHAMK
Subjt:  GPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMK

Query:  KGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISRCE
        +GILT+ SAGN+GPG  T+ N +PW ++VAA++ DRKF T V+LG+G    G S+N F+     +PL       N +AG  G  S CE
Subjt:  KGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISRCE

AT5G04810.1 pentatricopeptide (PPR) repeat-containing protein0.0e+0066.14Show/hide
Query:  HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAP-------FNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSL
        H   A I FSLK PPP   P P  S  D            +RRP+    +SS   + S  P        NPLK L   + V  L              SL
Subjt:  HPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNSTPPLSGRIRRPQTLKTTSSPKRTTSKAP-------FNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSL

Query:  SNKLWLSSKLS-----PPPPPISETCQ-----EDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPVKNVILIKGHDATER
        ++KL LSSKLS     PPPPP+ ET Q       + +  E ET N       + +FRQEGKIFVGNLP WIKK E +EFFRQFGP++NVILIKGH   E+
Subjt:  SNKLWLSSKLS-----PPPPPISETCQ-----EDENEVEEIETENALSKGREEVQFRQEGKIFVGNLPNWIKKHEVQEFFRQFGPVKNVILIKGHDATER

Query:  NAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWM---EGDDSVEYRSKWHEERDKARKGFRKVIETEPENWQAVVWAFE
        NAG+GF+IY    A KSAMKAVEFDGVEFHGR+LTVKLDDG+RLK K E R RW+   E D  +  +S WH+ER+ +RK  +++++T  +NWQAV+ AFE
Subjt:  NAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWM---EGDDSVEYRSKWHEERDKARKGFRKVIETEPENWQAVVWAFE

Query:  GIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAKMGNAESADHWF
         I KPSR E+GLMV +Y RRGDMHRARETFE+MRARGI PTS +YTSLIHAYAVGRDM+EALSCVRKMKEEGIEMSLVTYS++VGGF+K G+AE+AD+WF
Subjt:  GIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSILVGGFAKMGNAESADHWF

Query:  QEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLINLYTKLGKVSK
         EAK  H +LNA IYG IIYA+CQTCNM+RAEALVR+MEE+GIDAPI IYHTMMDGYTM+ D  K L+VF+R KECG  P+V+TYGCLINLYTK+GK+SK
Subjt:  QEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLINLYTKLGKVSK

Query:  ALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRTFLPIIHGFARK
        ALEVS+ M+  G+KHN+KTYSM+INGF+KLKDWANAFA+FED++K+G+KPD++LYNNII+AFCGMG MDRAI TVKEMQK RHRPTTRTF+PIIHG+A+ 
Subjt:  ALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRTFLPIIHGFARK

Query:  GEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGMELDVYTYEALLK
        G+MR++L+VFDMMR  GC+PTVHT+N LI GLVEKR+MEKAVEILDEMTLAGVS NEHTYT IM GYAS+GDTGKAF YFT+L+NEG+++D++TYEALLK
Subjt:  GEMRKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGMELDVYTYEALLK

Query:  ACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMKSVGVKPNVKTY
        ACCKSGRMQSALAVTKEMSA+NIPRN+F+YNIL+DGWARRGDVWEAADL+QQMK+EGV+ DIHTYTSFI+ACSKAGDM RAT+TIEEM+++GVKPN+KTY
Subjt:  ACCKSGRMQSALAVTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMKSVGVKPNVKTY

Query:  TTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERTGGEITEALQKT
        TTLI GWARASLPEKALSC+EEMK  G+KPDKAVYHCL+TSLLSRA++AE  IY GV+++C+EM+++GL VDMGTAVHWSKCL KIE +GGE+TE LQKT
Subjt:  TTLIHGWARASLPEKALSCFEEMKLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERTGGEITEALQKT

Query:  FPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAI
        FPP+W+S++          D  SD+ SD+DD+        D +D  DD+ +
Subjt:  FPPNWNSYNKVLASSSMDSDDESDISSDDDDICRGGVSNTDKDDESDDDAI

AT5G58830.1 Subtilisin-like serine endopeptidase family protein1.4e-8355.63Show/hide
Query:  KGREEVQFHQEVFPNGKKKIHTTRSWDFMGFSQ--QVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSD
        +G  E++    VF +   K+ TT SWDFMG  +     R  +VESD I+G +D+GIWPES SF DKG+GPPP KWKG C+   NF+CNNK+IGAR Y S+
Subjt:  KGREEVQFHQEVFPNGKKKIHTTRSWDFMGFSQ--QVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSD

Query:  GQLPANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSI
                 G RD  GHGTHT ST AG  V   S FG+G GTARGGVP++R+AAYKVC    C+D ++L+AFDDAIADGVD++SVSLGGD    Y  D+I
Subjt:  GQLPANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSI

Query:  AIGAFHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPN
        AIGAFHAM KGILT  SAGN GP  +T+ + +PW L+VAA+TT+R+F+T+V LG+G+   G SVN FD+ G ++PL Y GD  N
Subjt:  AIGAFHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPN

AT5G59090.1 subtilase 4.121.4e-8353.9Show/hide
Query:  EVQFHQEVFPNGKKKIHTTRSWDFMGFSQ--QVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLP
        E++    VFPN   ++HTT SWDFMG  +     R  ++ESD I+GV+DTGIWPES SF DKG+GPPP KWKG C    NF+CNNK+IGAR Y S+    
Subjt:  EVQFHQEVFPNGKKKIHTTRSWDFMGFSQ--QVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLP

Query:  ANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGA
             G RD++GHGTHTAST AG  V+  S FG+G GT RGGVP++RIAAYKVC    C+   +L++FDDAIADGVD++++S+G      + +D IAIGA
Subjt:  ANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGA

Query:  FHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAA
        FHAM KGILT  SAGN GP  +T+ + +PW  +VAASTT+R F+T+V LG+G+   G SVN FD+ G ++PLVY     ++A
Subjt:  FHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAA

AT5G59190.1 subtilase family protein7.9e-9057.24Show/hide
Query:  VFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRD
        VFP+   ++ TTRSWDF+GF ++  R    ESD+IVGV+D+GIWPES SFDD+G+GPPP KWKGSC+    F+CNNK+IGAR Y              RD
Subjt:  VFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRD

Query:  SNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGIL
          GHGTHTAST AG  V+ AS +GL  GTARGGVPSARIAAYKVC+ + CND DILAAFDDAIADGVD++S+S+  D   N  N S+AIG+FHAM +GI+
Subjt:  SNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILAAFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGIL

Query:  TSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDI----PNAAAGFNGS
        T+ SAGN+GP   ++ N SPW ++VAAS TDR+F+ RV LG+G+   G SVNTF++NGT+FP+VY  ++      A AG+  S
Subjt:  TSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAGDI----PNAAAGFNGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTCCCAACAAGTTGATCGAGTTCCATCGGTAGAAAGCGACATCATCGTTGGAGTACTCGACACAGGAATTTGGCCGGAATCTCCTAGTTTCGACGACAAAGG
ATACAGTCCAGCGCCAGCCAAGTGGAAGGGCAGTTGTGAAGTCTCAGCCAATTTCTCTTGCAACAATAAAATCATCGGAGCTCGATCATATCGCAGCGATGGTCAAGTTC
CAGCAAATGACATCCCAGATCCAAGAGATTCCAACGGCCACGGTACGCACACAGCGTCGACGGTGGCCGGCGGGTTAGTCAGGCAAGCCAGCATGTTTGGTCTCGGCCTC
GGCACAGCTAGGGGAGGAGTCCCGTCGGCACGTATTGCCGCTTATAAGGTATGCTGGTCCGACGGCTGCACAGACGCCGACATTCTTGCTGCATTCGACGACGCCATCGC
CGACGGAGTTGATATCCTCTCTGTCTCTATCGGGTGGCATGTGACTAGAGATTACTTCAACGACTCCATAGCCATTGGAGCTTTCCACGCAATGAAGAAAGGAATCCTCA
CGTCAGTGTCCGCCGGAAACGACGGCCCAGGCTTTTCAACCATTAGAAACTTCTCGCCGTGGTCTCTGTCGGTGGCGGCCAGCACCACCGACCGGAAATTCGTGACCAGA
GTTGAGCTCGGAGATGGAAGAGGATTCGATGGTGCCAGCGTCAATACATTTGATATAAATGGAACACAATTTCCGTTGGTTTACGCTGGAGATATTCCAAATGCTGCTGC
TGGCTTCAACGGATCCATTTCGAGCCGCCGCCGCCGCATCATTCTCCTTCCCCAGCTGGTGCTGATTCCGACGTTTCCTCCAACTCAAACCCCCCCACTCTCCGGCCTCA
TCCGTCGCCCCCAAACCCTAAAAACCACTTCTCCCCTACACGCACCACCTCCAAAGCTCCCTCTACCCTCTCAAGAAACTGGTCGGCTCCGCCTATGTTCCCCTTCTTCC
TCCGCTGCCACCGCCGCCGCCCCCTGTTTCCCACTCGCTCTCCATCAAGAGCTCTGGCTCTGCAGTAAGCTCTCTCCACCGCCTCCCCCGATTTCCGAGACATGCCAGGG
GGATGAGAATAAAGTTAAAGAAATTGAAACCGAGGTTTCTTTGAGTAAGGGGCGGGAAGAAGTTCAATTCCATCAAGAGGTTTTTCCAAATGGAAAAAAGAAAATCCACA
CAACAAGATCTTGGGACTTCATGGGTTTCTCCCAACAAGTCGATCGAGTTCCGTCGGTGGAAAGCGACATCATCGTCGGAGTACTCGACACCGGAATTTGGCCGGAATCT
CCTAGTTTCGACGACAAGGGATATGGTCCTCCGCCAGCCAAGTGGAAGGGCAGTTGTGAAGTCTCAGCCAATTTCTCTTGCAACAATAAAATCATCGGAGCTCGATCATA
TCGCAGCGACGGTCAACTTCCGGCAAATGATGTCCCAGGTCCAAGAGATTCCAACGGCCACGGGACGCACACAGCGTCGACGGTGGCCGGCGGGCTGGTCAGGCAAGCCA
GCATGTTTGGTCTCGGCCTCGGCACGGCGAGGGGAGGAGTCCCGTCGGCACGTATTGCCGCTTACAAGGTTTGCTGGTCGGACAACTGCAACGACGCCGACATTCTTGCC
GCATTCGACGACGCCATTGCCGACGGAGTGGATATCCTCTCTGTCTCTCTCGGAGGAGATGTGGCAAAAAATTATTTCAACGACTCCATAGCCATTGGAGCTTTCCACGC
AATGAAGAAGGGAATCCTCACGTCAGTGTCCGCCGGAAACGACGGCCCAGGCTTTTCAACCATTAGAAACTTCTCGCCGTGGTCTCTGTCGGTGGCGGCCAGCACCACCG
ACCGGAAATTCGTGACCAGAGTTGAGCTTGGAGACGGAAGAGGATTCGACGGTGCCAGCGTCAATACATTTGATATAAATGGAACACAATTTCCGTTGGTTTACGCTGGA
GATATTCCAAATGCTGCTGCTGGTTTCAACGGATCCATTTCGAGGTGTGAGACTATCAACTCAACTCGTCACTTGATACGACGCTACGACCATGAAAATGAAACAAACCA
GAGGCAGCGGCAGAGGCTAAAGAAGATAATGGACGCTCGCTCACTCTCAAACACCACCACCACCACTTCCTCCGCTTGTTTCTCCGCCGTCCTTGCCCCACATCGCCGCC
GCCAACATTTTCACCCTTCTTCCGCCATCATTGTTTTCTCACTGAAGCCGCCGCCGCCGCATCATTCGCCTTCTCCAGCTGGTTCTGACTCCGACGATTCCTCCAACTCA
ACCCCCCCGCTCTCCGGCCGCATCCGTCGCCCCCAAACCCTAAAAACCACTTCCTCCCCTAAACGCACTACCTCCAAAGCTCCCTTTAACCCTCTCAAGAATCTGGTCGG
CTCCGCCTACGTTCCCCTTCTTCCTCCACTGCCGCCGCCGCCGCCGCCGCCCCCTGTTTCCCACTCGCTCTCCAACAAACTCTGGCTCTCCAGTAAGCTCTCTCCACCGC
CTCCCCCGATTTCCGAGACATGCCAGGAGGATGAGAATGAAGTTGAAGAAATTGAAACCGAGAATGCTTTGAGTAAGGGGCGGGAAGAAGTTCAATTCCGTCAAGAGGGT
AAGATTTTTGTCGGGAACTTACCTAATTGGATAAAGAAGCATGAGGTCCAAGAGTTTTTTCGACAGTTTGGTCCTGTAAAGAATGTGATATTGATTAAGGGTCACGATGC
CACGGAAAGAAATGCCGGGTACGGATTCGTCATATATGACGGGTCAACTGCAGCCAAGTCGGCCATGAAAGCGGTTGAGTTTGACGGGGTGGAGTTTCACGGAAGGGTCC
TGACTGTGAAATTGGATGATGGGAGGAGGTTGAAGGAGAAGACGGAGGGGAGGGCGAGATGGATGGAGGGAGACGACAGCGTGGAATATCGTTCCAAATGGCATGAAGAG
AGAGATAAAGCACGGAAGGGCTTTCGCAAGGTTATTGAGACAGAGCCAGAGAACTGGCAGGCGGTTGTCTGGGCGTTCGAGGGGATCAAGAAGCCTTCTAGGAAAGAGTA
TGGTTTGATGGTGAACTACTACGCAAGAAGAGGTGATATGCACCGTGCACGTGAAACCTTTGAAAAGATGCGGGCTAGGGGAATAGAACCCACGTCTCATGTCTACACAA
GCCTTATTCATGCTTATGCGGTTGGTAGAGATATGGAAGAAGCATTATCTTGTGTTAGGAAAATGAAAGAAGAAGGCATAGAAATGAGTTTGGTAACTTACAGCATTCTT
GTGGGTGGATTTGCCAAAATGGGAAATGCAGAATCTGCAGATCACTGGTTTCAGGAGGCCAAAGAGAAACACTCCTCATTGAATGCAATTATTTATGGGAATATTATATA
TGCCTACTGTCAAACATGCAATATGGATAGAGCTGAAGCTTTGGTGAGGCAAATGGAAGAAGATGGCATAGATGCTCCAATTGACATATATCACACTATGATGGATGGTT
ATACAATGATTGGTGATGCGGACAAATGTCTGCTTGTGTTTGAGAGATTTAAGGAATGCGGTTTGAACCCTTCAGTCATTACTTATGGATGTCTCATTAATCTTTACACG
AAGCTTGGGAAAGTTTCTAAAGCTTTGGAAGTTAGCAAAGAAATGGAGCATGCTGGCATAAAACACAACATGAAGACCTACTCCATGTTGATCAATGGGTTCTTGAAGTT
GAAAGATTGGGCTAATGCTTTCGCTATTTTTGAGGATTTGATAAAAGATGGTATTAAGCCTGATATAGTACTCTATAATAATATTATCACCGCATTCTGTGGGATGGGGA
AGATGGATCGTGCCATTTGTACTGTCAAGGAAATGCAGAAGCAGAGGCACAGACCTACAACTCGAACATTTTTGCCCATCATACATGGTTTTGCAAGGAAAGGGGAAATG
AGGAAAGCGCTAGATGTTTTTGATATGATGCGGATGTCTGGATGCATTCCAACTGTGCACACTTATAATGCTCTGATTCTTGGTCTTGTCGAGAAGCGTAAGATGGAGAA
GGCTGTAGAAATACTGGATGAGATGACGCTGGCTGGTGTAAGTCCAAATGAACACACATACACAACCATCATGCATGGCTATGCTTCCTTGGGTGATACAGGAAAAGCAT
TCGCGTACTTCACTAAACTGAGGAATGAGGGTATGGAACTTGATGTTTATACGTATGAAGCACTGCTAAAAGCATGCTGCAAATCAGGCAGGATGCAGAGTGCATTGGCA
GTCACCAAGGAAATGAGTGCTCAAAATATCCCAAGAAACACCTTCATTTATAACATTTTAGTTGATGGATGGGCTCGACGAGGTGACGTTTGGGAGGCAGCTGATCTGAT
GCAACAAATGAAAAGAGAAGGAGTTCAATCTGACATCCATACATACACTTCTTTCATAAATGCCTGCTCCAAGGCTGGAGATATGCAGAGAGCAACGAAAACAATTGAAG
AAATGAAATCTGTAGGAGTGAAGCCTAATGTTAAAACGTATACTACACTGATTCATGGGTGGGCCCGTGCTTCTTTACCAGAGAAGGCATTATCATGCTTTGAAGAGATG
AAACTATCTGGGTTGAAACCAGACAAAGCTGTTTACCATTGTCTGATGACGTCTTTACTTTCGAGGGCTACTGTTGCAGAAGATTGCATTTATCCTGGTGTTCTCTCTGT
TTGCAGAGAGATGATTGATTCTGGACTAACAGTGGATATGGGGACAGCAGTTCACTGGTCCAAGTGTTTACGCAAGATTGAGAGAACTGGTGGGGAGATTACTGAAGCCT
TGCAGAAGACCTTCCCTCCCAATTGGAACTCGTATAACAAAGTCCTCGCGAGTTCCAGCATGGACTCTGATGACGAATCTGATATAAGCAGCGACGACGATGACATATGT
CGGGGGGGTGTATCTAACACTGACAAAGATGATGAAAGCGATGATGATGCAATTGGTAGATCATGGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTCTCCCAACAAGTTGATCGAGTTCCATCGGTAGAAAGCGACATCATCGTTGGAGTACTCGACACAGGAATTTGGCCGGAATCTCCTAGTTTCGACGACAAAGG
ATACAGTCCAGCGCCAGCCAAGTGGAAGGGCAGTTGTGAAGTCTCAGCCAATTTCTCTTGCAACAATAAAATCATCGGAGCTCGATCATATCGCAGCGATGGTCAAGTTC
CAGCAAATGACATCCCAGATCCAAGAGATTCCAACGGCCACGGTACGCACACAGCGTCGACGGTGGCCGGCGGGTTAGTCAGGCAAGCCAGCATGTTTGGTCTCGGCCTC
GGCACAGCTAGGGGAGGAGTCCCGTCGGCACGTATTGCCGCTTATAAGGTATGCTGGTCCGACGGCTGCACAGACGCCGACATTCTTGCTGCATTCGACGACGCCATCGC
CGACGGAGTTGATATCCTCTCTGTCTCTATCGGGTGGCATGTGACTAGAGATTACTTCAACGACTCCATAGCCATTGGAGCTTTCCACGCAATGAAGAAAGGAATCCTCA
CGTCAGTGTCCGCCGGAAACGACGGCCCAGGCTTTTCAACCATTAGAAACTTCTCGCCGTGGTCTCTGTCGGTGGCGGCCAGCACCACCGACCGGAAATTCGTGACCAGA
GTTGAGCTCGGAGATGGAAGAGGATTCGATGGTGCCAGCGTCAATACATTTGATATAAATGGAACACAATTTCCGTTGGTTTACGCTGGAGATATTCCAAATGCTGCTGC
TGGCTTCAACGGATCCATTTCGAGCCGCCGCCGCCGCATCATTCTCCTTCCCCAGCTGGTGCTGATTCCGACGTTTCCTCCAACTCAAACCCCCCCACTCTCCGGCCTCA
TCCGTCGCCCCCAAACCCTAAAAACCACTTCTCCCCTACACGCACCACCTCCAAAGCTCCCTCTACCCTCTCAAGAAACTGGTCGGCTCCGCCTATGTTCCCCTTCTTCC
TCCGCTGCCACCGCCGCCGCCCCCTGTTTCCCACTCGCTCTCCATCAAGAGCTCTGGCTCTGCAGTAAGCTCTCTCCACCGCCTCCCCCGATTTCCGAGACATGCCAGGG
GGATGAGAATAAAGTTAAAGAAATTGAAACCGAGGTTTCTTTGAGTAAGGGGCGGGAAGAAGTTCAATTCCATCAAGAGGTTTTTCCAAATGGAAAAAAGAAAATCCACA
CAACAAGATCTTGGGACTTCATGGGTTTCTCCCAACAAGTCGATCGAGTTCCGTCGGTGGAAAGCGACATCATCGTCGGAGTACTCGACACCGGAATTTGGCCGGAATCT
CCTAGTTTCGACGACAAGGGATATGGTCCTCCGCCAGCCAAGTGGAAGGGCAGTTGTGAAGTCTCAGCCAATTTCTCTTGCAACAATAAAATCATCGGAGCTCGATCATA
TCGCAGCGACGGTCAACTTCCGGCAAATGATGTCCCAGGTCCAAGAGATTCCAACGGCCACGGGACGCACACAGCGTCGACGGTGGCCGGCGGGCTGGTCAGGCAAGCCA
GCATGTTTGGTCTCGGCCTCGGCACGGCGAGGGGAGGAGTCCCGTCGGCACGTATTGCCGCTTACAAGGTTTGCTGGTCGGACAACTGCAACGACGCCGACATTCTTGCC
GCATTCGACGACGCCATTGCCGACGGAGTGGATATCCTCTCTGTCTCTCTCGGAGGAGATGTGGCAAAAAATTATTTCAACGACTCCATAGCCATTGGAGCTTTCCACGC
AATGAAGAAGGGAATCCTCACGTCAGTGTCCGCCGGAAACGACGGCCCAGGCTTTTCAACCATTAGAAACTTCTCGCCGTGGTCTCTGTCGGTGGCGGCCAGCACCACCG
ACCGGAAATTCGTGACCAGAGTTGAGCTTGGAGACGGAAGAGGATTCGACGGTGCCAGCGTCAATACATTTGATATAAATGGAACACAATTTCCGTTGGTTTACGCTGGA
GATATTCCAAATGCTGCTGCTGGTTTCAACGGATCCATTTCGAGGTGTGAGACTATCAACTCAACTCGTCACTTGATACGACGCTACGACCATGAAAATGAAACAAACCA
GAGGCAGCGGCAGAGGCTAAAGAAGATAATGGACGCTCGCTCACTCTCAAACACCACCACCACCACTTCCTCCGCTTGTTTCTCCGCCGTCCTTGCCCCACATCGCCGCC
GCCAACATTTTCACCCTTCTTCCGCCATCATTGTTTTCTCACTGAAGCCGCCGCCGCCGCATCATTCGCCTTCTCCAGCTGGTTCTGACTCCGACGATTCCTCCAACTCA
ACCCCCCCGCTCTCCGGCCGCATCCGTCGCCCCCAAACCCTAAAAACCACTTCCTCCCCTAAACGCACTACCTCCAAAGCTCCCTTTAACCCTCTCAAGAATCTGGTCGG
CTCCGCCTACGTTCCCCTTCTTCCTCCACTGCCGCCGCCGCCGCCGCCGCCCCCTGTTTCCCACTCGCTCTCCAACAAACTCTGGCTCTCCAGTAAGCTCTCTCCACCGC
CTCCCCCGATTTCCGAGACATGCCAGGAGGATGAGAATGAAGTTGAAGAAATTGAAACCGAGAATGCTTTGAGTAAGGGGCGGGAAGAAGTTCAATTCCGTCAAGAGGGT
AAGATTTTTGTCGGGAACTTACCTAATTGGATAAAGAAGCATGAGGTCCAAGAGTTTTTTCGACAGTTTGGTCCTGTAAAGAATGTGATATTGATTAAGGGTCACGATGC
CACGGAAAGAAATGCCGGGTACGGATTCGTCATATATGACGGGTCAACTGCAGCCAAGTCGGCCATGAAAGCGGTTGAGTTTGACGGGGTGGAGTTTCACGGAAGGGTCC
TGACTGTGAAATTGGATGATGGGAGGAGGTTGAAGGAGAAGACGGAGGGGAGGGCGAGATGGATGGAGGGAGACGACAGCGTGGAATATCGTTCCAAATGGCATGAAGAG
AGAGATAAAGCACGGAAGGGCTTTCGCAAGGTTATTGAGACAGAGCCAGAGAACTGGCAGGCGGTTGTCTGGGCGTTCGAGGGGATCAAGAAGCCTTCTAGGAAAGAGTA
TGGTTTGATGGTGAACTACTACGCAAGAAGAGGTGATATGCACCGTGCACGTGAAACCTTTGAAAAGATGCGGGCTAGGGGAATAGAACCCACGTCTCATGTCTACACAA
GCCTTATTCATGCTTATGCGGTTGGTAGAGATATGGAAGAAGCATTATCTTGTGTTAGGAAAATGAAAGAAGAAGGCATAGAAATGAGTTTGGTAACTTACAGCATTCTT
GTGGGTGGATTTGCCAAAATGGGAAATGCAGAATCTGCAGATCACTGGTTTCAGGAGGCCAAAGAGAAACACTCCTCATTGAATGCAATTATTTATGGGAATATTATATA
TGCCTACTGTCAAACATGCAATATGGATAGAGCTGAAGCTTTGGTGAGGCAAATGGAAGAAGATGGCATAGATGCTCCAATTGACATATATCACACTATGATGGATGGTT
ATACAATGATTGGTGATGCGGACAAATGTCTGCTTGTGTTTGAGAGATTTAAGGAATGCGGTTTGAACCCTTCAGTCATTACTTATGGATGTCTCATTAATCTTTACACG
AAGCTTGGGAAAGTTTCTAAAGCTTTGGAAGTTAGCAAAGAAATGGAGCATGCTGGCATAAAACACAACATGAAGACCTACTCCATGTTGATCAATGGGTTCTTGAAGTT
GAAAGATTGGGCTAATGCTTTCGCTATTTTTGAGGATTTGATAAAAGATGGTATTAAGCCTGATATAGTACTCTATAATAATATTATCACCGCATTCTGTGGGATGGGGA
AGATGGATCGTGCCATTTGTACTGTCAAGGAAATGCAGAAGCAGAGGCACAGACCTACAACTCGAACATTTTTGCCCATCATACATGGTTTTGCAAGGAAAGGGGAAATG
AGGAAAGCGCTAGATGTTTTTGATATGATGCGGATGTCTGGATGCATTCCAACTGTGCACACTTATAATGCTCTGATTCTTGGTCTTGTCGAGAAGCGTAAGATGGAGAA
GGCTGTAGAAATACTGGATGAGATGACGCTGGCTGGTGTAAGTCCAAATGAACACACATACACAACCATCATGCATGGCTATGCTTCCTTGGGTGATACAGGAAAAGCAT
TCGCGTACTTCACTAAACTGAGGAATGAGGGTATGGAACTTGATGTTTATACGTATGAAGCACTGCTAAAAGCATGCTGCAAATCAGGCAGGATGCAGAGTGCATTGGCA
GTCACCAAGGAAATGAGTGCTCAAAATATCCCAAGAAACACCTTCATTTATAACATTTTAGTTGATGGATGGGCTCGACGAGGTGACGTTTGGGAGGCAGCTGATCTGAT
GCAACAAATGAAAAGAGAAGGAGTTCAATCTGACATCCATACATACACTTCTTTCATAAATGCCTGCTCCAAGGCTGGAGATATGCAGAGAGCAACGAAAACAATTGAAG
AAATGAAATCTGTAGGAGTGAAGCCTAATGTTAAAACGTATACTACACTGATTCATGGGTGGGCCCGTGCTTCTTTACCAGAGAAGGCATTATCATGCTTTGAAGAGATG
AAACTATCTGGGTTGAAACCAGACAAAGCTGTTTACCATTGTCTGATGACGTCTTTACTTTCGAGGGCTACTGTTGCAGAAGATTGCATTTATCCTGGTGTTCTCTCTGT
TTGCAGAGAGATGATTGATTCTGGACTAACAGTGGATATGGGGACAGCAGTTCACTGGTCCAAGTGTTTACGCAAGATTGAGAGAACTGGTGGGGAGATTACTGAAGCCT
TGCAGAAGACCTTCCCTCCCAATTGGAACTCGTATAACAAAGTCCTCGCGAGTTCCAGCATGGACTCTGATGACGAATCTGATATAAGCAGCGACGACGATGACATATGT
CGGGGGGGTGTATCTAACACTGACAAAGATGATGAAAGCGATGATGATGCAATTGGTAGATCATGGTTTTGA
Protein sequenceShow/hide protein sequence
MGFSQQVDRVPSVESDIIVGVLDTGIWPESPSFDDKGYSPAPAKWKGSCEVSANFSCNNKIIGARSYRSDGQVPANDIPDPRDSNGHGTHTASTVAGGLVRQASMFGLGL
GTARGGVPSARIAAYKVCWSDGCTDADILAAFDDAIADGVDILSVSIGWHVTRDYFNDSIAIGAFHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTR
VELGDGRGFDGASVNTFDINGTQFPLVYAGDIPNAAAGFNGSISSRRRRIILLPQLVLIPTFPPTQTPPLSGLIRRPQTLKTTSPLHAPPPKLPLPSQETGRLRLCSPSS
SAATAAAPCFPLALHQELWLCSKLSPPPPPISETCQGDENKVKEIETEVSLSKGREEVQFHQEVFPNGKKKIHTTRSWDFMGFSQQVDRVPSVESDIIVGVLDTGIWPES
PSFDDKGYGPPPAKWKGSCEVSANFSCNNKIIGARSYRSDGQLPANDVPGPRDSNGHGTHTASTVAGGLVRQASMFGLGLGTARGGVPSARIAAYKVCWSDNCNDADILA
AFDDAIADGVDILSVSLGGDVAKNYFNDSIAIGAFHAMKKGILTSVSAGNDGPGFSTIRNFSPWSLSVAASTTDRKFVTRVELGDGRGFDGASVNTFDINGTQFPLVYAG
DIPNAAAGFNGSISRCETINSTRHLIRRYDHENETNQRQRQRLKKIMDARSLSNTTTTTSSACFSAVLAPHRRRQHFHPSSAIIVFSLKPPPPHHSPSPAGSDSDDSSNS
TPPLSGRIRRPQTLKTTSSPKRTTSKAPFNPLKNLVGSAYVPLLPPLPPPPPPPPVSHSLSNKLWLSSKLSPPPPPISETCQEDENEVEEIETENALSKGREEVQFRQEG
KIFVGNLPNWIKKHEVQEFFRQFGPVKNVILIKGHDATERNAGYGFVIYDGSTAAKSAMKAVEFDGVEFHGRVLTVKLDDGRRLKEKTEGRARWMEGDDSVEYRSKWHEE
RDKARKGFRKVIETEPENWQAVVWAFEGIKKPSRKEYGLMVNYYARRGDMHRARETFEKMRARGIEPTSHVYTSLIHAYAVGRDMEEALSCVRKMKEEGIEMSLVTYSIL
VGGFAKMGNAESADHWFQEAKEKHSSLNAIIYGNIIYAYCQTCNMDRAEALVRQMEEDGIDAPIDIYHTMMDGYTMIGDADKCLLVFERFKECGLNPSVITYGCLINLYT
KLGKVSKALEVSKEMEHAGIKHNMKTYSMLINGFLKLKDWANAFAIFEDLIKDGIKPDIVLYNNIITAFCGMGKMDRAICTVKEMQKQRHRPTTRTFLPIIHGFARKGEM
RKALDVFDMMRMSGCIPTVHTYNALILGLVEKRKMEKAVEILDEMTLAGVSPNEHTYTTIMHGYASLGDTGKAFAYFTKLRNEGMELDVYTYEALLKACCKSGRMQSALA
VTKEMSAQNIPRNTFIYNILVDGWARRGDVWEAADLMQQMKREGVQSDIHTYTSFINACSKAGDMQRATKTIEEMKSVGVKPNVKTYTTLIHGWARASLPEKALSCFEEM
KLSGLKPDKAVYHCLMTSLLSRATVAEDCIYPGVLSVCREMIDSGLTVDMGTAVHWSKCLRKIERTGGEITEALQKTFPPNWNSYNKVLASSSMDSDDESDISSDDDDIC
RGGVSNTDKDDESDDDAIGRSWF