; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001773 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001773
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr11:359952..361694
RNA-Seq ExpressionHG10001773
SyntenyHG10001773
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152299.1 pentatricopeptide repeat-containing protein At4g20090 isoform X1 [Cucumis sativus]7.8e-29788.89Show/hide
Query:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL
        TIR++SALLK + LHF+G SSHFFSTS TT HIAIAPR LARRPTSRTAP PR+P+TLGS+DVVN VC+LLSNKN Q PNLDLDHLLKRFKD LSSD VL
Subjt:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH+GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+STAIE F EM RAGLVPTR+AVNILIG+LCSLSAK+GAVE+VR  ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L+L+PS+FV VQLI ELCRLGQMQEAI+VLKVVEGDKLRC EECY+VVM+ALCEHRHV+EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMNKKRC PDHVTYSALIHAYGE ++WSAAY LLKEMLS GMSPHFHVYSIVDKLMREHGQIDLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA

Query:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        MKWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIY+RDAFESAFQKKGKFKIARELLQKMDGVHQHES TRNSS
Subjt:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]2.7e-29788.37Show/hide
Query:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL
        TIR++SALLK + LHF+GFSSHFFSTS TTKHIAIAPR L RRPTSRTAP PR+P+T+GS+DVVN VC+LLSNKN Q PNLD++HLLKRFKD LSSDLVL
Subjt:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH+GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+STAIE F EM RAGLVPTRSA NILIG+LCSLSAK+GA+E+VR RST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L L+PS+FV VQLI ELCR+GQMQEAIKVLKVVE DKLRC EECY+VVM+ALCEHRH++EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMNKKRC PDHVTYSALIHAYGE +NWSAAY LLKEMLS GMSPHFHVYS+VDKLMREHGQ+DLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA

Query:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        MKWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIY+RDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
Subjt:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]4.9e-29187.76Show/hide
Query:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS
        MLST TIRNASA LKFV    YGFSS+  STS+TTK  AIAPR LARRPTSRTA IPRA DT     V +VC+LLSNK+HQ  NL+LDHLLKRFK+TLSS
Subjt:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS

Query:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        D VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS++GR+SCRTFSICIRFLGRQGRVREALCLFEEMEP F
Subjt:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR
        GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVFDEM RA LVPTRSAVNILIGDLCSLSAK+GAVEQVR
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR

Query:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG
         RSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANR+AL+PSAFVIV+LI ELCRLGQMQEAI+VLKVVE +KLRCTEECY++VMQALCEHR V+EASDLFG
Subjt:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG

Query:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC
        RMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMN+KRC PDHVTYSALIHAYGE +NWSAAYSLLKEMLS G+SPHFHVYSIVDKLMRE GQ DLC
Subjt:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC

Query:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        LKL MKWE+QILQKLCKQGQL AAYEK+KSMLEKGFYPPIY+RDAFESAFQKKGKFKIARELLQ MDGVH+HES TR +S
Subjt:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

XP_023526018.1 pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Cucurbita pepo subsp. pepo]2.0e-29287.59Show/hide
Query:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS
        MLST TIRNASA LKFV    YGFSS+ FSTS+TTK  AIAPR LARRPTSRTAPIPRA DT     V +VC+LLSNKNHQ  NL+LDHLLKRFK+T+SS
Subjt:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS

Query:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        D VLQILMNYRL GRAKTLEFFSWS LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS++GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKF
Subjt:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR
        GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVFDEM+RAGLVPTRSAVNILIGDLCSLSAK+GAVEQVR
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR

Query:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG
         RSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLAL+PSAFVIV+LILELCRLGQMQEAI+VLKVVE +KLRCTEECY++VMQALCEHR V+EASDL G
Subjt:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG

Query:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC
        RMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMN+K+C PDHVTYSALIHAYGE +NWSA YSLLK+MLS G+SPHFHVYS+VDKLMRE GQ DLC
Subjt:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC

Query:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        LKL MKWE+QILQKLCKQGQL  AYEK+KSMLEKGFYPPIY+RDAFESAFQKKGKFKIARELLQ MDGVH+HESG+R +S
Subjt:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]0.0e+0092.77Show/hide
Query:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLS
        MLS  TIRNASA LKF PLHFYGFSSHFFSTS  TKHIAIAPR LARRPTSRTAPIPRA DTLGS+DVVN VC+LLSNKNHQ PNLDLDHLLKRFKDTLS
Subjt:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDE+VVEYMADFLGRRKLFDDMKCLLVTVSSH+GRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTAL+IFRRIELPDKYSYSNVIIGLCKFGRF TAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAK+GAVEQV
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQV

Query:  RFRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLF
        R RSTRRPFTVLVPNVNPKSGAIEPAVG+FWAAN+LAL+PSAFVIVQLI ELCRLGQMQEAIKVLKVVEGDKLRC EECY+VVM+ALCEHRHVEEASDLF
Subjt:  RFRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDL
        GR+LSQGMKPKLAIYNS+ICMLCK+GNL+DAERVFKIMN+KRC PDHVTYS+LIHAYGET+NWSAAYSLLKEMLS GMSPHFH+YS+VDKLMREHGQIDL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDL

Query:  CLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        CLKL MKWEAQILQKLCK GQL+AAYEKMKSMLEKGFYPPIY+RD+FESAFQKKGKFKIARELLQK+DGVHQHESGTRNSS
Subjt:  CLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein2.6e-28288.52Show/hide
Query:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL
        TIR++SALLK + LHF+G SSHFFSTS TT HIAIAPR LARRPTSRTAP PR+P+TLGS+DVVN VC+LLSNKN Q PNLDLDHLLKRFKD LSSD VL
Subjt:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH+GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+STAIE F EM RAGLVPTR+AVNILIG+LCSLSAK+GAVE+VR  ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L+L+PS+FV VQLI ELCRLGQMQEAI+VLKVVEGDKLRC EECY+VVM+ALCEHRHV+EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMNKKRC PDHVTYSALIHAYGE ++WSAAY LLKEMLS GMSPHFHVYSIVDKLMREHGQIDLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA

Query:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKK
        MKWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIY+RDAFESAFQKK
Subjt:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKK

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.3e-29788.37Show/hide
Query:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL
        TIR++SALLK + LHF+GFSSHFFSTS TTKHIAIAPR L RRPTSRTAP PR+P+T+GS+DVVN VC+LLSNKN Q PNLD++HLLKRFKD LSSDLVL
Subjt:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH+GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+STAIE F EM RAGLVPTRSA NILIG+LCSLSAK+GA+E+VR RST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L L+PS+FV VQLI ELCR+GQMQEAIKVLKVVE DKLRC EECY+VVM+ALCEHRH++EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMNKKRC PDHVTYSALIHAYGE +NWSAAY LLKEMLS GMSPHFHVYS+VDKLMREHGQ+DLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA

Query:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        MKWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIY+RDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
Subjt:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

A0A5A7TN12 Pentatricopeptide repeat-containing protein3.4e-28287.98Show/hide
Query:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL
        TIR++SALLK + LHF+GFSSHFFSTS TTKHIAIAPR L RRPTSRTAP PR+P+TLGS+DVVN VC+LLSNKN Q PNLD++HLLKRFKD LSSDLVL
Subjt:  TIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVN-VCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH+GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+STAIE F EM RAGLVPTRSA NILIG+LCSLSAK+GA+E+VR RST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L L+PS+FV VQLI ELCR+GQMQEAIKVLKVVE DKLRC EECY+VVM+ALCEHRH++EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMNKKRC PDHVTYSALIHAYGE +NWSAAY LLKEMLS GMSPHFHVYS+VDKLMREHGQ+DLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLA

Query:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKK
        MKWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIY+RDAFESAFQKK
Subjt:  MKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKK

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like9.0e-29187.41Show/hide
Query:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS
        MLST TIRNASA LKFV    YGFSS+  STS+T K  AIAPR LARRPTSRTAPIPRA DT     V +VC+LLSNKNHQ  NL+LDHLLKRFK+TLSS
Subjt:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS

Query:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        D VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS++GR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKF
Subjt:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR
        GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAK+GAVEQVR
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR

Query:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG
         RSTRRPFTVLVPNVNPKSGAI+ AVGVFWAANRLAL+PS FVIV+LI ELCRLGQMQEAI+VLKVVE +KLRCTEECY++VMQALCEHR V+EASDLFG
Subjt:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG

Query:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC
        RMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMN+KRC PDHVTYSALIHAYGE +NWSAAYSLLKEMLS G+SPHFHVYS+VDKLMRE GQ DLC
Subjt:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC

Query:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        LKL MKWE+QILQKLCKQGQL  AYEK+KSMLEKGFYPPIY+RDAFESAFQKKGKFKIARELLQ MDGVH+HES +R +S
Subjt:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like2.4e-29187.76Show/hide
Query:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS
        MLST TIRNASA LKFV    YGFSS+  STS+TTK  AIAPR LARRPTSRTA IPRA DT     V +VC+LLSNK+HQ  NL+LDHLLKRFK+TLSS
Subjt:  MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSS

Query:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        D VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS++GR+SCRTFSICIRFLGRQGRVREALCLFEEMEP F
Subjt:  DLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR
        GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVFDEM RA LVPTRSAVNILIGDLCSLSAK+GAVEQVR
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVR

Query:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG
         RSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANR+AL+PSAFVIV+LI ELCRLGQMQEAI+VLKVVE +KLRCTEECY++VMQALCEHR V+EASDLFG
Subjt:  FRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFG

Query:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC
        RMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMN+KRC PDHVTYSALIHAYGE +NWSAAYSLLKEMLS G+SPHFHVYSIVDKLMRE GQ DLC
Subjt:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLC

Query:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS
        LKL MKWE+QILQKLCKQGQL AAYEK+KSMLEKGFYPPIY+RDAFESAFQKKGKFKIARELLQ MDGVH+HES TR +S
Subjt:  LKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKMDGVHQHESGTRNSS

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.4e-2725Show/hide
Query:  IRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTR
        I  L R  ++ EA   F EM  + G  PD +V+  ++   CK+            +  R   PD  +Y+ +I G C+ G    A ++F EM   GL P  
Subjt:  IRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTR

Query:  SAVNILIGDLCSLSAKDGA--VEQVRFRSTRRP----FTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGD
             LI   C       A  V     ++   P    +T L+  +  K G ++ A  +     ++ L P+ F    ++  LC+ G ++EA+K++   E  
Subjt:  SAVNILIGDLCSLSAKDGA--VEQVRFRSTRRP----FTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGD

Query:  KLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLK
         L      YT +M A C+   +++A ++   ML +G++P +  +N ++   C  G L+D E++   M  K   P+  T+++L+  Y    N  AA ++ K
Subjt:  KLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLK

Query:  EMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM
        +M S G+ P    Y   + L++ H                     CK   ++ A+   + M  KGF   +         F K+ KF  ARE+  +M
Subjt:  EMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.3e-2825.37Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNVIIGLCKFGRFSTAIEVFDEM
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R + L    P+  SY+ VI GLC+ GR      V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNVIIGLCKFGRFSTAIEVFDEM

Query:  NRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVV
        NR G        N LI   C                              K G    A+ +     R  L PS      LI  +C+ G M  A++ L  +
Subjt:  NRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVV

Query:  EGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYS
            L   E  YT ++    +  ++ EA  +   M   G  P +  YN++I   C  G ++DA  V + M +K  +PD V+YS ++  +  + +   A  
Subjt:  EGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYS

Query:  LLKEMLSFGMSPHFHVY-SIVDKLMREHGQIDLC------LKLAMKWE----AQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKF
        + +EM+  G+ P    Y S++     +    + C      L++ +  +      ++   C +G LE A +    M+EKG  P +       +   K+ + 
Subjt:  LLKEMLSFGMSPHFHVY-SIVDKLMREHGQIDLC------LKLAMKWE----AQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKF

Query:  KIARELLQKM
        + A+ LL K+
Subjt:  KIARELLQKM

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011107.7e-2924.81Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAG
        T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ VI GLCK G++  A EVF EM R+G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAG

Query:  LVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKV
        L P  +    L+ + C    K   VE  +  S  R   V+        + ++  +SG ++ A+  F +     LIP   +   LI   CR G +  A+ +
Subjt:  LVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKV

Query:  LKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWS
           +           Y  ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F+ M +KR   D VTY+ L+  +G+  +  
Subjt:  LKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWS

Query:  AAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELL
         A  +  +M+S  + P    YSI                        ++  LC +G L  A+     M+ K   P + I ++    + + G        L
Subjt:  AAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELL

Query:  QKM
        +KM
Subjt:  QKM

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic9.0e-3025.4Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y++VI GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNI

Query:  LIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKL
        LI  LC  +  + A E  R  +++     ++P+V   +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKL

Query:  RCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEM
          +   Y  ++   C+     EA ++F  M   G+      YN++I  LCK   ++DA ++   M  +   PD  TY++L+  +    +   A  +++ M
Subjt:  RCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEM

Query:  LSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFYPP
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP
Subjt:  LSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFYPP

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655602.8e-3124.68Show/hide
Query:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKK
        +FF+++ L MGY           D     K+F++M   L     +E   +     +C+       R+ EA+ LF +M+    C P    +  ++ +LC  
Subjt:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKK

Query:  EPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKS
        E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   GL+P     N LI   C     + AV+ V    +R+    L PN    +
Subjt:  EPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKS

Query:  GAIE--------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKL
          I+         A+GV        ++P       LI   CR G    A ++L ++    L   +  YT ++ +LC+ + VEEA DLF  +  +G+ P +
Subjt:  GAIE--------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKL

Query:  AIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQID-------LCLKLA
         +Y ++I   CK G +D+A  + + M  K C P+ +T++ALIH          A  L ++M+  G+ P     +I+   + + G  D         L   
Subjt:  AIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQID-------LCLKLA

Query:  MKWEAQ----ILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM
         K +A      +Q  C++G+L  A + M  M E G  P ++   +    +   G+   A ++L++M
Subjt:  MKWEAQ----ILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.0e-2825Show/hide
Query:  IRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTR
        I  L R  ++ EA   F EM  + G  PD +V+  ++   CK+            +  R   PD  +Y+ +I G C+ G    A ++F EM   GL P  
Subjt:  IRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTR

Query:  SAVNILIGDLCSLSAKDGA--VEQVRFRSTRRP----FTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGD
             LI   C       A  V     ++   P    +T L+  +  K G ++ A  +     ++ L P+ F    ++  LC+ G ++EA+K++   E  
Subjt:  SAVNILIGDLCSLSAKDGA--VEQVRFRSTRRP----FTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGD

Query:  KLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLK
         L      YT +M A C+   +++A ++   ML +G++P +  +N ++   C  G L+D E++   M  K   P+  T+++L+  Y    N  AA ++ K
Subjt:  KLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLK

Query:  EMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM
        +M S G+ P    Y   + L++ H                     CK   ++ A+   + M  KGF   +         F K+ KF  ARE+  +M
Subjt:  EMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-3125.4Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y++VI GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNI

Query:  LIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKL
        LI  LC  +  + A E  R  +++     ++P+V   +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKL

Query:  RCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEM
          +   Y  ++   C+     EA ++F  M   G+      YN++I  LCK   ++DA ++   M  +   PD  TY++L+  +    +   A  +++ M
Subjt:  RCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEM

Query:  LSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFYPP
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP
Subjt:  LSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFYPP

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-3024.81Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAG
        T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ VI GLCK G++  A EVF EM R+G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAG

Query:  LVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKV
        L P  +    L+ + C    K   VE  +  S  R   V+        + ++  +SG ++ A+  F +     LIP   +   LI   CR G +  A+ +
Subjt:  LVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKV

Query:  LKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWS
           +           Y  ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F+ M +KR   D VTY+ L+  +G+  +  
Subjt:  LKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWS

Query:  AAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELL
         A  +  +M+S  + P    YSI                        ++  LC +G L  A+     M+ K   P + I ++    + + G        L
Subjt:  AAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELL

Query:  QKM
        +KM
Subjt:  QKM

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-3025.37Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNVIIGLCKFGRFSTAIEVFDEM
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R + L    P+  SY+ VI GLC+ GR      V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNVIIGLCKFGRFSTAIEVFDEM

Query:  NRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVV
        NR G        N LI   C                              K G    A+ +     R  L PS      LI  +C+ G M  A++ L  +
Subjt:  NRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVV

Query:  EGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYS
            L   E  YT ++    +  ++ EA  +   M   G  P +  YN++I   C  G ++DA  V + M +K  +PD V+YS ++  +  + +   A  
Subjt:  EGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYS

Query:  LLKEMLSFGMSPHFHVY-SIVDKLMREHGQIDLC------LKLAMKWE----AQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKF
        + +EM+  G+ P    Y S++     +    + C      L++ +  +      ++   C +G LE A +    M+EKG  P +       +   K+ + 
Subjt:  LLKEMLSFGMSPHFHVY-SIVDKLMREHGQIDLC------LKLAMKWE----AQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKF

Query:  KIARELLQKM
        + A+ LL K+
Subjt:  KIARELLQKM

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-3224.68Show/hide
Query:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKK
        +FF+++ L MGY           D     K+F++M   L     +E   +     +C+       R+ EA+ LF +M+    C P    +  ++ +LC  
Subjt:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKK

Query:  EPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKS
        E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   GL+P     N LI   C     + AV+ V    +R+    L PN    +
Subjt:  EPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKS

Query:  GAIE--------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKL
          I+         A+GV        ++P       LI   CR G    A ++L ++    L   +  YT ++ +LC+ + VEEA DLF  +  +G+ P +
Subjt:  GAIE--------PAVGVFWAANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKL

Query:  AIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQID-------LCLKLA
         +Y ++I   CK G +D+A  + + M  K C P+ +T++ALIH          A  L ++M+  G+ P     +I+   + + G  D         L   
Subjt:  AIYNSVICMLCKLGNLDDAERVFKIMNKKRCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQID-------LCLKLA

Query:  MKWEAQ----ILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM
         K +A      +Q  C++G+L  A + M  M E G  P ++   +    +   G+   A ++L++M
Subjt:  MKWEAQ----ILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAFQKKGKFKIARELLQKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCACCGGCACGATTAGAAATGCTTCGGCGTTGCTCAAATTTGTTCCTCTTCATTTTTATGGCTTCTCTTCACACTTCTTTAGCACTTCAGCGACAACAAAGCA
CATTGCCATAGCTCCAAGACCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTCCAGACACCCTCGGCTCTACCGATGTCGTCAACGTATGTGCTT
TACTTTCAAACAAAAATCACCAAATACCTAATCTCGATCTTGATCATTTATTGAAAAGGTTCAAAGACACCTTAAGTTCGGATCTCGTTCTTCAAATTCTTATGAATTAT
AGGCTGTTGGGTCGGGCTAAAACGCTAGAATTCTTCTCTTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTATATGGCTGATTTCTTAGGTAG
GAGGAAATTGTTTGATGATATGAAGTGTCTTTTGGTGACTGTGTCATCTCATGAGGGTCGGCTTTCTTGTCGAACATTTTCAATTTGTATCAGATTTTTGGGTAGGCAGG
GGAGGGTTAGAGAAGCACTTTGCTTGTTTGAAGAAATGGAACCTAAATTTGGGTGTAAACCTGATAATCTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAA
CCAACTGGGGAATTGATTGATACTGCTCTAACCATTTTCAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAATGTTATTATTGGATTGTGTAAATTTGGTAGATT
TAGTACAGCTATTGAAGTGTTTGATGAAATGAATAGGGCTGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCGTTGAGTGCCAAAGATG
GGGCTGTAGAACAAGTTAGGTTCAGAAGTACTCGTAGACCTTTTACCGTTCTAGTTCCAAATGTGAATCCGAAGAGCGGAGCCATTGAACCTGCAGTGGGAGTTTTTTGG
GCAGCTAATAGGCTGGCTTTAATTCCCAGTGCTTTTGTAATAGTTCAGCTCATCTTGGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAAAGTATTGAAGGTTGT
TGAGGGTGACAAGCTAAGATGTACTGAAGAGTGTTATACTGTTGTGATGCAAGCATTGTGTGAACATCGTCATGTAGAAGAAGCTAGTGATCTGTTTGGGAGGATGCTTT
CTCAGGGCATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAAGAAA
AGATGCACACCTGATCATGTTACTTATTCGGCGTTAATCCATGCCTACGGTGAAACTAAGAATTGGTCGGCAGCCTACAGTTTATTGAAGGAAATGCTGAGTTTTGGCAT
GTCTCCTCATTTTCATGTGTATAGTATAGTGGATAAACTAATGAGGGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGCAATGAAATGGGAAGCTCAAATTTTGCAGA
AGCTTTGTAAACAAGGACAACTGGAGGCCGCATATGAAAAGATGAAGTCAATGCTTGAAAAGGGTTTTTATCCTCCTATCTACATAAGAGATGCTTTTGAGAGTGCATTT
CAAAAGAAGGGTAAGTTTAAGATTGCACGGGAGTTGCTACAGAAGATGGATGGAGTCCACCAACATGAGTCAGGAACCAGAAATTCATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGCACCGGCACGATTAGAAATGCTTCGGCGTTGCTCAAATTTGTTCCTCTTCATTTTTATGGCTTCTCTTCACACTTCTTTAGCACTTCAGCGACAACAAAGCA
CATTGCCATAGCTCCAAGACCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTCCAGACACCCTCGGCTCTACCGATGTCGTCAACGTATGTGCTT
TACTTTCAAACAAAAATCACCAAATACCTAATCTCGATCTTGATCATTTATTGAAAAGGTTCAAAGACACCTTAAGTTCGGATCTCGTTCTTCAAATTCTTATGAATTAT
AGGCTGTTGGGTCGGGCTAAAACGCTAGAATTCTTCTCTTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTATATGGCTGATTTCTTAGGTAG
GAGGAAATTGTTTGATGATATGAAGTGTCTTTTGGTGACTGTGTCATCTCATGAGGGTCGGCTTTCTTGTCGAACATTTTCAATTTGTATCAGATTTTTGGGTAGGCAGG
GGAGGGTTAGAGAAGCACTTTGCTTGTTTGAAGAAATGGAACCTAAATTTGGGTGTAAACCTGATAATCTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAA
CCAACTGGGGAATTGATTGATACTGCTCTAACCATTTTCAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAATGTTATTATTGGATTGTGTAAATTTGGTAGATT
TAGTACAGCTATTGAAGTGTTTGATGAAATGAATAGGGCTGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCGTTGAGTGCCAAAGATG
GGGCTGTAGAACAAGTTAGGTTCAGAAGTACTCGTAGACCTTTTACCGTTCTAGTTCCAAATGTGAATCCGAAGAGCGGAGCCATTGAACCTGCAGTGGGAGTTTTTTGG
GCAGCTAATAGGCTGGCTTTAATTCCCAGTGCTTTTGTAATAGTTCAGCTCATCTTGGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAAAGTATTGAAGGTTGT
TGAGGGTGACAAGCTAAGATGTACTGAAGAGTGTTATACTGTTGTGATGCAAGCATTGTGTGAACATCGTCATGTAGAAGAAGCTAGTGATCTGTTTGGGAGGATGCTTT
CTCAGGGCATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAAGAAA
AGATGCACACCTGATCATGTTACTTATTCGGCGTTAATCCATGCCTACGGTGAAACTAAGAATTGGTCGGCAGCCTACAGTTTATTGAAGGAAATGCTGAGTTTTGGCAT
GTCTCCTCATTTTCATGTGTATAGTATAGTGGATAAACTAATGAGGGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGCAATGAAATGGGAAGCTCAAATTTTGCAGA
AGCTTTGTAAACAAGGACAACTGGAGGCCGCATATGAAAAGATGAAGTCAATGCTTGAAAAGGGTTTTTATCCTCCTATCTACATAAGAGATGCTTTTGAGAGTGCATTT
CAAAAGAAGGGTAAGTTTAAGATTGCACGGGAGTTGCTACAGAAGATGGATGGAGTCCACCAACATGAGTCAGGAACCAGAAATTCATCATGA
Protein sequenceShow/hide protein sequence
MLSTGTIRNASALLKFVPLHFYGFSSHFFSTSATTKHIAIAPRPLARRPTSRTAPIPRAPDTLGSTDVVNVCALLSNKNHQIPNLDLDHLLKRFKDTLSSDLVLQILMNY
RLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHEGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKE
PTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFSTAIEVFDEMNRAGLVPTRSAVNILIGDLCSLSAKDGAVEQVRFRSTRRPFTVLVPNVNPKSGAIEPAVGVFW
AANRLALIPSAFVIVQLILELCRLGQMQEAIKVLKVVEGDKLRCTEECYTVVMQALCEHRHVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNKK
RCTPDHVTYSALIHAYGETKNWSAAYSLLKEMLSFGMSPHFHVYSIVDKLMREHGQIDLCLKLAMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFYPPIYIRDAFESAF
QKKGKFKIARELLQKMDGVHQHESGTRNSS