; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018621 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018621
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153206:1091077..1092699
RNA-Seq ExpressionSgr018621
SyntenySgr018621
Gene Ontology termsGO:0006378 - mRNA polyadenylation (biological process)
GO:0006379 - mRNA cleavage (biological process)
GO:0005847 - mRNA cleavage and polyadenylation specificity factor complex (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146877.1 pentatricopeptide repeat-containing protein At2g01740 [Momordica charantia]8.6e-28287.27Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVK+ALQF AYLRR+SRFPSP+TCNKLLHSL+NSGCGELSAKLLFHFLSKGY+PHPSSFNSIISFFCRLG V++A++IL+SMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        +C NY+IRKACFLVNRVRGSEL+PDLVMFNILFNGFAK+YMK+EAFM+LGLMWK+CLPNVVTYGTF+DMFCKMGDM+MGN M LDM+KVG+VPN VAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSLDIAF+YFEKME CSV  NEFTY+TLIDGCCK+GMLG+AD LFEKMLN GI PNCTVYTSIIDGHFKKGNVDDA KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVISGFRRVGRLQKAMEAA+ VVKNGLLPD IILTAIMD+HFKAGNLKEALNSY+KLLARGFEPDA+TLSTLMDGLCKHGY+QEARRYL  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        +LYTVLIDALCKEGSLDEVERII+EMSEAGFVPDKYVYTSWIAELCKQG+LLKAF++KKRMV+EHIEPDLLTYSSLI G+AEKGLMIEAKQVF+DMLNKG
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDVN
        I PDSV  NILIRGYH  GN+ AI GLHDEMRKRGIVI+D N
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDVN

XP_022943027.1 pentatricopeptide repeat-containing protein At2g01740 [Cucurbita moschata]4.7e-27285.53Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQ  A+LRR+SRFPSP+TCNKL+HSLINSGCGELSAK+LFHFLSKGYTPH SSFNSIISFFC+LGN+KYAERILNSMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        YC +YKIR+ CFLVNR+R   LNPDLVMFNILFNGFAK+YMK EAFM+LGLMWK CLPNVVTYGT VDMFCKMGDM++GN+M  DMMKVG+VPNLVAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSLD+AF Y E+M+ CSV  NEFTYSTLIDGCCK+GML +AD LFE+ML+ GILPNCTV+TSIIDGHFKKGNVD+A+KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVISGFRRVGRL KAMEAA+ VVKNGLLPDRIILTAIMDVHFKAGNLKEALN+YR LLARGFEPD VTLS L+DGLCK+GYL EAR+Y+  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        ILYTV IDALCKEG+LDE ER IKEM EAGFVPDKYVYTSWIAELCKQGNLLKAF VKKRMVQE+IEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLN G
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ
        I PDSV Y+ILIRGYHNQGN VAISGLHDEMR RGIVI+
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ

XP_022995400.1 pentatricopeptide repeat-containing protein At2g01740 [Cucurbita maxima]4.6e-27586.46Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQF A+LRR+SRFPSP+TCNKL+HSLINSGCGELSAK+LFHFLSKGYTPH SSFNSIISFFC+LGN+KYAERILNSMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        YC +YKIR+ACFLVNRVRG  LNPDLVMFNILFNGFAK+YMK EAFM+LGLMWK CLPNVVTYGT VDMFCKMGDM++GN+M  DMMKVG+VPNLVAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSL++AF Y E+M+ CSV  NEFTYSTLIDGCCK+GML +AD LFE+ML+ GILPNCTV+TSIIDGHFKKGNVD+A+KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVI+GFRRVGRL KAMEAA+ VVKNGLLPDRIILTAIMDVHFKAGNLKEALN+YR LLARGFEPD VTLS L+DGLCK+GYLQEARRY+  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        ILYTVLIDALCKEG+LDE ER IKEM EAGFVPDKYVYTSWIAELCKQGNLLKAF VKKRMVQE+IEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLN G
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ
        I PDSV Y+ILIRGYHNQGN VAISGLHDEMR RGIVI+
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ

XP_023537260.1 pentatricopeptide repeat-containing protein At2g01740 [Cucurbita pepo subsp. pepo]1.2e-27586.83Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQF A+LRR+S FPSP+TCNKL+HSLINSGCGELSAK+LFHFLSKGYTPH SSFNSIISFFC+LGN+KYAERILNSMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        YC +YKIR+ACFLVNRVRG  LNPDLVMFNILFNGFAK YMK EAFM+LGLMWK CLPNVVTYGT VDMFCKMGDM++GN+M  DMMKVG+VPNLVAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSLD+AF Y E+M+ CSV  NEFTYSTLIDGCCK+GML +AD LFE+ML+ GILPNCTV+TSIIDGHFKKGNVD+A+KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVISGFRRVGRL KAMEAA+ VVKNGLLPDRIILTAIMDVHFKAGNLKEALN+YR LLARGFEPD VTLSTL+DGLCK+GYLQEARRY+  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        ILYTVLIDALCKEG+LDE ER IKEM EAGFVPDKYVYTSWIAELCKQGNLLKAF VKKRMVQE+IEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLN G
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ
        I PDSV Y+ILIRGYHNQGN VAISGLHDEMR RGIVI+
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ

XP_038891429.1 pentatricopeptide repeat-containing protein At2g01740 [Benincasa hispida]1.1e-27686.32Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        M+KEAL+F A+LRR+SRFP+P+TCNKLLHSLINSGCG+LSAKLLFHFLS  YTPHPSSFNSIISFFCRLGNVK+AE+ILNSMP FGCSPDIVSYNSLLDG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELN-PDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        YC +Y+I+KACFLV+RVRG ELN PDLVMFNILFNG AK+YMK EAFM+LGLMWKYCLP+VVTYGTFVDMFCKMGDM+MGNRM LDMMKVG++PNLV FS
Subjt:  YCTNYKIRKACFLVNRVRGSELN-PDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        SLIDGYCKAGSLD+AFEYFE+ME CSV  NEFTYS LIDGCCK GML +ADSLFEKML+ GILPNCTVYTSIIDGHFKKGNVDD IKYINGMFDREI LD
Subjt:  SLIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKAN
        LTAYTV+ISGF RVGRL K+MEAA+ VVKNGLLPDRIILTAIMDVHFKAGN+KEALN+Y+ LL++GFEPD VT S L+DGLCKHGYLQEARRYL  EKAN
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKAN

Query:  EILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNK
        EILYTV IDALCKEG+LDE ER IKEMSEAGFV DKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDM+NK
Subjt:  EILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNK

Query:  GIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQD
        GI PDSV+Y+ILIRGYHNQGN  AISGLHDEMRKRGI I+D
Subjt:  GIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQD

TrEMBL top hitse value%identityAlignment
A0A1S3BTQ4 pentatricopeptide repeat-containing protein At2g01740 isoform X24.6e-25773.81Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQF A+LRR+SRFPSP+TCNKLLHSLINSGCG LSAKLL H LSKGYTPHPSSFNSIISFFCR GNVK+AE+I  SM RFGCSPDIVSYNSLLDG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELN-PDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        YC++ +I+KACFLVNRVRG ELN PDLVMFNILF GFAK+YMK EAFM+LGLMWKY LP++VTYGTFVDMFCKMGDM+MGNRM LDMMKVG+VPNL+ FS
Subjt:  YCTNYKIRKACFLVNRVRGSELN-PDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKMEGCSV------------------------------------------------------------------------N
        SLIDGYCKAGSLD+AFEYFE+M+ CSV                                                                        N
Subjt:  SLIDGYCKAGSLDIAFEYFEKMEGCSV------------------------------------------------------------------------N

Query:  EFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDLTAYTVVISGFRRVGRLQKAMEAADTVVKN
        EFTYS LIDGC K GML +ADSLFEKML+  ILPNCTVYTSIIDGHFKKGNVDDAIKYIN MFD++I LDLTAYTV+ISGF RVGR  K+MEAA+ V K 
Subjt:  EFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDLTAYTVVISGFRRVGRLQKAMEAADTVVKN

Query:  GLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEA
        GLLPDRIILTAIMDVHFKAGN+KEALN+Y+ LLA+GFE D  TLS LMDGL KHGYLQ+ARRY   EKANEILYTV IDALCKEG+LDE E++IKEMSEA
Subjt:  GLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEA

Query:  GFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHD
        GFVPDK+VYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNKGI PD V Y+ILIRGYHNQGN  AISGLHD
Subjt:  GFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHD

Query:  EMRKRGIVIQD
        EMRKRGI ++D
Subjt:  EMRKRGIVIQD

A0A1S3BUW7 pentatricopeptide repeat-containing protein At2g01740 isoform X47.1e-26683.55Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQF A+LRR+SRFPSP+TCNKLLHSLINSGCG LSAKLL H LSKGYTPHPSSFNSIISFFCR GNVK+AE+I  SM RFGCSPDIVSYNSLLDG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELN-PDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        YC++ +I+KACFLVNRVRG ELN PDLVMFNILF GFAK+YMK EAFM+LGLMWKY LP++VTYGTFVDMFCKMGDM+MGNRM LDMMKVG+VPNL+ FS
Subjt:  YCTNYKIRKACFLVNRVRGSELN-PDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        SLIDGYCKAGSLD+AFEYFE+M+ CSV  NEFTYSTLIDGC K GML +ADSLFEKML+  ILPNCTVYTSIIDGHFKKGNVDDAIKYIN MFD++I LD
Subjt:  SLIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKAN
        LTAYTV+ISGF RVGR  K+MEAA+ V K GLLPDRIILTAIMDVHFKAGN+KEALN+Y+ LLA+GFE D  TLS LMDGL KHGYLQ+ARRY   EKAN
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKAN

Query:  EILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNK
        EILYTV IDALCKEG+LDE E++IKEMSEAGFVPDK+VYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNK
Subjt:  EILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNK

Query:  GIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQD
        GI PD V Y+ILIRGYHNQGN  AISGLHDEMRKRGI ++D
Subjt:  GIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQD

A0A6J1D0T3 pentatricopeptide repeat-containing protein At2g017404.2e-28287.27Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVK+ALQF AYLRR+SRFPSP+TCNKLLHSL+NSGCGELSAKLLFHFLSKGY+PHPSSFNSIISFFCRLG V++A++IL+SMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        +C NY+IRKACFLVNRVRGSEL+PDLVMFNILFNGFAK+YMK+EAFM+LGLMWK+CLPNVVTYGTF+DMFCKMGDM+MGN M LDM+KVG+VPN VAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSLDIAF+YFEKME CSV  NEFTY+TLIDGCCK+GMLG+AD LFEKMLN GI PNCTVYTSIIDGHFKKGNVDDA KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVISGFRRVGRLQKAMEAA+ VVKNGLLPD IILTAIMD+HFKAGNLKEALNSY+KLLARGFEPDA+TLSTLMDGLCKHGY+QEARRYL  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        +LYTVLIDALCKEGSLDEVERII+EMSEAGFVPDKYVYTSWIAELCKQG+LLKAF++KKRMV+EHIEPDLLTYSSLI G+AEKGLMIEAKQVF+DMLNKG
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDVN
        I PDSV  NILIRGYH  GN+ AI GLHDEMRKRGIVI+D N
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDVN

A0A6J1FRV7 pentatricopeptide repeat-containing protein At2g017402.3e-27285.53Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQ  A+LRR+SRFPSP+TCNKL+HSLINSGCGELSAK+LFHFLSKGYTPH SSFNSIISFFC+LGN+KYAERILNSMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        YC +YKIR+ CFLVNR+R   LNPDLVMFNILFNGFAK+YMK EAFM+LGLMWK CLPNVVTYGT VDMFCKMGDM++GN+M  DMMKVG+VPNLVAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSLD+AF Y E+M+ CSV  NEFTYSTLIDGCCK+GML +AD LFE+ML+ GILPNCTV+TSIIDGHFKKGNVD+A+KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVISGFRRVGRL KAMEAA+ VVKNGLLPDRIILTAIMDVHFKAGNLKEALN+YR LLARGFEPD VTLS L+DGLCK+GYL EAR+Y+  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        ILYTV IDALCKEG+LDE ER IKEM EAGFVPDKYVYTSWIAELCKQGNLLKAF VKKRMVQE+IEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLN G
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ
        I PDSV Y+ILIRGYHNQGN VAISGLHDEMR RGIVI+
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ

A0A6J1K3Y4 pentatricopeptide repeat-containing protein At2g017402.2e-27586.46Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MVKEALQF A+LRR+SRFPSP+TCNKL+HSLINSGCGELSAK+LFHFLSKGYTPH SSFNSIISFFC+LGN+KYAERILNSMPRFGCSPDIVSYNSLL G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS
        YC +YKIR+ACFLVNRVRG  LNPDLVMFNILFNGFAK+YMK EAFM+LGLMWK CLPNVVTYGT VDMFCKMGDM++GN+M  DMMKVG+VPNLVAFSS
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSS

Query:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL
        LIDGYCKAGSL++AF Y E+M+ CSV  NEFTYSTLIDGCCK+GML +AD LFE+ML+ GILPNCTV+TSIIDGHFKKGNVD+A+KYIN MFDREI LDL
Subjt:  LIDGYCKAGSLDIAFEYFEKMEGCSV--NEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDL

Query:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE
        TAYTVVI+GFRRVGRL KAMEAA+ VVKNGLLPDRIILTAIMDVHFKAGNLKEALN+YR LLARGFEPD VTLS L+DGLCK+GYLQEARRY+  EKANE
Subjt:  TAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANE

Query:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG
        ILYTVLIDALCKEG+LDE ER IKEM EAGFVPDKYVYTSWIAELCKQGNLLKAF VKKRMVQE+IEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLN G
Subjt:  ILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKG

Query:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ
        I PDSV Y+ILIRGYHNQGN VAISGLHDEMR RGIVI+
Subjt:  IAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQ

SwissProt top hitse value%identityAlignment
P0C894 Putative pentatricopeptide repeat-containing protein At2g021504.4e-7932.13Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        M++EA+Q F+ ++R   FP   +CN LLH     G  +   +     +  G  P   ++N +I   C+ G+V+ A  +   M   G  PD V+YNS++DG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        +    ++         ++     PD++ +N L N F K         F   M    L PNVV+Y T VD FCK G M    +  +DM +VG+VPN   ++
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        SLID  CK G+L  AF    +M   G   N  TY+ LIDG C    + +A+ LF KM   G++PN   Y ++I G  K  N+D A++ +N +  R I  D
Subjt:  SLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL------
        L  Y   I G   + +++ A    + + + G+  + +I T +MD +FK+GN  E L+   ++     E   VT   L+DGLCK+  + +A  Y       
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL------

Query:  FGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVF
        FG +AN  ++T +ID LCK+  ++    + ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL Y+SL+ GL+    + +A+   
Subjt:  FGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVF

Query:  DDMLNKGIAPDSVTYNILIRGYHNQG
        ++M+ +GI PD V    +++ ++  G
Subjt:  DDMLNKGIAPDSVTYNILIRGYHNQG

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397109.2e-6929.21Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLF-HFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLD
        ++ +AL      +     P   + N +L + I S      A+ +F   L    +P+  ++N +I  FC  GN+  A  + + M   GC P++V+YN+L+D
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLF-HFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLD

Query:  GYCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKL-YMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAF
        GYC   KI     L+  +    L P+L+ +N++ NG  +   MK+ +F+   +  +    + VTY T +  +CK G+      M  +M++ G+ P+++ +
Subjt:  GYCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKL-YMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAF

Query:  SSLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISL
        +SLI   CKAG+++ A E+ ++M   G   NE TY+TL+DG  ++G + +A  +  +M ++G  P+   Y ++I+GH   G ++DAI  +  M ++ +S 
Subjt:  SSLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISL

Query:  DLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKA
        D+ +Y+ V+SGF R   + +A+     +V+ G+ PD I  ++++    +    KEA + Y ++L  G  PD                             
Subjt:  DLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKA

Query:  NEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS---------------SLIGGLAEK
         E  YT LI+A C EG L++  ++  EM E G +PD   Y+  I  L KQ    +A  +  ++  E   P  +TY                SLI G   K
Subjt:  NEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS---------------SLIGGLAEK

Query:  GLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDV
        G+M EA QVF+ ML K   PD   YNI+I G+   G+      L+ EM K G ++  V
Subjt:  GLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDV

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655603.9e-6729.87Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        +V E  Q +  +      P+ YT NK+++     G  E + + +   +  G  P   ++ S+I  +C+  ++  A ++ N MP  GC  + V+Y  L+ G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
         C   +I +A  L  +++  E  P +  + +L         K EA   +  M +  + PN+ TY   +D  C     +    +L  M++ G++PN++ ++
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKMEG--CSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        +LI+GYCK G ++ A +  E ME    S N  TY+ LI G CK   + KA  +  KML   +LP+   Y S+IDG  + GN D A + ++ M DR +  D
Subjt:  SLIDGYCKAGSLDIAFEYFEKMEG--CSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEA-----RRYLF
           YT +I    +  R+++A +  D++ + G+ P+ ++ TA++D + KAG + EA     K+L++   P+++T + L+ GLC  G L+EA     +    
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEA-----RRYLF

Query:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD
        G +      T+LI  L K+G  D      ++M  +G  PD + YT++I   C++G LL A  +  +M +  + PDL TYSSLI G  + G    A  V  
Subjt:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD

Query:  DMLNKGIAPDSVTYNILIR
         M + G  P   T+  LI+
Subjt:  DMLNKGIAPDSVTYNILIR

Q9LVQ5 Pentatricopeptide repeat-containing protein At5g558401.4e-6928.86Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        M++++L+ F  +      PS YTCN +L S++ SG        L   L +   P  ++FN +I+  C  G+ + +  ++  M + G +P IV+YN++L  
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        YC   + + A  L++ ++   ++ D+  +N+L +   +     + ++ L  M K  + PN VTY T ++ F   G + + +++L +M+  G+ PN V F+
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        +LIDG+   G+   A + F  ME  G + +E +Y  L+DG CK      A   + +M  +G+      YT +IDG  K G +D+A+  +N M    I  D
Subjt:  SLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL-----F
        +  Y+ +I+GF +VGR + A E    + + GL P+ II + ++    + G LKEA+  Y  ++  G   D  T + L+  LCK G + EA  ++      
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL-----F

Query:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD
        G   N + +  LI+     G   +   +  EM++ G  P  + Y S +  LCK G+L +A    K +       D + Y++L+  + + G + +A  +F 
Subjt:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD

Query:  DMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVI
        +M+ + I PDS TY  LI G   +G  V       E   RG V+
Subjt:  DMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVI

Q9ZUA2 Pentatricopeptide repeat-containing protein At2g017405.4e-17052.41Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MV+EALQF + LR+ S  P P+TCNK +H LINS CG LS K L + +S+GYTPH SSFNS++SF C+LG VK+AE I++SMPRFGC PD++SYNSL+DG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSE---LNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVA
        +C N  IR A  ++  +R S      PD+V FN LFNGF+K+ M  E F+++G+M K C PNVVTY T++D FCK G++ +  +    M +  + PN+V 
Subjt:  YCTNYKIRKACFLVNRVRGSE---LNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVA

Query:  FSSLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREIS
        F+ LIDGYCKAG L++A   +++M     S+N  TY+ LIDG CK+G + +A+ ++ +M+ D + PN  VYT+IIDG F++G+ D+A+K++  M ++ + 
Subjt:  FSSLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREIS

Query:  LDLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEK
        LD+TAY V+ISG    G+L++A E  + + K+ L+PD +I T +M+ +FK+G +K A+N Y KL+ RGFEPD V LST++DG+ K+G L EA  Y   EK
Subjt:  LDLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEK

Query:  ANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDML
        AN+++YTVLIDALCKEG   EVER+  ++SEAG VPDK++YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI GLA KGLM+EA+QVFD+ML
Subjt:  ANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDML

Query:  NKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIV
        N GI+PDS  +++LIR Y  +GN  A S L  +M++RG+V
Subjt:  NKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIV

Arabidopsis top hitse value%identityAlignment
AT2G01740.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-17152.41Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        MV+EALQF + LR+ S  P P+TCNK +H LINS CG LS K L + +S+GYTPH SSFNS++SF C+LG VK+AE I++SMPRFGC PD++SYNSL+DG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSE---LNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVA
        +C N  IR A  ++  +R S      PD+V FN LFNGF+K+ M  E F+++G+M K C PNVVTY T++D FCK G++ +  +    M +  + PN+V 
Subjt:  YCTNYKIRKACFLVNRVRGSE---LNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVA

Query:  FSSLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREIS
        F+ LIDGYCKAG L++A   +++M     S+N  TY+ LIDG CK+G + +A+ ++ +M+ D + PN  VYT+IIDG F++G+ D+A+K++  M ++ + 
Subjt:  FSSLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREIS

Query:  LDLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEK
        LD+TAY V+ISG    G+L++A E  + + K+ L+PD +I T +M+ +FK+G +K A+N Y KL+ RGFEPD V LST++DG+ K+G L EA  Y   EK
Subjt:  LDLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEK

Query:  ANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDML
        AN+++YTVLIDALCKEG   EVER+  ++SEAG VPDK++YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI GLA KGLM+EA+QVFD+ML
Subjt:  ANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDML

Query:  NKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIV
        N GI+PDS  +++LIR Y  +GN  A S L  +M++RG+V
Subjt:  NKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIV

AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-8032.13Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        M++EA+Q F+ ++R   FP   +CN LLH     G  +   +     +  G  P   ++N +I   C+ G+V+ A  +   M   G  PD V+YNS++DG
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        +    ++         ++     PD++ +N L N F K         F   M    L PNVV+Y T VD FCK G M    +  +DM +VG+VPN   ++
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        SLID  CK G+L  AF    +M   G   N  TY+ LIDG C    + +A+ LF KM   G++PN   Y ++I G  K  N+D A++ +N +  R I  D
Subjt:  SLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL------
        L  Y   I G   + +++ A    + + + G+  + +I T +MD +FK+GN  E L+   ++     E   VT   L+DGLCK+  + +A  Y       
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL------

Query:  FGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVF
        FG +AN  ++T +ID LCK+  ++    + ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL Y+SL+ GL+    + +A+   
Subjt:  FGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVF

Query:  DDMLNKGIAPDSVTYNILIRGYHNQG
        ++M+ +GI PD V    +++ ++  G
Subjt:  DDMLNKGIAPDSVTYNILIRGYHNQG

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-7029.21Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLF-HFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLD
        ++ +AL      +     P   + N +L + I S      A+ +F   L    +P+  ++N +I  FC  GN+  A  + + M   GC P++V+YN+L+D
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLF-HFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLD

Query:  GYCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKL-YMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAF
        GYC   KI     L+  +    L P+L+ +N++ NG  +   MK+ +F+   +  +    + VTY T +  +CK G+      M  +M++ G+ P+++ +
Subjt:  GYCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKL-YMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAF

Query:  SSLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISL
        +SLI   CKAG+++ A E+ ++M   G   NE TY+TL+DG  ++G + +A  +  +M ++G  P+   Y ++I+GH   G ++DAI  +  M ++ +S 
Subjt:  SSLIDGYCKAGSLDIAFEYFEKM--EGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISL

Query:  DLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKA
        D+ +Y+ V+SGF R   + +A+     +V+ G+ PD I  ++++    +    KEA + Y ++L  G  PD                             
Subjt:  DLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKA

Query:  NEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS---------------SLIGGLAEK
         E  YT LI+A C EG L++  ++  EM E G +PD   Y+  I  L KQ    +A  +  ++  E   P  +TY                SLI G   K
Subjt:  NEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS---------------SLIGGLAEK

Query:  GLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDV
        G+M EA QVF+ ML K   PD   YNI+I G+   G+      L+ EM K G ++  V
Subjt:  GLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDV

AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-7028.86Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        M++++L+ F  +      PS YTCN +L S++ SG        L   L +   P  ++FN +I+  C  G+ + +  ++  M + G +P IV+YN++L  
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
        YC   + + A  L++ ++   ++ D+  +N+L +   +     + ++ L  M K  + PN VTY T ++ F   G + + +++L +M+  G+ PN V F+
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        +LIDG+   G+   A + F  ME  G + +E +Y  L+DG CK      A   + +M  +G+      YT +IDG  K G +D+A+  +N M    I  D
Subjt:  SLIDGYCKAGSLDIAFEYFEKME--GCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL-----F
        +  Y+ +I+GF +VGR + A E    + + GL P+ II + ++    + G LKEA+  Y  ++  G   D  T + L+  LCK G + EA  ++      
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYL-----F

Query:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD
        G   N + +  LI+     G   +   +  EM++ G  P  + Y S +  LCK G+L +A    K +       D + Y++L+  + + G + +A  +F 
Subjt:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD

Query:  DMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVI
        +M+ + I PDS TY  LI G   +G  V       E   RG V+
Subjt:  DMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVI

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein2.7e-6829.87Show/hide
Query:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG
        +V E  Q +  +      P+ YT NK+++     G  E + + +   +  G  P   ++ S+I  +C+  ++  A ++ N MP  GC  + V+Y  L+ G
Subjt:  MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDG

Query:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS
         C   +I +A  L  +++  E  P +  + +L         K EA   +  M +  + PN+ TY   +D  C     +    +L  M++ G++PN++ ++
Subjt:  YCTNYKIRKACFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCL-PNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFS

Query:  SLIDGYCKAGSLDIAFEYFEKMEG--CSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD
        +LI+GYCK G ++ A +  E ME    S N  TY+ LI G CK   + KA  +  KML   +LP+   Y S+IDG  + GN D A + ++ M DR +  D
Subjt:  SLIDGYCKAGSLDIAFEYFEKMEG--CSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLD

Query:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEA-----RRYLF
           YT +I    +  R+++A +  D++ + G+ P+ ++ TA++D + KAG + EA     K+L++   P+++T + L+ GLC  G L+EA     +    
Subjt:  LTAYTVVISGFRRVGRLQKAMEAADTVVKNGLLPDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEA-----RRYLF

Query:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD
        G +      T+LI  L K+G  D      ++M  +G  PD + YT++I   C++G LL A  +  +M +  + PDL TYSSLI G  + G    A  V  
Subjt:  GEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFD

Query:  DMLNKGIAPDSVTYNILIR
         M + G  P   T+  LI+
Subjt:  DMLNKGIAPDSVTYNILIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAAGAAGCCCTCCAATTCTTTGCTTATTTGCGACGAGTCTCCCGGTTTCCCTCCCCTTACACCTGCAACAAGCTTCTGCACTCTCTTATCAACTCCGGCTGCGG
CGAGCTCTCGGCCAAATTACTTTTCCACTTCCTCTCCAAAGGGTACACTCCCCATCCATCTTCTTTCAATTCTATCATCTCCTTCTTCTGTAGATTAGGGAACGTGAAAT
ATGCGGAACGGATTTTGAATTCAATGCCCAGATTTGGTTGCTCGCCTGATATTGTATCTTACAATTCTTTGTTAGATGGGTACTGTACAAATTATAAAATCCGGAAGGCT
TGTTTTCTAGTGAACAGAGTTCGTGGGAGCGAGTTGAATCCTGATTTGGTTATGTTCAATATACTGTTTAATGGGTTTGCTAAGCTTTATATGAAGAAAGAAGCATTTAT
GTTTCTGGGTTTGATGTGGAAATACTGTTTGCCTAATGTTGTTACTTATGGAACATTTGTTGATATGTTCTGCAAGATGGGGGATATGGATATGGGTAACAGAATGTTAT
TGGATATGATGAAGGTGGGGATGGTGCCAAACTTGGTTGCTTTTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAGTTTGGATATTGCATTTGAATACTTTGAGAAA
ATGGAGGGATGTTCGGTGAATGAGTTCACATATTCTACTTTGATTGACGGTTGCTGCAAGGAAGGGATGTTGGGAAAAGCCGACTCATTGTTCGAAAAGATGTTGAATGA
TGGTATTCTGCCTAATTGTACTGTTTACACTTCGATAATAGATGGACATTTTAAGAAGGGAAATGTAGACGATGCGATAAAGTATATAAATGGGATGTTTGATCGAGAGA
TAAGCCTAGATTTAACAGCATATACGGTAGTTATATCGGGCTTTCGTAGAGTTGGTAGGTTGCAGAAGGCAATGGAAGCTGCAGATACTGTGGTGAAGAATGGATTACTT
CCTGATAGAATAATACTGACAGCTATTATGGATGTGCATTTCAAAGCTGGAAACTTAAAAGAAGCTTTGAATTCATACAGAAAATTGCTCGCAAGGGGTTTTGAGCCTGA
TGCTGTGACTCTTTCAACTCTGATGGACGGCCTATGCAAGCACGGGTATTTGCAGGAGGCAAGACGGTATTTGTTCGGAGAAAAGGCCAATGAAATCTTGTATACAGTGC
TTATAGATGCACTCTGCAAGGAGGGGAGTTTAGATGAAGTTGAGAGAATCATTAAGGAGATGTCTGAGGCAGGGTTTGTTCCAGATAAATATGTGTACACTTCTTGGATT
GCAGAGCTTTGCAAGCAAGGAAATTTGCTGAAGGCTTTCATGGTCAAGAAAAGGATGGTTCAAGAGCACATTGAACCTGATTTATTAACCTATAGCTCCCTAATTGGCGG
TTTGGCTGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATAAAGGAATCGCTCCAGATTCTGTCACTTATAACATTCTCATAAGAGGGTACC
ATAATCAGGGTAATAAAGTTGCAATTTCAGGTCTACACGATGAAATGAGAAAGAGAGGAATTGTTATTCAAGATGTAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAAGAAGCCCTCCAATTCTTTGCTTATTTGCGACGAGTCTCCCGGTTTCCCTCCCCTTACACCTGCAACAAGCTTCTGCACTCTCTTATCAACTCCGGCTGCGG
CGAGCTCTCGGCCAAATTACTTTTCCACTTCCTCTCCAAAGGGTACACTCCCCATCCATCTTCTTTCAATTCTATCATCTCCTTCTTCTGTAGATTAGGGAACGTGAAAT
ATGCGGAACGGATTTTGAATTCAATGCCCAGATTTGGTTGCTCGCCTGATATTGTATCTTACAATTCTTTGTTAGATGGGTACTGTACAAATTATAAAATCCGGAAGGCT
TGTTTTCTAGTGAACAGAGTTCGTGGGAGCGAGTTGAATCCTGATTTGGTTATGTTCAATATACTGTTTAATGGGTTTGCTAAGCTTTATATGAAGAAAGAAGCATTTAT
GTTTCTGGGTTTGATGTGGAAATACTGTTTGCCTAATGTTGTTACTTATGGAACATTTGTTGATATGTTCTGCAAGATGGGGGATATGGATATGGGTAACAGAATGTTAT
TGGATATGATGAAGGTGGGGATGGTGCCAAACTTGGTTGCTTTTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAGTTTGGATATTGCATTTGAATACTTTGAGAAA
ATGGAGGGATGTTCGGTGAATGAGTTCACATATTCTACTTTGATTGACGGTTGCTGCAAGGAAGGGATGTTGGGAAAAGCCGACTCATTGTTCGAAAAGATGTTGAATGA
TGGTATTCTGCCTAATTGTACTGTTTACACTTCGATAATAGATGGACATTTTAAGAAGGGAAATGTAGACGATGCGATAAAGTATATAAATGGGATGTTTGATCGAGAGA
TAAGCCTAGATTTAACAGCATATACGGTAGTTATATCGGGCTTTCGTAGAGTTGGTAGGTTGCAGAAGGCAATGGAAGCTGCAGATACTGTGGTGAAGAATGGATTACTT
CCTGATAGAATAATACTGACAGCTATTATGGATGTGCATTTCAAAGCTGGAAACTTAAAAGAAGCTTTGAATTCATACAGAAAATTGCTCGCAAGGGGTTTTGAGCCTGA
TGCTGTGACTCTTTCAACTCTGATGGACGGCCTATGCAAGCACGGGTATTTGCAGGAGGCAAGACGGTATTTGTTCGGAGAAAAGGCCAATGAAATCTTGTATACAGTGC
TTATAGATGCACTCTGCAAGGAGGGGAGTTTAGATGAAGTTGAGAGAATCATTAAGGAGATGTCTGAGGCAGGGTTTGTTCCAGATAAATATGTGTACACTTCTTGGATT
GCAGAGCTTTGCAAGCAAGGAAATTTGCTGAAGGCTTTCATGGTCAAGAAAAGGATGGTTCAAGAGCACATTGAACCTGATTTATTAACCTATAGCTCCCTAATTGGCGG
TTTGGCTGAGAAGGGACTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATAAAGGAATCGCTCCAGATTCTGTCACTTATAACATTCTCATAAGAGGGTACC
ATAATCAGGGTAATAAAGTTGCAATTTCAGGTCTACACGATGAAATGAGAAAGAGAGGAATTGTTATTCAAGATGTAAACTGA
Protein sequenceShow/hide protein sequence
MVKEALQFFAYLRRVSRFPSPYTCNKLLHSLINSGCGELSAKLLFHFLSKGYTPHPSSFNSIISFFCRLGNVKYAERILNSMPRFGCSPDIVSYNSLLDGYCTNYKIRKA
CFLVNRVRGSELNPDLVMFNILFNGFAKLYMKKEAFMFLGLMWKYCLPNVVTYGTFVDMFCKMGDMDMGNRMLLDMMKVGMVPNLVAFSSLIDGYCKAGSLDIAFEYFEK
MEGCSVNEFTYSTLIDGCCKEGMLGKADSLFEKMLNDGILPNCTVYTSIIDGHFKKGNVDDAIKYINGMFDREISLDLTAYTVVISGFRRVGRLQKAMEAADTVVKNGLL
PDRIILTAIMDVHFKAGNLKEALNSYRKLLARGFEPDAVTLSTLMDGLCKHGYLQEARRYLFGEKANEILYTVLIDALCKEGSLDEVERIIKEMSEAGFVPDKYVYTSWI
AELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIGGLAEKGLMIEAKQVFDDMLNKGIAPDSVTYNILIRGYHNQGNKVAISGLHDEMRKRGIVIQDVN