; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029964 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029964
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153554:1516909..1531959
RNA-Seq ExpressionSgr029964
SyntenySgr029964
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576861.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.67Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF
        ++ LPCKWR+ISLFRPSFQACC LYS  TT+PKYY + VE EKKEIDFNRLF +C KVHLAKRLHALL+VSGK QS+FLSAK INLYAFLGD+SF+R TF
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF

Query:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------
        DQI+AKDVYTWNSMISAYARIGHFHEA+DCF+EFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCL LKLGFE DVF+AASLIHFYSR            
Subjt:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------

Query:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF
                                 AL VFDEMRFK V MDSVT SSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFV NALINMYAKFGELGSA+ IF
Subjt:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF

Query:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS---------------------------------------
        N++E +D+VSWNSLIAAFEQNKEP+VALG+++KMHA    PDLLTLVSLASVAAE GN LS                                       
Subjt:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS---------------------------------------

Query:  -----------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA
                          GYSQNG ANEAI+VYHLM DYSDAVPNQGTWVSILTAYS IGALKQGMKTHG LIK+ LYFDIFVGTCL+D+YGKCGRL DA
Subjt:  -----------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA

Query:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------
        +SLFYE+PHKSSVSWNAIISCHGLHG GLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E+YGIRPSLKHYGCM           
Subjt:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------

Query:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
                    PDASVWGALLGACRIHENVEL RTVSDHLLEVES+NVGYYVLLSNIYAK GQW+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVFYT
Subjt:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT

Query:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
         N+THP+CEEIY ELR LTAKMKSLGYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNATKFISKITEREIIVRD
Subjt:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD

Query:  SNRFHHFKDGVCSCGDYW
        SNRFHHFKDGVCSCGDYW
Subjt:  SNRFHHFKDGVCSCGDYW

KAG7014884.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0072.42Show/hide
Query:  MVLQLKDNIWNDFTENDDYIVPHLGDELWDLFEVQ------SREVVGFTYDPYTATKFSFHDKEKVRIQTSIMKDTIVEKDSWSHTPDGVPSPLNGDSFK
        MVLQLKDN+W DFTE DDYIVPH+GDELWDLFEV+      S   VGFT   Y+A+KFSF  KEKV+ QTSIMK+T++EKDSWSHTPDG PS LN  + K
Subjt:  MVLQLKDNIWNDFTENDDYIVPHLGDELWDLFEVQ------SREVVGFTYDPYTATKFSFHDKEKVRIQTSIMKDTIVEKDSWSHTPDGVPSPLNGDSFK

Query:  DMKMESSSLSMSNHCFKTGVGTDLEYCTDDPIVTDNSAAEENDMYQYSVSHISQTENDISFLDDDHENKENNDLLYYGWQDIESFEDVDRMFRNCDSTFG
        DMKMESSSLSMSNHCFKT VGTDL+YCTDD IVTDNSAA+ENDMYQYSVSH+SQT+NDISFLDDD EN ENNDLLYYGWQDI SFEDVDRMFRNCDSTFG
Subjt:  DMKMESSSLSMSNHCFKTGVGTDLEYCTDDPIVTDNSAAEENDMYQYSVSHISQTENDISFLDDDHENKENNDLLYYGWQDIESFEDVDRMFRNCDSTFG

Query:  LGNLSNEDELRWFSPSHGSEKLEDPSKSNFKFSCCEGSTINDASECNEDSNPVNSDPSSDGLNRNNILTGCKVNDGIADMCDSAAISHLSTADMSDTKSN
        LGNL+NED+LRWFSPSHGSEK EDPSKSNFKFSCCEGSTI DASE NE+SNPVNS+PS DGLNRNNIL GCKVNDG  D+ DS AISHLS ADMSD KSN
Subjt:  LGNLSNEDELRWFSPSHGSEKLEDPSKSNFKFSCCEGSTINDASECNEDSNPVNSDPSSDGLNRNNILTGCKVNDGIADMCDSAAISHLSTADMSDTKSN

Query:  SRGDLIPKKQESSYASNQLLSIRSSHYPSLDAPAIAANENREKLYHQDLQASFNKNFTFMSTPSSETFNTSFPVRKQAPRSESDIDDGHSETGVVSRGSR
        SRGDLIP+KQESSYAS+QL S+ SSHYPS DAP  AANENREKLYHQDL ASFNKNFTFMS PSSETFNTSFPVRKQAPRSES+IDDGHSETGVVSRG+R
Subjt:  SRGDLIPKKQESSYASNQLLSIRSSHYPSLDAPAIAANENREKLYHQDLQASFNKNFTFMSTPSSETFNTSFPVRKQAPRSESDIDDGHSETGVVSRGSR

Query:  EELDSSNARDKSCR-STVLNGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANLNENIGEDKLGRIAPSIDQDTNR-----------
         ELDSSNA+DKSCR  T+L+GISLEATSF QLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANLNENIGEDKL R   SIDQD NR           
Subjt:  EELDSSNARDKSCR-STVLNGISLEATSFCQLQQVMEQLDIRTKLCIRDSLYRLARSAEQRHNCANLNENIGEDKLGRIAPSIDQDTNR-----------

Query:  -----------------------SGGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLNHLQFEKLERAFSSQCASRKTNFPGRNRWSC
                               SG FLDLETDTNPIDRSVAHLLFHRPSDPS+MP GGNTLPLKS+KL                A+ K NF      + 
Subjt:  -----------------------SGGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLNHLQFEKLERAFSSQCASRKTNFPGRNRWSC

Query:  FLCRSKATGKWEETMSNKVEDSLSNLGIFCEENFPASESIERKSMRMIKMRESTGFGAVKRKFRDFSDWNNSGEAEERLLHTRVLMGMDSS-----GLVK
          C   A  K  E +  +VEDSL NLGI CEE+   S  +ERK M MI+  ESTG GA+K K +DF  WNNSGEAEE+++  +      SS      ++K
Subjt:  FLCRSKATGKWEETMSNKVEDSLSNLGIFCEENFPASESIERKSMRMIKMRESTGFGAVKRKFRDFSDWNNSGEAEERLLHTRVLMGMDSS-----GLVK

Query:  SSSSVQISSWSFNYLLKTLDAIKLLPCKWRQISLFRPSFQACCSLYSVATT--TPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSV
           S+     S   L ++    K L  +   +    PSFQACC LYS  TT  TPKYYL+ VE EKKEIDFNRLFL+C KVHLAKRLHALLVVSGK QS+
Subjt:  SSSSVQISSWSFNYLLKTLDAIKLLPCKWRQISLFRPSFQACCSLYSVATT--TPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSV

Query:  FLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVF
        FLSAK INLYAFLGD+SF+R TFDQI+AKDVYTWNSMISAYARIGHFHEA+DCF+EFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCL LKLGFE DVF
Subjt:  FLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVF

Query:  VAASLIHFYSR-------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDL
        +AASLIHFYSR                                     AL VFDEMRFK V MDSVT SSLLPICAQLDDIISGVLIHVYAIKLGLEFDL
Subjt:  VAASLIHFYSR-------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDL

Query:  FVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS----------------
        FV NALINMYAKFGELGSA+ IFN++E +D+VSWNSLIAAFEQNKEP+VALG+++KMHA    PDLLTLVSLASVAAE GN LS                
Subjt:  FVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS----------------

Query:  ----------------------------------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRL
                                                 GYSQNG ANEAI+VYHLM DYSDAVPNQGTWVSILTAYS IGALKQGMKTHG LIK+ L
Subjt:  ----------------------------------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRL

Query:  YFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMY
        YFDIFVGTCL+D+YGKCGRL DA+SLFYE+PHKSSVSWNAIISCHGLHG GLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E+Y
Subjt:  YFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMY

Query:  GIRPSLKHYGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARD
        GIRPSLKHYGCM                       PDASVWGALLGACRIHENVEL RTVSDHLLEVES+NVGYYVLLSNIYAK GQW+GVDEVRSLARD
Subjt:  GIRPSLKHYGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARD

Query:  RGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRV
        RGL+KTPGWSSIE+DKKIDVFYT N+THP+CEEIY ELR LTAKMKSLGYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPP+TTL+IFKNLRV
Subjt:  RGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRV

Query:  CGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
        CGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
Subjt:  CGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW

XP_022141202.1 pentatricopeptide repeat-containing protein At4g33990 [Momordica charantia]0.0e+0078.36Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF
        ++LLPCKWRQISLFRPSFQACCSLYS  TTTPKYYL+GVENE+KEIDFNRLFL CTKVHLAK LH LLVVSGKAQS+FLSAK  NLYAFLGDIS SR TF
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF

Query:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------
        DQI+AKDVYTWN MISAYARIGHFHEA+DCFYEFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFE DVFVAASLIHFYSR            
Subjt:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------

Query:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF
                                 AL VFDEMR K VTMD VTISSLLPICAQLDDIISG+LIHVYAIKLGLEFDLFVSNALI +YAKFGELGSA  IF
Subjt:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF

Query:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG-------------------------------------------
        N+MEVRDVVSWNSLIAAFEQNKEPMVAL V++KMHA+ + PDLLTLVSLASVAAE G                                           
Subjt:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG-------------------------------------------

Query:  -------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA
                     NSL  GYSQNGFANEAIEVYHLMK+YSDAVPNQGTWVSILTAYS +G LKQGMKTHGQLIK+ LYFDIFV TCL+DMYGKCGRL+DA
Subjt:  -------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA

Query:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------
        +SLFYEVPHKSSVSWNAIISCHGLHG GLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQ M E YGI PSLKHYGCM           
Subjt:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------

Query:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
                    PDASVWGALLGACRIHENVELVRTVSDHLLEVES+NVGYYVLLSNIYAK GQWEGVDEVRSLARDRGLKKTPGWSSIEV+KKIDVFYT
Subjt:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT

Query:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
         NQTHPKCEEIYKELR+LTAKMKSLGYV DY FVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
Subjt:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD

Query:  SNRFHHFKDGVCSCGDYW
        SNRFHHFKDGVCSCGDYW
Subjt:  SNRFHHFKDGVCSCGDYW

XP_022923150.1 pentatricopeptide repeat-containing protein At4g33990 [Cucurbita moschata]0.0e+0076.22Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATT--TPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRL
        ++ LPCKWR+ISLFRPSFQACC LYS  TT  TPKYYL+ VE EKKEIDFNRLFL+C KVHLAKRLHALLVVSGK QS+FLSAK INLYAFLGD+SF+R 
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATT--TPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRL

Query:  TFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR----------
        TFDQI+AKDVYTWNSMISAYARIGHFHEA+DCF+EFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCL LKLGFE DVF+AASLIHFYSR          
Subjt:  TFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR----------

Query:  ---------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQI
                                   AL VFDEMRFK VTMDSVT SSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFV NALINMYAKFGELGSA+ 
Subjt:  ---------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQI

Query:  IFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS-------------------------------------
        IFN++E +D+VSWNSLIAAFEQNKEP+VALG+++KMHA    PDLLTLVSLASVAAE GN LS                                     
Subjt:  IFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS-------------------------------------

Query:  -------------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLD
                            GYSQNG ANEAI+VYHLM DYSDAVPNQGTWVSILTAYS IGALKQGMKTHG LIK+ LYFDIFVGTCL+DMYGKCGRL 
Subjt:  -------------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLD

Query:  DAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM---------
        DA+SLFYE+PHKSSVSWNAIISCHGLHG GLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E+YGIRPSLKHYGCM         
Subjt:  DAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM---------

Query:  --------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVF
                      PDASVWGALLGACRIHENVEL RTVSDHLLEVES+NVGYYVLLSNIYAK GQW+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVF
Subjt:  --------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVF

Query:  YTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIV
        YT N+THP+CEEIY ELR LTAKMKSLGYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNATKFISKITEREIIV
Subjt:  YTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIV

Query:  RDSNRFHHFKDGVCSCGDYW
        RDSNRFHHFKDGVCSCGDYW
Subjt:  RDSNRFHHFKDGVCSCGDYW

XP_038903939.1 pentatricopeptide repeat-containing protein At4g33990 [Benincasa hispida]0.0e+0076.04Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF
        ++ L CKWR++SLFRPSFQ CCSLYS  TT PKYYL+GVENEK+EIDFNRLFL CTKVH A+RLHALLVVSGKAQ++FLSAK +NLYAFLGDISF+  TF
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF

Query:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------
        DQI+ KDVYTWNSMISAYARIG FHEA+DCFY+F+STS LQPDYYTFPPVIRACG LDDGKKIHCLVLKLGFE DVF+AASL+HFYSR            
Subjt:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------

Query:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF
                                 AL VFDEMRFK VTMDSVTIS+LLPICAQLDDIISGVLIHVYAIKLGLEFDLFV NALINMYAKFGEL SA+ IF
Subjt:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF

Query:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS---------------------------------------
        N+MEVRD+VSWNSLIAAFEQNKEP+VALGV+ KMH++ V PDLLTLVSLASVAAE GN LS                                       
Subjt:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS---------------------------------------

Query:  -----------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA
                          GYSQNG AN+AI+VYH MKDYSDAVPNQGTWVSILTA+S IGALKQGMKTHGQLIK  LYFDIFVGTCL+DMYGKCGRL DA
Subjt:  -----------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA

Query:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------
        +SLFYEVPHKSSVSWNAIISCHGLHG GLKAVELFREMQ+EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM + +GI+P LKHYGCM           
Subjt:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------

Query:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
                    PDASVWGALLGACRIHENVELVRT+SDHLLEVESENVGYYVLLSNIYAK GQWEGV+EVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
Subjt:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT

Query:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
         NQTHPKCEEIY ELR LTAKMK LGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNATKFISKITEREIIVRD
Subjt:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD

Query:  SNRFHHFKDGVCSCGDYW
        SNRFHHFKDGVCSCGDYW
Subjt:  SNRFHHFKDGVCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0L0N9 DYW_deaminase domain-containing protein0.0e+0073.11Show/hide
Query:  LLKTLD--------AIKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFIN
        LLK +D         ++ L CKWR++SLF+PSFQA CSLYS AT  PK YL+GVENEK+EIDFNR+FL CTKVHLAK+LHALLVVSGK QS+FLSAK IN
Subjt:  LLKTLD--------AIKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFIN

Query:  LYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHF
         YAFLGDI  +RLTFDQI+ KDVYTWNSMISAYARIGHFH A+DCF EF+STS LQ D+YTFPPVIRACGNLDDG+K+HCLVLKLGFE DV++AAS IHF
Subjt:  LYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHF

Query:  YSR-------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALIN
        YSR                                     AL VFDEMRFK V+MDSVTISSLLPIC QLDDIISGVLIHVYAIKLGLEFDLFV NALIN
Subjt:  YSR-------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALIN

Query:  MYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG----------------------------
        MYAKFGEL SA+ IFN+M+VRD+VSWNSL+AAFEQNK+P++ALGV+ KMH++ V PDLLTLVSLASVAAE G                            
Subjt:  MYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG----------------------------

Query:  ----------------------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGT
                                    NSL  GYSQNG ANEAI+VY  M+ YS AVPNQGTWVSILTA+S +GALKQGMK HGQLIK+ LYFDIFV T
Subjt:  ----------------------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGT

Query:  CLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKH
        CLVDMYGKCG+L DA+SLFYEVPH+SSVSWNAIISCHGLHG GLKAV+LF+EMQ+EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E YGIRPSLKH
Subjt:  CLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKH

Query:  YGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPG
        YGCM                       PD SVWGALLGACRIHENVELVRTVSDHLL+VESENVGYYVLLSNIYAK G WEGVDEVRSLARDRGLKKTPG
Subjt:  YGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPG

Query:  WSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNAT
        WSSIEVDKKIDVFYT NQTHPKCEEIY ELR LTAKMKS+GYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNAT
Subjt:  WSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNAT

Query:  KFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
        KFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
Subjt:  KFISKITEREIIVRDSNRFHHFKDGVCSCGDYW

A0A1S3C233 pentatricopeptide repeat-containing protein At4g339900.0e+0074.08Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF
        ++ L CKWRQ+SLF+PSFQACCSLYS ATT PKYYL+GVENEK+EIDFNRLFL CTKVHLAK+LH LLVVSGK QS+FLSAK IN YAFLGDIS +RLTF
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF

Query:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------
        DQI+ KDVYTWNSMISAYARIGHFH A+DCF EF+STS LQ D+YTFPPVIRACGNLDDG+KIHCLVLKLGFE DV++AAS IHFYSR            
Subjt:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------

Query:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF
                                 AL VFDEMR K VTMDSVTISSLLPICAQLDDII GVLIHVYAIKLGLEFDLFV NALINMYAKFGEL SA+ IF
Subjt:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF

Query:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG-------------------------------------------
        N+M+VRD+VSWNSL+AAFEQNK+P++ALGV+ KMH++ + PDLLTLVSLASV AE G                                           
Subjt:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG-------------------------------------------

Query:  -------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA
                     NSL  GYSQNG ANEAI+VY  M+DYS+AVPNQGTWVSILTA S +GALKQGMKTHGQLIK+ LYFDIFV TCL+DMYGKCGRL DA
Subjt:  -------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA

Query:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------
        +SLFYEVPHKSSVSWNAIISCHGLHG GLKAV+LF+EMQ+EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM   Y IRPSLKHYGCM           
Subjt:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------

Query:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
                    PD SVWGALLGACRIHENVELVRTVSDHLL+VES+NVGYYVLLSNIYAKFGQWEG D VRS AR+RGLKKTPGWSSIEVDKKIDVFYT
Subjt:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT

Query:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
         NQTHPKCEEIY ELR LTAKMKS+GYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNATKFISKITEREIIVRD
Subjt:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD

Query:  SNRFHHFKDGVCSCGDYW
        SNRFHHFKDG CSCGDYW
Subjt:  SNRFHHFKDGVCSCGDYW

A0A6J1CJ79 pentatricopeptide repeat-containing protein At4g339900.0e+0078.36Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF
        ++LLPCKWRQISLFRPSFQACCSLYS  TTTPKYYL+GVENE+KEIDFNRLFL CTKVHLAK LH LLVVSGKAQS+FLSAK  NLYAFLGDIS SR TF
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF

Query:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------
        DQI+AKDVYTWN MISAYARIGHFHEA+DCFYEFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFE DVFVAASLIHFYSR            
Subjt:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------

Query:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF
                                 AL VFDEMR K VTMD VTISSLLPICAQLDDIISG+LIHVYAIKLGLEFDLFVSNALI +YAKFGELGSA  IF
Subjt:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF

Query:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG-------------------------------------------
        N+MEVRDVVSWNSLIAAFEQNKEPMVAL V++KMHA+ + PDLLTLVSLASVAAE G                                           
Subjt:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERG-------------------------------------------

Query:  -------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA
                     NSL  GYSQNGFANEAIEVYHLMK+YSDAVPNQGTWVSILTAYS +G LKQGMKTHGQLIK+ LYFDIFV TCL+DMYGKCGRL+DA
Subjt:  -------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA

Query:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------
        +SLFYEVPHKSSVSWNAIISCHGLHG GLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQ M E YGI PSLKHYGCM           
Subjt:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------

Query:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
                    PDASVWGALLGACRIHENVELVRTVSDHLLEVES+NVGYYVLLSNIYAK GQWEGVDEVRSLARDRGLKKTPGWSSIEV+KKIDVFYT
Subjt:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT

Query:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
         NQTHPKCEEIYKELR+LTAKMKSLGYV DY FVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
Subjt:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD

Query:  SNRFHHFKDGVCSCGDYW
        SNRFHHFKDGVCSCGDYW
Subjt:  SNRFHHFKDGVCSCGDYW

A0A6J1EAW2 pentatricopeptide repeat-containing protein At4g339900.0e+0076.22Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATT--TPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRL
        ++ LPCKWR+ISLFRPSFQACC LYS  TT  TPKYYL+ VE EKKEIDFNRLFL+C KVHLAKRLHALLVVSGK QS+FLSAK INLYAFLGD+SF+R 
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATT--TPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRL

Query:  TFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR----------
        TFDQI+AKDVYTWNSMISAYARIGHFHEA+DCF+EFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCL LKLGFE DVF+AASLIHFYSR          
Subjt:  TFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR----------

Query:  ---------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQI
                                   AL VFDEMRFK VTMDSVT SSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFV NALINMYAKFGELGSA+ 
Subjt:  ---------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQI

Query:  IFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS-------------------------------------
        IFN++E +D+VSWNSLIAAFEQNKEP+VALG+++KMHA    PDLLTLVSLASVAAE GN LS                                     
Subjt:  IFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS-------------------------------------

Query:  -------------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLD
                            GYSQNG ANEAI+VYHLM DYSDAVPNQGTWVSILTAYS IGALKQGMKTHG LIK+ LYFDIFVGTCL+DMYGKCGRL 
Subjt:  -------------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLD

Query:  DAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM---------
        DA+SLFYE+PHKSSVSWNAIISCHGLHG GLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E+YGIRPSLKHYGCM         
Subjt:  DAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM---------

Query:  --------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVF
                      PDASVWGALLGACRIHENVEL RTVSDHLLEVES+NVGYYVLLSNIYAK GQW+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVF
Subjt:  --------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVF

Query:  YTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIV
        YT N+THP+CEEIY ELR LTAKMKSLGYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNATKFISKITEREIIV
Subjt:  YTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIV

Query:  RDSNRFHHFKDGVCSCGDYW
        RDSNRFHHFKDGVCSCGDYW
Subjt:  RDSNRFHHFKDGVCSCGDYW

A0A6J1J817 pentatricopeptide repeat-containing protein At4g339900.0e+0075.31Show/hide
Query:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF
        ++ LPCKWR ISLFRPSFQACC LYS     PKYY + VE EKKEIDFNRLFL+C KVHLAKRLHALLVVSGK QS+FLSAK INLYAFLGD+SF+R TF
Subjt:  IKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTF

Query:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------
        DQI+AKDVYTWNSMISAYARIGHFHEA+DCF+EFMSTS LQPDYYTFPPVIRACGNLDDGKKIHCL LKLGFE DVF+AASLIHFYSR            
Subjt:  DQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR------------

Query:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF
                                 AL VFDEMRFK V MDSVT SSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFV NALINMYAKFGELGSA+ IF
Subjt:  -------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIF

Query:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS---------------------------------------
        N++E +D+VSWNSLIAAFEQNKEP+VALG+++KMHA    PDLLTLVSLASVAAE GN LS                                       
Subjt:  NEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLS---------------------------------------

Query:  -----------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA
                          GYSQNG ANEAI+VYH M DYSDAVPNQGTWVSILTAYS IGALKQGMKTHG LIK+ LYFDIFVGTCL+DMYGKCGRL DA
Subjt:  -----------------VGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDA

Query:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------
        +SLFYE+PHKSSVSWN+IISCHG+HG GL+AVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E+YGIRPSLKHYGCM           
Subjt:  ISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------

Query:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT
                    PDASVWGALLGACRIHENVEL RTVSDHLLEVES+NVGYYVLLSNIYAK GQW+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVFYT
Subjt:  ------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYT

Query:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD
         N+THP+CEEIY ELR LTAKMKSLGYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPP+TTL+IFKNLRVCGDCHNATKFISKITEREIIVRD
Subjt:  SNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRD

Query:  SNRFHHFKDGVCSCGDYW
        SNRFHHFKDGVCSCGDYW
Subjt:  SNRFHHFKDGVCSCGDYW

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339907.2e-24354.58Show/hide
Query:  NEKKEI-DFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSG
        NE KEI D + LF  CT +  AK LHA LVVS + Q+V +SAK +NLY +LG+++ +R TFD I+ +DVY WN MIS Y R G+  E + CF  FM +SG
Subjt:  NEKKEI-DFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSG

Query:  LQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAV------FDEMRFKRV---------------------------TMDSV
        L PDY TFP V++AC  + DG KIHCL LK GF +DV+VAASLIH YSR  AV      FDEM  + +                            MDSV
Subjt:  LQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAV------FDEMRFKRV---------------------------TMDSV

Query:  TISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDL
        T+ SLL  C +  D   GV IH Y+IK GLE +LFVSN LI++YA+FG L   Q +F+ M VRD++SWNS+I A+E N++P+ A+ +F++M   R+ PD 
Subjt:  TISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDL

Query:  LTLVSLASVAAERG--------------------------------------------------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAV
        LTL+SLAS+ ++ G                                                        N++  GY+QNGFA+EAIE+Y++M++  +  
Subjt:  LTLVSLASVAAERG--------------------------------------------------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAV

Query:  PNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGV
         NQGTWVS+L A S  GAL+QGMK HG+L+K+ LY D+FV T L DMYGKCGRL+DA+SLFY++P  +SV WN +I+CHG HG G KAV LF+EM  EGV
Subjt:  PNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGV

Query:  KPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLE
        KPDHITFV+LLSACSHSGLVDEGQWCF++M   YGI PSLKHYGCM                       PDAS+WGALL ACR+H NV+L +  S+HL E
Subjt:  KPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLE

Query:  VESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEK
        VE E+VGY+VLLSN+YA  G+WEGVDE+RS+A  +GL+KTPGWSS+EVD K++VFYT NQTHP  EE+Y+EL  L AK+K +GYVPD+ FVLQDVEDDEK
Subjt:  VESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEK

Query:  ENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
        E+IL SHSERLA+AF +I+TP +TT+RIFKNLRVCGDCH+ TKFISKITEREIIVRDSNRFHHFK+GVCSCGDYW
Subjt:  ENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW

P0C899 Putative pentatricopeptide repeat-containing protein At3g491423.6e-12535.4Show/hide
Query:  VATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHE
        V+++ PK  L+    ++      ++      +   + +H+ +++     +  L  K +  YA L D++ +R  FD+I  ++V   N MI +Y   G + E
Subjt:  VATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHE

Query:  ALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVL
         +  F   M    ++PD+YTFP V++AC                                                            C+    I+ G  
Subjt:  ALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVL

Query:  IHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGN-----
        IH  A K+GL   LFV N L++MY K G L  A+++ +EM  RDVVSWNSL+  + QN+    AL V R+M +++++ D  T+ SL    +         
Subjt:  IHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGN-----

Query:  ----------------SLSVG-YSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRL
                        ++ +G Y +N    EA+E+Y  M +     P+  +  S+L A     AL  G K HG + + +L  ++ +   L+DMY KCG L
Subjt:  ----------------SLSVG-YSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRL

Query:  DDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM--------
        + A  +F  +  +  VSW A+IS +G  G G  AV LF ++Q  G+ PD I FV+ L+ACSH+GL++EG+ CF+LM + Y I P L+H  CM        
Subjt:  DDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM--------

Query:  ---------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDV
                       P+  VWGALLGACR+H + ++    +D L ++  E  GYYVLLSNIYAK G+WE V  +R++ + +GLKK PG S++EV++ I  
Subjt:  ---------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDV

Query:  FYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIIST-----PPRTTLRIFKNLRVCGDCHNATKFISKIT
        F   +++HP+ +EIY+EL VL  KMK LGYVPD    L DVE+++KE  L  HSE+LA+ F +++T         T+RI KNLR+CGDCH A K IS+IT
Subjt:  FYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIIST-----PPRTTLRIFKNLRVCGDCHNATKFISKIT

Query:  EREIIVRDSNRFHHFKDGVCSCGDYW
         REII+RD+NRFH F+ GVCSCGDYW
Subjt:  EREIIVRDSNRFHHFKDGVCSCGDYW

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.6e-12334.49Show/hide
Query:  DFNRLFLLC---TKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPD
        +F  L  +C    ++ + K +H LLV SG +  +F      N+YA    ++ +R  FD++  +D+ +WN++++ Y++ G    AL+   + M    L+P 
Subjt:  DFNRLFLLC---TKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPD

Query:  YYTFP---PVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR-------------------------------------ALAVFDEMRFKRVTM
        + T     P + A   +  GK+IH   ++ GF+  V ++ +L+  Y++                                     A+ +F +M  + V  
Subjt:  YYTFP---PVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR-------------------------------------ALAVFDEMRFKRVTM

Query:  DSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVT
          V++   L  CA L D+  G  IH  +++LGL+ ++ V N+LI+MY K  E+ +A  +F +++ R +VSWN++I  F QN  P+ AL  F +M +  V 
Subjt:  DSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVT

Query:  PDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGR
        PD  T VS+ +  AE                              ++ +   W+                  HG +++S L  ++FV T LVDMY KCG 
Subjt:  PDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGR

Query:  LDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-------
        +  A  +F  +  +   +WNA+I  +G HG G  A+ELF EMQ   +KP+ +TF+S++SACSHSGLV+ G  CF +M E Y I  S+ HYG M       
Subjt:  LDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-------

Query:  ----------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKID
                        P  +V+GA+LGAC+IH+NV      ++ L E+  ++ GY+VLL+NIY     WE V +VR     +GL+KTPG S +E+  ++ 
Subjt:  ----------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKID

Query:  VFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREI
         F++ +  HP  ++IY  L  L   +K  GYVPD N VL  VE+D KE +L++HSE+LA++FG+++T   TT+ + KNLRVC DCHNATK+IS +T REI
Subjt:  VFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREI

Query:  IVRDSNRFHHFKDGVCSCGDYW
        +VRD  RFHHFK+G CSCGDYW
Subjt:  IVRDSNRFHHFKDGVCSCGDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.8e-12434.99Show/hide
Query:  KRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDD--
        K++HA L+V G   S FL  K I+  +  GDI+F+R  FD +    ++ WN++I  Y+R  HF +AL   Y  M  + + PD +TFP +++AC  L    
Subjt:  KRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDD--

Query:  -GKKIHCLVLKLGFEYDVFVAASLIHFYSR---------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLD
         G+ +H  V +LGF+ DVFV   LI  Y++                                       AL +F +MR   V  D V + S+L     L 
Subjt:  -GKKIHCLVLKLGFEYDVFVAASLIHFYSR---------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLD

Query:  DIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAER
        D+  G  IH   +K+GLE +  +  +L  MYAK G++ +A+I+F++M+  +++ WN++I+                                        
Subjt:  DIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAER

Query:  GNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSS
              GY++NG+A EAI+++H M +  D  P+  +  S ++A + +G+L+Q    +  + +S    D+F+ + L+DM+ KCG ++ A  +F     +  
Subjt:  GNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSS

Query:  VSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------
        V W+A+I  +GLHG   +A+ L+R M+  GV P+ +TF+ LL AC+HSG+V EG W F  M + + I P  +HY C+                       
Subjt:  VSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------

Query:  PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIY
        P  +VWGALL AC+ H +VEL    +  L  ++  N G+YV LSN+YA    W+ V EVR   +++GL K  G S +EV  +++ F   +++HP+ EEI 
Subjt:  PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIY

Query:  KELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVC
        +++  + +++K  G+V + +  L D+ D+E E  L SHSER+A+A+G+ISTP  T LRI KNLR C +CH ATK ISK+ +REI+VRD+NRFHHFKDGVC
Subjt:  KELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVC

Query:  SCGDYW
        SCGDYW
Subjt:  SCGDYW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331702.0e-12334.42Show/hide
Query:  KYYLEGVENEKKEIDFNRLFLLCTKVH-----LAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEA
        K + + VE++ +      + +L T V      L +++H + +  G    + +S   IN+Y  L    F+R  FD +  +D+ +WNS+I+  A+ G   EA
Subjt:  KYYLEGVENEKKEIDFNRLFLLCTKVH-----LAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEA

Query:  LDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDG----KKIHCLVLKLGFEYDVFVAASLIHFYSR-----------------------------------
        + C +  +   GL+PD YT   V++A  +L +G    K++H   +K+    D FV+ +LI  YSR                                   
Subjt:  LDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDG----KKIHCLVLKLGFEYDVFVAASLIHFYSR-----------------------------------

Query:  -ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEP
          L +F  M  +    D  T++++   C  L  I  G  +H YAIK G + DL+VS+ +++MY K G++ +AQ  F+ + V D V+W ++I+   +N E 
Subjt:  -ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEP

Query:  MVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFD
          A  VF +M  M V PD  T+ +LA                                                A S + AL+QG + H   +K     D
Subjt:  MVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFD

Query:  IFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIR
         FVGT LVDMY KCG +DDA  LF  +   +  +WNA++     HG G + ++LF++M++ G+KPD +TF+ +LSACSHSGLV E     + M   YGI+
Subjt:  IFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIR

Query:  PSLKHYGCMPD-----------------------ASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGL
        P ++HY C+ D                       AS++  LL ACR+  + E  + V+  LLE+E  +   YVLLSN+YA   +W+ +   R++ +   +
Subjt:  PSLKHYGCMPD-----------------------ASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGL

Query:  KKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGD
        KK PG+S IEV  KI +F   ++++ + E IY++++ +   +K  GYVP+ +F L DVE++EKE  L  HSE+LA+AFG++STPP T +R+ KNLRVCGD
Subjt:  KKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGD

Query:  CHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
        CHNA K+I+K+  REI++RD+NRFH FKDG+CSCGDYW
Subjt:  CHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-12434.49Show/hide
Query:  DFNRLFLLC---TKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPD
        +F  L  +C    ++ + K +H LLV SG +  +F      N+YA    ++ +R  FD++  +D+ +WN++++ Y++ G    AL+   + M    L+P 
Subjt:  DFNRLFLLC---TKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPD

Query:  YYTFP---PVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR-------------------------------------ALAVFDEMRFKRVTM
        + T     P + A   +  GK+IH   ++ GF+  V ++ +L+  Y++                                     A+ +F +M  + V  
Subjt:  YYTFP---PVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSR-------------------------------------ALAVFDEMRFKRVTM

Query:  DSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVT
          V++   L  CA L D+  G  IH  +++LGL+ ++ V N+LI+MY K  E+ +A  +F +++ R +VSWN++I  F QN  P+ AL  F +M +  V 
Subjt:  DSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVT

Query:  PDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGR
        PD  T VS+ +  AE                              ++ +   W+                  HG +++S L  ++FV T LVDMY KCG 
Subjt:  PDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGR

Query:  LDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-------
        +  A  +F  +  +   +WNA+I  +G HG G  A+ELF EMQ   +KP+ +TF+S++SACSHSGLV+ G  CF +M E Y I  S+ HYG M       
Subjt:  LDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-------

Query:  ----------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKID
                        P  +V+GA+LGAC+IH+NV      ++ L E+  ++ GY+VLL+NIY     WE V +VR     +GL+KTPG S +E+  ++ 
Subjt:  ----------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKID

Query:  VFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREI
         F++ +  HP  ++IY  L  L   +K  GYVPD N VL  VE+D KE +L++HSE+LA++FG+++T   TT+ + KNLRVC DCHNATK+IS +T REI
Subjt:  VFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREI

Query:  IVRDSNRFHHFKDGVCSCGDYW
        +VRD  RFHHFK+G CSCGDYW
Subjt:  IVRDSNRFHHFKDGVCSCGDYW

AT3G12770.1 mitochondrial editing factor 221.3e-12534.99Show/hide
Query:  KRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDD--
        K++HA L+V G   S FL  K I+  +  GDI+F+R  FD +    ++ WN++I  Y+R  HF +AL   Y  M  + + PD +TFP +++AC  L    
Subjt:  KRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDD--

Query:  -GKKIHCLVLKLGFEYDVFVAASLIHFYSR---------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLD
         G+ +H  V +LGF+ DVFV   LI  Y++                                       AL +F +MR   V  D V + S+L     L 
Subjt:  -GKKIHCLVLKLGFEYDVFVAASLIHFYSR---------------------------------------ALAVFDEMRFKRVTMDSVTISSLLPICAQLD

Query:  DIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAER
        D+  G  IH   +K+GLE +  +  +L  MYAK G++ +A+I+F++M+  +++ WN++I+                                        
Subjt:  DIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAER

Query:  GNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSS
              GY++NG+A EAI+++H M +  D  P+  +  S ++A + +G+L+Q    +  + +S    D+F+ + L+DM+ KCG ++ A  +F     +  
Subjt:  GNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSS

Query:  VSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------
        V W+A+I  +GLHG   +A+ L+R M+  GV P+ +TF+ LL AC+HSG+V EG W F  M + + I P  +HY C+                       
Subjt:  VSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------

Query:  PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIY
        P  +VWGALL AC+ H +VEL    +  L  ++  N G+YV LSN+YA    W+ V EVR   +++GL K  G S +EV  +++ F   +++HP+ EEI 
Subjt:  PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIY

Query:  KELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVC
        +++  + +++K  G+V + +  L D+ D+E E  L SHSER+A+A+G+ISTP  T LRI KNLR C +CH ATK ISK+ +REI+VRD+NRFHHFKDGVC
Subjt:  KELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVC

Query:  SCGDYW
        SCGDYW
Subjt:  SCGDYW

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-12635.4Show/hide
Query:  VATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHE
        V+++ PK  L+    ++      ++      +   + +H+ +++     +  L  K +  YA L D++ +R  FD+I  ++V   N MI +Y   G + E
Subjt:  VATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHE

Query:  ALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVL
         +  F   M    ++PD+YTFP V++AC                                                            C+    I+ G  
Subjt:  ALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVL

Query:  IHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGN-----
        IH  A K+GL   LFV N L++MY K G L  A+++ +EM  RDVVSWNSL+  + QN+    AL V R+M +++++ D  T+ SL    +         
Subjt:  IHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASVAAERGN-----

Query:  ----------------SLSVG-YSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRL
                        ++ +G Y +N    EA+E+Y  M +     P+  +  S+L A     AL  G K HG + + +L  ++ +   L+DMY KCG L
Subjt:  ----------------SLSVG-YSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRL

Query:  DDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM--------
        + A  +F  +  +  VSW A+IS +G  G G  AV LF ++Q  G+ PD I FV+ L+ACSH+GL++EG+ CF+LM + Y I P L+H  CM        
Subjt:  DDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM--------

Query:  ---------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDV
                       P+  VWGALLGACR+H + ++    +D L ++  E  GYYVLLSNIYAK G+WE V  +R++ + +GLKK PG S++EV++ I  
Subjt:  ---------------PDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDV

Query:  FYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIIST-----PPRTTLRIFKNLRVCGDCHNATKFISKIT
        F   +++HP+ +EIY+EL VL  KMK LGYVPD    L DVE+++KE  L  HSE+LA+ F +++T         T+RI KNLR+CGDCH A K IS+IT
Subjt:  FYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIIST-----PPRTTLRIFKNLRVCGDCHNATKFISKIT

Query:  EREIIVRDSNRFHHFKDGVCSCGDYW
         REII+RD+NRFH F+ GVCSCGDYW
Subjt:  EREIIVRDSNRFHHFKDGVCSCGDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-12434.42Show/hide
Query:  KYYLEGVENEKKEIDFNRLFLLCTKVH-----LAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEA
        K + + VE++ +      + +L T V      L +++H + +  G    + +S   IN+Y  L    F+R  FD +  +D+ +WNS+I+  A+ G   EA
Subjt:  KYYLEGVENEKKEIDFNRLFLLCTKVH-----LAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEA

Query:  LDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDG----KKIHCLVLKLGFEYDVFVAASLIHFYSR-----------------------------------
        + C +  +   GL+PD YT   V++A  +L +G    K++H   +K+    D FV+ +LI  YSR                                   
Subjt:  LDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDG----KKIHCLVLKLGFEYDVFVAASLIHFYSR-----------------------------------

Query:  -ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEP
          L +F  M  +    D  T++++   C  L  I  G  +H YAIK G + DL+VS+ +++MY K G++ +AQ  F+ + V D V+W ++I+   +N E 
Subjt:  -ALAVFDEMRFKRVTMDSVTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEP

Query:  MVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFD
          A  VF +M  M V PD  T+ +LA                                                A S + AL+QG + H   +K     D
Subjt:  MVALGVFRKMHAMRVTPDLLTLVSLASVAAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFD

Query:  IFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIR
         FVGT LVDMY KCG +DDA  LF  +   +  +WNA++     HG G + ++LF++M++ G+KPD +TF+ +LSACSHSGLV E     + M   YGI+
Subjt:  IFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIR

Query:  PSLKHYGCMPD-----------------------ASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGL
        P ++HY C+ D                       AS++  LL ACR+  + E  + V+  LLE+E  +   YVLLSN+YA   +W+ +   R++ +   +
Subjt:  PSLKHYGCMPD-----------------------ASVWGALLGACRIHENVELVRTVSDHLLEVESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGL

Query:  KKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGD
        KK PG+S IEV  KI +F   ++++ + E IY++++ +   +K  GYVP+ +F L DVE++EKE  L  HSE+LA+AFG++STPP T +R+ KNLRVCGD
Subjt:  KKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGD

Query:  CHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
        CHNA K+I+K+  REI++RD+NRFH FKDG+CSCGDYW
Subjt:  CHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-24454.58Show/hide
Query:  NEKKEI-DFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSG
        NE KEI D + LF  CT +  AK LHA LVVS + Q+V +SAK +NLY +LG+++ +R TFD I+ +DVY WN MIS Y R G+  E + CF  FM +SG
Subjt:  NEKKEI-DFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISFSRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSG

Query:  LQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAV------FDEMRFKRV---------------------------TMDSV
        L PDY TFP V++AC  + DG KIHCL LK GF +DV+VAASLIH YSR  AV      FDEM  + +                            MDSV
Subjt:  LQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAV------FDEMRFKRV---------------------------TMDSV

Query:  TISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDL
        T+ SLL  C +  D   GV IH Y+IK GLE +LFVSN LI++YA+FG L   Q +F+ M VRD++SWNS+I A+E N++P+ A+ +F++M   R+ PD 
Subjt:  TISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDL

Query:  LTLVSLASVAAERG--------------------------------------------------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAV
        LTL+SLAS+ ++ G                                                        N++  GY+QNGFA+EAIE+Y++M++  +  
Subjt:  LTLVSLASVAAERG--------------------------------------------------------NSLSVGYSQNGFANEAIEVYHLMKDYSDAV

Query:  PNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGV
         NQGTWVS+L A S  GAL+QGMK HG+L+K+ LY D+FV T L DMYGKCGRL+DA+SLFY++P  +SV WN +I+CHG HG G KAV LF+EM  EGV
Subjt:  PNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAIISCHGLHGCGLKAVELFREMQTEGV

Query:  KPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLE
        KPDHITFV+LLSACSHSGLVDEGQWCF++M   YGI PSLKHYGCM                       PDAS+WGALL ACR+H NV+L +  S+HL E
Subjt:  KPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCM-----------------------PDASVWGALLGACRIHENVELVRTVSDHLLE

Query:  VESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEK
        VE E+VGY+VLLSN+YA  G+WEGVDE+RS+A  +GL+KTPGWSS+EVD K++VFYT NQTHP  EE+Y+EL  L AK+K +GYVPD+ FVLQDVEDDEK
Subjt:  VESENVGYYVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEK

Query:  ENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
        E+IL SHSERLA+AF +I+TP +TT+RIFKNLRVCGDCH+ TKFISKITEREIIVRDSNRFHHFK+GVCSCGDYW
Subjt:  ENILTSHSERLAMAFGIISTPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTGCAGCTCAAGGATAATATTTGGAATGATTTCACCGAGAATGATGATTATATAGTGCCTCATCTTGGTGATGAACTTTGGGATCTATTTGAAGTACAGAGTCG
TGAAGTTGTTGGCTTTACATATGATCCTTACACTGCAACTAAGTTCAGTTTTCATGATAAGGAGAAAGTAAGAATACAAACATCGATCATGAAGGATACAATCGTGGAAA
AAGATTCATGGTCTCATACACCCGATGGTGTTCCTTCACCATTAAATGGTGACTCATTCAAAGATATGAAAATGGAATCTAGTAGTCTTAGTATGTCCAACCATTGCTTT
AAAACAGGCGTGGGTACAGATCTTGAATACTGCACAGATGATCCTATTGTGACTGACAATAGTGCTGCCGAAGAGAATGACATGTATCAATATTCTGTCAGTCACATATC
CCAAACAGAGAACGATATTAGTTTTCTGGATGATGATCATGAAAACAAAGAAAATAATGATCTTCTGTATTATGGGTGGCAAGATATAGAAAGCTTTGAGGATGTCGATA
GGATGTTTAGAAATTGTGATTCAACATTTGGGCTTGGGAATCTAAGCAATGAAGATGAGCTACGCTGGTTTTCCCCATCCCATGGCTCGGAAAAACTTGAAGATCCATCA
AAGTCAAACTTTAAGTTTTCATGCTGTGAAGGAAGTACGATAAATGATGCATCAGAATGTAATGAAGATTCTAATCCCGTGAATTCAGACCCTTCATCTGATGGTTTGAA
CAGAAATAACATTTTAACAGGGTGTAAGGTGAATGATGGGATTGCAGATATGTGTGACTCTGCTGCTATAAGTCACTTGTCAACTGCTGACATGTCAGATACAAAAAGCA
ATTCTAGAGGTGACTTGATACCTAAAAAACAGGAGTCCTCTTATGCATCTAATCAACTACTGTCTATACGTAGCTCTCATTATCCTTCCTTAGATGCTCCAGCAATTGCA
GCAAATGAAAATAGGGAAAAACTGTACCACCAGGATTTACAAGCCTCATTCAATAAGAATTTCACTTTTATGTCTACACCAAGTTCAGAAACATTCAACACTTCATTTCC
AGTTAGAAAGCAGGCGCCAAGGTCTGAAAGTGATATTGATGATGGTCATAGTGAAACTGGAGTAGTTAGCAGAGGAAGTCGAGAAGAATTAGATTCTTCAAATGCACGGG
ATAAGTCTTGCAGGAGCACTGTGCTGAACGGAATCTCACTGGAAGCAACTAGTTTTTGCCAGCTTCAACAAGTAATGGAGCAGTTGGATATCAGGACAAAACTATGCATA
AGGGATAGTCTATATCGCCTGGCTAGAAGTGCAGAGCAGAGACATAATTGTGCTAATCTTAATGAGAATATTGGAGAAGACAAGCTTGGGAGAATTGCGCCGTCAATTGA
TCAAGACACAAACAGGAGTGGAGGTTTTTTGGATTTGGAAACTGATACCAATCCTATAGACCGGTCAGTCGCCCACTTGCTGTTTCACCGGCCTTCGGATCCATCTATAA
TGCCTGTTGGTGGTAACACCTTGCCTCTGAAATCTCACAAACTGAATCATCTCCAGTTTGAGAAACTCGAGAGGGCCTTCTCTTCTCAATGTGCCAGCCGAAAAACAAAC
TTTCCAGGACGAAACCGGTGGAGCTGCTTCTTGTGCAGATCAAAAGCCACTGGCAAATGGGAAGAAACTATGAGCAATAAAGTCGAGGATTCACTCAGCAATCTCGGAAT
TTTTTGCGAGGAAAATTTCCCTGCCAGCGAGAGTATTGAAAGGAAGTCCATGAGGATGATTAAAATGCGAGAAAGTACTGGGTTTGGAGCTGTAAAACGAAAGTTTCGAG
ATTTCAGCGATTGGAATAACTCAGGCGAGGCCGAAGAACGGCTTCTCCACACTCGTGTTCTTATGGGAATGGACTCAAGTGGTCTGGTGAAGTCTTCTAGCTCCGTGCAG
ATTTCATCATGGTCTTTTAACTATCTTTTAAAAACATTGGACGCCATAAAATTGCTGCCATGTAAATGGAGACAGATTTCCTTGTTCAGGCCTTCATTCCAAGCTTGTTG
CTCCCTGTATTCTGTAGCTACAACTACTCCCAAGTATTACTTGGAAGGAGTTGAAAATGAGAAAAAGGAAATTGATTTCAATCGACTATTCCTTCTGTGCACAAAAGTAC
ACCTTGCTAAAAGACTTCATGCACTACTTGTGGTATCTGGGAAGGCTCAGAGCGTTTTTCTTTCCGCTAAATTCATCAATCTTTATGCTTTTCTTGGTGATATATCATTC
TCTCGCCTTACTTTTGACCAAATTAAGGCAAAAGATGTATATACATGGAATTCTATGATATCTGCTTATGCTCGAATTGGTCACTTCCATGAAGCTTTAGATTGTTTTTA
TGAATTTATGTCAACTTCTGGTCTTCAGCCTGATTATTACACATTTCCTCCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGTTCTAA
AGTTGGGTTTTGAATATGATGTATTTGTTGCTGCTTCTCTGATCCATTTTTATTCTCGGGCATTGGCAGTATTTGATGAGATGAGATTCAAGCGCGTAACTATGGATTCT
GTTACAATCTCAAGTCTACTTCCTATTTGTGCACAGTTGGATGATATAATAAGTGGTGTCCTAATTCATGTCTATGCCATCAAGCTTGGGTTAGAATTTGACTTGTTTGT
GTCTAATGCATTGATAAACATGTATGCCAAATTTGGTGAATTGGGAAGTGCGCAAATTATTTTCAATGAAATGGAAGTTAGGGATGTTGTGTCTTGGAACTCTTTGATTG
CTGCATTTGAGCAGAATAAAGAGCCCATGGTGGCACTTGGAGTGTTCAGAAAGATGCATGCTATGAGGGTTACGCCTGACTTGTTGACACTCGTGAGTTTGGCTTCTGTT
GCTGCTGAACGTGGCAATTCTTTAAGTGTAGGTTATTCTCAAAATGGTTTCGCAAATGAGGCAATTGAAGTGTATCATTTGATGAAAGATTATAGCGATGCAGTTCCTAA
CCAGGGCACTTGGGTAAGCATTCTAACAGCATACTCCCATATAGGAGCCTTGAAACAAGGGATGAAAACTCATGGTCAGCTGATCAAGAGTCGTCTGTACTTTGACATCT
TTGTGGGTACTTGTCTTGTTGATATGTATGGAAAATGTGGAAGATTAGACGACGCAATATCTTTATTCTACGAAGTTCCACATAAAAGCTCGGTTTCTTGGAATGCCATC
ATATCATGTCATGGACTCCATGGATGTGGTTTAAAAGCTGTCGAGTTATTTAGGGAAATGCAAACCGAAGGAGTGAAGCCCGATCACATCACTTTTGTATCCTTATTGTC
TGCTTGTAGCCATTCAGGTTTGGTTGATGAGGGTCAGTGGTGCTTCCAATTGATGCTAGAGATGTACGGGATAAGGCCTAGCTTGAAGCATTATGGCTGCATGCCTGATG
CATCTGTGTGGGGTGCACTTCTTGGTGCTTGTAGAATACATGAGAATGTAGAGTTGGTCAGAACTGTTTCAGATCACTTGTTGGAGGTTGAATCAGAAAATGTTGGCTAC
TATGTTTTGTTATCGAATATTTATGCAAAATTTGGACAATGGGAAGGAGTCGATGAAGTGCGATCATTAGCTCGAGACAGGGGATTGAAGAAGACTCCTGGGTGGAGTTC
AATTGAAGTGGACAAGAAAATTGATGTCTTTTACACTAGCAACCAAACACATCCAAAATGTGAGGAGATATACAAGGAATTGAGGGTTCTGACTGCTAAAATGAAGAGTC
TTGGTTATGTTCCAGATTATAACTTTGTATTGCAGGATGTGGAGGATGATGAGAAGGAAAACATCCTTACTAGCCATAGCGAGCGATTGGCTATGGCATTCGGGATTATC
AGCACGCCACCAAGAACTACTCTTCGGATCTTCAAGAACTTGCGGGTTTGTGGAGACTGCCATAACGCTACCAAGTTCATATCTAAAATTACTGAAAGAGAGATCATCGT
GAGGGATTCAAACCGATTCCATCATTTCAAAGACGGAGTTTGTTCTTGTGGTGATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTGCAGCTCAAGGATAATATTTGGAATGATTTCACCGAGAATGATGATTATATAGTGCCTCATCTTGGTGATGAACTTTGGGATCTATTTGAAGTACAGAGTCG
TGAAGTTGTTGGCTTTACATATGATCCTTACACTGCAACTAAGTTCAGTTTTCATGATAAGGAGAAAGTAAGAATACAAACATCGATCATGAAGGATACAATCGTGGAAA
AAGATTCATGGTCTCATACACCCGATGGTGTTCCTTCACCATTAAATGGTGACTCATTCAAAGATATGAAAATGGAATCTAGTAGTCTTAGTATGTCCAACCATTGCTTT
AAAACAGGCGTGGGTACAGATCTTGAATACTGCACAGATGATCCTATTGTGACTGACAATAGTGCTGCCGAAGAGAATGACATGTATCAATATTCTGTCAGTCACATATC
CCAAACAGAGAACGATATTAGTTTTCTGGATGATGATCATGAAAACAAAGAAAATAATGATCTTCTGTATTATGGGTGGCAAGATATAGAAAGCTTTGAGGATGTCGATA
GGATGTTTAGAAATTGTGATTCAACATTTGGGCTTGGGAATCTAAGCAATGAAGATGAGCTACGCTGGTTTTCCCCATCCCATGGCTCGGAAAAACTTGAAGATCCATCA
AAGTCAAACTTTAAGTTTTCATGCTGTGAAGGAAGTACGATAAATGATGCATCAGAATGTAATGAAGATTCTAATCCCGTGAATTCAGACCCTTCATCTGATGGTTTGAA
CAGAAATAACATTTTAACAGGGTGTAAGGTGAATGATGGGATTGCAGATATGTGTGACTCTGCTGCTATAAGTCACTTGTCAACTGCTGACATGTCAGATACAAAAAGCA
ATTCTAGAGGTGACTTGATACCTAAAAAACAGGAGTCCTCTTATGCATCTAATCAACTACTGTCTATACGTAGCTCTCATTATCCTTCCTTAGATGCTCCAGCAATTGCA
GCAAATGAAAATAGGGAAAAACTGTACCACCAGGATTTACAAGCCTCATTCAATAAGAATTTCACTTTTATGTCTACACCAAGTTCAGAAACATTCAACACTTCATTTCC
AGTTAGAAAGCAGGCGCCAAGGTCTGAAAGTGATATTGATGATGGTCATAGTGAAACTGGAGTAGTTAGCAGAGGAAGTCGAGAAGAATTAGATTCTTCAAATGCACGGG
ATAAGTCTTGCAGGAGCACTGTGCTGAACGGAATCTCACTGGAAGCAACTAGTTTTTGCCAGCTTCAACAAGTAATGGAGCAGTTGGATATCAGGACAAAACTATGCATA
AGGGATAGTCTATATCGCCTGGCTAGAAGTGCAGAGCAGAGACATAATTGTGCTAATCTTAATGAGAATATTGGAGAAGACAAGCTTGGGAGAATTGCGCCGTCAATTGA
TCAAGACACAAACAGGAGTGGAGGTTTTTTGGATTTGGAAACTGATACCAATCCTATAGACCGGTCAGTCGCCCACTTGCTGTTTCACCGGCCTTCGGATCCATCTATAA
TGCCTGTTGGTGGTAACACCTTGCCTCTGAAATCTCACAAACTGAATCATCTCCAGTTTGAGAAACTCGAGAGGGCCTTCTCTTCTCAATGTGCCAGCCGAAAAACAAAC
TTTCCAGGACGAAACCGGTGGAGCTGCTTCTTGTGCAGATCAAAAGCCACTGGCAAATGGGAAGAAACTATGAGCAATAAAGTCGAGGATTCACTCAGCAATCTCGGAAT
TTTTTGCGAGGAAAATTTCCCTGCCAGCGAGAGTATTGAAAGGAAGTCCATGAGGATGATTAAAATGCGAGAAAGTACTGGGTTTGGAGCTGTAAAACGAAAGTTTCGAG
ATTTCAGCGATTGGAATAACTCAGGCGAGGCCGAAGAACGGCTTCTCCACACTCGTGTTCTTATGGGAATGGACTCAAGTGGTCTGGTGAAGTCTTCTAGCTCCGTGCAG
ATTTCATCATGGTCTTTTAACTATCTTTTAAAAACATTGGACGCCATAAAATTGCTGCCATGTAAATGGAGACAGATTTCCTTGTTCAGGCCTTCATTCCAAGCTTGTTG
CTCCCTGTATTCTGTAGCTACAACTACTCCCAAGTATTACTTGGAAGGAGTTGAAAATGAGAAAAAGGAAATTGATTTCAATCGACTATTCCTTCTGTGCACAAAAGTAC
ACCTTGCTAAAAGACTTCATGCACTACTTGTGGTATCTGGGAAGGCTCAGAGCGTTTTTCTTTCCGCTAAATTCATCAATCTTTATGCTTTTCTTGGTGATATATCATTC
TCTCGCCTTACTTTTGACCAAATTAAGGCAAAAGATGTATATACATGGAATTCTATGATATCTGCTTATGCTCGAATTGGTCACTTCCATGAAGCTTTAGATTGTTTTTA
TGAATTTATGTCAACTTCTGGTCTTCAGCCTGATTATTACACATTTCCTCCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGTTCTAA
AGTTGGGTTTTGAATATGATGTATTTGTTGCTGCTTCTCTGATCCATTTTTATTCTCGGGCATTGGCAGTATTTGATGAGATGAGATTCAAGCGCGTAACTATGGATTCT
GTTACAATCTCAAGTCTACTTCCTATTTGTGCACAGTTGGATGATATAATAAGTGGTGTCCTAATTCATGTCTATGCCATCAAGCTTGGGTTAGAATTTGACTTGTTTGT
GTCTAATGCATTGATAAACATGTATGCCAAATTTGGTGAATTGGGAAGTGCGCAAATTATTTTCAATGAAATGGAAGTTAGGGATGTTGTGTCTTGGAACTCTTTGATTG
CTGCATTTGAGCAGAATAAAGAGCCCATGGTGGCACTTGGAGTGTTCAGAAAGATGCATGCTATGAGGGTTACGCCTGACTTGTTGACACTCGTGAGTTTGGCTTCTGTT
GCTGCTGAACGTGGCAATTCTTTAAGTGTAGGTTATTCTCAAAATGGTTTCGCAAATGAGGCAATTGAAGTGTATCATTTGATGAAAGATTATAGCGATGCAGTTCCTAA
CCAGGGCACTTGGGTAAGCATTCTAACAGCATACTCCCATATAGGAGCCTTGAAACAAGGGATGAAAACTCATGGTCAGCTGATCAAGAGTCGTCTGTACTTTGACATCT
TTGTGGGTACTTGTCTTGTTGATATGTATGGAAAATGTGGAAGATTAGACGACGCAATATCTTTATTCTACGAAGTTCCACATAAAAGCTCGGTTTCTTGGAATGCCATC
ATATCATGTCATGGACTCCATGGATGTGGTTTAAAAGCTGTCGAGTTATTTAGGGAAATGCAAACCGAAGGAGTGAAGCCCGATCACATCACTTTTGTATCCTTATTGTC
TGCTTGTAGCCATTCAGGTTTGGTTGATGAGGGTCAGTGGTGCTTCCAATTGATGCTAGAGATGTACGGGATAAGGCCTAGCTTGAAGCATTATGGCTGCATGCCTGATG
CATCTGTGTGGGGTGCACTTCTTGGTGCTTGTAGAATACATGAGAATGTAGAGTTGGTCAGAACTGTTTCAGATCACTTGTTGGAGGTTGAATCAGAAAATGTTGGCTAC
TATGTTTTGTTATCGAATATTTATGCAAAATTTGGACAATGGGAAGGAGTCGATGAAGTGCGATCATTAGCTCGAGACAGGGGATTGAAGAAGACTCCTGGGTGGAGTTC
AATTGAAGTGGACAAGAAAATTGATGTCTTTTACACTAGCAACCAAACACATCCAAAATGTGAGGAGATATACAAGGAATTGAGGGTTCTGACTGCTAAAATGAAGAGTC
TTGGTTATGTTCCAGATTATAACTTTGTATTGCAGGATGTGGAGGATGATGAGAAGGAAAACATCCTTACTAGCCATAGCGAGCGATTGGCTATGGCATTCGGGATTATC
AGCACGCCACCAAGAACTACTCTTCGGATCTTCAAGAACTTGCGGGTTTGTGGAGACTGCCATAACGCTACCAAGTTCATATCTAAAATTACTGAAAGAGAGATCATCGT
GAGGGATTCAAACCGATTCCATCATTTCAAAGACGGAGTTTGTTCTTGTGGTGATTATTGGTGA
Protein sequenceShow/hide protein sequence
MVLQLKDNIWNDFTENDDYIVPHLGDELWDLFEVQSREVVGFTYDPYTATKFSFHDKEKVRIQTSIMKDTIVEKDSWSHTPDGVPSPLNGDSFKDMKMESSSLSMSNHCF
KTGVGTDLEYCTDDPIVTDNSAAEENDMYQYSVSHISQTENDISFLDDDHENKENNDLLYYGWQDIESFEDVDRMFRNCDSTFGLGNLSNEDELRWFSPSHGSEKLEDPS
KSNFKFSCCEGSTINDASECNEDSNPVNSDPSSDGLNRNNILTGCKVNDGIADMCDSAAISHLSTADMSDTKSNSRGDLIPKKQESSYASNQLLSIRSSHYPSLDAPAIA
ANENREKLYHQDLQASFNKNFTFMSTPSSETFNTSFPVRKQAPRSESDIDDGHSETGVVSRGSREELDSSNARDKSCRSTVLNGISLEATSFCQLQQVMEQLDIRTKLCI
RDSLYRLARSAEQRHNCANLNENIGEDKLGRIAPSIDQDTNRSGGFLDLETDTNPIDRSVAHLLFHRPSDPSIMPVGGNTLPLKSHKLNHLQFEKLERAFSSQCASRKTN
FPGRNRWSCFLCRSKATGKWEETMSNKVEDSLSNLGIFCEENFPASESIERKSMRMIKMRESTGFGAVKRKFRDFSDWNNSGEAEERLLHTRVLMGMDSSGLVKSSSSVQ
ISSWSFNYLLKTLDAIKLLPCKWRQISLFRPSFQACCSLYSVATTTPKYYLEGVENEKKEIDFNRLFLLCTKVHLAKRLHALLVVSGKAQSVFLSAKFINLYAFLGDISF
SRLTFDQIKAKDVYTWNSMISAYARIGHFHEALDCFYEFMSTSGLQPDYYTFPPVIRACGNLDDGKKIHCLVLKLGFEYDVFVAASLIHFYSRALAVFDEMRFKRVTMDS
VTISSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVSNALINMYAKFGELGSAQIIFNEMEVRDVVSWNSLIAAFEQNKEPMVALGVFRKMHAMRVTPDLLTLVSLASV
AAERGNSLSVGYSQNGFANEAIEVYHLMKDYSDAVPNQGTWVSILTAYSHIGALKQGMKTHGQLIKSRLYFDIFVGTCLVDMYGKCGRLDDAISLFYEVPHKSSVSWNAI
ISCHGLHGCGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMLEMYGIRPSLKHYGCMPDASVWGALLGACRIHENVELVRTVSDHLLEVESENVGY
YVLLSNIYAKFGQWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTSNQTHPKCEEIYKELRVLTAKMKSLGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGII
STPPRTTLRIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW