; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg02023 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg02023
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr04:7980772..7991164
RNA-Seq ExpressionCarg02023
SyntenyCarg02023
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR000887 - KDPG/KHG aldolase
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013785 - Aldolase-type TIM barrel


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648639.1 hypothetical protein Csa_007883 [Cucumis sativus]0.0e+0075.78Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREG
        M LTTI+  F   RNLV  PSK+ F  Q R WRS AEGDIV FRTED  + YL  S  IS+RGHL +ALSLFY S+QPHS QTYAYLFH CARLRCL+EG
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREG

Query:  AVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGER
          LHRYM+S +PM SFDLFVTNHLINMYCKCGHLDYA QLFNEMPRRN VSWTVLI+G SQYGHVDECFLIFSRMLVDHRPNEFTV+SLLTSFG+HDGER
Subjt:  AVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGER

Query:  GRQIHGFALK------------------------------------------------------------------------------------------
        GRQIHGFALK                                                                                          
Subjt:  GRQIHGFALK------------------------------------------------------------------------------------------

Query:  --SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDG
          S CN DE    L FC ++HCQALKTAF SEVEIITAL+KTYAELGGDIADS+RLF+EAGYNRDIVLWTSIM AF+DHDPGKTLSLFCQFRQEGLTPDG
Subjt:  --SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDG

Query:  HTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTT
        HTFSIVLKACAGFLTEKHASTYHSLLIKSMSED TVLNNALIHAYGRCGSI+SSKKVFNQMKHHDLVSWNTMMK YA+HGQAEIALQLF+KM VPPD+TT
Subjt:  HTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTT

Query:  FVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSL
        FVSLLSACSHAGLVEEGT+LFNSI NYG+VC+LDHYACMVDILGRSG++QEA DFIS MPIEPD+V+WSSFLGSC+K+GAT LAKLAS KLKELDPSNSL
Subjt:  FVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSL

Query:  AYVQMRISMWSLPATGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRAGSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYP
        AYVQM ISMWSLPA GCW P AFARFRV CASSQLP+ PKD+TLRTI+NSGVIACLRA SAELAMSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYP
Subjt:  AYVQMRISMWSLPATGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRAGSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYP

Query:  TRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGIT
        T+TLGVGTVLN+KDAKNAV+AGAKFLMSPTMVKG IM D+EGEFLYIPGVMTPTEVLTAYE+G++IVKVYPVSALGGIKYISALKKPFPHISMVASQGIT
Subjt:  TRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGIT

Query:  I
        I
Subjt:  I

KAG6601094.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.5e-29384.91Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
        MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA

Query:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
        VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
Subjt:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG

Query:  RQIHGFALK-------------------------------------------------------------------------------------------
        RQIHGFALK                                                                                           
Subjt:  RQIHGFALK-------------------------------------------------------------------------------------------

Query:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
        SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
Subjt:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT

Query:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
        FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
Subjt:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV

Query:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
Subjt:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

Query:  VQM
        VQM
Subjt:  VQM

KAG7031898.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MKFFNMITSPSPLSVGSQINAVFSAKNLLRFIDLMNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLE
        MKFFNMITSPSPLSVGSQINAVFSAKNLLRFIDLMNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLE
Subjt:  MKFFNMITSPSPLSVGSQINAVFSAKNLLRFIDLMNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLE

Query:  QALSLFYSRQPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDE
        QALSLFYSRQPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDE
Subjt:  QALSLFYSRQPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDE

Query:  CFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIE
        CFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIE
Subjt:  CFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIE

Query:  AGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFN
        AGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFN
Subjt:  AGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFN

Query:  QMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKM
        QMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKM
Subjt:  QMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKM

Query:  PIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRAG
        PIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRAG
Subjt:  PIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRAG

Query:  SAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKGIMDDLEGEFLYIPGVMTPTEVLTAY
        SAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKGIMDDLEGEFLYIPGVMTPTEVLTAY
Subjt:  SAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKGIMDDLEGEFLYIPGVMTPTEVLTAY

Query:  EAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITIESTGDYIRGGASSVVLSDAIFNKEFMKQKNFEGISQLSKLAASRAMEALEWVQLDDVKS
        EAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITIESTGDYIRGGASSVVLSDAIFNKEFMKQKNFEGISQLSKLAASRAMEALEWVQLDDVKS
Subjt:  EAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITIESTGDYIRGGASSVVLSDAIFNKEFMKQKNFEGISQLSKLAASRAMEALEWVQLDDVKS

Query:  LRSLNG
        LRSLNG
Subjt:  LRSLNG

XP_022957425.1 pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita moschata]2.0e-29084.08Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
        MNLTTIHFRFLAKRNLVLYPSKYGF SQLRFWRSGAEGDIVSFRTEDFRH YLFGSPVISTRGHLEQALSLFYSRQPHS QTYAYLFHACARLRCLREGA
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA

Query:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
        VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
Subjt:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG

Query:  RQIHGFALK-------------------------------------------------------------------------------------------
        RQIHGFALK                                                                                           
Subjt:  RQIHGFALK-------------------------------------------------------------------------------------------

Query:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
        SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADS+RLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
Subjt:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT

Query:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
        FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
Subjt:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV

Query:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        SLLSACSHAGLVEEGT LFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
Subjt:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

Query:  VQM
        VQM
Subjt:  VQM

XP_023511808.1 pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-28582.92Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
        MNLTTIHFRFLAKRNLVLYPSKY F SQLRFWRSGAEGDIVSFRTEDFRH YLFGS VISTRGHL QALSLFYSRQPHSLQTYAYLFHACARLRCLREG 
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA

Query:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
         LHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
Subjt:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG

Query:  RQIHGFALK-------------------------------------------------------------------------------------------
        RQ+HGFALK                                                                                           
Subjt:  RQIHGFALK-------------------------------------------------------------------------------------------

Query:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
        SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITAL+KTYAELGGDI DS+RLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLF QFRQEGLTPDGHT
Subjt:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT

Query:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
        FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
Subjt:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV

Query:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
Subjt:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

Query:  VQM
        VQM
Subjt:  VQM

TrEMBL top hitse value%identityAlignment
A0A1S3BDV4 pentatricopeptide repeat-containing protein At1g714201.1e-24371.36Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREG
        M LTTI+  F   RNLV  PSK+ F  Q R WRS AEGDIV FRTED  + YL  +  IS+RGHL +ALSLFY SRQPHS QTYAYLFH CARLRCL+EG
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREG

Query:  AVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGER
          LHRYM+S +PM SFDLFVTNHLINMYCKCGHLDYA QLFNEMPRRN VSWTVLISGLSQYGHVDECF IFSRMLVD RPNEFTVASLLTSFG+HDGER
Subjt:  AVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGER

Query:  GRQIHGFALK------------------------------------------------------------------------------------------
        GRQIHGFALK                                                                                          
Subjt:  GRQIHGFALK------------------------------------------------------------------------------------------

Query:  -SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGH
           CN DE    LGFC ++HCQALKTAFTSE+EIITAL+KTYAELGG+IADS++LF+EAGYNRDIVLWTSIM AF+DHDPGKTLSLFCQFRQEGLTPDGH
Subjt:  -SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGH

Query:  TFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTF
        TFS+VLKACAGFLTEKHAS YHSLLIKSMSEDDTVLNNALIHAYGRCGSI+SSKKVFNQMKHHDLVSWNTMMK YA+HGQAEIALQLF+KM VPPD+TTF
Subjt:  TFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTF

Query:  VSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLA
        VSLLSACSHAGLVEEGT+LFNSI NYG+VCQLDHYACMVDILGRSGR+QEA DFISKMPIEPD+V+WSSFLGSC+K+GA  LAKLAS KLKELDPSNSLA
Subjt:  VSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLA

Query:  YVQM
        YVQM
Subjt:  YVQM

A0A5D3D022 Pentatricopeptide repeat-containing protein1.1e-24371.36Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREG
        M LTTI+  F   RNLV  PSK+ F  Q R WRS AEGDIV FRTED  + YL  +  IS+RGHL +ALSLFY SRQPHS QTYAYLFH CARLRCL+EG
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFY-SRQPHSLQTYAYLFHACARLRCLREG

Query:  AVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGER
          LHRYM+S +PM SFDLFVTNHLINMYCKCGHLDYA QLFNEMPRRN VSWTVLISGLSQYGHVDECF IFSRMLVD RPNEFTVASLLTSFG+HDGER
Subjt:  AVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGER

Query:  GRQIHGFALK------------------------------------------------------------------------------------------
        GRQIHGFALK                                                                                          
Subjt:  GRQIHGFALK------------------------------------------------------------------------------------------

Query:  -SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGH
           CN DE    LGFC ++HCQALKTAFTSE+EIITAL+KTYAELGG+IADS++LF+EAGYNRDIVLWTSIM AF+DHDPGKTLSLFCQFRQEGLTPDGH
Subjt:  -SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGH

Query:  TFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTF
        TFS+VLKACAGFLTEKHAS YHSLLIKSMSEDDTVLNNALIHAYGRCGSI+SSKKVFNQMKHHDLVSWNTMMK YA+HGQAEIALQLF+KM VPPD+TTF
Subjt:  TFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTF

Query:  VSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLA
        VSLLSACSHAGLVEEGT+LFNSI NYG+VCQLDHYACMVDILGRSGR+QEA DFISKMPIEPD+V+WSSFLGSC+K+GA  LAKLAS KLKELDPSNSLA
Subjt:  VSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLA

Query:  YVQM
        YVQM
Subjt:  YVQM

A0A6J1CBA2 pentatricopeptide repeat-containing protein At1g71420 isoform X18.8e-24469.44Show/hide
Query:  SVGSQINAVFSAKNLLRFIDLMNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYS-RQPH
        +VG  +  +  A+ + R ID+M L TIH+ FLAKRNLVLYPSK+ F   LR+WRS AE D V  RTED  + YL+ + VISTRGHL  ALSLFYS RQPH
Subjt:  SVGSQINAVFSAKNLLRFIDLMNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYS-RQPH

Query:  SLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDH
        S QTYAYLFHACARLRCL EG  LHRYMMS D M SFDLFVTNHLINMYCKCGHLDYA+QLF+EMPRRNLVSWTVLISGLSQYGHVDECFL+F RMLVD 
Subjt:  SLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDH

Query:  RPNEFTVASLLTSFGDHDGERGRQIHGFALK---------------------------------------------------------------------
        RPNEFTVASLLTSFG+HDGERGRQ+HGFALK                                                                     
Subjt:  RPNEFTVASLLTSFGDHDGERGRQIHGFALK---------------------------------------------------------------------

Query:  ----------------------SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHD
                              +LCN DE  LGL FC ELHC A KTAF SE+E+ TAL+KTYA+LGGDIADS+RLF+EAGY+ DIVLWTSIMTA V+HD
Subjt:  ----------------------SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHD

Query:  PGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHG
        PGKTLSLF QFRQEGLTPDGHTFSIVLKACAG+LTEKHASTYHSLLIKSMSEDD VLNNALIHAYGRCGSIT SKKVF +MK+ DLVSWNTMMK YA+HG
Subjt:  PGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHG

Query:  QAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGA
        QA+ AL LFSKM VPPDSTTFVSLLSACSHAGLVEEGT+LFNSI  YG+VCQLDHYACMVDILGR GR+QEAE FISKMPIEPD+V+WSSFLGSC+KHGA
Subjt:  QAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGA

Query:  TQLAKLASDKLKELDPSNSLAYVQM
        TQLAKLAS+KLKELDPSNSLAYVQM
Subjt:  TQLAKLASDKLKELDPSNSLAYVQM

A0A6J1H0I1 pentatricopeptide repeat-containing protein At1g71420 isoform X19.6e-29184.08Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
        MNLTTIHFRFLAKRNLVLYPSKYGF SQLRFWRSGAEGDIVSFRTEDFRH YLFGSPVISTRGHLEQALSLFYSRQPHS QTYAYLFHACARLRCLREGA
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA

Query:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
        VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
Subjt:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG

Query:  RQIHGFALK-------------------------------------------------------------------------------------------
        RQIHGFALK                                                                                           
Subjt:  RQIHGFALK-------------------------------------------------------------------------------------------

Query:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
        SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADS+RLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
Subjt:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT

Query:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
        FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
Subjt:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV

Query:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        SLLSACSHAGLVEEGT LFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
Subjt:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

Query:  VQM
        VQM
Subjt:  VQM

A0A6J1JQJ9 pentatricopeptide repeat-containing protein At1g71420 isoform X13.4e-28081.26Show/hide
Query:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
        M LTTIHFRFLAKRNLVLYPSKY F SQLRFWRSG EGDIVSFRTEDFR  YLFGS VISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA
Subjt:  MNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQPHSLQTYAYLFHACARLRCLREGA

Query:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
        VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQY HVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG
Subjt:  VLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERG

Query:  RQIHGFALK-------------------------------------------------------------------------------------------
        RQ+HGFALK                                                                                           
Subjt:  RQIHGFALK-------------------------------------------------------------------------------------------

Query:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT
        SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITAL+KTYAELGGDI DS+RLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLF QFRQEGLTPDGHT
Subjt:  SLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHT

Query:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
        FSIVLKACAGFLTEKHASTYHSLLIKS SEDDTV+NNALIHAYGRCGSITSSKKVF+QMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV
Subjt:  FSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFV

Query:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        SLLSACSHAGLVEEGT LFNSI NYGLVCQLDHYACMVDILGRSGRI+EAE F+SKMPIEPDYV+WSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
Subjt:  SLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

Query:  VQM
        VQM
Subjt:  VQM

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148202.4e-6530.93Show/hide
Query:  TRGHLEQALSLFYSRQPH---SLQTYAYL--FHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLI
        +R    +A  LFY R  H    L  +++L    A +++  L EG  LH     +  +   D FV    ++MY  CG ++YA  +F+EM  R++V+W  +I
Subjt:  TRGHLEQALSLFYSRQPH---SLQTYAYL--FHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLI

Query:  SGLSQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWD--------ELDLGLGFCRELHCQALKTAFTSEVEIITA
            ++G VDE F +F  M   +  P+E  + +++++ G     R  R I+ F +++    D         +  G G C ++  +  +      + + TA
Subjt:  SGLSQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWD--------ELDLGLGFCRELHCQALKTAFTSEVEIITA

Query:  LLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHD-PGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVL
        ++  Y++ G    D  ++  +    +D+V WT++++A+V+ D P + L +F +    G+ PD  +   V+ ACA       A   HS +  +  E +  +
Subjt:  LLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHD-PGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVL

Query:  NNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKM---TVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQL
        NNALI+ Y +CG + +++ VF +M   ++VSW++M+   ++HG+A  AL LF++M    V P+  TFV +L  CSH+GLVEEG  +F S+ + Y +  +L
Subjt:  NNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKM---TVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQL

Query:  DHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM
        +HY CMVD+ GR+  ++EA + I  MP+  + VIW S + +C+ HG  +L K A+ ++ EL+P +  A V M
Subjt:  DHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM

Q9C9H9 Pentatricopeptide repeat-containing protein At1g714205.3e-11341.2Show/hide
Query:  YLFGSPVISTRGHLEQALSLFYSR--QPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLV
        ++ G   +   G + +A+SLFYS   +  S Q YA LF ACA  R L +G  LH +M+S     S ++ + N LINMY KCG++ YA Q+F+ MP RN+V
Subjt:  YLFGSPVISTRGHLEQALSLFYSR--QPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLV

Query:  SWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFAL---------------------------------------KS
        SWT LI+G  Q G+  E F +FS ML    PNEFT++S+LTS      E G+Q+HG AL                                       K+
Subjt:  SWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFAL---------------------------------------KS

Query:  LCNWDEL---------------------DLGLGF--------------------------CRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRL
        L  W+ +                       G+GF                          C +LH   +K+   ++ E+ TAL+K Y+E+  D  D ++L
Subjt:  LCNWDEL---------------------DLGLGF--------------------------CRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRL

Query:  FIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK
        F+E  + RDIV W  I+TAF  +DP + + LF Q RQE L+PD +TFS VLKACAG +T +HA + H+ +IK     DTVLNN+LIHAY +CGS+    +
Subjt:  FIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK

Query:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSI-ANYGLVCQLDHYACMVDILGRSGRIQEAEDF
        VF+ M   D+VSWN+M+K Y++HGQ +  L +F KM + PDS TF++LLSACSHAG VEEG  +F S+      + QL+HYAC++D+L R+ R  EAE+ 
Subjt:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSI-ANYGLVCQLDHYACMVDILGRSGRIQEAEDF

Query:  ISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKEL-DPSNSLAYVQM
        I +MP++PD V+W + LGSC+KHG T+L KLA+DKLKEL +P+NS++Y+QM
Subjt:  ISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKEL-DPSNSLAYVQM

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.3e-6332Show/hide
Query:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKC-------------------------------GHLDYAYQLFNEMPRRNLVS
        T   +  + A  RC+  G  +H +++ L   G  ++ V+N L+NMY KC                               G +D A   F +M  R++V+
Subjt:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKC-------------------------------GHLDYAYQLFNEMPRRNLVS

Query:  WTVLISGLSQYGHVDECFLIFSRMLVDH--RPNEFTVASLLTSFGDHDGE-RGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTA---------FTS
        W  +ISG +Q G+      IFS+ML D    P+ FT+AS+L++  + +    G+QIH   + +  +   + L         C  ++TA            
Subjt:  WTVLISGLSQYGHVDECFLIFSRMLVDH--RPNEFTVASLLTSFGDHDGE-RGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTA---------FTS

Query:  EVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDP-GKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSM
        ++E  TALL  Y +L GD+  +  +F+    +RD+V WT+++  +  H   G+ ++LF      G  P+ +T + +L   +   +  H    H   +KS 
Subjt:  EVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDP-GKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSM

Query:  SEDDTVLNNALIHAYGRCGSITSSKKVFNQMK-HHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVP---PDSTTFVSLLSACSHAGLVEEGTNLFNSIAN
              ++NALI  Y + G+ITS+ + F+ ++   D VSW +M+   A HG AE AL+LF  M +    PD  T+V + SAC+HAGLV +G   F+ + +
Subjt:  SEDDTVLNNALIHAYGRCGSITSSKKVFNQMK-HHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVP---PDSTTFVSLLSACSHAGLVEEGTNLFNSIAN

Query:  YG-LVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFAR
           ++  L HYACMVD+ GR+G +QEA++FI KMPIEPD V W S L +C+ H    L K+A+++L  L+P NS AY  +  +++S  A G W   A  R
Subjt:  YG-LVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFAR

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial2.7e-6433.92Show/hide
Query:  LEQALSLFYSRQPHSL----QTYAYLFHACARLRCLREGAVL--HRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGL
        L +A+    S Q H L     TY+ L   C   R + EG ++  H Y     PM    +F+ N LINMY K   L+ A+QLF++MP+RN++SWT +IS  
Subjt:  LEQALSLFYSRQPHSL----QTYAYLFHACARLRCLREGAVL--HRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGL

Query:  SQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDI
        S+     +   +   ML D+ RPN +T +S+L S                    CN      G+   R LHC  +K    S+V + +AL+  +A+L G+ 
Subjt:  SQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDI

Query:  ADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGK-TLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCG
         D+  +F E     D ++W SI+  F  +      L LF + ++ G   +  T + VL+AC G    +     H  ++K   + D +LNNAL+  Y +CG
Subjt:  ADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGK-TLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCG

Query:  SITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQLDHYACMVDILGR
        S+  + +VFNQMK  D+++W+TM+   A +G ++ AL+LF +M      P+  T V +L ACSHAGL+E+G   F S+   YG+    +HY CM+D+LG+
Subjt:  SITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQLDHYACMVDILGR

Query:  SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        +G++ +A   +++M  EPD V W + LG+C+      LA+ A+ K+  LDP ++  Y
Subjt:  SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.1e-6534.15Show/hide
Query:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDH-RP
        ++A +  AC+ L  + +G  +H  +++  P  S D+++ + L++MY KCG+++ A ++F+EM  RN+VSW  LI+   Q G   E   +F  ML     P
Subjt:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDH-RP

Query:  NEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWDELDLGLGFC-RELHCQALKTA---FTS----EVEIITALLKTYAELGGDIADSFRLFIEAGYNR
        +E T+AS++++       + G+++HG  +K+    +++ L   F      C  +K A   F S     V   T+++  YA        + RL       R
Subjt:  NEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWDELDLGLGFC-RELHCQALKTA---FTS----EVEIITALLKTYAELGGDIADSFRLFIEAGYNR

Query:  DIVLWTSIMTAFVDH-DPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTY-------HSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK
        ++V W +++  +  + +  + LSLFC  ++E + P  ++F+ +LKACA  L E H           H    +S  EDD  + N+LI  Y +CG +     
Subjt:  DIVLWTSIMTAFVDH-DPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTY-------HSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK

Query:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIA-NYGLVCQLDHYACMVDILGRSGRIQEA
        VF +M   D VSWN M+  +A +G    AL+LF +M      PD  T + +LSAC HAG VEEG + F+S+  ++G+    DHY CMVD+LGR+G ++EA
Subjt:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIA-NYGLVCQLDHYACMVDILGRSGRIQEA

Query:  EDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYV
        +  I +MP++PD VIW S L +CK H    L K  ++KL E++PSNS  YV
Subjt:  EDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYV

Arabidopsis top hitse value%identityAlignment
AT1G71420.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-11441.2Show/hide
Query:  YLFGSPVISTRGHLEQALSLFYSR--QPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLV
        ++ G   +   G + +A+SLFYS   +  S Q YA LF ACA  R L +G  LH +M+S     S ++ + N LINMY KCG++ YA Q+F+ MP RN+V
Subjt:  YLFGSPVISTRGHLEQALSLFYSR--QPHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLV

Query:  SWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFAL---------------------------------------KS
        SWT LI+G  Q G+  E F +FS ML    PNEFT++S+LTS      E G+Q+HG AL                                       K+
Subjt:  SWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVASLLTSFGDHDGERGRQIHGFAL---------------------------------------KS

Query:  LCNWDEL---------------------DLGLGF--------------------------CRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRL
        L  W+ +                       G+GF                          C +LH   +K+   ++ E+ TAL+K Y+E+  D  D ++L
Subjt:  LCNWDEL---------------------DLGLGF--------------------------CRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRL

Query:  FIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK
        F+E  + RDIV W  I+TAF  +DP + + LF Q RQE L+PD +TFS VLKACAG +T +HA + H+ +IK     DTVLNN+LIHAY +CGS+    +
Subjt:  FIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK

Query:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSI-ANYGLVCQLDHYACMVDILGRSGRIQEAEDF
        VF+ M   D+VSWN+M+K Y++HGQ +  L +F KM + PDS TF++LLSACSHAG VEEG  +F S+      + QL+HYAC++D+L R+ R  EAE+ 
Subjt:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDSTTFVSLLSACSHAGLVEEGTNLFNSI-ANYGLVCQLDHYACMVDILGRSGRIQEAEDF

Query:  ISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKEL-DPSNSLAYVQM
        I +MP++PD V+W + LGSC+KHG T+L KLA+DKLKEL +P+NS++Y+QM
Subjt:  ISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKEL-DPSNSLAYVQM

AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-6533.92Show/hide
Query:  LEQALSLFYSRQPHSL----QTYAYLFHACARLRCLREGAVL--HRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGL
        L +A+    S Q H L     TY+ L   C   R + EG ++  H Y     PM    +F+ N LINMY K   L+ A+QLF++MP+RN++SWT +IS  
Subjt:  LEQALSLFYSRQPHSL----QTYAYLFHACARLRCLREGAVL--HRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGL

Query:  SQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDI
        S+     +   +   ML D+ RPN +T +S+L S                    CN      G+   R LHC  +K    S+V + +AL+  +A+L G+ 
Subjt:  SQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDI

Query:  ADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGK-TLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCG
         D+  +F E     D ++W SI+  F  +      L LF + ++ G   +  T + VL+AC G    +     H  ++K   + D +LNNAL+  Y +CG
Subjt:  ADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGK-TLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCG

Query:  SITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQLDHYACMVDILGR
        S+  + +VFNQMK  D+++W+TM+   A +G ++ AL+LF +M      P+  T V +L ACSHAGL+E+G   F S+   YG+    +HY CM+D+LG+
Subjt:  SITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQLDHYACMVDILGR

Query:  SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY
        +G++ +A   +++M  EPD V W + LG+C+      LA+ A+ K+  LDP ++  Y
Subjt:  SGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAY

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein7.7e-6734.15Show/hide
Query:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDH-RP
        ++A +  AC+ L  + +G  +H  +++  P  S D+++ + L++MY KCG+++ A ++F+EM  RN+VSW  LI+   Q G   E   +F  ML     P
Subjt:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDH-RP

Query:  NEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWDELDLGLGFC-RELHCQALKTA---FTS----EVEIITALLKTYAELGGDIADSFRLFIEAGYNR
        +E T+AS++++       + G+++HG  +K+    +++ L   F      C  +K A   F S     V   T+++  YA        + RL       R
Subjt:  NEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWDELDLGLGFC-RELHCQALKTA---FTS----EVEIITALLKTYAELGGDIADSFRLFIEAGYNR

Query:  DIVLWTSIMTAFVDH-DPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTY-------HSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK
        ++V W +++  +  + +  + LSLFC  ++E + P  ++F+ +LKACA  L E H           H    +S  EDD  + N+LI  Y +CG +     
Subjt:  DIVLWTSIMTAFVDH-DPGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTY-------HSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKK

Query:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIA-NYGLVCQLDHYACMVDILGRSGRIQEA
        VF +M   D VSWN M+  +A +G    AL+LF +M      PD  T + +LSAC HAG VEEG + F+S+  ++G+    DHY CMVD+LGR+G ++EA
Subjt:  VFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMT---VPPDSTTFVSLLSACSHAGLVEEGTNLFNSIA-NYGLVCQLDHYACMVDILGRSGRIQEA

Query:  EDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYV
        +  I +MP++PD VIW S L +CK H    L K  ++KL E++PSNS  YV
Subjt:  EDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYV

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein9.4e-6532Show/hide
Query:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKC-------------------------------GHLDYAYQLFNEMPRRNLVS
        T   +  + A  RC+  G  +H +++ L   G  ++ V+N L+NMY KC                               G +D A   F +M  R++V+
Subjt:  TYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKC-------------------------------GHLDYAYQLFNEMPRRNLVS

Query:  WTVLISGLSQYGHVDECFLIFSRMLVDH--RPNEFTVASLLTSFGDHDGE-RGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTA---------FTS
        W  +ISG +Q G+      IFS+ML D    P+ FT+AS+L++  + +    G+QIH   + +  +   + L         C  ++TA            
Subjt:  WTVLISGLSQYGHVDECFLIFSRMLVDH--RPNEFTVASLLTSFGDHDGE-RGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTA---------FTS

Query:  EVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDP-GKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSM
        ++E  TALL  Y +L GD+  +  +F+    +RD+V WT+++  +  H   G+ ++LF      G  P+ +T + +L   +   +  H    H   +KS 
Subjt:  EVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDP-GKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSM

Query:  SEDDTVLNNALIHAYGRCGSITSSKKVFNQMK-HHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVP---PDSTTFVSLLSACSHAGLVEEGTNLFNSIAN
              ++NALI  Y + G+ITS+ + F+ ++   D VSW +M+   A HG AE AL+LF  M +    PD  T+V + SAC+HAGLV +G   F+ + +
Subjt:  SEDDTVLNNALIHAYGRCGSITSSKKVFNQMK-HHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVP---PDSTTFVSLLSACSHAGLVEEGTNLFNSIAN

Query:  YG-LVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFAR
           ++  L HYACMVD+ GR+G +QEA++FI KMPIEPD V W S L +C+ H    L K+A+++L  L+P NS AY  +  +++S  A G W   A  R
Subjt:  YG-LVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISMWSLPATGCWNPFAFAR

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-6630.93Show/hide
Query:  TRGHLEQALSLFYSRQPH---SLQTYAYL--FHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLI
        +R    +A  LFY R  H    L  +++L    A +++  L EG  LH     +  +   D FV    ++MY  CG ++YA  +F+EM  R++V+W  +I
Subjt:  TRGHLEQALSLFYSRQPH---SLQTYAYL--FHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLI

Query:  SGLSQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWD--------ELDLGLGFCRELHCQALKTAFTSEVEIITA
            ++G VDE F +F  M   +  P+E  + +++++ G     R  R I+ F +++    D         +  G G C ++  +  +      + + TA
Subjt:  SGLSQYGHVDECFLIFSRMLVDH-RPNEFTVASLLTSFGDHDGER-GRQIHGFALKSLCNWD--------ELDLGLGFCRELHCQALKTAFTSEVEIITA

Query:  LLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHD-PGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVL
        ++  Y++ G    D  ++  +    +D+V WT++++A+V+ D P + L +F +    G+ PD  +   V+ ACA       A   HS +  +  E +  +
Subjt:  LLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHD-PGKTLSLFCQFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVL

Query:  NNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKM---TVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQL
        NNALI+ Y +CG + +++ VF +M   ++VSW++M+   ++HG+A  AL LF++M    V P+  TFV +L  CSH+GLVEEG  +F S+ + Y +  +L
Subjt:  NNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKM---TVPPDSTTFVSLLSACSHAGLVEEGTNLFNSIAN-YGLVCQL

Query:  DHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM
        +HY CMVD+ GR+  ++EA + I  MP+  + VIW S + +C+ HG  +L K A+ ++ EL+P +  A V M
Subjt:  DHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTTTTTAATATGATAACCAGTCCATCGCCTCTCTCTGTGGGTTCGCAAATCAATGCCGTTTTCTCTGCTAAAAATCTGTTAAGGTTCATTGACCTCATGAATCT
CACGACAATTCATTTTCGATTTCTTGCCAAACGGAATTTGGTTTTGTACCCGAGTAAGTATGGTTTTGATTCCCAACTTAGATTCTGGAGGTCGGGGGCAGAAGGCGATA
TCGTGTCTTTTAGGACAGAAGATTTTCGTCATGGCTATCTATTTGGATCCCCCGTGATTTCCACGCGTGGCCACCTTGAGCAGGCTCTTTCACTGTTTTACTCTAGACAG
CCTCATTCCCTCCAGACCTATGCGTATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTCCGGGAAGGTGCGGTACTACACCGTTACATGATGTCCCTGGATCCCATGGG
CTCATTTGACCTCTTTGTTACCAATCACCTTATCAACATGTACTGTAAATGTGGTCATTTAGACTATGCCTACCAATTATTCAATGAGATGCCTAGGAGAAACCTTGTCT
CTTGGACTGTGCTTATCTCGGGACTTTCTCAATATGGCCATGTGGATGAGTGCTTCCTTATATTTTCGAGAATGTTGGTAGATCACAGGCCAAATGAGTTTACAGTTGCA
AGTTTGCTTACCTCGTTTGGTGACCATGATGGTGAGCGTGGGCGGCAGATACATGGGTTTGCCTTGAAAAGTCTCTGCAATTGGGATGAACTTGATCTTGGTTTGGGCTT
TTGTCGTGAATTACACTGTCAAGCATTAAAAACTGCTTTCACATCAGAAGTTGAAATAATTACTGCATTGTTAAAAACTTATGCTGAACTTGGAGGGGACATTGCGGATA
GTTTTAGGCTTTTTATTGAAGCAGGATATAATCGGGATATAGTTTTATGGACCAGCATCATGACAGCTTTTGTAGACCATGACCCTGGGAAAACACTTTCCCTTTTTTGT
CAGTTCCGACAAGAAGGCTTAACTCCAGATGGACACACTTTTTCGATTGTATTAAAGGCTTGTGCTGGATTCCTAACAGAGAAGCATGCTTCAACATATCATTCACTGCT
AATTAAATCTATGTCTGAGGATGACACTGTCCTTAACAATGCCTTGATTCATGCTTATGGGAGGTGTGGTTCAATTACTTCATCCAAGAAAGTATTCAATCAAATGAAAC
ATCACGATTTGGTTTCTTGGAACACAATGATGAAGGTCTATGCTGTCCATGGCCAAGCTGAGATTGCTTTGCAGCTTTTTTCAAAGATGACTGTGCCACCTGATTCTACT
ACATTTGTCTCTCTTCTTTCAGCATGTAGCCATGCAGGGCTCGTGGAAGAAGGGACCAACCTTTTTAATTCAATTGCAAATTATGGACTTGTTTGCCAACTAGATCACTA
TGCTTGCATGGTTGACATTTTGGGGAGATCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAAATGCCTATAGAACCTGATTATGTTATTTGGAGTTCATTCCTGG
GATCATGTAAAAAGCATGGCGCAACACAATTGGCCAAATTAGCATCTGATAAATTGAAGGAGTTAGATCCTAGCAATTCCTTAGCTTATGTGCAAATGAGAATCAGCATG
TGGTCTTTGCCTGCTACTGGGTGTTGGAACCCTTTCGCATTTGCTCGCTTTAGAGTTTCTTGTGCTTCTAGTCAGCTACCAGTTTCTCCCAAAGACAAGACCCTGAGAAC
AATTTACAATTCTGGAGTCATTGCTTGCCTTCGTGCTGGCAGTGCGGAGCTGGCAATGAGTGCTGCTTGTGCTGCGCTAAATGGGGGAGTATCAGTTCTCGAGATTGTTA
TGTCGACACCAGGTGTGCTTGAGGTTCTACAACAGTTGCTGCAAGACTATCCTACAAGAACATTGGGAGTTGGGACCGTTCTTAATGTTAAGGATGCAAAGAATGCTGTC
GAAGCTGGAGCCAAGTTTCTAATGAGTCCCACTATGGTGAAGGGTATCATGGATGATCTTGAAGGGGAATTCTTGTATATACCTGGTGTGATGACCCCGACAGAAGTACT
GACTGCATATGAAGCTGGCGCTCAGATTGTCAAAGTTTATCCAGTTTCTGCATTAGGTGGTATCAAATATATATCAGCCCTCAAGAAGCCATTTCCTCATATCTCAATGG
TTGCTTCTCAAGGCATAACTATTGAATCTACTGGGGACTATATTAGAGGAGGAGCATCTTCGGTAGTTTTATCTGATGCAATATTTAATAAGGAGTTTATGAAGCAAAAG
AACTTTGAAGGAATATCTCAACTTTCTAAGTTGGCTGCTTCCCGGGCGATGGAAGCTTTAGAATGGGTTCAACTTGATGACGTTAAAAGCTTAAGGTCGTTGAACGGATG
A
mRNA sequenceShow/hide mRNA sequence
ATAATGTAAAAAAAATTAAATAAGGAAAGAAGAAAATAAAAAGATGAAATTTTTTAATATGATAACCAGTCCATCGCCTCTCTCTGTGGGTTCGCAAATCAATGCCGTTT
TCTCTGCTAAAAATCTGTTAAGGTTCATTGACCTCATGAATCTCACGACAATTCATTTTCGATTTCTTGCCAAACGGAATTTGGTTTTGTACCCGAGTAAGTATGGTTTT
GATTCCCAACTTAGATTCTGGAGGTCGGGGGCAGAAGGCGATATCGTGTCTTTTAGGACAGAAGATTTTCGTCATGGCTATCTATTTGGATCCCCCGTGATTTCCACGCG
TGGCCACCTTGAGCAGGCTCTTTCACTGTTTTACTCTAGACAGCCTCATTCCCTCCAGACCTATGCGTATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTCCGGGAAG
GTGCGGTACTACACCGTTACATGATGTCCCTGGATCCCATGGGCTCATTTGACCTCTTTGTTACCAATCACCTTATCAACATGTACTGTAAATGTGGTCATTTAGACTAT
GCCTACCAATTATTCAATGAGATGCCTAGGAGAAACCTTGTCTCTTGGACTGTGCTTATCTCGGGACTTTCTCAATATGGCCATGTGGATGAGTGCTTCCTTATATTTTC
GAGAATGTTGGTAGATCACAGGCCAAATGAGTTTACAGTTGCAAGTTTGCTTACCTCGTTTGGTGACCATGATGGTGAGCGTGGGCGGCAGATACATGGGTTTGCCTTGA
AAAGTCTCTGCAATTGGGATGAACTTGATCTTGGTTTGGGCTTTTGTCGTGAATTACACTGTCAAGCATTAAAAACTGCTTTCACATCAGAAGTTGAAATAATTACTGCA
TTGTTAAAAACTTATGCTGAACTTGGAGGGGACATTGCGGATAGTTTTAGGCTTTTTATTGAAGCAGGATATAATCGGGATATAGTTTTATGGACCAGCATCATGACAGC
TTTTGTAGACCATGACCCTGGGAAAACACTTTCCCTTTTTTGTCAGTTCCGACAAGAAGGCTTAACTCCAGATGGACACACTTTTTCGATTGTATTAAAGGCTTGTGCTG
GATTCCTAACAGAGAAGCATGCTTCAACATATCATTCACTGCTAATTAAATCTATGTCTGAGGATGACACTGTCCTTAACAATGCCTTGATTCATGCTTATGGGAGGTGT
GGTTCAATTACTTCATCCAAGAAAGTATTCAATCAAATGAAACATCACGATTTGGTTTCTTGGAACACAATGATGAAGGTCTATGCTGTCCATGGCCAAGCTGAGATTGC
TTTGCAGCTTTTTTCAAAGATGACTGTGCCACCTGATTCTACTACATTTGTCTCTCTTCTTTCAGCATGTAGCCATGCAGGGCTCGTGGAAGAAGGGACCAACCTTTTTA
ATTCAATTGCAAATTATGGACTTGTTTGCCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGATCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAA
ATGCCTATAGAACCTGATTATGTTATTTGGAGTTCATTCCTGGGATCATGTAAAAAGCATGGCGCAACACAATTGGCCAAATTAGCATCTGATAAATTGAAGGAGTTAGA
TCCTAGCAATTCCTTAGCTTATGTGCAAATGAGAATCAGCATGTGGTCTTTGCCTGCTACTGGGTGTTGGAACCCTTTCGCATTTGCTCGCTTTAGAGTTTCTTGTGCTT
CTAGTCAGCTACCAGTTTCTCCCAAAGACAAGACCCTGAGAACAATTTACAATTCTGGAGTCATTGCTTGCCTTCGTGCTGGCAGTGCGGAGCTGGCAATGAGTGCTGCT
TGTGCTGCGCTAAATGGGGGAGTATCAGTTCTCGAGATTGTTATGTCGACACCAGGTGTGCTTGAGGTTCTACAACAGTTGCTGCAAGACTATCCTACAAGAACATTGGG
AGTTGGGACCGTTCTTAATGTTAAGGATGCAAAGAATGCTGTCGAAGCTGGAGCCAAGTTTCTAATGAGTCCCACTATGGTGAAGGGTATCATGGATGATCTTGAAGGGG
AATTCTTGTATATACCTGGTGTGATGACCCCGACAGAAGTACTGACTGCATATGAAGCTGGCGCTCAGATTGTCAAAGTTTATCCAGTTTCTGCATTAGGTGGTATCAAA
TATATATCAGCCCTCAAGAAGCCATTTCCTCATATCTCAATGGTTGCTTCTCAAGGCATAACTATTGAATCTACTGGGGACTATATTAGAGGAGGAGCATCTTCGGTAGT
TTTATCTGATGCAATATTTAATAAGGAGTTTATGAAGCAAAAGAACTTTGAAGGAATATCTCAACTTTCTAAGTTGGCTGCTTCCCGGGCGATGGAAGCTTTAGAATGGG
TTCAACTTGATGACGTTAAAAGCTTAAGGTCGTTGAACGGATGATACAACTTGCAGCTCTAGCAAGGAGAATCAAAGTAGCTAGAGGTTAGTTCGGCCTATTCAATAGCG
TTGTTAAAAATAATTAAGGTCTCGACTCGTATAAGACTAAAGATGATTTGTTTTTCTCTATTTCATACTGTATAAGATTAAAGAGGTTAATTTGTTTAAATCTTGAGCCA
ATATAGTTATGTATATAGGGGAAAAAATATCAATTTCATATCCATTAATTACAT
Protein sequenceShow/hide protein sequence
MKFFNMITSPSPLSVGSQINAVFSAKNLLRFIDLMNLTTIHFRFLAKRNLVLYPSKYGFDSQLRFWRSGAEGDIVSFRTEDFRHGYLFGSPVISTRGHLEQALSLFYSRQ
PHSLQTYAYLFHACARLRCLREGAVLHRYMMSLDPMGSFDLFVTNHLINMYCKCGHLDYAYQLFNEMPRRNLVSWTVLISGLSQYGHVDECFLIFSRMLVDHRPNEFTVA
SLLTSFGDHDGERGRQIHGFALKSLCNWDELDLGLGFCRELHCQALKTAFTSEVEIITALLKTYAELGGDIADSFRLFIEAGYNRDIVLWTSIMTAFVDHDPGKTLSLFC
QFRQEGLTPDGHTFSIVLKACAGFLTEKHASTYHSLLIKSMSEDDTVLNNALIHAYGRCGSITSSKKVFNQMKHHDLVSWNTMMKVYAVHGQAEIALQLFSKMTVPPDST
TFVSLLSACSHAGLVEEGTNLFNSIANYGLVCQLDHYACMVDILGRSGRIQEAEDFISKMPIEPDYVIWSSFLGSCKKHGATQLAKLASDKLKELDPSNSLAYVQMRISM
WSLPATGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRAGSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAV
EAGAKFLMSPTMVKGIMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITIESTGDYIRGGASSVVLSDAIFNKEFMKQK
NFEGISQLSKLAASRAMEALEWVQLDDVKSLRSLNG