; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015412 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015412
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold586:180208..181938
RNA-Seq ExpressionMS015412
SyntenyMS015412
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152299.1 pentatricopeptide repeat-containing protein At4g20090 isoform X1 [Cucumis sativus]5.2e-26982.54Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        R+ S LLK  +LHF   SS+FF TS T N IAIAPR  ARRPTSR+AP PR+ +TL S+DVVNSVCSLLSNKN QT NLDLDHLLKRF +NLSSD VL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNY++LGRAKTLEFFSWSGLQMG+RFD SVVEYMADF GRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM R+G VPTR++VNILIG+LCSLSAKEGA+EKVRV ST R
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAIEPAVG+FWAAN+++LVPSSFV VQLISELCRLGQMQEAI VLKVVE  KLRC EEC+S+VM+ALCE+R V+EASDLFGRMLSQG
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YN VICMLCKLGN+  AERVF IMN+KRC PDHVTYSALIHAY E  +WSAAY LLKEMLSLGMSPHFH+YS VDKLMREHGQ+DLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE
        WEAQIL KLCKQGQLEAAYEK+KSMLEKG+ PP YVRDAFE+AFQK GK+KIARELL+K+ GVH  E
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE

XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]2.6e-26882.19Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        R+ S LLK  +LHF  FSS+FF TS T   IAIAPR   RRPTSR+AP PR+ +T+ S+DVVNSVCSLLSNKN QT NLD++HLLKRF +NLSSDLVL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNY++LGRAKTLEFFSWSGLQMG+RFD SVVEYMADF GRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM R+G VPTRS+ NILIG+LCSLSAKEGA+EKVRVRST R
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAIEPAVG+FWAAN++ LVPSSFV VQLISELCR+GQMQEAI+VLKVVE  KLRC EEC+S+VM+ALCE+R ++EASDLFGRMLSQG
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YN VICMLCKLGN+  AERVF IMN+KRC PDHVTYSALIHAY E  NWSAAY LLKEMLSLGMSPHFH+YS VDKLMREHGQVDLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE
        WEAQIL KLCKQGQLEAAYEK+KSMLEKG+ PP YVRDAFE+AFQK GK+KIARELL+K+ GVH  E
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE

XP_022145163.1 pentatricopeptide repeat-containing protein At5g39710-like [Momordica charantia]0.0e+0099.13Show/hide
Query:  MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLV
        MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLD LLKRFNENLSSDLV
Subjt:  MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLV

Query:  LRILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LRILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
Subjt:  LRILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMER GSVPTRS+VNILIGDLCSLSAKEGAIEKVRVRS
Subjt:  PDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRML
        TRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRML
Subjt:  TRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRML

Query:  SQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKL
        SQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPD VTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKL
Subjt:  SQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKL

Query:  EMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHNREESVIRHSP
        EMKWEAQIL KLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHNREESVIRHSP
Subjt:  EMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHNREESVIRHSP

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]6.8e-26983.95Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        RN S  LKF    FS +SS+   TS+TR   AIAPR  ARRPTSR+AP+PRALD    TD V+SVCSLLSNKNHQTTNL+LDHLLKRF E LSSD VL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNYR+ GRAKTLEFFSWSGLQMGYRFDESVVEYMADF GRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVF+EM R+G VPTRS+VNILIGDLCSLSAKEGA+E+VRVRSTRR
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAI+ AVGVFWAANR+ALVPS+FV+V+LISELCRLGQMQEAI VLKVVE  KLRC EEC+SIVMQALCE+R+V+EASDLFGRMLSQ 
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YNSVICMLCKLGN+ DAERVFKIMNRKRCVPDHVTYSALIHAY E  NWSAAYSLLKEMLSLG+SPHFH+YS VDKLMRE GQ DLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE
        WE+QIL KLCKQGQL  AYEKLKSMLEKG +PP YVRDAFE+AFQK GK+KIARELL+ + GVH  E
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]2.3e-28085.19Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        RN S  LKF  LHF  FSS+FF TST    IAIAPR  ARRPTSR+AP+PRA DTL S+DVVNSVCSLLSNKNHQT NLDLDHLLKRF + LSSDLVL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNYR+LGRAKTLEFFSWSGLQMGYRFDE+VVEYMADF GRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTAL+IFRRIELPDKYSYSN+IIGLCKFGRF TA+EVF+EM R+G VPTRS+VNILIGDLCSLSAKEGA+E+VRVRSTRR
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAIEPAVG+FWAAN++ALVPS+FV+VQLISELCRLGQMQEAI+VLKVVE  KLRC EEC+S+VM+ALCE+R VEEASDLFGR+LSQG
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YNS+ICMLCK+GN+ DAERVFKIMNRKRC PDHVTYS+LIHAY E  NWSAAYSLLKEMLSLGMSPHFHLYS VDKLMREHGQ+DLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE
        WEAQIL KLCK GQL+AAYEK+KSMLEKG +PP YVRD+FE+AFQK GK+KIARELL+KI GVH  E
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein1.0e-26280.85Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        R+ S LLK  +LHF   SS+FF TS T N IAIAPR  ARRPTSR+AP PR+ +TL S+DVVNSVCSLLSNKN QT NLDLDHLLKRF +NLSSD VL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNY++LGRAKTLEFFSWSGLQMG+RFD SVVEYMADF GRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM R+G VPTR++VNILIG+LCSLSAKEGA+EKVRV ST R
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAIEPAVG+FWAAN+++LVPSSFV VQLISELCRLGQMQEAI VLKVVE  KLRC EEC+S+VM+ALCE+R V+EASDLFGRMLSQG
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YN VICMLCKLGN+  AERVF IMN+KRC PDHVTYSALIHAY E  +WSAAY LLKEMLSLGMSPHFH+YS VDKLMREHGQ+DLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHN
        WEAQIL KLCKQGQLEAAYEK+KSMLEKG+ PP YVRDAFE+AFQK      + ++    G+ N
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHN

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.3e-26882.19Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        R+ S LLK  +LHF  FSS+FF TS T   IAIAPR   RRPTSR+AP PR+ +T+ S+DVVNSVCSLLSNKN QT NLD++HLLKRF +NLSSDLVL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNY++LGRAKTLEFFSWSGLQMG+RFD SVVEYMADF GRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM R+G VPTRS+ NILIG+LCSLSAKEGA+EKVRVRST R
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAIEPAVG+FWAAN++ LVPSSFV VQLISELCR+GQMQEAI+VLKVVE  KLRC EEC+S+VM+ALCE+R ++EASDLFGRMLSQG
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YN VICMLCKLGN+  AERVF IMN+KRC PDHVTYSALIHAY E  NWSAAY LLKEMLSLGMSPHFH+YS VDKLMREHGQVDLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE
        WEAQIL KLCKQGQLEAAYEK+KSMLEKG+ PP YVRDAFE+AFQK GK+KIARELL+K+ GVH  E
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like0.0e+0099.13Show/hide
Query:  MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLV
        MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLD LLKRFNENLSSDLV
Subjt:  MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLV

Query:  LRILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LRILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
Subjt:  LRILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMER GSVPTRS+VNILIGDLCSLSAKEGAIEKVRVRS
Subjt:  PDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRML
        TRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRML
Subjt:  TRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRML

Query:  SQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKL
        SQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPD VTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKL
Subjt:  SQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKL

Query:  EMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHNREESVIRHSP
        EMKWEAQIL KLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHNREESVIRHSP
Subjt:  EMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKIGVHNREESVIRHSP

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like3.3e-26983.95Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        RN S  LKF    FS +SS+   TS+TR   AIAPR  ARRPTSR+AP+PRALD    TD V+SVCSLLSNKNHQTTNL+LDHLLKRF E LSSD VL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNYR+ GRAKTLEFFSWSGLQMGYRFDESVVEYMADF GRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVF+EM R+G VPTRS+VNILIGDLCSLSAKEGA+E+VRVRSTRR
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAI+ AVGVFWAANR+ALVPS+FV+V+LISELCRLGQMQEAI VLKVVE  KLRC EEC+SIVMQALCE+R+V+EASDLFGRMLSQ 
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YNSVICMLCKLGN+ DAERVFKIMNRKRCVPDHVTYSALIHAY E  NWSAAYSLLKEMLSLG+SPHFH+YS VDKLMRE GQ DLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE
        WE+QIL KLCKQGQL  AYEKLKSMLEKG +PP YVRDAFE+AFQK GK+KIARELL+ + GVH  E
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNRE

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like8.1e-26882.93Show/hide
Query:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI
        RN S  LKF    FS +SS+   TS+T    AIAPR  ARRPTSR+A +PRALD    TD V+SVCSLLSNK+HQTTNL+LDHLLKRF E LSSD VL+I
Subjt:  RNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRI

Query:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN
        LMNYR+ GRAKTLEFFSWSGLQMGYRFDESVVEYMADF GRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEP FGCKPDN
Subjt:  LMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDN

Query:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR
        LVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVF+EM R+  VPTRS+VNILIGDLCSLSAKEGA+E+VRVRSTRR
Subjt:  LVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRR

Query:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG
        PFTVLVPNVN KSGAIEPAVGVFWAANRMALVPS+FV+V+LISELCRLGQMQEAI VLKVVE  KLRC EEC+SIVMQALCE+R+V+EASDLFGRMLSQ 
Subjt:  PFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQG

Query:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK
        MKPKLA+YNSVICMLCKLGN+ DAERVFKIMNRKRCVPDHVTYSALIHAY E  NWSAAYSLLKEMLSLG+SPHFH+YS VDKLMRE GQ DLCLKLEMK
Subjt:  MKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMK

Query:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNREESVIRHS
        WE+QIL KLCKQGQL AAYEKLKSMLEKG +PP YVRDAFE+AFQK GK+KIARELL+ + GVH  E    + S
Subjt:  WEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI-GVHNREESVIRHS

SwissProt top hitse value%identityAlignment
Q3EDF8 Pentatricopeptide repeat-containing protein At1g099006.9e-3024.34Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSG
        T+++ I    + G +  AL + + M       PD + +N IL +LC      + ++    + +R   PD  +Y+ +I   C+      A+++ +EM   G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSG

Query:  SVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIE
          P   + N+L+  +C    KEG +++        P +   PNV +          +G    A  +     R    PS      LI+ LCR G +  AI+
Subjt:  SVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIE

Query:  VLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNW
        +L+ +     +     ++ ++   C+ ++++ A +   RM+S+G  P +  YN+++  LCK G V DA  +   ++ K C P  +TY+ +I   ++ G  
Subjt:  VLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNW

Query:  SAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEA-----------QILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQ
          A  LL EM +  + P    YSS+   +   G+VD  +K   ++E             I+  LCK  Q + A + L  M+ +G  P             
Subjt:  SAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEA-----------QILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQ

Query:  KNGKYKIARELLEKI
          G  K A ELL ++
Subjt:  KNGKYKIARELLEKI

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial1.8e-3023.9Show/hide
Query:  RILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        ++L  + VLGRA       W   ++GY  D      + + F       +   L+  +   K R    T S  I  L  +GRV EAL L + M  ++G +P
Subjt:  RILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNILYALCKKETTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVR
        D + +  +L  LCK   +      AL +FR++E          YS +I  LCK G F  AL +F EME  G      + + LIG LC+    +   + +R
Subjt:  DNLVFNNILYALCKKETTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVR

Query:  VRSTRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFG
            R                               ++P       LI    + G++ EA E+   +    +      ++ ++   C+   + EA+ +F 
Subjt:  VRSTRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFG

Query:  RMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLC
         M+S+G +P +  Y+ +I   CK   V D  R+F+ ++ K  +P+ +TY+ L+  + + G  +AA  L +EM+S G+ P    Y  +   + ++G+++  
Subjt:  RMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLC

Query:  LKLEMKWEAQ-----------ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI
        L++  K +             I+H +C   +++ A+    S+ +KG+ P     +       K G    A  L  K+
Subjt:  LKLEMKWEAQ-----------ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.5e-2926.23Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFRTALEVFEEM
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R + L    P+  SY+ +I GLC+ GR +    V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFRTALEVFEEM

Query:  ERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQ
         R G      + N LI   C    KEG   +  V         L P+V +         K+G +  A+          L P+      L+    + G M 
Subjt:  ERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQ

Query:  EAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSE
        EA  VL+ + D         ++ ++   C   ++E+A  +   M  +G+ P +  Y++V+   C+  +V +A RV + M  K   PD +TYS+LI  + E
Subjt:  EAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSE

Query:  IGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKI
              A  L +EML +G+ P    Y++                        +++  C +G LE A +    M+EKG+ P         N   K  + + 
Subjt:  IGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKI

Query:  ARELLEKI
        A+ LL K+
Subjt:  ARELLEKI

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic7.4e-3226.2Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G  + A+EV ++M      P   + N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNI

Query:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSKSGAIE---------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKL
        LI  LC  +  E A E  RV +++     ++P+V + +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSKSGAIE---------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKL

Query:  RCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEM
              ++ ++   C+  +  EA ++F  M   G+      YN++I  LCK   V DA ++   M  +   PD  TY++L+  +   G+   A  +++ M
Subjt:  RCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEM

Query:  LSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIH-PPTYVRDAFENAFQK---NGKYKIARELLEK
         S G  P    Y +                        ++  LCK G++E A + L+S+  KGI+  P       +  F+K        + RE+LE+
Subjt:  LSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIH-PPTYVRDAFENAFQK---NGKYKIARELLEK

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655607.4e-3224.12Show/hide
Query:  EFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCK
        +FF+++ L MGY           D     K+F++M          KG R +   ++  I  L    R+ EA+ LF +M+    C P    +  ++ +LC 
Subjt:  EFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCK

Query:  KETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSK
         E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   G +P   + N LI   C     E A++ V +  +R+    L PN  + 
Subjt:  KETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSK

Query:  SGAIE--------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPK
        +  I+         A+GV        ++P       LI   CR G    A  +L ++ D  L   +  ++ ++ +LC++++VEEA DLF  +  +G+ P 
Subjt:  SGAIE--------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPK

Query:  LAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQ
        + +Y ++I   CK G V +A  + + M  K C+P+ +T++ALIH     G    A  L ++M+ +G+ P                         +  +  
Subjt:  LAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQ

Query:  ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI
        ++H+L K G  + AY + + ML  G  P  +    F   + + G+   A +++ K+
Subjt:  ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.9e-3124.34Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSG
        T+++ I    + G +  AL + + M       PD + +N IL +LC      + ++    + +R   PD  +Y+ +I   C+      A+++ +EM   G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSG

Query:  SVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIE
          P   + N+L+  +C    KEG +++        P +   PNV +          +G    A  +     R    PS      LI+ LCR G +  AI+
Subjt:  SVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIE

Query:  VLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNW
        +L+ +     +     ++ ++   C+ ++++ A +   RM+S+G  P +  YN+++  LCK G V DA  +   ++ K C P  +TY+ +I   ++ G  
Subjt:  VLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNW

Query:  SAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEA-----------QILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQ
          A  LL EM +  + P    YSS+   +   G+VD  +K   ++E             I+  LCK  Q + A + L  M+ +G  P             
Subjt:  SAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEA-----------QILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQ

Query:  KNGKYKIARELLEKI
          G  K A ELL ++
Subjt:  KNGKYKIARELLEKI

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-3123.9Show/hide
Query:  RILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        ++L  + VLGRA       W   ++GY  D      + + F       +   L+  +   K R    T S  I  L  +GRV EAL L + M  ++G +P
Subjt:  RILMNYRVLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNILYALCKKETTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVR
        D + +  +L  LCK   +      AL +FR++E          YS +I  LCK G F  AL +F EME  G      + + LIG LC+    +   + +R
Subjt:  DNLVFNNILYALCKKETTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVR

Query:  VRSTRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFG
            R                               ++P       LI    + G++ EA E+   +    +      ++ ++   C+   + EA+ +F 
Subjt:  VRSTRRPFTVLVPNVNSKSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFG

Query:  RMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLC
         M+S+G +P +  Y+ +I   CK   V D  R+F+ ++ K  +P+ +TY+ L+  + + G  +AA  L +EM+S G+ P    Y  +   + ++G+++  
Subjt:  RMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLC

Query:  LKLEMKWEAQ-----------ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI
        L++  K +             I+H +C   +++ A+    S+ +KG+ P     +       K G    A  L  K+
Subjt:  LKLEMKWEAQ-----------ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein5.2e-3326.2Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G  + A+EV ++M      P   + N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNI

Query:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSKSGAIE---------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKL
        LI  LC  +  E A E  RV +++     ++P+V + +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSKSGAIE---------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKL

Query:  RCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEM
              ++ ++   C+  +  EA ++F  M   G+      YN++I  LCK   V DA ++   M  +   PD  TY++L+  +   G+   A  +++ M
Subjt:  RCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEM

Query:  LSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIH-PPTYVRDAFENAFQK---NGKYKIARELLEK
         S G  P    Y +                        ++  LCK G++E A + L+S+  KGI+  P       +  F+K        + RE+LE+
Subjt:  LSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIH-PPTYVRDAFENAFQK---NGKYKIARELLEK

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-3026.23Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFRTALEVFEEM
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R + L    P+  SY+ +I GLC+ GR +    V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFRTALEVFEEM

Query:  ERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQ
         R G      + N LI   C    KEG   +  V         L P+V +         K+G +  A+          L P+      L+    + G M 
Subjt:  ERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNS---------KSGAIEPAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQ

Query:  EAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSE
        EA  VL+ + D         ++ ++   C   ++E+A  +   M  +G+ P +  Y++V+   C+  +V +A RV + M  K   PD +TYS+LI  + E
Subjt:  EAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSE

Query:  IGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKI
              A  L +EML +G+ P    Y++                        +++  C +G LE A +    M+EKG+ P         N   K  + + 
Subjt:  IGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKI

Query:  ARELLEKI
        A+ LL K+
Subjt:  ARELLEKI

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein5.2e-3324.12Show/hide
Query:  EFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCK
        +FF+++ L MGY           D     K+F++M          KG R +   ++  I  L    R+ EA+ LF +M+    C P    +  ++ +LC 
Subjt:  EFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCK

Query:  KETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSK
         E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   G +P   + N LI   C     E A++ V +  +R+    L PN  + 
Subjt:  KETTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSK

Query:  SGAIE--------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPK
        +  I+         A+GV        ++P       LI   CR G    A  +L ++ D  L   +  ++ ++ +LC++++VEEA DLF  +  +G+ P 
Subjt:  SGAIE--------PAVGVFWAANRMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPK

Query:  LAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQ
        + +Y ++I   CK G V +A  + + M  K C+P+ +T++ALIH     G    A  L ++M+ +G+ P                         +  +  
Subjt:  LAVYNSVICMLCKLGNVVDAERVFKIMNRKRCVPDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQ

Query:  ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI
        ++H+L K G  + AY + + ML  G  P  +    F   + + G+   A +++ K+
Subjt:  ILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKNGKYKIARELLEKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCAGGAATGGTTCGGTGTTACTCAAATTCAGTACTCTCCATTTTTCTCGCTTCTCTTCAAACTTTTTCGGAACTTCAACGACGAGAAATGACATTGCCATAGC
TCCAAGAACTTTTGCAAGAAGACCCACTTCGCGTTCTGCCCCAGTCCCTCGCGCTCTGGACACTCTCAGCTCTACCGATGTCGTCAATTCAGTATGTTCTCTACTTTCAA
ACAAAAATCACCAAACAACTAATCTCGATCTCGATCATTTGTTGAAAAGATTCAATGAAAACTTAAGTTCGGATCTCGTTCTTCGAATTCTGATGAATTATAGGGTGTTG
GGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCGGTGGTTGAGTACATGGCTGATTTCTTTGGTAGAAGGAAACT
GTTTGATGATATGAAGTGTCTTCTGGTGACGGTATCTTCTCATAAGGGTCGGCTTTCTTGTCGAACGTTTTCAATTTGTATCAGATTTTTGGGTAGGCAAGGGAGGGTTA
GAGAAGCCCTTTGCTTGTTCGAAGAAATGGAGCCAAAATTTGGGTGTAAACCTGATAATCTTGTCTTTAACAACATTCTTTATGCCCTTTGTAAGAAGGAAACAACTGGG
GAATTGATTGATACTGCTCTTACAATTTTCAGAAGAATCGAATTGCCTGATAAATATTCATACAGTAATATCATTATAGGATTGTGTAAATTTGGAAGGTTTCGTACAGC
TCTTGAAGTGTTTGAGGAAATGGAAAGGTCGGGCTCGGTTCCTACTCGATCTTCAGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCCAAGGAAGGGGCTATAG
AAAAAGTTAGAGTTAGAAGTACTCGTAGGCCTTTTACCGTTCTAGTTCCAAATGTGAATTCGAAGAGCGGTGCCATTGAACCTGCAGTTGGGGTTTTTTGGGCAGCTAAT
AGGATGGCTTTGGTTCCAAGTTCATTTGTAATGGTTCAGCTTATCTCGGAACTTTGTCGATTAGGTCAAATGCAAGAAGCCATTGAAGTATTGAAGGTTGTTGAGGATGG
AAAGCTAAGATGTGGAGAAGAGTGTCACTCCATTGTGATGCAAGCACTGTGTGAAAATCGTCAGGTTGAAGAAGCTAGTGATCTGTTTGGGAGGATGCTTTCTCAGGGTA
TGAAGCCAAAGTTGGCTGTTTACAATTCTGTTATTTGCATGCTATGCAAATTGGGGAATGTGGTTGATGCTGAAAGGGTTTTTAAGATTATGAATAGGAAAAGATGCGTA
CCTGATCATGTTACTTATTCGGCGCTAATCCATGCCTATAGTGAAATTGGGAATTGGTCAGCAGCCTACAGTTTATTGAAGGAAATGTTGAGTTTAGGCATGTCTCCTCA
TTTTCATTTGTATAGTTCAGTGGATAAACTAATGAGGGAACATGGGCAAGTTGATCTGTGTTTGAAGCTGGAAATGAAATGGGAAGCCCAAATTTTGCACAAGCTTTGTA
AGCAAGGTCAACTGGAGGCTGCTTATGAAAAGCTCAAGTCAATGCTTGAAAAGGGCATTCACCCTCCTACCTATGTGAGAGATGCTTTTGAGAACGCATTTCAGAAGAAC
GGTAAGTATAAGATTGCTCGTGAGTTGCTGGAGAAGATCGGAGTCCACAACCGTGAGGAGTCTGTAATCAGACATTCACCA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGCAGGAATGGTTCGGTGTTACTCAAATTCAGTACTCTCCATTTTTCTCGCTTCTCTTCAAACTTTTTCGGAACTTCAACGACGAGAAATGACATTGCCATAGC
TCCAAGAACTTTTGCAAGAAGACCCACTTCGCGTTCTGCCCCAGTCCCTCGCGCTCTGGACACTCTCAGCTCTACCGATGTCGTCAATTCAGTATGTTCTCTACTTTCAA
ACAAAAATCACCAAACAACTAATCTCGATCTCGATCATTTGTTGAAAAGATTCAATGAAAACTTAAGTTCGGATCTCGTTCTTCGAATTCTGATGAATTATAGGGTGTTG
GGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCGGTGGTTGAGTACATGGCTGATTTCTTTGGTAGAAGGAAACT
GTTTGATGATATGAAGTGTCTTCTGGTGACGGTATCTTCTCATAAGGGTCGGCTTTCTTGTCGAACGTTTTCAATTTGTATCAGATTTTTGGGTAGGCAAGGGAGGGTTA
GAGAAGCCCTTTGCTTGTTCGAAGAAATGGAGCCAAAATTTGGGTGTAAACCTGATAATCTTGTCTTTAACAACATTCTTTATGCCCTTTGTAAGAAGGAAACAACTGGG
GAATTGATTGATACTGCTCTTACAATTTTCAGAAGAATCGAATTGCCTGATAAATATTCATACAGTAATATCATTATAGGATTGTGTAAATTTGGAAGGTTTCGTACAGC
TCTTGAAGTGTTTGAGGAAATGGAAAGGTCGGGCTCGGTTCCTACTCGATCTTCAGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCCAAGGAAGGGGCTATAG
AAAAAGTTAGAGTTAGAAGTACTCGTAGGCCTTTTACCGTTCTAGTTCCAAATGTGAATTCGAAGAGCGGTGCCATTGAACCTGCAGTTGGGGTTTTTTGGGCAGCTAAT
AGGATGGCTTTGGTTCCAAGTTCATTTGTAATGGTTCAGCTTATCTCGGAACTTTGTCGATTAGGTCAAATGCAAGAAGCCATTGAAGTATTGAAGGTTGTTGAGGATGG
AAAGCTAAGATGTGGAGAAGAGTGTCACTCCATTGTGATGCAAGCACTGTGTGAAAATCGTCAGGTTGAAGAAGCTAGTGATCTGTTTGGGAGGATGCTTTCTCAGGGTA
TGAAGCCAAAGTTGGCTGTTTACAATTCTGTTATTTGCATGCTATGCAAATTGGGGAATGTGGTTGATGCTGAAAGGGTTTTTAAGATTATGAATAGGAAAAGATGCGTA
CCTGATCATGTTACTTATTCGGCGCTAATCCATGCCTATAGTGAAATTGGGAATTGGTCAGCAGCCTACAGTTTATTGAAGGAAATGTTGAGTTTAGGCATGTCTCCTCA
TTTTCATTTGTATAGTTCAGTGGATAAACTAATGAGGGAACATGGGCAAGTTGATCTGTGTTTGAAGCTGGAAATGAAATGGGAAGCCCAAATTTTGCACAAGCTTTGTA
AGCAAGGTCAACTGGAGGCTGCTTATGAAAAGCTCAAGTCAATGCTTGAAAAGGGCATTCACCCTCCTACCTATGTGAGAGATGCTTTTGAGAACGCATTTCAGAAGAAC
GGTAAGTATAAGATTGCTCGTGAGTTGCTGGAGAAGATCGGAGTCCACAACCGTGAGGAGTCTGTAATCAGACATTCACCA
Protein sequenceShow/hide protein sequence
MLSRNGSVLLKFSTLHFSRFSSNFFGTSTTRNDIAIAPRTFARRPTSRSAPVPRALDTLSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNENLSSDLVLRILMNYRVL
GRAKTLEFFSWSGLQMGYRFDESVVEYMADFFGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNILYALCKKETTG
ELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFRTALEVFEEMERSGSVPTRSSVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNSKSGAIEPAVGVFWAAN
RMALVPSSFVMVQLISELCRLGQMQEAIEVLKVVEDGKLRCGEECHSIVMQALCENRQVEEASDLFGRMLSQGMKPKLAVYNSVICMLCKLGNVVDAERVFKIMNRKRCV
PDHVTYSALIHAYSEIGNWSAAYSLLKEMLSLGMSPHFHLYSSVDKLMREHGQVDLCLKLEMKWEAQILHKLCKQGQLEAAYEKLKSMLEKGIHPPTYVRDAFENAFQKN
GKYKIARELLEKIGVHNREESVIRHSP