; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014102 (gene) of Snake gourd v1 genome

Gene IDTan0014102
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG10:15266136..15270088
RNA-Seq ExpressionTan0014102
SyntenyTan0014102
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]2.3e-28084.2Show/hide
Query:  TTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVL
        T R++SALLK ++LHF+GFSS+FF TS TTKHIAIAP    RRPTSRT PT R+ +   S++VVNSVCSLLSN+N  T NLD++HLLKRFK+ LSSDLVL
Subjt:  TTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRV+ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L LVPS+FV VQL+SELCR+GQMQEAI++LKVVE  KLRCAEECYS VM+ALCEHR ++E SDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLS

Query:  QGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLE
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDH TYSALIHAYGE RNWS+AY LLKEMLSLGMSPHFH++SLVDKLMREHGQ DLCLKLE
Subjt:  QGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLE

Query:  MKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        MKWEAQILQKLCKQGQLEAAYEK+KSMLEKG  PPIYV++AFESAFQK GK KIARELLQK+DGVH+HESG RN S
Subjt:  MKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]2.7e-28186.23Show/hide
Query:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS
        MLSTST RNASA LK V    YGFSS    TS+T K  AIAP    RRPTSRT P  RALD    T+ V+SVCSLLSN+NH TTNL+LDHLLKRFKETLS
Subjt:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEME K
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF
        RV+STRRPFTVLVPNVNPKSGAI+ AVGVFWAANRLALVPS FVIV+L+SELCRLGQMQEAIR+LKVVE  KLRC EECYS VMQALCEHR+V+E SDLF
Subjt:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF

Query:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL
        GRMLSQ MKPKLAIYN+VICMLCKLGNLDDAERVFKIMNRKRCVPDH TYSALIHAYGE RNWS+AYSLLKEMLSLG+SPHFH++S+VDKLMRE GQTDL
Subjt:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        CLKLEMKWE+QILQKLCKQGQL  AYEKLKSMLEKGFYPPIYV++AFESAFQK GK KIARELLQ +DGVH+HES  R  S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]4.6e-28186.23Show/hide
Query:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS
        MLSTST RNASA LK V    YGFSS    TS+TTK  AIAP    RRPTSRT    RALD    T+ V+SVCSLLSN++H TTNL+LDHLLKRFKETLS
Subjt:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEME  
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVFDEM RA LVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF
        RV+STRRPFTVLVPNVNPKSGAIEPAVGVFWAANR+ALVPSAFVIV+L+SELCRLGQMQEAIR+LKVVE  KLRC EECYS VMQALCEHR+V+E SDLF
Subjt:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF

Query:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL
        GRMLSQ MKPKLAIYN+VICMLCKLGNLDDAERVFKIMNRKRCVPDH TYSALIHAYGE RNWS+AYSLLKEMLSLG+SPHFH++S+VDKLMRE GQTDL
Subjt:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        CLKLEMKWE+QILQKLCKQGQL AAYEKLKSMLEKGFYPPIYV++AFESAFQK GK KIARELLQ +DGVH+HES  R  S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

XP_023526018.1 pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Cucurbita pepo subsp. pepo]3.2e-28286.06Show/hide
Query:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS
        MLSTST RNASA LK V    YGFSSN F TS+TTK  AIAP    RRPTSRT P  RALD    T+ V+SVCSLLSN+NH T NL+LDHLLKRFKET+S
Subjt:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK
        SD VLQILMNYRL GRAKTLEFFSWS LQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEME K
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF
        RV+STRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIV+L+ ELCRLGQMQEAIR+LKVVE  KLRC EECYS VMQALCEHR+V+E SDL 
Subjt:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF

Query:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL
        GRMLSQ MKPKLAIYN+VICMLCKLGNLDDAERVFKIMNRK+CVPDH TYSALIHAYGE RNWS+ YSLLK+MLSLG+SPHFH++S+VDKLMRE GQTDL
Subjt:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        CLKLEMKWE+QILQKLCKQGQL  AYEKLKSMLEKGFYPPIYV++AFESAFQK GK KIARELLQ +DGVH+HESG R  S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]1.1e-29587.95Show/hide
Query:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS
        MLS +T RNASA LK   LHFYGFSS+FF TST TKHIAIAP    RRPTSRT P  RA D   S++VVNSVCSLLSN+NH T NLDLDHLLKRFK+TLS
Subjt:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK
        SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDE+VVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEME K
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTAL+IFRRIELPDKYSYSN+IIGLCKFGRF TA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF
        RV+STRRPFTVLVPNVNPKSGAIEPAVG+FWAAN+LALVPSAFVIVQL+SELCRLGQMQEAI++LKVVEG KLRCAEECYS VM+ALCEHR VEE SDLF
Subjt:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF

Query:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL
        GR+LSQGMKPKLAIYN++ICMLCK+GNL+DAERVFKIMNRKRC PDH TYS+LIHAYGETRNWS+AYSLLKEMLSLGMSPHFHL+SLVDKLMREHGQ DL
Subjt:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        CLKLEMKWEAQILQKLCK GQL+AAYEK+KSMLEKGFYPPIYV+++FESAFQK GK KIARELLQK+DGVH+HESG RN S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

TrEMBL top hitse value%identityAlignment
A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.1e-28084.2Show/hide
Query:  TTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVL
        T R++SALLK ++LHF+GFSS+FF TS TTKHIAIAP    RRPTSRT PT R+ +   S++VVNSVCSLLSN+N  T NLD++HLLKRFK+ LSSDLVL
Subjt:  TTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRV+ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L LVPS+FV VQL+SELCR+GQMQEAI++LKVVE  KLRCAEECYS VM+ALCEHR ++E SDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLS

Query:  QGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLE
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDH TYSALIHAYGE RNWS+AY LLKEMLSLGMSPHFH++SLVDKLMREHGQ DLCLKLE
Subjt:  QGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLE

Query:  MKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        MKWEAQILQKLCKQGQLEAAYEK+KSMLEKG  PPIYV++AFESAFQK GK KIARELLQK+DGVH+HESG RN S
Subjt:  MKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

A0A5A7TN12 Pentatricopeptide repeat-containing protein1.8e-26784.49Show/hide
Query:  TTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVL
        T R++SALLK ++LHF+GFSS+FF TS TTKHIAIAP    RRPTSRT PT R+ +   S++VVNSVCSLLSN+N  T NLD++HLLKRFK+ LSSDLVL
Subjt:  TTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRV+ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L LVPS+FV VQL+SELCR+GQMQEAI++LKVVE  KLRCAEECYS VM+ALCEHR ++E SDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLS

Query:  QGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLE
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDH TYSALIHAYGE RNWS+AY LLKEMLSLGMSPHFH++SLVDKLMREHGQ DLCLKLE
Subjt:  QGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLE

Query:  MKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQK
        MKWEAQILQKLCKQGQLEAAYEK+KSMLEKG  PPIYV++AFESAFQK
Subjt:  MKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQK

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like6.7e-27886.62Show/hide
Query:  TRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVLQ
        +RN S LLK  TLHF  FSSNFFGTSTT   IAIAP    RRPTSR+ P  RALD  SST+VVNSVCSLLSN+NH TTNLDLD LLKRF E LSSDLVL+
Subjt:  TRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVLQ

Query:  ILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPD
        ILMNYR+LGRAKTLEFFSWSGLQMGYRFDESVVEYMADF GRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEME KFGCKPD
Subjt:  ILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPD

Query:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTR
        NLVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TALEVF+EM R G VPTRSAVNILIGDLCSLSAKEGAIEKVRV+STR
Subjt:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTR

Query:  RPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQ
        RPFTVLVPNVN KSGAIEPAVGVFWAANR+ALVPS+FV+VQL+SELCRLGQMQEAI +LKVVE GKLRC EEC+S VMQALCE+RQVEE SDLFGRMLSQ
Subjt:  RPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQ

Query:  GMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEM
        GMKPKLA+YN+VICMLCKLGN+ DAERVFKIMNRKRCVPD  TYSALIHAY E  NWS+AYSLLKEMLSLGMSPHFHL+S VDKLMREHGQ DLCLKLEM
Subjt:  GMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEM

Query:  KWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHE
        KWEAQILQKLCKQGQLEAAYEKLKSMLEKG +PP YV++AFE+AFQKNGK KIARELL+K+ GVH  E
Subjt:  KWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHE

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like1.3e-28186.23Show/hide
Query:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS
        MLSTST RNASA LK V    YGFSS    TS+T K  AIAP    RRPTSRT P  RALD    T+ V+SVCSLLSN+NH TTNL+LDHLLKRFKETLS
Subjt:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEME K
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF
        RV+STRRPFTVLVPNVNPKSGAI+ AVGVFWAANRLALVPS FVIV+L+SELCRLGQMQEAIR+LKVVE  KLRC EECYS VMQALCEHR+V+E SDLF
Subjt:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF

Query:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL
        GRMLSQ MKPKLAIYN+VICMLCKLGNLDDAERVFKIMNRKRCVPDH TYSALIHAYGE RNWS+AYSLLKEMLSLG+SPHFH++S+VDKLMRE GQTDL
Subjt:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        CLKLEMKWE+QILQKLCKQGQL  AYEKLKSMLEKGFYPPIYV++AFESAFQK GK KIARELLQ +DGVH+HES  R  S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like2.2e-28186.23Show/hide
Query:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS
        MLSTST RNASA LK V    YGFSS    TS+TTK  AIAP    RRPTSRT    RALD    T+ V+SVCSLLSN++H TTNL+LDHLLKRFKETLS
Subjt:  MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAP----RRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEME  
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TA+EVFDEM RA LVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF
        RV+STRRPFTVLVPNVNPKSGAIEPAVGVFWAANR+ALVPSAFVIV+L+SELCRLGQMQEAIR+LKVVE  KLRC EECYS VMQALCEHR+V+E SDLF
Subjt:  RVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLF

Query:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL
        GRMLSQ MKPKLAIYN+VICMLCKLGNLDDAERVFKIMNRKRCVPDH TYSALIHAYGE RNWS+AYSLLKEMLSLG+SPHFH++S+VDKLMRE GQTDL
Subjt:  GRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS
        CLKLEMKWE+QILQKLCKQGQL AAYEKLKSMLEKGFYPPIYV++AFESAFQK GK KIARELLQ +DGVH+HES  R  S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKLDGVHKHESGFRNPS

SwissProt top hitse value%identityAlignment
Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745807.6e-2924.88Show/hide
Query:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEM
        L   TF+  +R L ++G V+E   L +++  K G  P+   +N  +  LC++      +     +  +   PD  +Y+N+I GLCK  +F  A     +M
Subjt:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEM

Query:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVV
           GL P     N LI   C                              K G ++ A  +   A     VP  F    L+  LC  G+   A+ L    
Subjt:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVV

Query:  EGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYS
         G  ++     Y+T+++ L     + E + L   M  +G+ P++  +N ++  LCK+G + DA+ + K+M  K   PD FT++ LIH Y       +A  
Subjt:  EGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYS

Query:  LLKEMLSLGMSPHFHLF-SLVDKLMREHGQTDLCLKLEMKWEAQ----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKL
        +L  ML  G+ P  + + SL++ L +     D+    +   E            +L+ LC+  +L+ A   L+ M  K   P           F KNG L
Subjt:  LLKEMLSLGMSPHFHLF-SLVDKLMREHGQTDLCLKLEMKWEAQ----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKL

Query:  KIARELLQKLDGVHKHES
          A  L +K++  +K  S
Subjt:  KIARELLQKLDGVHKHES

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.7e-2826.23Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFCTALEVFDEM
        T++I IR     G +  AL LF++METK GC P+ + +N ++   CK       ID    + R + L    P+  SY+ +I GLC+ GR      V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFCTALEVFDEM

Query:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQ
         R G        N LI   C    KEG   +  V         L P+V           K+G +  A+          L P+      L+    + G M 
Subjt:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQ

Query:  EAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGE
        EA R+L+ +       +   Y+ ++   C   ++E+   +   M  +G+ P +  Y+ V+   C+  ++D+A RV + M  K   PD  TYS+LI  + E
Subjt:  EAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGE

Query:  TRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKI
         R    A  L +EML +G+ P    ++                         ++   C +G LE A +    M+EKG  P +   +   +   K  + + 
Subjt:  TRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKI

Query:  ARELLQKL
        A+ LL KL
Subjt:  ARELLQKL

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011102.9e-2823.89Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAG
        T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ +I GLCK G++  A EVF EM R+G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAG

Query:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRL
        L P  +    L+ + C    K   +E  +V S  R   V+        + ++  +SG ++ A+  F +     L+P   +   L+   CR G +  A+ L
Subjt:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRL

Query:  L-KVVEGGKLRCAEE--CYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETR
          ++++ G   CA +   Y+T++  LC+ + + E   LF  M  + + P       +I   CKLGNL +A  +F+ M  KR   D  TY+ L+  +G+  
Subjt:  L-KVVEGGKLRCAEE--CYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETR

Query:  NWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIAR
        +  +A  +  +M+S  + P    +S+                        ++  LC +G L  A+     M+ K   P + + N+    + ++G      
Subjt:  NWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIAR

Query:  ELLQKL
          L+K+
Subjt:  ELLQKL

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.8e-3024.57Show/hide
Query:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNI

Query:  LIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKL
        LI  LC  +  E A E  RV +++     ++P+V   +  I+          A+ +F         P  F    L+  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKL

Query:  RCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEM
          +   Y+T++   C+  +  E  ++F  M   G+      YN +I  LCK   ++DA ++   M  +   PD +TY++L+  +    +   A  +++ M
Subjt:  RCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEM

Query:  LSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPP--IYVKNAFESAFQKNGKLKIA
         S G  P    +  +   + + G+ ++  KL    + +           ++Q L ++ +   A    + MLE+   PP  +  +  F       G ++ A
Subjt:  LSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPP--IYVKNAFESAFQKNGKLKIA

Query:  RELLQKL
         + L +L
Subjt:  RELLQKL

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655602.1e-3124.34Show/hide
Query:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCK
        +FF+++ L MGY           D     K+F++M          KG R +   ++  I  L    R+ EA+ LF +M+    C P    +  ++ +LC 
Subjt:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCK

Query:  KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPK
         E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   GL+P     N LI   C     E A++ V +  +R+    L PN    
Subjt:  KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPK

Query:  SGAIE--------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPK
        +  I+         A+GV        ++P       L+   CR G    A RLL ++    L   +  Y++++ +LC+ ++VEE  DLF  +  +G+ P 
Subjt:  SGAIE--------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPK

Query:  LAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ
        + +Y A+I   CK G +D+A  + + M  K C+P+  T++ALIH          A  L ++M+ +G+ P                         +  +  
Subjt:  LAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ

Query:  ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKL
        ++ +L K G  + AY + + ML  G  P  +    F   + + G+L  A +++ K+
Subjt:  ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKL

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein5.4e-3024.88Show/hide
Query:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEM
        L   TF+  +R L ++G V+E   L +++  K G  P+   +N  +  LC++      +     +  +   PD  +Y+N+I GLCK  +F  A     +M
Subjt:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEM

Query:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVV
           GL P     N LI   C                              K G ++ A  +   A     VP  F    L+  LC  G+   A+ L    
Subjt:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVV

Query:  EGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYS
         G  ++     Y+T+++ L     + E + L   M  +G+ P++  +N ++  LCK+G + DA+ + K+M  K   PD FT++ LIH Y       +A  
Subjt:  EGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYS

Query:  LLKEMLSLGMSPHFHLF-SLVDKLMREHGQTDLCLKLEMKWEAQ----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKL
        +L  ML  G+ P  + + SL++ L +     D+    +   E            +L+ LC+  +L+ A   L+ M  K   P           F KNG L
Subjt:  LLKEMLSLGMSPHFHLF-SLVDKLMREHGQTDLCLKLEMKWEAQ----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKL

Query:  KIARELLQKLDGVHKHES
          A  L +K++  +K  S
Subjt:  KIARELLQKLDGVHKHES

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-3124.57Show/hide
Query:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNI

Query:  LIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKL
        LI  LC  +  E A E  RV +++     ++P+V   +  I+          A+ +F         P  F    L+  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKL

Query:  RCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEM
          +   Y+T++   C+  +  E  ++F  M   G+      YN +I  LCK   ++DA ++   M  +   PD +TY++L+  +    +   A  +++ M
Subjt:  RCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEM

Query:  LSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPP--IYVKNAFESAFQKNGKLKIA
         S G  P    +  +   + + G+ ++  KL    + +           ++Q L ++ +   A    + MLE+   PP  +  +  F       G ++ A
Subjt:  LSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKSMLEKGFYPP--IYVKNAFESAFQKNGKLKIA

Query:  RELLQKL
         + L +L
Subjt:  RELLQKL

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-2923.89Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAG
        T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ +I GLCK G++  A EVF EM R+G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAG

Query:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRL
        L P  +    L+ + C    K   +E  +V S  R   V+        + ++  +SG ++ A+  F +     L+P   +   L+   CR G +  A+ L
Subjt:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRL

Query:  L-KVVEGGKLRCAEE--CYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETR
          ++++ G   CA +   Y+T++  LC+ + + E   LF  M  + + P       +I   CKLGNL +A  +F+ M  KR   D  TY+ L+  +G+  
Subjt:  L-KVVEGGKLRCAEE--CYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETR

Query:  NWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIAR
        +  +A  +  +M+S  + P    +S+                        ++  LC +G L  A+     M+ K   P + + N+    + ++G      
Subjt:  NWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIAR

Query:  ELLQKL
          L+K+
Subjt:  ELLQKL

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-2926.23Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFCTALEVFDEM
        T++I IR     G +  AL LF++METK GC P+ + +N ++   CK       ID    + R + L    P+  SY+ +I GLC+ GR      V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNIIIGLCKFGRFCTALEVFDEM

Query:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQ
         R G        N LI   C    KEG   +  V         L P+V           K+G +  A+          L P+      L+    + G M 
Subjt:  ARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQ

Query:  EAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGE
        EA R+L+ +       +   Y+ ++   C   ++E+   +   M  +G+ P +  Y+ V+   C+  ++D+A RV + M  K   PD  TYS+LI  + E
Subjt:  EAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGE

Query:  TRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKI
         R    A  L +EML +G+ P    ++                         ++   C +G LE A +    M+EKG  P +   +   +   K  + + 
Subjt:  TRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKI

Query:  ARELLQKL
        A+ LL KL
Subjt:  ARELLQKL

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-3224.34Show/hide
Query:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCK
        +FF+++ L MGY           D     K+F++M          KG R +   ++  I  L    R+ EA+ LF +M+    C P    +  ++ +LC 
Subjt:  EFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCK

Query:  KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPK
         E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   GL+P     N LI   C     E A++ V +  +R+    L PN    
Subjt:  KEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPK

Query:  SGAIE--------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPK
        +  I+         A+GV        ++P       L+   CR G    A RLL ++    L   +  Y++++ +LC+ ++VEE  DLF  +  +G+ P 
Subjt:  SGAIE--------PAVGVFWAANRLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPK

Query:  LAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ
        + +Y A+I   CK G +D+A  + + M  K C+P+  T++ALIH          A  L ++M+ +G+ P                         +  +  
Subjt:  LAIYNAVICMLCKLGNLDDAERVFKIMNRKRCVPDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQ

Query:  ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKL
        ++ +L K G  + AY + + ML  G  P  +    F   + + G+L  A +++ K+
Subjt:  ILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKNGKLKIARELLQKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCACGAGCACGACTAGGAATGCTTCGGCGTTGCTCAAATCCGTTACTCTTCACTTTTATGGCTTCTCTTCAAACTTTTTCGGCACTTCAACGACCACAAAGCA
CATTGCCATAGCTCCAAGAAGACCCACTTCGCGAACTGTCCCAACCTCTCGGGCCTTGGACGCCTTCAGCTCTACCAATGTCGTCAATTCAGTATGTTCTTTACTTTCAA
ACCAAAATCACCATACAACTAATCTCGATCTTGATCATTTATTGAAAAGGTTCAAAGAAACTTTAAGTTCCGATCTCGTTCTTCAAATTCTAATGAATTATAGGCTGTTG
GGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTACATGGCTGATTTTTTAGGTAGAAGGAAACT
GTTTGATGATATGAAGTGTCTTTTGGTGACGGTGTCGTCTCACAAGGGTCGTCTTTCTTGTCGGACATTTTCGATTTGTATCAGATTTTTGGGTAGGCAAGGGAGGGTTA
GAGAAGCGCTTTGCTTGTTCGAAGAAATGGAGACAAAATTTGGGTGTAAACCAGATAATCTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAACCAACTGGG
GAATTGATTGATACTGCTCTAACAATTTTTAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAATATAATTATAGGATTGTGTAAATTTGGTAGGTTTTGTACAGC
TCTTGAAGTGTTTGATGAAATGGCTAGGGCAGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCTAAAGAAGGGGCTATAG
AAAAAGTTAGGGTCAAAAGTACTCGTAGACCTTTTACCGTTCTAGTTCCAAATGTGAACCCAAAGAGCGGTGCGATTGAACCTGCAGTTGGAGTTTTTTGGGCTGCTAAT
AGGCTGGCTTTAGTTCCCAGTGCATTTGTAATAGTTCAGCTCCTATCGGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAGATTATTGAAAGTTGTCGAGGGTGG
AAAGCTAAGATGTGCTGAAGAGTGTTACTCCACTGTGATGCAAGCATTGTGTGAACATCGTCAGGTCGAAGAAGTTAGTGATCTGTTTGGGAGGATGCTTTCTCAGGGTA
TGAAGCCAAAGTTGGCTATTTACAATGCTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGGAAAAGATGTGTA
CCTGATCATTTTACGTATTCAGCGCTAATCCATGCCTATGGTGAAACTAGGAATTGGTCGTCAGCCTACAGTTTATTGAAGGAAATGTTGAGTTTAGGCATGTCTCCTCA
TTTTCATTTGTTTAGTTTAGTGGATAAACTAATGAGGGAACATGGGCAAACTGATCTGTGCTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGCAGAAGCTTTGTA
AACAAGGACAACTGGAGGCCGCGTATGAAAAGCTAAAGTCAATGCTTGAAAAAGGTTTTTATCCTCCTATCTATGTGAAGAATGCTTTTGAAAGTGCATTTCAAAAGAAT
GGTAAGTTGAAGATTGCACGGGAGTTGCTGCAGAAGCTAGACGGAGTCCACAAACATGAATCAGGATTCAGAAATCCATCATGA
mRNA sequenceShow/hide mRNA sequence
TCTAAATCAAAAACTTCGGTGCAAATTTTACAATTTCCCTAGAGGAATCCAGGTTGAACCTGTTGGTTCTGCAACTCGTCGATGTTGAGCACGAGCACGACTAGGAATGC
TTCGGCGTTGCTCAAATCCGTTACTCTTCACTTTTATGGCTTCTCTTCAAACTTTTTCGGCACTTCAACGACCACAAAGCACATTGCCATAGCTCCAAGAAGACCCACTT
CGCGAACTGTCCCAACCTCTCGGGCCTTGGACGCCTTCAGCTCTACCAATGTCGTCAATTCAGTATGTTCTTTACTTTCAAACCAAAATCACCATACAACTAATCTCGAT
CTTGATCATTTATTGAAAAGGTTCAAAGAAACTTTAAGTTCCGATCTCGTTCTTCAAATTCTAATGAATTATAGGCTGTTGGGTAGGGCTAAAACGTTGGAATTCTTTTC
TTGGTCTGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTACATGGCTGATTTTTTAGGTAGAAGGAAACTGTTTGATGATATGAAGTGTCTTTTGGTGA
CGGTGTCGTCTCACAAGGGTCGTCTTTCTTGTCGGACATTTTCGATTTGTATCAGATTTTTGGGTAGGCAAGGGAGGGTTAGAGAAGCGCTTTGCTTGTTCGAAGAAATG
GAGACAAAATTTGGGTGTAAACCAGATAATCTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAACCAACTGGGGAATTGATTGATACTGCTCTAACAATTTT
TAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAATATAATTATAGGATTGTGTAAATTTGGTAGGTTTTGTACAGCTCTTGAAGTGTTTGATGAAATGGCTAGGG
CAGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCTAAAGAAGGGGCTATAGAAAAAGTTAGGGTCAAAAGTACTCGTAGA
CCTTTTACCGTTCTAGTTCCAAATGTGAACCCAAAGAGCGGTGCGATTGAACCTGCAGTTGGAGTTTTTTGGGCTGCTAATAGGCTGGCTTTAGTTCCCAGTGCATTTGT
AATAGTTCAGCTCCTATCGGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAGATTATTGAAAGTTGTCGAGGGTGGAAAGCTAAGATGTGCTGAAGAGTGTTACT
CCACTGTGATGCAAGCATTGTGTGAACATCGTCAGGTCGAAGAAGTTAGTGATCTGTTTGGGAGGATGCTTTCTCAGGGTATGAAGCCAAAGTTGGCTATTTACAATGCT
GTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGGAAAAGATGTGTACCTGATCATTTTACGTATTCAGCGCTAAT
CCATGCCTATGGTGAAACTAGGAATTGGTCGTCAGCCTACAGTTTATTGAAGGAAATGTTGAGTTTAGGCATGTCTCCTCATTTTCATTTGTTTAGTTTAGTGGATAAAC
TAATGAGGGAACATGGGCAAACTGATCTGTGCTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGCAGAAGCTTTGTAAACAAGGACAACTGGAGGCCGCGTATGAA
AAGCTAAAGTCAATGCTTGAAAAAGGTTTTTATCCTCCTATCTATGTGAAGAATGCTTTTGAAAGTGCATTTCAAAAGAATGGTAAGTTGAAGATTGCACGGGAGTTGCT
GCAGAAGCTAGACGGAGTCCACAAACATGAATCAGGATTCAGAAATCCATCATGAGTTATCAAATGGAAATCTTTTCCTTTCATTATCAAAGAAATCTTTCATTTTAATG
TGGCAAAATTGCATGGGGTCAACCATGTCCTGATATCCAAAAAGAAACGTACAAACGTTTATTGTTGATGTTAACATATTTTGAGCCTTGTCAAATCCTTAGAAGTTTGA
GCTTCTCTTGAAGGCCCTTCCAGTGAAAGTTTACCCAGCCAACCAAGATCTTTCAAGTCTAAAATGTTGTACTAGCTTTGAGGAGGATCATATTGGATGCACCCCAGGAC
AATAACAATGGCGACGAGAAGAGCAAGAAGGCAAAGAATTCCAACTAGTTCCACATGAAAATGGAGTAGGAAGAATACTAAAAGAAAAGGTAAGGGTAGTGGCTGACTTC
AGGTCTCTTGGAAAGCAAAAACTTGTACAAAAGTTGACCTTCCAAGTTCCAACCATCAGATCATCAATCATATCCAAGCTTGGTCCTCAGCCAGTCAAGAAAATGGCCGA
CGCACCTCGTCAGCATTTATGCATATGTGGTGAGTCTTGTGGACAAGATATTGGAGATTTTCGACCTACAGTGGACGGAAATCCAAGGTTTCGCTGGTCCAAGATTTTAA
AAGTGCAATAAGGTTTTGTGGTCAAATACTTTTTGTGCTCTTCTTCGGAATGCCTGATTGTAAGCACCAAACAACTTTTCAGGACGAAGAGAAGAATGGGAAGGAAAAAA
ATGTTGTTGTGTATCTGACATCTTCATGGTGTGTTAATTCTATCATTTTTCTCTAATTAGCCATTGAGGCTGTTGAGTGGATTATAATAACATAGGTTCTAATGTTCATA
TGTGGAACTATATAATATTATTTAAAATGCAGAGTTGTTTAGTCTGGAGTTATAATAGTTTGTGTTTGGAGTGC
Protein sequenceShow/hide protein sequence
MLSTSTTRNASALLKSVTLHFYGFSSNFFGTSTTTKHIAIAPRRPTSRTVPTSRALDAFSSTNVVNSVCSLLSNQNHHTTNLDLDHLLKRFKETLSSDLVLQILMNYRLL
GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTG
ELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFCTALEVFDEMARAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVKSTRRPFTVLVPNVNPKSGAIEPAVGVFWAAN
RLALVPSAFVIVQLLSELCRLGQMQEAIRLLKVVEGGKLRCAEECYSTVMQALCEHRQVEEVSDLFGRMLSQGMKPKLAIYNAVICMLCKLGNLDDAERVFKIMNRKRCV
PDHFTYSALIHAYGETRNWSSAYSLLKEMLSLGMSPHFHLFSLVDKLMREHGQTDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKSMLEKGFYPPIYVKNAFESAFQKN
GKLKIARELLQKLDGVHKHESGFRNPS