; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012690 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012690
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr09:11111984..11114122
RNA-Seq ExpressionCmoCh09G012690
SyntenyCmoCh09G012690
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025166.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0098.88Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLR SYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFT+A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDI RAYILFR+NDDKSIITWNSMLSGLSHVDRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALRLFRELLL+GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
        RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
        RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
        M DDYDIGDDIF
Subjt:  MIDDYDIGDDIF

XP_022141804.1 pentatricopeptide repeat-containing protein At1g71490-like [Momordica charantia]0.0e+0087.5Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+G+RPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDIM AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD LN+VMKHG+LV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
          D +DI +D F
Subjt:  MIDDYDIGDDIF

XP_022925519.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
        RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
        RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
        MIDDYDIGDDIF
Subjt:  MIDDYDIGDDIF

XP_022973516.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima]0.0e+0098.17Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMI SIF SLKSFASHGQLSKAFEAFSLVQLR SYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFT+A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALRLFRE LL+GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
        RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSP LEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
        RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLD LNDVMKHGTLV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
        M DDYDIG+D+F
Subjt:  MIDDYDIGDDIF

XP_023535485.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0098.03Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSSPEYILRGLSIYKLLTF+PKQWKNV VSNGREFMI SIFDSLKSFASHGQLSKAFEAFSLVQLR SYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHG IISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDI RAYILFRLNDDKSIITWNSMLSGLSHVDRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALRLFRELLL+GVEPNYVT ASILPLCARVADLQHGREFHCYITKRQDF+DYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
        RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMP RPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
        RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLD LNDVMKHGTLV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
        M DDYDIGDDIF
Subjt:  MIDDYDIGDDIF

TrEMBL top hitse value%identityAlignment
A0A1S3CB12 pentatricopeptide repeat-containing protein At1g714900.0e+0084.41Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MS S    IL+GLSI KL TFIPK WK +PVSN  EFMI SIF SLK FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHG IISSGL EDS LV KLV FYSS + L EAHTLVE SNLF PC WN+L+TSYVRN+L+E+AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHK IN+ S  WSLFV NALISMYGRCGE+DTAR LFD ML+RD VSWNSMISCY+S GMW+EAFELF+ MQSK LEIN+VTWNIIAGGCLR+G FT+A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        L LLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRH +H  STVQNAL+TMYARCKDI  AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL+GVEPNYVTFASILPLCARVA+LQHGREFHCYITKR DF+D+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVLSACSHSGL+ QGE+LFAEMQSVHGLSPRLEHY+CMADLFGRVGLL++AKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
         NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRD GVAK PGCSWV+VGSEFVSF VGDTS+PQALESK LLD L DV+KH +L+
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
          D+YD GD+IF
Subjt:  MIDDYDIGDDIF

A0A5D3BN10 Pentatricopeptide repeat-containing protein0.0e+0084.41Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MS S    IL+GLSI KL TFIPK WK +PVSN  EFMI SIF SLK FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHG IISSGL EDS LV KLV FYSS + L EAHTLVE SNLF PC WN+L+TSYVRN+L+E+AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHK IN+ S  WSLFV NALISMYGRCGE+DTAR LFD ML+RD VSWNSMISCY+S GMW+EAFELF+ MQSK LEIN+VTWNIIAGGCLR+G FT+A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        L LLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRH +H  STVQNAL+TMYARCKDI  AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL+GVEPNYVTFASILPLCARVA+LQHGREFHCYITKR DF+D+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVLSACSHSGL+ QGE+LFAEMQSVHGLSPRLEHY+CMADLFGRVGLL++AKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
         NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRD GVAK PGCSWV+VGSEFVSF VGDTS+PQALESK LLD L DV+KH +L+
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
          D+YD GD+IF
Subjt:  MIDDYDIGDDIF

A0A6J1CJU8 pentatricopeptide repeat-containing protein At1g71490-like0.0e+0087.5Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSS +YI RGLS+YKL TFIPKQWKN PVSNG EFMI  +F SLKSFA HGQLSKAFEAFSL+QLR  YNDSFDLI+QS SILLVSCT  SSLP G+Q
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRII SGLE+DSILVPKLVTFYSSFKLLAEAHTLVENSN+FHPCPWNLLITSYVRN LHE+AIL YKQMLS+G+RPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFD MQSKC+EINIVTWNIIAGGCLR+G F  A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRHCYH+ S VQNAL+TMYARCKDIM AYILFRLN DKSIITWNSMLSG +H+DRVE+
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMK   IKPDHITMVAVLSACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL++AK IITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
         NT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAKAPGCSWV+VGS FVSFLVGDTSNPQALE+  LLD LN+VMKHG+LV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
          D +DI +D F
Subjt:  MIDDYDIGDDIF

A0A6J1EI84 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+00100Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
        RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
        RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
        MIDDYDIGDDIF
Subjt:  MIDDYDIGDDIF

A0A6J1I8V4 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0098.17Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
        MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMI SIF SLKSFASHGQLSKAFEAFSLVQLR SYNDSFDLIVQSISILLVSCTTCSSLPSGKQ
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQ

Query:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
        LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL
Subjt:  LHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGL

Query:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA
        EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYAS GMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFT+A
Subjt:  EVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQA

Query:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
        LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED
Subjt:  LKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVED

Query:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALRLFRE LL+GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
        RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSP LEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH
Subjt:  RKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIH

Query:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
        RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLD LNDVMKHGTLV
Subjt:  RNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Query:  MIDDYDIGDDIF
        M DDYDIG+D+F
Subjt:  MIDDYDIGDDIF

SwissProt top hitse value%identityAlignment
Q4V389 Pentatricopeptide repeat-containing protein At1g228306.0e-20150.87Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVP-------VSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCS
        M SS    ILRGL++ ++  FIP+ WK +P        ++  E +   +F+S +   SHGQL +AF  FSL++ +   + S + ++ S + LL +C   +
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVP-------VSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCS

Query:  SLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGET
            G+QLH   ISSGLE DS+LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SY+RN+  + ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGET

Query:  QNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLR
         +  +G  VH  I   S++ +L+V NALISMY R G++D AR LFD M +RDAVSWN++I+CY S     EAF+L D M    +E +IVTWN IAGGCL 
Subjt:  QNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLR

Query:  LGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSG
         G +  AL  +  MRN  + +  VAMI GL ACSHIGA++ GK  H   IR C   H    V+N+L+TMY+RC D+  A+I+F+  +  S+ TWNS++SG
Subjt:  LGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSG

Query:  LSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
         ++ +R E+   L +E+LL G  PN++T ASILPL ARV +LQHG+EFHCYI +RQ ++D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWA
         GYG  G+G  AL  F++M    IKPDH+TMVAVLSACSHS LV++G  LF +M+ V G+  RLEHY+CM DL+ R G LD+A++I   +PY P+SAM A
Subjt:  AGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWA

Query:  TLIGACCIHRNTDIGEWAAEK-LLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNP
        TL+ AC IH NT+IGEWAA+K LLE KPEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV KA   + +E  SE    L G+ + P
Subjt:  TLIGACCIHRNTDIGEWAAEK-LLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNP

Q9C9I6 Pentatricopeptide repeat-containing protein At1g714902.3e-23258.06Show/hide
Query:  SIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENS
        S+F SL   ASHG L  AF+ FSL++L+ S   S DL++ S + LL +C    +  +G Q+H   ISSG+E  S+LVPKLVTFYS+F L  EA +++ENS
Subjt:  SIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENS

Query:  NLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDN
        ++ HP PWN+LI SY +NEL E  I AYK+M+SKG+RPD FT+PS+LKACGET ++ FG  VH  I   S + SL+V NALISMY R   +  AR LFD 
Subjt:  NLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDN

Query:  MLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHG
        M +RDAVSWN++I+CYAS GMW EAFELFD M    +E++++TWNII+GGCL+ G +  AL L+S+MRNF   LD VAMIIGL ACS IGAIRLGKEIHG
Subjt:  MLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHG

Query:  FTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFH
          I   Y     V+N L+TMY++CKD+  A I+FR  ++ S+ TWNS++SG + +++ E+A  L RE+L+ G +PN +T ASILPLCAR+A+LQHG+EFH
Subjt:  FTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFH

Query:  CYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEV
        CYI +R+ F+DY +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEG  AL LF+EM    IKPDH+T+VAVLSACSHS LV +GE 
Subjt:  CYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEV

Query:  LFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKL
        LF +MQ  +G+ P L+H++CM DL+GR G L +AK+II  MPY+P+ A WATL+ AC IH NT IG+WAAEKLLEMKPE+ GYYVLIANMYAAAGSWSKL
Subjt:  LFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKL

Query:  AKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLVMIDDYDIGDD
        A++RT+MRD GV K PGC+W++  S F  F VGDTS+P+A  +  LLDGLN +MK      I+     D+
Subjt:  AKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLVMIDDYDIGDD

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168602.2e-11834.25Show/hide
Query:  LKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHP
        ++S+  +G  +K    F L+       D++     +   +  +C   SS+  G+  H   + +G   +  +   LV  YS  + L++A  + +  +++  
Subjt:  LKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHP

Query:  CPWNLLITSYVRNELHESAILAYKQMLSK-GVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDR
          WN +I SY +    + A+  + +M ++ G RPDN T  ++L  C        G ++H    +     ++FV N L+ MY +CG +D A  +F NM  +
Subjt:  CPWNLLITSYVRNELHESAILAYKQMLSK-GVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDR

Query:  DAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIR
        D VSWN+M++ Y+  G +++A  LF+ MQ + +++++VTW+    G  + G   +AL +  QM + GI  ++V +I  L  C+ +GA+  GKEIH + I+
Subjt:  DAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIR

Query:  H-------CYHKSSTVQNALLTMYARCKDIMRAYILF--RLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVE--PNYVTFASILPLCARVADL
        +        +   + V N L+ MYA+CK +  A  +F      ++ ++TW  M+ G S       AL L  E+     +  PN  T +  L  CA +A L
Subjt:  H-------CYHKSSTVQNALLTMYARCKDIMRAYILF--RLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVE--PNYVTFASILPLCARVADL

Query:  QHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSG
        + G++ H Y  + Q     L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +AL +F+EM+ +  K D +T++ VL ACSHSG
Subjt:  QHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSG

Query:  LVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAA
        ++ QG   F  M++V G+SP  EHYAC+ DL GR G L+ A  +I  MP  P   +W   +  C IH   ++GE+AAEK+ E+   H G Y L++N+YA 
Subjt:  LVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAA

Query:  AGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLL
        AG W  + +IR+LMR  GV K PGCSWVE      +F VGD ++P A E   +L
Subjt:  AGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLL

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.0e-11233.49Show/hide
Query:  LRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNLFHPCPWNLLIT
        L  S +  +D I    S+ L+    C +L S + +H ++I  GL   +  + KL+ F              S FK + E + L+          WN +  
Subjt:  LRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNLFHPCPWNLLIT

Query:  SYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMI
         +  +    SA+  Y  M+S G+ P+++TFP +LK+C +++    G ++H  +        L+V  +LISMY + G L+ A  +FD    RD VS+ ++I
Subjt:  SYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMI

Query:  SCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTV
          YAS G  + A +LFD +  K    ++V+WN +  G    G + +AL+L   M    +  D+  M+  + AC+  G+I LG+++H +   H +  +  +
Subjt:  SCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTV

Query:  QNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFQDY
         NAL+ +Y++C ++  A  LF     K +I+WN+++ G +H++  ++AL LF+E+L  G  PN VT  SILP CA +  +  GR  H YI KR +   + 
Subjt:  QNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFQDY

Query:  LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLS
          L  +L+DMYA+ G +  A +VF+S+  K   ++ ++I G+ M G    +  LF  M+ + I+PD IT V +LSACSHSG++  G  +F  M   + ++
Subjt:  LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLS

Query:  PRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGV
        P+LEHY CM DL G  GL   A+E+I  M   P   +W +L+ AC +H N ++GE  AE L++++PE+ G YVL++N+YA+AG W+++AK R L+ D G+
Subjt:  PRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGV

Query:  AKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
         K PGCS +E+ S    F++GD  +P+  E   +L+ +  +++    V
Subjt:  AKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202301.5e-10631.92Show/hide
Query:  SSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGE
        SSL    Q H RI+ SG + D  +  KL+  YS++    +A  ++++        ++ LI +  + +L   +I  + +M S G+ PD+   P++ K C E
Subjt:  SSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGE

Query:  TQNLGFGLEVHKCINSWSN-QWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGC
              G ++H C++  S      FVQ ++  MY RCG +  AR +FD M D+D V+ ++++  YA  G  +E   +   M+S  +E NIV+WN I  G 
Subjt:  TQNLGFGLEVHKCINSWSN-QWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGC

Query:  LRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARC-----------------KDIMRAYI--
         R G   +A+ +  ++ + G   D V +   L +      + +G+ IHG+ I+    K   V +A++ MY +                    +  AYI  
Subjt:  LRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARC-----------------KDIMRAYI--

Query:  ------------LFRLNDDK----SIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLW
                    +F L  ++    ++++W S+++G +   +  +AL LFRE+ + GV+PN+VT  S+LP C  +A L HGR  H +   R    D + + 
Subjt:  ------------LFRLNDDK----SIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLW

Query:  NALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLE
        +AL+DMYA+ G++  ++ VF+ +  K+ V + SL+ G+ M G+ ++ + +FE +    +KPD I+  ++LSAC   GL  +G   F  M   +G+ PRLE
Subjt:  NALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLE

Query:  HYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAP
        HY+CM +L GR G L  A ++I  MP+ P S +W  L+ +C +  N D+ E AAEKL  ++PE+ G YVL++N+YAA G W+++  IR  M   G+ K P
Subjt:  HYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAP

Query:  GCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMK
        GCSW++V +   + L GD S+PQ  +    +D ++  M+
Subjt:  GCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.4e-11433.49Show/hide
Query:  LRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNLFHPCPWNLLIT
        L  S +  +D I    S+ L+    C +L S + +H ++I  GL   +  + KL+ F              S FK + E + L+          WN +  
Subjt:  LRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTF-------------YSSFKLLAEAHTLVENSNLFHPCPWNLLIT

Query:  SYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMI
         +  +    SA+  Y  M+S G+ P+++TFP +LK+C +++    G ++H  +        L+V  +LISMY + G L+ A  +FD    RD VS+ ++I
Subjt:  SYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMI

Query:  SCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTV
          YAS G  + A +LFD +  K    ++V+WN +  G    G + +AL+L   M    +  D+  M+  + AC+  G+I LG+++H +   H +  +  +
Subjt:  SCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHCYHKSSTV

Query:  QNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFQDY
         NAL+ +Y++C ++  A  LF     K +I+WN+++ G +H++  ++AL LF+E+L  G  PN VT  SILP CA +  +  GR  H YI KR +   + 
Subjt:  QNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFQDY

Query:  LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLS
          L  +L+DMYA+ G +  A +VF+S+  K   ++ ++I G+ M G    +  LF  M+ + I+PD IT V +LSACSHSG++  G  +F  M   + ++
Subjt:  LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLS

Query:  PRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGV
        P+LEHY CM DL G  GL   A+E+I  M   P   +W +L+ AC +H N ++GE  AE L++++PE+ G YVL++N+YA+AG W+++AK R L+ D G+
Subjt:  PRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGV

Query:  AKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV
         K PGCS +E+ S    F++GD  +P+  E   +L+ +  +++    V
Subjt:  AKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLV

AT1G22830.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.3e-20250.87Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVP-------VSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCS
        M SS    ILRGL++ ++  FIP+ WK +P        ++  E +   +F+S +   SHGQL +AF  FSL++ +   + S + ++ S + LL +C   +
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVP-------VSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCS

Query:  SLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGET
            G+QLH   ISSGLE DS+LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SY+RN+  + ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGET

Query:  QNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLR
         +  +G  VH  I   S++ +L+V NALISMY R G++D AR LFD M +RDAVSWN++I+CY S     EAF+L D M    +E +IVTWN IAGGCL 
Subjt:  QNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLR

Query:  LGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSG
         G +  AL  +  MRN  + +  VAMI GL ACSHIGA++ GK  H   IR C   H    V+N+L+TMY+RC D+  A+I+F+  +  S+ TWNS++SG
Subjt:  LGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSG

Query:  LSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
         ++ +R E+   L +E+LL G  PN++T ASILPL ARV +LQHG+EFHCYI +RQ ++D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWA
         GYG  G+G  AL  F++M    IKPDH+TMVAVLSACSHS LV++G  LF +M+ V G+  RLEHY+CM DL+ R G LD+A++I   +PY P+SAM A
Subjt:  AGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWA

Query:  TLIGACCIHRNTDIGEWAAEK-LLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNP
        TL+ AC IH NT+IGEWAA+K LLE KPEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV KA   + +E  SE    L G+ + P
Subjt:  TLIGACCIHRNTDIGEWAAEK-LLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNP

AT1G22830.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.3e-20250.87Show/hide
Query:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVP-------VSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCS
        M SS    ILRGL++ ++  FIP+ WK +P        ++  E +   +F+S +   SHGQL +AF  FSL++ +   + S + ++ S + LL +C   +
Subjt:  MSSSSPEYILRGLSIYKLLTFIPKQWKNVP-------VSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCS

Query:  SLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGET
            G+QLH   ISSGLE DS+LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SY+RN+  + ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGET

Query:  QNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLR
         +  +G  VH  I   S++ +L+V NALISMY R G++D AR LFD M +RDAVSWN++I+CY S     EAF+L D M    +E +IVTWN IAGGCL 
Subjt:  QNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLR

Query:  LGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSG
         G +  AL  +  MRN  + +  VAMI GL ACSHIGA++ GK  H   IR C   H    V+N+L+TMY+RC D+  A+I+F+  +  S+ TWNS++SG
Subjt:  LGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIRHC--YHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSG

Query:  LSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
         ++ +R E+   L +E+LL G  PN++T ASILPL ARV +LQHG+EFHCYI +RQ ++D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWA
         GYG  G+G  AL  F++M    IKPDH+TMVAVLSACSHS LV++G  LF +M+ V G+  RLEHY+CM DL+ R G LD+A++I   +PY P+SAM A
Subjt:  AGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWA

Query:  TLIGACCIHRNTDIGEWAAEK-LLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNP
        TL+ AC IH NT+IGEWAA+K LLE KPEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV KA   + +E  SE    L G+ + P
Subjt:  TLIGACCIHRNTDIGEWAAEK-LLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNP

AT1G71490.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-23358.06Show/hide
Query:  SIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENS
        S+F SL   ASHG L  AF+ FSL++L+ S   S DL++ S + LL +C    +  +G Q+H   ISSG+E  S+LVPKLVTFYS+F L  EA +++ENS
Subjt:  SIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENS

Query:  NLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDN
        ++ HP PWN+LI SY +NEL E  I AYK+M+SKG+RPD FT+PS+LKACGET ++ FG  VH  I   S + SL+V NALISMY R   +  AR LFD 
Subjt:  NLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDN

Query:  MLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHG
        M +RDAVSWN++I+CYAS GMW EAFELFD M    +E++++TWNII+GGCL+ G +  AL L+S+MRNF   LD VAMIIGL ACS IGAIRLGKEIHG
Subjt:  MLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHG

Query:  FTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFH
          I   Y     V+N L+TMY++CKD+  A I+FR  ++ S+ TWNS++SG + +++ E+A  L RE+L+ G +PN +T ASILPLCAR+A+LQHG+EFH
Subjt:  FTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFH

Query:  CYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEV
        CYI +R+ F+DY +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEG  AL LF+EM    IKPDH+T+VAVLSACSHS LV +GE 
Subjt:  CYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEV

Query:  LFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKL
        LF +MQ  +G+ P L+H++CM DL+GR G L +AK+II  MPY+P+ A WATL+ AC IH NT IG+WAAEKLLEMKPE+ GYYVLIANMYAAAGSWSKL
Subjt:  LFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKL

Query:  AKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLVMIDDYDIGDD
        A++RT+MRD GV K PGC+W++  S F  F VGDTS+P+A  +  LLDGLN +MK      I+     D+
Subjt:  AKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLVMIDDYDIGDD

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-11934.25Show/hide
Query:  LKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHP
        ++S+  +G  +K    F L+       D++     +   +  +C   SS+  G+  H   + +G   +  +   LV  YS  + L++A  + +  +++  
Subjt:  LKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHP

Query:  CPWNLLITSYVRNELHESAILAYKQMLSK-GVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDR
          WN +I SY +    + A+  + +M ++ G RPDN T  ++L  C        G ++H    +     ++FV N L+ MY +CG +D A  +F NM  +
Subjt:  CPWNLLITSYVRNELHESAILAYKQMLSK-GVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNALISMYGRCGELDTARNLFDNMLDR

Query:  DAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIR
        D VSWN+M++ Y+  G +++A  LF+ MQ + +++++VTW+    G  + G   +AL +  QM + GI  ++V +I  L  C+ +GA+  GKEIH + I+
Subjt:  DAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKEIHGFTIR

Query:  H-------CYHKSSTVQNALLTMYARCKDIMRAYILF--RLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVE--PNYVTFASILPLCARVADL
        +        +   + V N L+ MYA+CK +  A  +F      ++ ++TW  M+ G S       AL L  E+     +  PN  T +  L  CA +A L
Subjt:  H-------CYHKSSTVQNALLTMYARCKDIMRAYILF--RLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVE--PNYVTFASILPLCARVADL

Query:  QHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSG
        + G++ H Y  + Q     L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +AL +F+EM+ +  K D +T++ VL ACSHSG
Subjt:  QHGREFHCYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSG

Query:  LVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAA
        ++ QG   F  M++V G+SP  EHYAC+ DL GR G L+ A  +I  MP  P   +W   +  C IH   ++GE+AAEK+ E+   H G Y L++N+YA 
Subjt:  LVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAA

Query:  AGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLL
        AG W  + +IR+LMR  GV K PGCSWVE      +F VGD ++P A E   +L
Subjt:  AGSWSKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTTCCTCTCCTGAATACATCCTCAGAGGTCTTTCTATATATAAGCTCCTGACATTCATACCTAAACAATGGAAAAATGTACCTGTGAGCAATGGTAGAGAATT
TATGATTACTTCTATTTTTGATTCCCTTAAAAGCTTTGCCTCTCATGGTCAGTTGTCCAAAGCATTTGAAGCCTTCTCCCTTGTTCAATTGCGCGGTAGTTATAATGATT
CATTTGACCTCATCGTGCAATCCATCTCCATTCTTCTTGTATCATGCACCACTTGTAGCTCACTCCCGTCAGGTAAGCAACTTCATGGTCGCATTATCTCGTCAGGTCTT
GAGGAAGACTCCATTTTGGTCCCCAAGCTTGTCACATTCTACTCGAGCTTTAAACTTCTGGCCGAGGCTCATACCCTTGTTGAGAATTCTAATTTATTTCACCCCTGTCC
TTGGAATCTACTCATCACATCATATGTCAGAAATGAACTTCATGAGTCAGCCATTTTAGCTTATAAGCAGATGTTGAGTAAAGGGGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTGAAGGCTTGTGGTGAAACACAAAATCTGGGATTTGGTTTAGAAGTTCACAAGTGTATTAATTCTTGGTCAAATCAATGGAGTTTGTTTGTTCAGAATGCT
CTGATATCTATGTATGGAAGATGTGGCGAGCTGGACACTGCACGTAACTTGTTCGACAATATGCTTGACCGGGATGCAGTATCGTGGAATTCCATGATCTCTTGTTATGC
CTCCAACGGTATGTGGAAGGAGGCATTTGAACTATTTGACATCATGCAGAGTAAGTGTCTTGAAATTAACATTGTAACTTGGAATATTATAGCTGGAGGTTGCTTGCGCC
TTGGTAAGTTTACTCAAGCTCTTAAGTTACTGTCTCAAATGAGAAATTTTGGTATTCATTTGGACGATGTAGCAATGATAATAGGTTTAGGTGCTTGTTCACACATTGGT
GCCATTAGATTAGGGAAGGAAATTCATGGCTTTACTATTAGACATTGTTATCATAAGTCATCCACTGTTCAAAACGCTTTACTTACCATGTATGCTCGTTGTAAAGACAT
CATGCGTGCATATATTTTGTTTCGATTAAACGACGACAAAAGTATAATCACGTGGAACTCCATGCTTTCTGGCCTCTCACATGTGGACCGGGTTGAGGATGCCCTGCGTC
TGTTTAGAGAATTGTTACTATATGGTGTAGAACCCAACTATGTGACATTTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCAGATTTACAACATGGGAGAGAATTTCAC
TGTTACATTACTAAACGCCAAGATTTTCAAGATTATTTGTTATTGTGGAATGCTTTGGTGGACATGTATGCAAGGTCGGGCAAGGTCTTAGAAGCTAAAAGAGTTTTCGA
TTCATTAAGCAAAAAGGACGAAGTGACATATACTTCCCTGATTGCAGGCTATGGCATGCAAGGAGAGGGGAGGAAAGCGCTAAGGCTGTTCGAAGAAATGAAAAGTGTTG
ATATCAAACCAGATCATATAACTATGGTTGCTGTCCTATCAGCTTGTAGTCACTCCGGTCTTGTGAAACAGGGTGAAGTTTTATTTGCAGAGATGCAAAGTGTGCATGGA
CTAAGCCCTCGTTTGGAACATTATGCTTGTATGGCAGACCTTTTTGGGAGGGTTGGTCTGTTGGACAGAGCAAAGGAGATTATCACGAGAATGCCTTATAGACCAACATC
GGCTATGTGGGCGACTCTTATCGGAGCATGTTGCATCCATCGAAACACGGATATCGGGGAATGGGCTGCAGAGAAACTTCTGGAAATGAAGCCTGAACATTCTGGTTACT
ATGTCTTGATTGCTAACATGTATGCTGCTGCAGGTTCTTGGAGTAAGTTGGCAAAAATAAGGACTCTCATGAGAGATTATGGTGTGGCAAAAGCTCCTGGTTGTTCTTGG
GTCGAAGTTGGCTCGGAATTCGTCTCGTTCTTGGTTGGGGACACTTCTAATCCTCAAGCCCTTGAATCTAAACACTTGTTAGATGGTTTGAACGATGTAATGAAACACGG
TACTCTAGTGATGATAGATGATTACGATATCGGCGATGACATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCTTCCTCTCCTGAATACATCCTCAGAGGTCTTTCTATATATAAGCTCCTGACATTCATACCTAAACAATGGAAAAATGTACCTGTGAGCAATGGTAGAGAATT
TATGATTACTTCTATTTTTGATTCCCTTAAAAGCTTTGCCTCTCATGGTCAGTTGTCCAAAGCATTTGAAGCCTTCTCCCTTGTTCAATTGCGCGGTAGTTATAATGATT
CATTTGACCTCATCGTGCAATCCATCTCCATTCTTCTTGTATCATGCACCACTTGTAGCTCACTCCCGTCAGGTAAGCAACTTCATGGTCGCATTATCTCGTCAGGTCTT
GAGGAAGACTCCATTTTGGTCCCCAAGCTTGTCACATTCTACTCGAGCTTTAAACTTCTGGCCGAGGCTCATACCCTTGTTGAGAATTCTAATTTATTTCACCCCTGTCC
TTGGAATCTACTCATCACATCATATGTCAGAAATGAACTTCATGAGTCAGCCATTTTAGCTTATAAGCAGATGTTGAGTAAAGGGGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTGAAGGCTTGTGGTGAAACACAAAATCTGGGATTTGGTTTAGAAGTTCACAAGTGTATTAATTCTTGGTCAAATCAATGGAGTTTGTTTGTTCAGAATGCT
CTGATATCTATGTATGGAAGATGTGGCGAGCTGGACACTGCACGTAACTTGTTCGACAATATGCTTGACCGGGATGCAGTATCGTGGAATTCCATGATCTCTTGTTATGC
CTCCAACGGTATGTGGAAGGAGGCATTTGAACTATTTGACATCATGCAGAGTAAGTGTCTTGAAATTAACATTGTAACTTGGAATATTATAGCTGGAGGTTGCTTGCGCC
TTGGTAAGTTTACTCAAGCTCTTAAGTTACTGTCTCAAATGAGAAATTTTGGTATTCATTTGGACGATGTAGCAATGATAATAGGTTTAGGTGCTTGTTCACACATTGGT
GCCATTAGATTAGGGAAGGAAATTCATGGCTTTACTATTAGACATTGTTATCATAAGTCATCCACTGTTCAAAACGCTTTACTTACCATGTATGCTCGTTGTAAAGACAT
CATGCGTGCATATATTTTGTTTCGATTAAACGACGACAAAAGTATAATCACGTGGAACTCCATGCTTTCTGGCCTCTCACATGTGGACCGGGTTGAGGATGCCCTGCGTC
TGTTTAGAGAATTGTTACTATATGGTGTAGAACCCAACTATGTGACATTTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCAGATTTACAACATGGGAGAGAATTTCAC
TGTTACATTACTAAACGCCAAGATTTTCAAGATTATTTGTTATTGTGGAATGCTTTGGTGGACATGTATGCAAGGTCGGGCAAGGTCTTAGAAGCTAAAAGAGTTTTCGA
TTCATTAAGCAAAAAGGACGAAGTGACATATACTTCCCTGATTGCAGGCTATGGCATGCAAGGAGAGGGGAGGAAAGCGCTAAGGCTGTTCGAAGAAATGAAAAGTGTTG
ATATCAAACCAGATCATATAACTATGGTTGCTGTCCTATCAGCTTGTAGTCACTCCGGTCTTGTGAAACAGGGTGAAGTTTTATTTGCAGAGATGCAAAGTGTGCATGGA
CTAAGCCCTCGTTTGGAACATTATGCTTGTATGGCAGACCTTTTTGGGAGGGTTGGTCTGTTGGACAGAGCAAAGGAGATTATCACGAGAATGCCTTATAGACCAACATC
GGCTATGTGGGCGACTCTTATCGGAGCATGTTGCATCCATCGAAACACGGATATCGGGGAATGGGCTGCAGAGAAACTTCTGGAAATGAAGCCTGAACATTCTGGTTACT
ATGTCTTGATTGCTAACATGTATGCTGCTGCAGGTTCTTGGAGTAAGTTGGCAAAAATAAGGACTCTCATGAGAGATTATGGTGTGGCAAAAGCTCCTGGTTGTTCTTGG
GTCGAAGTTGGCTCGGAATTCGTCTCGTTCTTGGTTGGGGACACTTCTAATCCTCAAGCCCTTGAATCTAAACACTTGTTAGATGGTTTGAACGATGTAATGAAACACGG
TACTCTAGTGATGATAGATGATTACGATATCGGCGATGACATTTTTTGA
Protein sequenceShow/hide protein sequence
MSSSSPEYILRGLSIYKLLTFIPKQWKNVPVSNGREFMITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPSGKQLHGRIISSGL
EEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVRNELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFVQNA
LISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCLEINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIG
AIRLGKEIHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDRVEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFH
CYITKRQDFQDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKPDHITMVAVLSACSHSGLVKQGEVLFAEMQSVHG
LSPRLEHYACMADLFGRVGLLDRAKEIITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDYGVAKAPGCSW
VEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHGTLVMIDDYDIGDDIF