; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017796 (gene) of Snake gourd v1 genome

Gene IDTan0017796
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG01:116369372..116378162
RNA-Seq ExpressionTan0017796
SyntenyTan0017796
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591227.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.91Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISK---------------------------GSEFLGG-AESTKFMQMQIVDALRLGDRSSA
        MHRAR RL SIADSLYRF+PHEHGR+QDA+K+   RALLIS+                           G E+LG  AESTKFMQ QIVDALR+GDRSSA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISK---------------------------GSEFLGG-AESTKFMQMQIVDALRLGDRSSA

Query:  SNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKR
        SNLLMELGQEKHSLTADNFV ILSYCARSPDPLFVMETWKIMEERG+FL+NTC+LLMI+ALCKGGYLDEAFGLI+ L ES VMFPVLPVYN FLRAC KR
Subjt:  SNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKR

Query:  QSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELD
        QSTVHVSQCLD+MD RMVGKNEATY ELLK+AVCQKNLSSVHEIWTDFVKNYSPSVLSLR FIW Y+RLGDLKSA+ ALQKMVAL IGAAG KLPSLELD
Subjt:  QSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELD

Query:  IPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHE
        IP+P RTEFY +NFNFEENG STDE+YCKKMVP  GDI +FSVN MKC EVESG  T  +NY+S++VMKVLRWSFNDVI ACALTRNCGLAEQLMQQMHE
Subjt:  IPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHE

Query:  LRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKM
        L LQPSSHTFDGFVRSVVSERGFSDG+KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISAC +PHPFNAFLSACD MDQ ERAMRML KM
Subjt:  LRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKM

Query:  KQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYN
        KQM+VLPDVKTYELLYSLFGNVNAPYE+GNRLSQVDAA+RIR+IE+DM KHGIQHSH SM NLLKALGAEGMTKELLQYL+VAENLF+YNNT LGTPIYN
Subjt:  KQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYN

Query:  TVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELD
        T LHFLVESK+IHMAIELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGFE+FDDALNLLDQA+SEGIELD
Subjt:  TVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELD

Query:  VVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFF
        VVIMNTI+QKACEKGRIDVIEF VE+M REKIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRMLC++ DTSP +TEYVE+FVLAEDSE +S ILEFF
Subjt:  VVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFF

Query:  KCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY-DGY
        KCSE++LSFAL NLRWSAMLGYSLCSSP QSPWA RLA+SY DGY
Subjt:  KCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY-DGY

XP_022132690.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Momordica charantia]0.0e+0086.83Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRA  RL SIADSLYRFKPHEHGR+Q +SKL  HR LLISK SEFLG GAE+TKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
         SPDPLFVMETW+IME+RGIFLNNTCSLLMIEALCKGGYLDEAFGLIN LAES VMFPVLPVYNCFLRAC K QSTVHV QCLDLMDHRMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLKLAV Q+NLSSVHEIWTDFVKNYSPSVLSLR FIWSY+RLGDLKSA I+LQKMVAL +GAAGGKLPSLELDIPIPS TEFYRNNF+FE+N HS+DELY
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
         KK+V  + DIG+FSVN MKC + ESGPLT QNN +SS+VMKVLRWSFNDVIHACA TR+CGLAEQLMQQM +L LQPS HTFDGFVRSVVSERGFSDGM
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISACPYPHPFNAFL ACD MDQ ERAMRMLVKMKQ+KVLP+V TYE LYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQ DA +RIR+IE+DM KHGIQHS+LSM NLLKALGAEGMTKELLQYL VAENLF+YNNT+LGTP+YNTVLHFLVESK+IHMAIELFNNMKHSGF
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMM+DCCSVM CLKSAFALLSMMVRTGFCPQILTYTSLVKIVL  E FDDALNLLDQA+SEGI+LDVVIMNTIL KACEKGR+DVIEF++ERM
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
        NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRML +++D SP+LTEYVENFVLAED   D  ILEFFKCSE++LSFALFNLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSYDGYRTS
        P QSPWA RLANSYD  R+S
Subjt:  PYQSPWATRLANSYDGYRTS

XP_022937086.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita moschata]0.0e+0086.8Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRAR RL SIADSLYRF+PHEHGR+QDA+K+   RALLIS+G E+LG  AESTKFMQ QIVDALR+GDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
        RSPDPLFVMETWKIMEERG+FL+NTC+LLMI+ALCKGGYLDEAFGLI+ LAES VMFPVLPVYN FLRAC KRQSTVHVSQCLD+MD RMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLK+AVCQKNLSSVHEIWTDFVKNYSPSVLSLR FIW Y+RLGDLKSA+ ALQKMVAL IGAAG KLPSLELDIP+P RTEFY +NFNFEENG STDE+Y
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
        CKKMVP  GDI +FSVN MKC EVESG  T  +NY+S++VMKVLRWSFNDVI ACALTRNCGLAEQLMQQMHEL LQPSSHTFDGFVRSVVSERGFSDG+
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISAC YPHPFNAFLSACD MDQ ERAMRML KMKQM+VLPDVKTYELLYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQVDAA+RIR+IE+DM KHGIQHSH SM NLLKALGAEGMTKELLQYL+VAENLF+YNNT LGTPIYNT LHFLVESK+IHMAIELFNNMKHSG 
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGFE+FDDALNLLDQA+SEGIELDVVIMNTI+QKACEKGRIDVIEF+VE+M
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
         R+KIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRMLC++ DTSP +TEYVE+FVLAEDSE +S ILEFFKCSE++LSFAL NLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSY-DGY
        P QSPWA RLA+SY DGY
Subjt:  PYQSPWATRLANSY-DGY

XP_022976056.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxima]0.0e+0086.55Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRAR RL SIADSLYRF+PHEHGR+QDA+K+   RALLIS+G E+LG  AESTKFMQ QIVDALR+GDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
        RSPDPLFVMETWKIMEERG+FL+NTC+LLMI+ALCKGGYLDEAFGLI+ LAES VMFPVLPVYN FLRAC KRQSTVHVSQCLD+MD RMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLK+AV QKNLSSVHEIWTDFVKNYSPSVLSLR FIWSY+RLGDLKSAY ALQKMV L IGAAG KL SLELDIP+P RTEFY +NFNFEENG STDELY
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
        CKK+VP  GDI +FSVN MKC EVESG LT  +NY+S++VMKVLRWSFNDVI ACA TRNCGLAEQLMQQMHEL LQPSSHTFDGFVRSVVSERGFSDG+
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISAC YPHPFNAFLSACD MDQ ERAMRML KMKQM+VLPDVKTYELLYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQVDAA+RIR+IE+DM KHGIQHSH SM NLLKALGAEGMTKELLQYL+VAENLF+YNNT LGTPIYNT LHFLVESK+IHMA ELFNNMKHSG 
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGFE+FDDALNLLDQA+SEGIELDVVIMNTI+QKACEKGRIDVIEF+VE+M
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
         REKIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRMLC++ DTSP +TEYVE+FVLAEDSE +S ILEFFKCSE++LSFAL NLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSY-DGY
          QSPWA RLA+SY DGY
Subjt:  PYQSPWATRLANSY-DGY

XP_023536089.1 pentatricopeptide repeat-containing protein At1g76280 [Cucurbita pepo subsp. pepo]0.0e+0086.8Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRAR+RL SIADSLYRF+PHEHGR+QDA+K+   RALLIS+G E+LG  AESTKFMQ QIVDALR+GDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
        RSPDPLFVMETWKIMEERG+FL+NTC+LLMI+ALCKGGYLDEAFGLI+ LAES VMFPVLPVYN FLRAC KRQSTVHVSQCLD+MD RMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLK+AVCQKNLSSVHEIWTDFVKNYSPSVLSLR FIWSY+RLGDLKSAY ALQKMVAL IGAAG KLPSLELDIP+P RTE Y  NFNFEENG STDELY
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
        CKKMVP  GDIG+FSVN MKC EVESG LT  +NY+S++VMKVLRWSFNDVI ACALTRNCGLAEQLMQQMHEL LQPSSHTFDGFVRSVVSERGFSDG+
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISAC +PHPFNAFLSACD MDQ ERAMRMLVKMKQM+VLPDVKTYELLYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQVDAA+RIR+IE+DM KHGIQHSH SM NLLKALGAEGMTKELLQYL+VAENLF+Y+NT LGTPIYNT LHFLVESK+IHMAIELFNNMKHSG 
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMMI+CCSV+GCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGFE+FDDALNLLDQA+SEGIELDVVIMNTI+QKACEK   DVIEF+VE+M
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
         REKIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRMLC++ DTSP +TEYVE+FVLAEDSE +S ILEFFKCSE++LSFAL NLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSY-DGY
        P QSPWA RLA+SY DGY
Subjt:  PYQSPWATRLANSY-DGY

TrEMBL top hitse value%identityAlignment
A0A1S3CEF2 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0082.54Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRA +RL SIADS+YRFKPHE  R+QDASKL  HRALLISKGSE  G GAEST FMQ+QIVDALRLGDRS ASNLLM LGQEK SLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
        +SPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLIN LAESHVMFPVLPVYNCFLRACA RQSTVH SQCLDLMDHRMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLKLAVCQ+N SSVHEIWTDFVKNYSPSV SLR FIWS++RLGDL SAY ALQKMVAL  GA G KL S  LDIPIP RTEFY NNFNFEE   S DE +
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
        CKKMVP+NGD+G  SVNDMKC   E+GPLT  NN++SS+V KVLRWS NDV+ +C+L  NCGLAEQLMQQMH+L LQPSSHTFDGFVRSVVSERGFS GM
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        +ILK+MQ+R L+PYDSTLAAVS+SCSKALELDLAEALLE +SACPYP+PFNAFLSAC  MDQ ERAMRMLVKMKQMKV+PDV+TYELLYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +G++LSQVDAA+RIR+IE+DM KHGIQ+SH SM NLLKALGAEGM KE+LQYL++AENLF+YNNT LG P+YNTVLHFLV+SK+I+MAIELFNNMK+SGF
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFE+M+DCCSVMGCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGF +FDDALNLLDQA+SEGIELDV+IMNTI++KACEK RIDVIEFLVE+M
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRM-LCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCS
        NREKIQPDPSTCH+VFSAYVNLGYHSTAMEALQVLSMRM LCE+DD S  +TEY+ENFVLAED+  DS I EFFKCS + L FALFNLRW AMLGYS+C 
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRM-LCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCS

Query:  SPYQSPWATRLANSYDGYR
        SP QSPWA RLA+SYDGY+
Subjt:  SPYQSPWATRLANSYDGYR

A0A5A7V601 Pentatricopeptide repeat-containing protein0.0e+0082.25Show/hide
Query:  RARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARS
        +A +RL SIADS+YRFKPHE  R+QDASKL  HRALLISKGSE  G GAEST FMQ+QIVDALRLGDR+ ASNLLM LGQEK SLTADNFV ILSYCA+S
Subjt:  RARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARS

Query:  PDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELL
        PDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLIN LAESHVMFPVLPVYNCFLRACA RQSTVH SQCLDLMDHRMVGKNEATY ELL
Subjt:  PDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELL

Query:  KLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCK
        KLAVCQ+N SSVHEIWTDFVKNYSPSV SLR FIWS++RLGDL SAY ALQKMVAL  GA G KL S  LDIPIP RTEFY NNFNFEE   S DE +CK
Subjt:  KLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCK

Query:  KMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKI
        KMVP+NGD+G  SVNDMKC   E+GPLT  NN++SS+V KVLRWS NDV+ +C+L  NCGLAEQLMQQMH+L LQPSSHTFDGFVRSVVSERGFS GM+I
Subjt:  KMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKI

Query:  LKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDG
        LK+MQ+R L+PYDSTLAAVS+SCSKALELDLAEALLE +SACPYP+PFNAFLSAC  MDQ ERAMRMLVKMKQMKV+PDV+TYELLYSLFGNVNAPYE+G
Subjt:  LKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDG

Query:  NRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFP
        ++LSQVDAA+RIR+IE+DM KHGIQ+SH SM NLLKALGAEGM KE+LQYL++AENLF+YNNT LG P+YNTVLHFLV+SK+I+MAIELFNNMK+SGFFP
Subjt:  NRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFP

Query:  DAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNR
        DAATFE+M+DCCSVMGCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGF +FDDALNLLDQA+SEGIELDV+IMNTI++KACEK RIDVIEFLVE+MNR
Subjt:  DAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNR

Query:  EKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRM-LCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSP
        EKIQPDPSTCH+VFSAYVNLGYHSTAMEALQVLSMRM LCE+DD S  +TEY+ENFVLAED+  DS I EFFKCS + L FALFNLRW AMLGYS+C SP
Subjt:  EKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRM-LCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSP

Query:  YQSPWATRLANSYDGYR
         QSPWA RLA+SYDGY+
Subjt:  YQSPWATRLANSYDGYR

A0A6J1BT64 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0086.83Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRA  RL SIADSLYRFKPHEHGR+Q +SKL  HR LLISK SEFLG GAE+TKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLG-GAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
         SPDPLFVMETW+IME+RGIFLNNTCSLLMIEALCKGGYLDEAFGLIN LAES VMFPVLPVYNCFLRAC K QSTVHV QCLDLMDHRMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLKLAV Q+NLSSVHEIWTDFVKNYSPSVLSLR FIWSY+RLGDLKSA I+LQKMVAL +GAAGGKLPSLELDIPIPS TEFYRNNF+FE+N HS+DELY
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
         KK+V  + DIG+FSVN MKC + ESGPLT QNN +SS+VMKVLRWSFNDVIHACA TR+CGLAEQLMQQM +L LQPS HTFDGFVRSVVSERGFSDGM
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISACPYPHPFNAFL ACD MDQ ERAMRMLVKMKQ+KVLP+V TYE LYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQ DA +RIR+IE+DM KHGIQHS+LSM NLLKALGAEGMTKELLQYL VAENLF+YNNT+LGTP+YNTVLHFLVESK+IHMAIELFNNMKHSGF
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMM+DCCSVM CLKSAFALLSMMVRTGFCPQILTYTSLVKIVL  E FDDALNLLDQA+SEGI+LDVVIMNTIL KACEKGR+DVIEF++ERM
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
        NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRML +++D SP+LTEYVENFVLAED   D  ILEFFKCSE++LSFALFNLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSYDGYRTS
        P QSPWA RLANSYD  R+S
Subjt:  PYQSPWATRLANSYDGYRTS

A0A6J1F9C6 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0086.8Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRAR RL SIADSLYRF+PHEHGR+QDA+K+   RALLIS+G E+LG  AESTKFMQ QIVDALR+GDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
        RSPDPLFVMETWKIMEERG+FL+NTC+LLMI+ALCKGGYLDEAFGLI+ LAES VMFPVLPVYN FLRAC KRQSTVHVSQCLD+MD RMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLK+AVCQKNLSSVHEIWTDFVKNYSPSVLSLR FIW Y+RLGDLKSA+ ALQKMVAL IGAAG KLPSLELDIP+P RTEFY +NFNFEENG STDE+Y
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
        CKKMVP  GDI +FSVN MKC EVESG  T  +NY+S++VMKVLRWSFNDVI ACALTRNCGLAEQLMQQMHEL LQPSSHTFDGFVRSVVSERGFSDG+
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISAC YPHPFNAFLSACD MDQ ERAMRML KMKQM+VLPDVKTYELLYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQVDAA+RIR+IE+DM KHGIQHSH SM NLLKALGAEGMTKELLQYL+VAENLF+YNNT LGTPIYNT LHFLVESK+IHMAIELFNNMKHSG 
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGFE+FDDALNLLDQA+SEGIELDVVIMNTI+QKACEKGRIDVIEF+VE+M
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
         R+KIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRMLC++ DTSP +TEYVE+FVLAEDSE +S ILEFFKCSE++LSFAL NLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSY-DGY
        P QSPWA RLA+SY DGY
Subjt:  PYQSPWATRLANSY-DGY

A0A6J1IFV2 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0086.55Show/hide
Query:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA
        MHRAR RL SIADSLYRF+PHEHGR+QDA+K+   RALLIS+G E+LG  AESTKFMQ QIVDALR+GDRSSASNLLMELGQEKHSLTADNFV ILSYCA
Subjt:  MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGG-AESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCA

Query:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE
        RSPDPLFVMETWKIMEERG+FL+NTC+LLMI+ALCKGGYLDEAFGLI+ LAES VMFPVLPVYN FLRAC KRQSTVHVSQCLD+MD RMVGKNEATY E
Subjt:  RSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCE

Query:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY
        LLK+AV QKNLSSVHEIWTDFVKNYSPSVLSLR FIWSY+RLGDLKSAY ALQKMV L IGAAG KL SLELDIP+P RTEFY +NFNFEENG STDELY
Subjt:  LLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELY

Query:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM
        CKK+VP  GDI +FSVN MKC EVESG LT  +NY+S++VMKVLRWSFNDVI ACA TRNCGLAEQLMQQMHEL LQPSSHTFDGFVRSVVSERGFSDG+
Subjt:  CKKMVPYNGDIGKFSVNDMKCEEVESGPLTSQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGM

Query:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE
        KILKIMQ+RKLKPYDSTLAAVSISCSKALELDLAEALLE ISAC YPHPFNAFLSACD MDQ ERAMRML KMKQM+VLPDVKTYELLYSLFGNVNAPYE
Subjt:  KILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYE

Query:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF
        +GNRLSQVDAA+RIR+IE+DM KHGIQHSH SM NLLKALGAEGMTKELLQYL+VAENLF+YNNT LGTPIYNT LHFLVESK+IHMA ELFNNMKHSG 
Subjt:  DGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGF

Query:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM
        FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R+GFCPQILTYTSLVKIVLGFE+FDDALNLLDQA+SEGIELDVVIMNTI+QKACEKGRIDVIEF+VE+M
Subjt:  FPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERM

Query:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS
         REKIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRMLC++ DTSP +TEYVE+FVLAEDSE +S ILEFFKCSE++LSFAL NLRWSAMLGYSLCSS
Subjt:  NREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSS

Query:  PYQSPWATRLANSY-DGY
          QSPWA RLA+SY DGY
Subjt:  PYQSPWATRLANSY-DGY

SwissProt top hitse value%identityAlignment
Q76C99 Protein Rf1, mitochondrial8.0e-1623.58Show/hide
Query:  DGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI---SACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLP
        +GF +   S++ +S   ++L    +R + P   T  ++  +  KA  +D A  +L  +      P    +N+ L    +  Q + A+  L KM+   V P
Subjt:  DGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI---SACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLP

Query:  DVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLV
        DV TY LL              + L +       R I   MTK G++    +   LL+    +G   E    +H   +L   N  H    +++ ++    
Subjt:  DVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLV

Query:  ESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTI
        +   +  A+ +F+ M+  G  P+A T+  +I      G ++ A      M+  G  P  + Y SL+  +    K++ A  L+ +    GI L+ +  N+I
Subjt:  ESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTI

Query:  LQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEAL
        +   C++GR+   E L E M R  ++P+  T +++ + Y   G    AM+ L
Subjt:  LQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEAL

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic2.2e-1321.26Show/hide
Query:  NCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEAL-LELISA---CPYPHPFNAFLS
        +  L ++L+  + E RL+P +  F   + +    R     +++L   Q   L P  + + A+  S   A  +  AEAL LE   A    P    +NA L 
Subjt:  NCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEAL-LELISA---CPYPHPFNAFLS

Query:  ACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHV
            +   + A ++L +M Q  V PD  TY LL   +           R  + ++A   R++  +M   G++ S      +L      G  ++    L  
Subjt:  ACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHV

Query:  AENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCP-QILTYTSLVKIVLGFEK
                + H     YN ++    +   +  A++ F+ M+  G  PD  T+  +ID     G    A  L   M R   CP    TY  ++ ++   ++
Subjt:  AENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCP-QILTYTSLVKIVLGFEK

Query:  FDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVL
        ++    +L +   +G+  +++   T++      GR       +E M  + ++P P+  H++ +AY   G    A+  ++ +
Subjt:  FDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVL

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655604.4e-1418.5Show/hide
Query:  WSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI---S
        +++N +++      N   A Q + ++ E  L P   T+   +      +      K+   M  +  +  +     +      A  +D A  L   +    
Subjt:  WSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI---S

Query:  ACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGA
          P    +   + +    ++   A+ ++ +M++  + P++ TY +L              + L       + R +   M + G+  + ++   L+     
Subjt:  ACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGA

Query:  EGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTY
         GM ++ +  + + E+     NT      YN ++    +S ++H A+ + N M      PD  T+  +ID     G   SA+ LLS+M   G  P   TY
Subjt:  EGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTY

Query:  TSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSV
        TS++  +   ++ ++A +L D    +G+  +VV+   ++   C+ G++D    ++E+M  +   P+  T +++
Subjt:  TSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSV

Q9SGQ6 Pentatricopeptide repeat-containing protein At1g762809.3e-20649.36Show/hide
Query:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK
        R++    G+EF+   + +K +Q+QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARSPDP+FVMET+ +M ++ I L++   L ++++LC 
Subjt:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK

Query:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI
        GG+LD+A   I+ + E   + P+LP+YN FL ACA+ +S  H S+CL+LMD R VGKN  TY  LLKLAV Q+NLS+V++IW  +V +Y+  +LSLR FI
Subjt:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI

Query:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT
        WS++RLGDLKSAY  LQ MV L       + +  GKL S  L IP+PS+ E     F                          F V D   +   S  + 
Subjt:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT

Query:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE
            +     ++VLRWSFNDVIHAC  ++N  LAEQLM Q                                LK+MQ++ LKPYDSTLA V+  CSKAL+
Subjt:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE

Query:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH
        +DLAE LL+ IS C Y +PFN  L+A D++DQ ERA+R+L +MK++K+ PD++TYELL+SLFGNVNAPYE+GN LSQVD  +RI  IE+DM ++G QHS 
Subjt:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH

Query:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM
        +S  N+L+ALGAEGM  E++++L  AENL  ++N +LGTP YN VLH L+E+ +  M I +F  MK  G   D AT+ +MIDCCS++   KSA AL+SMM
Subjt:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM

Query:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME
        +R GF P+ +T+T+L+KI+L    F++ALNLLDQA  E I LDV+  NTIL+KA EKG IDVIE++VE+M+REK+ PDP+TCH VFS YV  GYH+TA+E
Subjt:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME

Query:  ALQVLSMRMLCEDDDTS--PNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY
        AL VLS+RML E+D  S      E  ENFV++ED E ++ I+E F+ SE++L+ AL NLRW AMLG  +  S  QSPWA  L+N Y
Subjt:  ALQVLSMRMLCEDDDTS--PNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY

Q9SUD8 Pentatricopeptide repeat-containing protein At4g280101.7e-1324.3Show/hide
Query:  AEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLEL-ISACPYPHP----FNAFLSACD
        A QL+  M E   +P++ T++  +  +  +   +D ++I+++M++R+ +P + T   +        +LD A  LL L +    Y  P    +NA +    
Subjt:  AEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLEL-ISACPYPHP----FNAFLSACD

Query:  AMDQAERAMRML-VKMKQMKVLPDVKTYELLYSLF--GNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHV
          ++  +A+ +  + ++++     V T  LL S    G+VN   E   ++S     R                ++ +M +     G   + K LL  + V
Subjt:  AMDQAERAMRML-VKMKQMKVLPDVKTYELLYSLF--GNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHV

Query:  AENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKF
        +E              YN +L  L +   +  A  LF  M+    FPD  +F +MID     G +KSA +LL  M R G  P + TY+ L+   L     
Subjt:  AENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKF

Query:  DDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPS-TC
        D+A++  D+    G E D  I +++L+    +G  D +  LV+++  + I  D   TC
Subjt:  DDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPS-TC

Arabidopsis top hitse value%identityAlignment
AT1G76280.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.6e-20749.36Show/hide
Query:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK
        R++    G+EF+   + +K +Q+QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARSPDP+FVMET+ +M ++ I L++   L ++++LC 
Subjt:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK

Query:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI
        GG+LD+A   I+ + E   + P+LP+YN FL ACA+ +S  H S+CL+LMD R VGKN  TY  LLKLAV Q+NLS+V++IW  +V +Y+  +LSLR FI
Subjt:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI

Query:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT
        WS++RLGDLKSAY  LQ MV L       + +  GKL S  L IP+PS+ E     F                          F V D   +   S  + 
Subjt:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT

Query:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE
            +     ++VLRWSFNDVIHAC  ++N  LAEQLM Q                                LK+MQ++ LKPYDSTLA V+  CSKAL+
Subjt:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE

Query:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH
        +DLAE LL+ IS C Y +PFN  L+A D++DQ ERA+R+L +MK++K+ PD++TYELL+SLFGNVNAPYE+GN LSQVD  +RI  IE+DM ++G QHS 
Subjt:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH

Query:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM
        +S  N+L+ALGAEGM  E++++L  AENL  ++N +LGTP YN VLH L+E+ +  M I +F  MK  G   D AT+ +MIDCCS++   KSA AL+SMM
Subjt:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM

Query:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME
        +R GF P+ +T+T+L+KI+L    F++ALNLLDQA  E I LDV+  NTIL+KA EKG IDVIE++VE+M+REK+ PDP+TCH VFS YV  GYH+TA+E
Subjt:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME

Query:  ALQVLSMRMLCEDDDTS--PNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY
        AL VLS+RML E+D  S      E  ENFV++ED E ++ I+E F+ SE++L+ AL NLRW AMLG  +  S  QSPWA  L+N Y
Subjt:  ALQVLSMRMLCEDDDTS--PNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY

AT1G76280.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-18149.93Show/hide
Query:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK
        R++    G+EF+   + +K +Q+QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARSPDP+FVMET+ +M ++ I L++   L ++++LC 
Subjt:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK

Query:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI
        GG+LD+A   I+ + E   + P+LP+YN FL ACA+ +S  H S+CL+LMD R VGKN  TY  LLKLAV Q+NLS+V++IW  +V +Y+  +LSLR FI
Subjt:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI

Query:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT
        WS++RLGDLKSAY  LQ MV L       + +  GKL S  L IP+PS+ E     F                          F V D   +   S  + 
Subjt:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT

Query:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE
            +     ++VLRWSFNDVIHAC  ++N  LAEQLM QM  L L PSSHT+DGF+R+V    G+  GM +LK+MQ++ LKPYDSTLA V+  CSKAL+
Subjt:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE

Query:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH
        +DLAE LL+ IS C Y +PFN  L+A D++DQ ERA+R+L +MK++K+ PD++TYELL+SLFGNVNAPYE+GN LSQVD  +RI  IE+DM ++G QHS 
Subjt:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH

Query:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM
        +S  N+L+ALGAEGM  E++++L  AENL  ++N +LGTP YN VLH L+E+ +  M I +F  MK  G   D AT+ +MIDCCS++   KSA AL+SMM
Subjt:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM

Query:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFL-VERMNR
        +R GF P+ +T+T+L+KI+L    F++ALNLLDQA  E I LDV+  NTIL+KA EK +I V++ L V ++N+
Subjt:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFL-VERMNR

AT1G76280.3 Tetratricopeptide repeat (TPR)-like superfamily protein8.3e-21850.89Show/hide
Query:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK
        R++    G+EF+   + +K +Q+QIVDALR G+R  AS LL +L Q  +SL+AD+F  IL YCARSPDP+    T+ +M ++ I L++   L ++++LC 
Subjt:  RALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCK

Query:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI
        GG+LD+A   I+ + E   + P+LP+YN FL ACA+ +S  H S+CL+LMD R VGKN  TY  LLKLAV Q+NLS+V++IW  +V +Y+  +LSLR FI
Subjt:  GGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDFVKNYSPSVLSLRNFI

Query:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT
        WS++RLGDLKSAY  LQ MV L       + +  GKL S  L IP+PS+ E     F                          F V D   +   S  + 
Subjt:  WSYSRLGDLKSAYIALQKMVALT------IGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLT

Query:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE
            +     ++VLRWSFNDVIHAC  ++N  LAEQLM QM  L L PSSHT+DGF+R+V    G+  GM +LK+MQ++ LKPYDSTLA V+  CSKAL+
Subjt:  SQNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALE

Query:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH
        +DLAE LL+ IS C Y +PFN  L+A D++DQ ERA+R+L +MK++K+ PD++TYELL+SLFGNVNAPYE+GN LSQVD  +RI  IE+DM ++G QHS 
Subjt:  LDLAEALLELISACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSH

Query:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM
        +S  N+L+ALGAEGM  E++++L  AENL  ++N +LGTP YN VLH L+E+ +  M I +F  MK  G   D AT+ +MIDCCS++   KSA AL+SMM
Subjt:  LSMKNLLKALGAEGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMM

Query:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME
        +R GF P+ +T+T+L+KI+L    F++ALNLLDQA  E I LDV+  NTIL+KA EKG IDVIE++VE+M+REK+ PDP+TCH VFS YV  GYH+TA+E
Subjt:  VRTGFCPQILTYTSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME

Query:  ALQVLSMRMLCEDDDTS--PNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY
        AL VLS+RML E+D  S      E  ENFV++ED E ++ I+E F+ SE++L+ AL NLRW AMLG  +  S  QSPWA  L+N Y
Subjt:  ALQVLSMRMLCEDDDTS--PNLTEYVENFVLAEDSEVDSHILEFFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSY

AT4G28010.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-1424.3Show/hide
Query:  AEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLEL-ISACPYPHP----FNAFLSACD
        A QL+  M E   +P++ T++  +  +  +   +D ++I+++M++R+ +P + T   +        +LD A  LL L +    Y  P    +NA +    
Subjt:  AEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLEL-ISACPYPHP----FNAFLSACD

Query:  AMDQAERAMRML-VKMKQMKVLPDVKTYELLYSLF--GNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHV
          ++  +A+ +  + ++++     V T  LL S    G+VN   E   ++S     R                ++ +M +     G   + K LL  + V
Subjt:  AMDQAERAMRML-VKMKQMKVLPDVKTYELLYSLF--GNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQYLHV

Query:  AENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKF
        +E              YN +L  L +   +  A  LF  M+    FPD  +F +MID     G +KSA +LL  M R G  P + TY+ L+   L     
Subjt:  AENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKF

Query:  DDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPS-TC
        D+A++  D+    G E D  I +++L+    +G  D +  LV+++  + I  D   TC
Subjt:  DDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPS-TC

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-1518.5Show/hide
Query:  WSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI---S
        +++N +++      N   A Q + ++ E  L P   T+   +      +      K+   M  +  +  +     +      A  +D A  L   +    
Subjt:  WSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI---S

Query:  ACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGA
          P    +   + +    ++   A+ ++ +M++  + P++ TY +L              + L       + R +   M + G+  + ++   L+     
Subjt:  ACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGA

Query:  EGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTY
         GM ++ +  + + E+     NT      YN ++    +S ++H A+ + N M      PD  T+  +ID     G   SA+ LLS+M   G  P   TY
Subjt:  EGMTKELLQYLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTY

Query:  TSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSV
        TS++  +   ++ ++A +L D    +G+  +VV+   ++   C+ G++D    ++E+M  +   P+  T +++
Subjt:  TSLVKIVLGFEKFDDALNLLDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAGGGCTAGGTATCGTCTTAGATCAATAGCTGACTCGCTATACAGATTCAAGCCACATGAACATGGGCGAAGACAGGATGCAAGCAAATTGTTTTTACATCGGGC
TCTTCTCATCTCCAAAGGTAGTGAATTTTTGGGAGGAGCAGAGTCAACTAAGTTCATGCAGATGCAGATTGTTGATGCACTTCGTCTGGGTGATAGAAGTAGTGCTTCCA
ACCTGCTTATGGAACTTGGCCAGGAAAAGCACTCTTTAACTGCAGATAATTTTGTTCACATTTTGAGCTACTGTGCAAGATCACCTGATCCATTGTTTGTCATGGAGACT
TGGAAAATAATGGAAGAAAGAGGAATTTTCCTAAATAACACATGCTCCTTACTCATGATAGAAGCACTCTGTAAAGGGGGTTACTTGGATGAGGCATTTGGTCTAATAAA
TCTCCTAGCAGAAAGTCATGTCATGTTTCCTGTTCTGCCTGTGTACAATTGTTTCTTGAGAGCCTGTGCCAAAAGGCAGAGTACGGTTCATGTTAGTCAGTGCTTGGATC
TGATGGATCATAGAATGGTTGGGAAGAATGAAGCTACATATTGTGAGCTACTCAAGCTTGCAGTTTGTCAGAAAAACTTGTCTTCCGTGCATGAGATCTGGACAGACTTT
GTAAAAAATTACAGCCCAAGTGTTTTATCTCTGAGAAACTTTATATGGTCTTATTCAAGGCTGGGAGACCTAAAATCTGCATATATTGCATTGCAAAAGATGGTGGCTTT
GACCATTGGAGCTGCTGGAGGAAAGTTACCCTCTTTAGAATTGGACATTCCTATACCTTCAAGAACTGAATTCTATCGTAACAATTTTAATTTTGAGGAGAATGGACATT
CTACTGATGAGTTATACTGTAAGAAAATGGTCCCCTACAATGGTGACATAGGGAAATTTTCTGTTAATGATATGAAGTGTGAAGAAGTTGAAAGTGGTCCATTAACTTCG
CAGAACAATTACAAAAGCAGTTATGTCATGAAGGTTTTGAGATGGTCTTTCAATGACGTGATACACGCATGTGCACTTACTAGGAACTGTGGTCTTGCAGAGCAGTTAAT
GCAACAGATGCATGAACTCAGATTGCAACCTTCAAGCCACACATTTGACGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATGAAAATTTTAAAAA
TAATGCAAGAGAGGAAATTGAAGCCATATGATTCAACTCTTGCTGCTGTCTCCATAAGTTGCAGCAAAGCACTAGAACTTGATTTAGCTGAAGCTCTACTAGAACTAATT
TCTGCTTGTCCTTACCCACACCCCTTCAATGCATTTCTTTCAGCATGTGACGCGATGGATCAGGCTGAACGCGCTATGCGCATGTTGGTTAAAATGAAACAAATGAAGGT
GCTTCCAGATGTCAAGACATATGAACTTTTATATTCTTTATTTGGTAACGTGAATGCTCCATATGAGGATGGAAACAGATTGTCACAGGTGGATGCTGCTAGAAGGATAC
GCGTGATAGAGCTGGATATGACCAAACATGGGATCCAACACAGTCATTTATCTATGAAGAACTTGTTGAAAGCCCTAGGTGCAGAGGGGATGACAAAGGAGCTCCTTCAG
TATTTACATGTGGCAGAAAACCTCTTCTTTTACAATAACACTCATCTGGGAACGCCTATTTACAACACAGTGCTGCATTTTTTAGTTGAATCCAAGGATATTCACATGGC
CATAGAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGATGCTGCCACATTTGAGATGATGATTGACTGTTGTAGTGTTATGGGATGCTTGAAATCAGCTTTTG
CCCTTCTTTCCATGATGGTCCGCACAGGGTTTTGTCCACAGATATTAACTTATACAAGTCTAGTAAAGATTGTGTTGGGGTTTGAGAAATTTGATGATGCCTTGAATCTT
TTGGATCAAGCTAATTCAGAAGGGATTGAACTTGATGTAGTTATAATGAATACCATCTTGCAGAAAGCTTGTGAAAAGGGAAGGATTGATGTCATCGAGTTCCTCGTTGA
AAGGATGAATCGCGAAAAGATCCAACCCGACCCTTCAACGTGTCACAGTGTCTTCTCTGCATATGTGAACCTTGGCTATCACAGCACTGCCATGGAAGCACTGCAAGTAC
TGAGTATGCGTATGTTATGCGAAGACGACGACACCTCTCCAAACTTGACAGAATATGTCGAAAACTTTGTTCTTGCAGAGGACTCCGAAGTTGATTCACACATTTTGGAA
TTCTTCAAATGCTCTGAAGATAACCTAAGTTTTGCCCTCTTCAACTTGAGATGGTCTGCCATGCTGGGATATTCACTTTGTTCTTCCCCTTATCAGAGCCCATGGGCAAC
GAGACTTGCAAATTCCTATGATGGCTACAGAACCTCATAG
mRNA sequenceShow/hide mRNA sequence
ATTCAGTTTTATAAAAGAAGAAAAAAAGAAGATTTAGGATGGGTTTTATTTCTATTGACCAACAGGACTGTGGACAATATCATACCAAGTATAGATATGTGAGGGTCGTT
TATCCCCAACAAGGACTATTATGAGTTGGTGAGATTACTATTTTGTCCTTAAAAAAGCCCTAGGGCTATGGTCCCTCTTCCTTCTTTCTCTCCGGCAGCACGCACAAAAT
AACAACTCCCTCTGCGAAGGCAACGGCGACGACGACGACGACGGTGACCGCCTCCAAGCCGACGGTACTGCAGTTGAAGTATTCGGCTGAAGAGCAAGAATCAAACTCAT
GGCTTTCTTCATATTGCATCCGGAATTTCGTGTTTACTGAGCTCTCGTTGCCGTCTTTGCTTGGAAGACCATGCACAGGGCTAGGTATCGTCTTAGATCAATAGCTGACT
CGCTATACAGATTCAAGCCACATGAACATGGGCGAAGACAGGATGCAAGCAAATTGTTTTTACATCGGGCTCTTCTCATCTCCAAAGGTAGTGAATTTTTGGGAGGAGCA
GAGTCAACTAAGTTCATGCAGATGCAGATTGTTGATGCACTTCGTCTGGGTGATAGAAGTAGTGCTTCCAACCTGCTTATGGAACTTGGCCAGGAAAAGCACTCTTTAAC
TGCAGATAATTTTGTTCACATTTTGAGCTACTGTGCAAGATCACCTGATCCATTGTTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTCCTAAATAACA
CATGCTCCTTACTCATGATAGAAGCACTCTGTAAAGGGGGTTACTTGGATGAGGCATTTGGTCTAATAAATCTCCTAGCAGAAAGTCATGTCATGTTTCCTGTTCTGCCT
GTGTACAATTGTTTCTTGAGAGCCTGTGCCAAAAGGCAGAGTACGGTTCATGTTAGTCAGTGCTTGGATCTGATGGATCATAGAATGGTTGGGAAGAATGAAGCTACATA
TTGTGAGCTACTCAAGCTTGCAGTTTGTCAGAAAAACTTGTCTTCCGTGCATGAGATCTGGACAGACTTTGTAAAAAATTACAGCCCAAGTGTTTTATCTCTGAGAAACT
TTATATGGTCTTATTCAAGGCTGGGAGACCTAAAATCTGCATATATTGCATTGCAAAAGATGGTGGCTTTGACCATTGGAGCTGCTGGAGGAAAGTTACCCTCTTTAGAA
TTGGACATTCCTATACCTTCAAGAACTGAATTCTATCGTAACAATTTTAATTTTGAGGAGAATGGACATTCTACTGATGAGTTATACTGTAAGAAAATGGTCCCCTACAA
TGGTGACATAGGGAAATTTTCTGTTAATGATATGAAGTGTGAAGAAGTTGAAAGTGGTCCATTAACTTCGCAGAACAATTACAAAAGCAGTTATGTCATGAAGGTTTTGA
GATGGTCTTTCAATGACGTGATACACGCATGTGCACTTACTAGGAACTGTGGTCTTGCAGAGCAGTTAATGCAACAGATGCATGAACTCAGATTGCAACCTTCAAGCCAC
ACATTTGACGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATGAAAATTTTAAAAATAATGCAAGAGAGGAAATTGAAGCCATATGATTCAACTCT
TGCTGCTGTCTCCATAAGTTGCAGCAAAGCACTAGAACTTGATTTAGCTGAAGCTCTACTAGAACTAATTTCTGCTTGTCCTTACCCACACCCCTTCAATGCATTTCTTT
CAGCATGTGACGCGATGGATCAGGCTGAACGCGCTATGCGCATGTTGGTTAAAATGAAACAAATGAAGGTGCTTCCAGATGTCAAGACATATGAACTTTTATATTCTTTA
TTTGGTAACGTGAATGCTCCATATGAGGATGGAAACAGATTGTCACAGGTGGATGCTGCTAGAAGGATACGCGTGATAGAGCTGGATATGACCAAACATGGGATCCAACA
CAGTCATTTATCTATGAAGAACTTGTTGAAAGCCCTAGGTGCAGAGGGGATGACAAAGGAGCTCCTTCAGTATTTACATGTGGCAGAAAACCTCTTCTTTTACAATAACA
CTCATCTGGGAACGCCTATTTACAACACAGTGCTGCATTTTTTAGTTGAATCCAAGGATATTCACATGGCCATAGAATTATTCAATAATATGAAGCATTCTGGTTTCTTT
CCAGATGCTGCCACATTTGAGATGATGATTGACTGTTGTAGTGTTATGGGATGCTTGAAATCAGCTTTTGCCCTTCTTTCCATGATGGTCCGCACAGGGTTTTGTCCACA
GATATTAACTTATACAAGTCTAGTAAAGATTGTGTTGGGGTTTGAGAAATTTGATGATGCCTTGAATCTTTTGGATCAAGCTAATTCAGAAGGGATTGAACTTGATGTAG
TTATAATGAATACCATCTTGCAGAAAGCTTGTGAAAAGGGAAGGATTGATGTCATCGAGTTCCTCGTTGAAAGGATGAATCGCGAAAAGATCCAACCCGACCCTTCAACG
TGTCACAGTGTCTTCTCTGCATATGTGAACCTTGGCTATCACAGCACTGCCATGGAAGCACTGCAAGTACTGAGTATGCGTATGTTATGCGAAGACGACGACACCTCTCC
AAACTTGACAGAATATGTCGAAAACTTTGTTCTTGCAGAGGACTCCGAAGTTGATTCACACATTTTGGAATTCTTCAAATGCTCTGAAGATAACCTAAGTTTTGCCCTCT
TCAACTTGAGATGGTCTGCCATGCTGGGATATTCACTTTGTTCTTCCCCTTATCAGAGCCCATGGGCAACGAGACTTGCAAATTCCTATGATGGCTACAGAACCTCATAG
ATGAAATCATGTTCTACTCTACCTAAGTTAATTGCCCGAATTTTTTTCAATGTATCGGGTTTCGTTCAAATTGTTGCTCGAATGGCTTGAAAAGATCACGACGTGATTCA
ACATGGAAGCAAAACAAGAGGTTTTAAGTTCAAATTGTTGTTCTATGAACTATTCATAGGAGAATGTCAAATATGAATACAACAATTATTTAATGTAGTCTCATAATATA
AATTGAAGCTTATTTCACAGGGTTTGCCGAT
Protein sequenceShow/hide protein sequence
MHRARYRLRSIADSLYRFKPHEHGRRQDASKLFLHRALLISKGSEFLGGAESTKFMQMQIVDALRLGDRSSASNLLMELGQEKHSLTADNFVHILSYCARSPDPLFVMET
WKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINLLAESHVMFPVLPVYNCFLRACAKRQSTVHVSQCLDLMDHRMVGKNEATYCELLKLAVCQKNLSSVHEIWTDF
VKNYSPSVLSLRNFIWSYSRLGDLKSAYIALQKMVALTIGAAGGKLPSLELDIPIPSRTEFYRNNFNFEENGHSTDELYCKKMVPYNGDIGKFSVNDMKCEEVESGPLTS
QNNYKSSYVMKVLRWSFNDVIHACALTRNCGLAEQLMQQMHELRLQPSSHTFDGFVRSVVSERGFSDGMKILKIMQERKLKPYDSTLAAVSISCSKALELDLAEALLELI
SACPYPHPFNAFLSACDAMDQAERAMRMLVKMKQMKVLPDVKTYELLYSLFGNVNAPYEDGNRLSQVDAARRIRVIELDMTKHGIQHSHLSMKNLLKALGAEGMTKELLQ
YLHVAENLFFYNNTHLGTPIYNTVLHFLVESKDIHMAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSMMVRTGFCPQILTYTSLVKIVLGFEKFDDALNL
LDQANSEGIELDVVIMNTILQKACEKGRIDVIEFLVERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLCEDDDTSPNLTEYVENFVLAEDSEVDSHILE
FFKCSEDNLSFALFNLRWSAMLGYSLCSSPYQSPWATRLANSYDGYRTS