; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g36370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g36370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr4:27307915..27310059
RNA-Seq ExpressionMoc04g36370
SyntenyMoc04g36370
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602498.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.13Show/hide
Query:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR
        MEVPL  YQNYV+D LQ +S SSS+SY    F+ S+LFR RSLLS Y+LWSNRRKLR+SFCW+KCSSLEQGLRPR +P+PSK+D DVRKGT  ++ TRI 
Subjt:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM
        KS V IC  IEKLVLC K+RDALEMFEI ELEGGYD+GNST+DALINACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE     PRTFA +IRA AGLELIFPGRQLHSCA+KAGVGQDIFVSCALIDMYSKCG LEDAHCVFDE
Subjt:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD
        MPDKTIVGWNSIIAGYALHGYSEEAL+L Y+MRDSG+K+DHFTFSIIIRICSRLASV RAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARHIFD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD

Query:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL
        RMS KN+ISWNALIAGYGNHGRGEEAI++FERMLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHKIK RAMH+ CMIELLGREGLLDEAYAL
Subjt:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL

Query:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH
        IR APF+PTANMWAALLRACRVHENLELGK  AE LYGMEP+KL NYIVLLNIY SSGKLKEAADVVQTLKRKGL M+PACSWIEVK+QPH+F SGDK H
Subjt:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH

Query:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH
         EIEKVVEKVDE+ML+ISKLGYV E+N LLPDVD  EEKI +YHSEKLAIAYGL++TL  TPLQIVQ HR+CGDCHS IKLIA+IT+REIVVRDASRFHH
Subjt:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGSCSCGDYW

XP_004148701.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus]0.0e+0085.36Show/hide
Query:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGL--RPRPEPRPSKIDHDVRKGTSSNET
        M ME+PL RYQNYVYDRLQC+ST    S+  +R++DS LF K S L      SN RK RNSFCWIKCSS EQGL  RPRP+P+PSK+D   RK T   E 
Subjt:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGL--RPRPEPRPSKIDHDVRKGTSSNET

Query:  TRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRL
        T ++KS VGICSQIEKLVLCKKYRDALEMFEIFELE G+ +G STYDALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+NR+LLMHVKCGMMIDACRL
Subjt:  TRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRL

Query:  FDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHC
        FDEMP RNAVSW TIISGYVDSGNY+EAFRLFI+M EE  D GPRTFA MIRASAGLE+IFPGRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHC
Subjt:  FDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHC

Query:  VFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDAR
        VFDEMPDKTIVGWNSIIAGYALHGYSEEALDL +EMRDSG+KMDHFTFSIIIRICSRLASVARAKQVHA LVRNGFGLDVVANTALVDFYSKWGK+DDAR
Subjt:  VFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDAR

Query:  HIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDE
        H+FDRMS +NIISWNALIAGYGNHG GEEAI MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK+KPRAMHFACMIELLGREGLLDE
Subjt:  HIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDE

Query:  AYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSG
        AYALIR APF+PTANMWAALLRACRVH NLELGK AAE LYGMEP+KLSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRM+PACSWIEV NQPH+FLSG
Subjt:  AYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSG

Query:  DKHHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDA
        DKHH +IEKVV KVDE+ML ISKLGYV  EQNF+LPDVDE EEKI MYHSEKLAIAYGLL+TL+KTPLQIVQSHRIC DCHS IKLIA+IT+REIV+RDA
Subjt:  DKHHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDA

Query:  SRFHHFRDGSCSCGDYW
        SRFHHFRDGSCSCGDYW
Subjt:  SRFHHFRDGSCSCGDYW

XP_008459324.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo]0.0e+0084.76Show/hide
Query:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR
        M ME+PL RYQNYVYDRLQC ST     Y  +R++DS LF K S L      SNRRK RNSFCW+KCSS EQGLRPRP+P+PSK+D  VRK     ET  
Subjt:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR

Query:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD
        +RKS VGICSQIEKLVLCK+YRDALEMFEIFELE G+ +GNSTYDALINACIGLKSIRGVKRL NYM+DNGFEPDQYM+NR+LLMHVKCGMMIDACRLFD
Subjt:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSWSTIISGYVDSGNY+EAFRLFI+MWEE    GPRT A MIRASAGLE+IF GRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI
        DEMPDKTIVGWNSIIAGYALHGYSEEALDL +EM  SG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI

Query:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY
        FDRMS +N+ISWNALIAGYGNHGRGEEAI MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK++PRAMHFACMIELLGREGLLDEAY
Subjt:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY

Query:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK
        ALIR APF+PTANMWAALLRACRVH NLELGK AAE LYGMEP+KLSNYIVLLNIYN+SGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+FLSGDK
Subjt:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK

Query:  HHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASR
        HH ++EKVV KVDE+MLKISKLGYV  EQNF+LPDVDE EEKI MYHSEKLAIAYGLL+TL++TPLQIVQSHRIC DCHS IKLIA+IT+REIV+RDASR
Subjt:  HHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASR

Query:  FHHFRDGSCSCGDYW
        FHHFRDG+CSCGDYW
Subjt:  FHHFRDGSCSCGDYW

XP_022133879.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia]0.0e+00100Show/hide
Query:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR
        MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR
Subjt:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR

Query:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD
        IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD
Subjt:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI
        DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI

Query:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY
        FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY
Subjt:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY

Query:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK
        ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK
Subjt:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK

Query:  HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRF
        HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRF
Subjt:  HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

XP_022989822.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucurbita maxima]0.0e+0084.55Show/hide
Query:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR
        MEVPL  YQNYV+D L+ +S SSS+SY    F+ S+LFR RSLLS Y+LWSNRRKLRNSFCW+KCSSLEQGLRPR +P+PSK+D DVRKGT S E TRI 
Subjt:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM
        KS V IC  IEKLVLC K+RDALEMFEI ELEGGYD+GNST+DALI ACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE     PRTFA +IRASAGLELIFPG+QLHSCA+KAGVGQDIFVSCALIDMYSKCG LEDAHCVFDE
Subjt:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD
        MPDKTIVGWNSIIAGYALHG+SEEAL+L ++MRDSG+K+DHFTFSIIIRICSRLASV RAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARHIFD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD

Query:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL
        RMS KN+ISWNALIAGYGNHGRGEEAI++FERMLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHKIK RAMH+ CMIELLGREGLLDEAYAL
Subjt:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL

Query:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH
        IR APF+PTANMWAALLRACRVHENLELGK AAE LYGMEP+KL NYIVLLNIY SSGKLKEAADVV+TLKRKGL M+PACSWIEVK+QPH+FLSGDKHH
Subjt:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH

Query:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH
         EIEKVVEKVDE+ML+ISKLGYV EQN LLPDVD  EEKI +YHSEKLAIAYGL++TLK+TPLQIVQ HR+CGDCHS IKLIA+IT+REIVVRDASRFHH
Subjt:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGSCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KXD9 DYW_deaminase domain-containing protein0.0e+0085.36Show/hide
Query:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGL--RPRPEPRPSKIDHDVRKGTSSNET
        M ME+PL RYQNYVYDRLQC+ST    S+  +R++DS LF K S L      SN RK RNSFCWIKCSS EQGL  RPRP+P+PSK+D   RK T   E 
Subjt:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGL--RPRPEPRPSKIDHDVRKGTSSNET

Query:  TRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRL
        T ++KS VGICSQIEKLVLCKKYRDALEMFEIFELE G+ +G STYDALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+NR+LLMHVKCGMMIDACRL
Subjt:  TRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRL

Query:  FDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHC
        FDEMP RNAVSW TIISGYVDSGNY+EAFRLFI+M EE  D GPRTFA MIRASAGLE+IFPGRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHC
Subjt:  FDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHC

Query:  VFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDAR
        VFDEMPDKTIVGWNSIIAGYALHGYSEEALDL +EMRDSG+KMDHFTFSIIIRICSRLASVARAKQVHA LVRNGFGLDVVANTALVDFYSKWGK+DDAR
Subjt:  VFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDAR

Query:  HIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDE
        H+FDRMS +NIISWNALIAGYGNHG GEEAI MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK+KPRAMHFACMIELLGREGLLDE
Subjt:  HIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDE

Query:  AYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSG
        AYALIR APF+PTANMWAALLRACRVH NLELGK AAE LYGMEP+KLSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRM+PACSWIEV NQPH+FLSG
Subjt:  AYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSG

Query:  DKHHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDA
        DKHH +IEKVV KVDE+ML ISKLGYV  EQNF+LPDVDE EEKI MYHSEKLAIAYGLL+TL+KTPLQIVQSHRIC DCHS IKLIA+IT+REIV+RDA
Subjt:  DKHHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDA

Query:  SRFHHFRDGSCSCGDYW
        SRFHHFRDGSCSCGDYW
Subjt:  SRFHHFRDGSCSCGDYW

A0A1S3C9W7 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0084.76Show/hide
Query:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR
        M ME+PL RYQNYVYDRLQC ST     Y  +R++DS LF K S L      SNRRK RNSFCW+KCSS EQGLRPRP+P+PSK+D  VRK     ET  
Subjt:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR

Query:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD
        +RKS VGICSQIEKLVLCK+YRDALEMFEIFELE G+ +GNSTYDALINACIGLKSIRGVKRL NYM+DNGFEPDQYM+NR+LLMHVKCGMMIDACRLFD
Subjt:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSWSTIISGYVDSGNY+EAFRLFI+MWEE    GPRT A MIRASAGLE+IF GRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI
        DEMPDKTIVGWNSIIAGYALHGYSEEALDL +EM  SG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI

Query:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY
        FDRMS +N+ISWNALIAGYGNHGRGEEAI MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK++PRAMHFACMIELLGREGLLDEAY
Subjt:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY

Query:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK
        ALIR APF+PTANMWAALLRACRVH NLELGK AAE LYGMEP+KLSNYIVLLNIYN+SGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+FLSGDK
Subjt:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK

Query:  HHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASR
        HH ++EKVV KVDE+MLKISKLGYV  EQNF+LPDVDE EEKI MYHSEKLAIAYGLL+TL++TPLQIVQSHRIC DCHS IKLIA+IT+REIV+RDASR
Subjt:  HHAEIEKVVEKVDEIMLKISKLGYV-AEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASR

Query:  FHHFRDGSCSCGDYW
        FHHFRDG+CSCGDYW
Subjt:  FHHFRDGSCSCGDYW

A0A6J1BWH3 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+00100Show/hide
Query:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR
        MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR
Subjt:  MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTR

Query:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD
        IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD
Subjt:  IRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI
        DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHI

Query:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY
        FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY
Subjt:  FDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAY

Query:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK
        ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK
Subjt:  ALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK

Query:  HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRF
        HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRF
Subjt:  HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

A0A6J1GSZ5 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0083.99Show/hide
Query:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR
        MEVPL  YQNYV+D LQ +S SSS+SY    F+ S+LFR RSLLS Y+LWSN RKLRNSFCW+KCSSLEQGLRPR +P+PSK++ DVRKGT  ++ TRI 
Subjt:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM
        KS V IC  IEKLVLC K+RDALEMFEI ELEGGYD+GNST+DALINACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE     PRTFA +IRASAGLELIFPGRQLHSCA+KAGVGQDIFVSCALIDMYSKCG LEDAHCVFDE
Subjt:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD
        MPDKTIVGWNSIIAGYALHGYSEEAL+L Y+MRDSG+K+DHFTFSIIIRICSRLASV RAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARHIFD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD

Query:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL
        RMS KN+ISWNALIAGYGNHGRGEEAI++FERMLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK+K RAMH+ CMIELLGREGLLDEAYAL
Subjt:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL

Query:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH
        IR APF+PTANMWAALLRACRVHENLELGK  AE LYGMEP+KL NYIVLLNIY SSGKLKEAADVVQTLKRKGL M+PACSWIEVK+QPH+F SGDK H
Subjt:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH

Query:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH
         EIEKVVEKVDE+ML+ISKLGYV E+N LLPDVD  EEKI +YHSEKLAIAYGL++TL  TPLQIVQ HR+CGDCHS IKLIA+IT+REIVVRDASRFHH
Subjt:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGSCSCGDYW

A0A6J1JGW0 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0084.55Show/hide
Query:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR
        MEVPL  YQNYV+D L+ +S SSS+SY    F+ S+LFR RSLLS Y+LWSNRRKLRNSFCW+KCSSLEQGLRPR +P+PSK+D DVRKGT S E TRI 
Subjt:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM
        KS V IC  IEKLVLC K+RDALEMFEI ELEGGYD+GNST+DALI ACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE     PRTFA +IRASAGLELIFPG+QLHSCA+KAGVGQDIFVSCALIDMYSKCG LEDAHCVFDE
Subjt:  PERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD
        MPDKTIVGWNSIIAGYALHG+SEEAL+L ++MRDSG+K+DHFTFSIIIRICSRLASV RAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARHIFD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFD

Query:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL
        RMS KN+ISWNALIAGYGNHGRGEEAI++FERMLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHKIK RAMH+ CMIELLGREGLLDEAYAL
Subjt:  RMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYAL

Query:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH
        IR APF+PTANMWAALLRACRVHENLELGK AAE LYGMEP+KL NYIVLLNIY SSGKLKEAADVV+TLKRKGL M+PACSWIEVK+QPH+FLSGDKHH
Subjt:  IRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH

Query:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH
         EIEKVVEKVDE+ML+ISKLGYV EQN LLPDVD  EEKI +YHSEKLAIAYGL++TLK+TPLQIVQ HR+CGDCHS IKLIA+IT+REIVVRDASRFHH
Subjt:  AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGSCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK33 Pentatricopeptide repeat-containing protein At5g50390, chloroplastic2.9e-25659.19Show/hide
Query:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSS-NETTRI
        ME+PL RYQ+   D ++ SS++      P +F+                    R+ +N F  + CSS+ QGL+P+P+ +P  I  +V++      + T+I
Subjt:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSS-NETTRI

Query:  RKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDE
         KSGV ICSQIEKLVLC ++R+A E+FEI E+   + +G STYDAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFDE
Subjt:  RKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        +PERN  S+ +IISG+V+ GNY+EAF LF MMWEE SD    TFA+M+RASAGL  I+ G+QLH CA+K GV  + FVSC LIDMYSKCG +EDA C F+
Subjt:  MPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIF
         MP+KT V WN++IAGYALHGYSEEAL L Y+MRDSG+ +D FT SI+IRI ++LA +   KQ HA L+RNGF  ++VANTALVDFYSKWG++D AR++F
Subjt:  EMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIF

Query:  DRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYA
        D++  KNIISWNAL+ GY NHGRG +A+++FE+M+   + PNHVTFLAVLSAC+ SGL E+GWEIF S++  H IKPRAMH+ACMIELLGR+GLLDEA A
Subjt:  DRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYA

Query:  LIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKH
         IR AP K T NMWAALL ACR+ ENLELG++ AE LYGM P+KL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q HSFLSGD+ 
Subjt:  LIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKH

Query:  HAEIE----KVVEKVDEIMLKISKLGYVAEQNFLLPDVDEK-EEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRD
         +  E    ++ +KVDE+M +IS+ GY  E+  LLPDVDEK EE++  YHSEKLAIAYGL++T +  PLQI Q+HRIC +CH  ++ I+L+T RE+VVRD
Subjt:  HAEIE----KVVEKVDEIMLKISKLGYVAEQNFLLPDVDEK-EEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRD

Query:  ASRFHHFRDGSCSCGDYW
        ASRFHHF++G CSCG YW
Subjt:  ASRFHHFRDGSCSCGDYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial3.4e-13538.63Show/hide
Query:  ELEGGY-DIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLF
        +LEG Y       Y+ L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+W+T+ISGY       +A   F
Subjt:  ELEGGY-DIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLF

Query:  IMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL
          M          T + +I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL+L
Subjt:  IMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL

Query:  CYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQ
           M   G +  HF+++ +   CS    + + K VHA ++++G  L   A   L+D Y+K G I DAR IFDR++ ++++SWN+L+  Y  HG G+EA+ 
Subjt:  CYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQ

Query:  MFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLEL
         FE M R G+ PN ++FL+VL+ACS SGL + GW  ++ +  D  I P A H+  +++LLGR G L+ A   I   P +PTA +W ALL ACR+H+N EL
Subjt:  MFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLEL

Query:  GKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNF
        G  AAE+++ ++PD    +++L NIY S G+  +AA V + +K  G++  PACSW+E++N  H F++ D+ H + E++  K +E++ KI +LGYV + + 
Subjt:  GKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNF

Query:  LLPDVDEKEEKIHM-YHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW
        ++  VD++E ++++ YHSEK+A+A+ LL+T   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+DG+CSC DYW
Subjt:  LLPDVDEKEEKIHM-YHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127706.3e-12637.06Show/hide
Query:  YRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD--EMPERNAVSWSTIISGYV
        ++DAL M+   +L       + T+  L+ AC GL  ++  + +   +   GF+ D +++N ++ ++ KC  +  A  +F+   +PER  VSW+ I+S Y 
Subjt:  YRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD--EMPERNAVSWSTIISGYV

Query:  DSGNYIEAFRLFIMMWEECSDSGPRTFAI--MIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIA
         +G  +EA  +F  M +   D  P   A+  ++ A   L+ +  GR +H+  +K G+  +  +  +L  MY+KCG +  A  +FD+M    ++ WN++I+
Subjt:  DSGNYIEAFRLFIMMWEECSDSGPRTFAI--MIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIA

Query:  GYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALI
        GYA +GY+ EA+D+ +EM +  ++ D  + +  I  C+++ S+ +A+ ++  + R+ +  DV  ++AL+D ++K G ++ AR +FDR   ++++ W+A+I
Subjt:  GYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALI

Query:  AGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWA
         GYG HGR  EAI ++  M R G+ PN VTFL +L AC+ SG+   GW  F  +  DHKI P+  H+AC+I+LLGR G LD+AY +I+  P +P   +W 
Subjt:  AGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWA

Query:  ALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIM
        ALL AC+ H ++ELG+ AA+ L+ ++P    +Y+ L N+Y ++      A+V   +K KGL     CSW+EV+ +  +F  GDK H   E++  +V+ I 
Subjt:  ALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIM

Query:  LKISKLGYVAEQNFLLPDV-DEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW
         ++ + G+VA ++  L D+ DE+ E+    HSE++AIAYGL+ST + TPL+I ++ R C +CH+  KLI+ +  REIVVRD +RFHHF+DG CSCGDYW
Subjt:  LKISKLGYVAEQNFLLPDV-DEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233301.3e-12636.5Show/hide
Query:  NSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ + +++ +C  +  +R  + +  +++  G + D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS++TII+GY  SG Y +A R+   M          T + ++   +    +  G+++H   I+ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRM
         +  + WNS++AGY  +G   EAL L  +M  + +K     FS +I  C+ LA++   KQ+H  ++R GFG ++   +ALVD YSK G I  AR IFDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRM

Query:  SHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE M R+G+ PN V F+AVL+ACS  GL +  W  F S+T  + +     H+A + +LLGR G L+EAY  I 
Subjt:  SHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIR

Query:  NAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAE
            +PT ++W+ LL +C VH+NLEL +  AE ++ ++ + +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACSWIE+KN+ H F+SGD+ H  
Subjt:  NAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAE

Query:  IEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMY-HSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHF
        ++K+ E +  +M ++ K GYVA+ + +L DVDE+ ++  ++ HSE+LA+A+G+++T   T +++ ++ RIC DCH  IK I+ IT REI+VRD SRFHHF
Subjt:  IEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMY-HSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHF

Query:  RDGSCSCGDYW
          G+CSCGDYW
Subjt:  RDGSCSCGDYW

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial3.4e-12736.52Show/hide
Query:  VRKGTSSNETTRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK
        +R   SS + T +      +C Q   L    K  D+L+         G    ++TY  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK
Subjt:  VRKGTSSNETTRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK

Query:  CGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYS
          ++ DA +LFD+MP+RN +SW+T+IS Y     + +A  L ++M  +       T++ ++R+  G+  +   R LH   IK G+  D+FV  ALID+++
Subjt:  CGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYS

Query:  KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFY
        K G  EDA  VFDEM     + WNSII G+A +  S+ AL+L   M+ +G   +  T + ++R C+ LA +    Q H  +V+  +  D++ N ALVD Y
Subjt:  KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFY

Query:  SKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIE
         K G ++DA  +F++M  +++I+W+ +I+G   +G  +EA+++FERM   G  PN++T + VL ACS +GL E GW  F+S+   + I P   H+ CMI+
Subjt:  SKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIE

Query:  LLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEV
        LLG+ G LD+A  L+     +P A  W  LL ACRV  N+ L + AA+ +  ++P+    Y +L NIY +S K     ++   ++ +G++  P CSWIEV
Subjt:  LLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEV

Query:  KNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVD-EKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALI
          Q H+F+ GD  H +I +V +K+++++ +++ +GYV E NF+L D++ E+ E    +HSEKLA+A+GL++   +  ++I ++ RICGDCH   KL + +
Subjt:  KNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVD-EKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALI

Query:  TRREIVVRDASRFHHFRDGSCSCGDYW
          R IV+RD  R+HHF+DG CSCGDYW
Subjt:  TRREIVVRDASRFHHFRDGSCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-12836.52Show/hide
Query:  VRKGTSSNETTRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK
        +R   SS + T +      +C Q   L    K  D+L+         G    ++TY  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK
Subjt:  VRKGTSSNETTRIRKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK

Query:  CGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYS
          ++ DA +LFD+MP+RN +SW+T+IS Y     + +A  L ++M  +       T++ ++R+  G+  +   R LH   IK G+  D+FV  ALID+++
Subjt:  CGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYS

Query:  KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFY
        K G  EDA  VFDEM     + WNSII G+A +  S+ AL+L   M+ +G   +  T + ++R C+ LA +    Q H  +V+  +  D++ N ALVD Y
Subjt:  KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFY

Query:  SKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIE
         K G ++DA  +F++M  +++I+W+ +I+G   +G  +EA+++FERM   G  PN++T + VL ACS +GL E GW  F+S+   + I P   H+ CMI+
Subjt:  SKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIE

Query:  LLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEV
        LLG+ G LD+A  L+     +P A  W  LL ACRV  N+ L + AA+ +  ++P+    Y +L NIY +S K     ++   ++ +G++  P CSWIEV
Subjt:  LLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEV

Query:  KNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVD-EKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALI
          Q H+F+ GD  H +I +V +K+++++ +++ +GYV E NF+L D++ E+ E    +HSEKLA+A+GL++   +  ++I ++ RICGDCH   KL + +
Subjt:  KNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVD-EKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALI

Query:  TRREIVVRDASRFHHFRDGSCSCGDYW
          R IV+RD  R+HHF+DG CSCGDYW
Subjt:  TRREIVVRDASRFHHFRDGSCSCGDYW

AT3G12770.1 mitochondrial editing factor 224.5e-12737.06Show/hide
Query:  YRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD--EMPERNAVSWSTIISGYV
        ++DAL M+   +L       + T+  L+ AC GL  ++  + +   +   GF+ D +++N ++ ++ KC  +  A  +F+   +PER  VSW+ I+S Y 
Subjt:  YRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFD--EMPERNAVSWSTIISGYV

Query:  DSGNYIEAFRLFIMMWEECSDSGPRTFAI--MIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIA
         +G  +EA  +F  M +   D  P   A+  ++ A   L+ +  GR +H+  +K G+  +  +  +L  MY+KCG +  A  +FD+M    ++ WN++I+
Subjt:  DSGNYIEAFRLFIMMWEECSDSGPRTFAI--MIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIA

Query:  GYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALI
        GYA +GY+ EA+D+ +EM +  ++ D  + +  I  C+++ S+ +A+ ++  + R+ +  DV  ++AL+D ++K G ++ AR +FDR   ++++ W+A+I
Subjt:  GYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALI

Query:  AGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWA
         GYG HGR  EAI ++  M R G+ PN VTFL +L AC+ SG+   GW  F  +  DHKI P+  H+AC+I+LLGR G LD+AY +I+  P +P   +W 
Subjt:  AGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWA

Query:  ALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIM
        ALL AC+ H ++ELG+ AA+ L+ ++P    +Y+ L N+Y ++      A+V   +K KGL     CSW+EV+ +  +F  GDK H   E++  +V+ I 
Subjt:  ALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIM

Query:  LKISKLGYVAEQNFLLPDV-DEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW
         ++ + G+VA ++  L D+ DE+ E+    HSE++AIAYGL+ST + TPL+I ++ R C +CH+  KLI+ +  REIVVRD +RFHHF+DG CSCGDYW
Subjt:  LKISKLGYVAEQNFLLPDV-DEKEEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.1e-12836.5Show/hide
Query:  NSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ + +++ +C  +  +R  + +  +++  G + D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS++TII+GY  SG Y +A R+   M          T + ++   +    +  G+++H   I+ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRM
         +  + WNS++AGY  +G   EAL L  +M  + +K     FS +I  C+ LA++   KQ+H  ++R GFG ++   +ALVD YSK G I  AR IFDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRM

Query:  SHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE M R+G+ PN V F+AVL+ACS  GL +  W  F S+T  + +     H+A + +LLGR G L+EAY  I 
Subjt:  SHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIR

Query:  NAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAE
            +PT ++W+ LL +C VH+NLEL +  AE ++ ++ + +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACSWIE+KN+ H F+SGD+ H  
Subjt:  NAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAE

Query:  IEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMY-HSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHF
        ++K+ E +  +M ++ K GYVA+ + +L DVDE+ ++  ++ HSE+LA+A+G+++T   T +++ ++ RIC DCH  IK I+ IT REI+VRD SRFHHF
Subjt:  IEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMY-HSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHF

Query:  RDGSCSCGDYW
          G+CSCGDYW
Subjt:  RDGSCSCGDYW

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.8e-13138.06Show/hide
Query:  ELEGGY-DIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLF
        +LEG Y       Y+ L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+W+T+ISGY       +A   F
Subjt:  ELEGGY-DIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLF

Query:  IMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL
          M          T + +I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL+L
Subjt:  IMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL

Query:  CYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQ
           M   G +  HF+++ +   CS    + + K VHA ++++G  L   A   L+D Y+K G I DAR IFDR++ ++++SWN+L+  Y  HG G+EA+ 
Subjt:  CYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQ

Query:  MFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLEL
         FE M R G+ PN ++FL+VL+ACS SGL + GW  ++ +  D  I P A H+  +++LLGR G L+ A   I   P +PTA +W ALL ACR+H+N EL
Subjt:  MFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLEL

Query:  GKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNF
        G  AAE+++ ++PD    +++L NIY S G+  +AA V + +K  G++  PACSW+E++N  H F++ D+ H + E++  K +E++ KI +LGYV + + 
Subjt:  GKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNF

Query:  LLPDVDEKEEKIHM-YHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGS
        ++  VD++E ++++ YHSEK+A+A+ LL+T   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+D S
Subjt:  LLPDVDEKEEKIHM-YHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGS

AT5G50390.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.1e-25759.19Show/hide
Query:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSS-NETTRI
        ME+PL RYQ+   D ++ SS++      P +F+                    R+ +N F  + CSS+ QGL+P+P+ +P  I  +V++      + T+I
Subjt:  MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSS-NETTRI

Query:  RKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDE
         KSGV ICSQIEKLVLC ++R+A E+FEI E+   + +G STYDAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFDE
Subjt:  RKSGVGICSQIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        +PERN  S+ +IISG+V+ GNY+EAF LF MMWEE SD    TFA+M+RASAGL  I+ G+QLH CA+K GV  + FVSC LIDMYSKCG +EDA C F+
Subjt:  MPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIF
         MP+KT V WN++IAGYALHGYSEEAL L Y+MRDSG+ +D FT SI+IRI ++LA +   KQ HA L+RNGF  ++VANTALVDFYSKWG++D AR++F
Subjt:  EMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIF

Query:  DRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYA
        D++  KNIISWNAL+ GY NHGRG +A+++FE+M+   + PNHVTFLAVLSAC+ SGL E+GWEIF S++  H IKPRAMH+ACMIELLGR+GLLDEA A
Subjt:  DRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYA

Query:  LIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKH
         IR AP K T NMWAALL ACR+ ENLELG++ AE LYGM P+KL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q HSFLSGD+ 
Subjt:  LIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKH

Query:  HAEIE----KVVEKVDEIMLKISKLGYVAEQNFLLPDVDEK-EEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRD
         +  E    ++ +KVDE+M +IS+ GY  E+  LLPDVDEK EE++  YHSEKLAIAYGL++T +  PLQI Q+HRIC +CH  ++ I+L+T RE+VVRD
Subjt:  HAEIE----KVVEKVDEIMLKISKLGYVAEQNFLLPDVDEK-EEKIHMYHSEKLAIAYGLLSTLKKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRD

Query:  ASRFHHFRDGSCSCGDYW
        ASRFHHF++G CSCG YW
Subjt:  ASRFHHFRDGSCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATGGAAGTCCCCCTCCCGCGCTATCAAAACTATGTTTATGATCGGCTTCAATGTAGCTCCACCTCTAGCTCTAGTTCCTACCTCCCCGTTCGTTTTACGGATTC
GAAGCTTTTTAGGAAGAGATCTTTGCTTTCTGAGTATACTTTGTGGTCTAATAGAAGAAAATTGCGTAATTCGTTTTGTTGGATCAAGTGCTCCTCGTTGGAACAAGGAC
TGCGGCCGCGGCCCGAACCTAGACCTTCGAAAATCGATCATGACGTCCGGAAAGGAACGTCTTCGAACGAGACGACCCGTATTAGAAAATCCGGTGTAGGGATCTGTAGT
CAGATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTCGAAATGTTTGAAATTTTTGAACTAGAGGGTGGTTATGATATTGGTAACAGCACCTACGATGCGTT
GATTAATGCCTGTATTGGCTTGAAATCTATAAGAGGAGTGAAGAGGTTGTGTAATTACATGATTGATAATGGATTTGAGCCTGATCAATATATGAAGAACAGGATTCTAC
TTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCCGAAAGGAATGCAGTTTCGTGGAGTACTATAATTTCCGGGTACGTAGACTCT
GGAAATTATATAGAAGCGTTTAGATTGTTCATTATGATGTGGGAAGAGTGTTCTGATAGCGGACCTCGCACCTTTGCGATAATGATACGGGCATCAGCTGGTTTGGAACT
TATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAAGGCTGGTGTGGGACAGGATATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTG
AAGATGCTCACTGTGTTTTTGATGAGATGCCAGATAAGACAATAGTTGGATGGAATTCAATTATAGCTGGTTATGCCCTCCATGGCTACAGTGAGGAAGCTTTAGATCTA
TGTTATGAGATGCGTGATTCTGGAATTAAAATGGACCATTTCACCTTTTCTATAATCATAAGGATATGTTCGAGATTAGCCTCTGTAGCACGTGCTAAGCAAGTTCATGC
AGGTTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTCGTGGATTTCTATAGCAAATGGGGGAAAATAGATGATGCCAGGCATATTTTTGACAGGA
TGTCCCATAAAAACATAATATCATGGAATGCTTTGATAGCGGGATATGGGAATCACGGTCGGGGTGAGGAGGCCATCCAGATGTTTGAAAGGATGCTTAGGGAAGGCATG
ACGCCAAACCATGTGACATTCCTTGCTGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTCGAACGCGGGTGGGAAATTTTTCAATCGATAACAACAGATCACAAGATTAA
ACCACGTGCTATGCATTTCGCGTGCATGATCGAATTGCTAGGCCGAGAAGGGCTCCTAGACGAAGCCTATGCCCTCATAAGGAATGCTCCATTCAAACCCACAGCAAATA
TGTGGGCTGCCTTGCTTAGAGCTTGCAGAGTTCATGAAAATCTAGAGCTTGGGAAACTTGCTGCTGAAAACCTCTACGGCATGGAACCCGACAAGCTTAGTAATTATATT
GTGCTCTTAAACATATACAACAGTTCTGGCAAATTAAAGGAAGCAGCCGATGTTGTTCAGACTTTGAAGAGGAAAGGCTTAAGAATGGTTCCAGCATGCAGTTGGATTGA
GGTTAAAAACCAACCTCATTCATTCCTCTCTGGGGACAAACATCATGCCGAAATCGAAAAGGTTGTCGAAAAAGTGGACGAAATAATGTTAAAGATCTCAAAGCTTGGTT
ATGTAGCTGAACAGAACTTCTTGCTCCCAGATGTAGATGAAAAGGAAGAAAAGATACACATGTACCATAGTGAGAAATTGGCCATAGCTTATGGACTACTCAGTACTTTA
AAGAAAACGCCATTGCAAATTGTGCAGAGCCATAGAATTTGTGGTGACTGTCATTCTACGATAAAGTTGATTGCTTTGATCACCAGACGTGAAATTGTGGTCAGAGATGC
TAGCAGATTCCATCATTTTCGAGATGGGAGTTGTTCTTGTGGAGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATGGAAGTCCCCCTCCCGCGCTATCAAAACTATGTTTATGATCGGCTTCAATGTAGCTCCACCTCTAGCTCTAGTTCCTACCTCCCCGTTCGTTTTACGGATTC
GAAGCTTTTTAGGAAGAGATCTTTGCTTTCTGAGTATACTTTGTGGTCTAATAGAAGAAAATTGCGTAATTCGTTTTGTTGGATCAAGTGCTCCTCGTTGGAACAAGGAC
TGCGGCCGCGGCCCGAACCTAGACCTTCGAAAATCGATCATGACGTCCGGAAAGGAACGTCTTCGAACGAGACGACCCGTATTAGAAAATCCGGTGTAGGGATCTGTAGT
CAGATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTCGAAATGTTTGAAATTTTTGAACTAGAGGGTGGTTATGATATTGGTAACAGCACCTACGATGCGTT
GATTAATGCCTGTATTGGCTTGAAATCTATAAGAGGAGTGAAGAGGTTGTGTAATTACATGATTGATAATGGATTTGAGCCTGATCAATATATGAAGAACAGGATTCTAC
TTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCCGAAAGGAATGCAGTTTCGTGGAGTACTATAATTTCCGGGTACGTAGACTCT
GGAAATTATATAGAAGCGTTTAGATTGTTCATTATGATGTGGGAAGAGTGTTCTGATAGCGGACCTCGCACCTTTGCGATAATGATACGGGCATCAGCTGGTTTGGAACT
TATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAAGGCTGGTGTGGGACAGGATATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTG
AAGATGCTCACTGTGTTTTTGATGAGATGCCAGATAAGACAATAGTTGGATGGAATTCAATTATAGCTGGTTATGCCCTCCATGGCTACAGTGAGGAAGCTTTAGATCTA
TGTTATGAGATGCGTGATTCTGGAATTAAAATGGACCATTTCACCTTTTCTATAATCATAAGGATATGTTCGAGATTAGCCTCTGTAGCACGTGCTAAGCAAGTTCATGC
AGGTTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTCGTGGATTTCTATAGCAAATGGGGGAAAATAGATGATGCCAGGCATATTTTTGACAGGA
TGTCCCATAAAAACATAATATCATGGAATGCTTTGATAGCGGGATATGGGAATCACGGTCGGGGTGAGGAGGCCATCCAGATGTTTGAAAGGATGCTTAGGGAAGGCATG
ACGCCAAACCATGTGACATTCCTTGCTGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTCGAACGCGGGTGGGAAATTTTTCAATCGATAACAACAGATCACAAGATTAA
ACCACGTGCTATGCATTTCGCGTGCATGATCGAATTGCTAGGCCGAGAAGGGCTCCTAGACGAAGCCTATGCCCTCATAAGGAATGCTCCATTCAAACCCACAGCAAATA
TGTGGGCTGCCTTGCTTAGAGCTTGCAGAGTTCATGAAAATCTAGAGCTTGGGAAACTTGCTGCTGAAAACCTCTACGGCATGGAACCCGACAAGCTTAGTAATTATATT
GTGCTCTTAAACATATACAACAGTTCTGGCAAATTAAAGGAAGCAGCCGATGTTGTTCAGACTTTGAAGAGGAAAGGCTTAAGAATGGTTCCAGCATGCAGTTGGATTGA
GGTTAAAAACCAACCTCATTCATTCCTCTCTGGGGACAAACATCATGCCGAAATCGAAAAGGTTGTCGAAAAAGTGGACGAAATAATGTTAAAGATCTCAAAGCTTGGTT
ATGTAGCTGAACAGAACTTCTTGCTCCCAGATGTAGATGAAAAGGAAGAAAAGATACACATGTACCATAGTGAGAAATTGGCCATAGCTTATGGACTACTCAGTACTTTA
AAGAAAACGCCATTGCAAATTGTGCAGAGCCATAGAATTTGTGGTGACTGTCATTCTACGATAAAGTTGATTGCTTTGATCACCAGACGTGAAATTGTGGTCAGAGATGC
TAGCAGATTCCATCATTTTCGAGATGGGAGTTGTTCTTGTGGAGACTATTGGTGA
Protein sequenceShow/hide protein sequence
MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIRKSGVGICS
QIEKLVLCKKYRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDS
GNYIEAFRLFIMMWEECSDSGPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL
CYEMRDSGIKMDHFTFSIIIRICSRLASVARAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNHGRGEEAIQMFERMLREGM
TPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGMEPDKLSNYI
VLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTL
KKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW