; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015212 (gene) of Snake gourd v1 genome

Gene IDTan0015212
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG11:4316816..4319473
RNA-Seq ExpressionTan0015212
SyntenyTan0015212
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148701.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus]0.0e+0087.11Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPK--PKPSKIDPDIRKGTSSKKTHI
        ME+PLSRYQNY+YDRLQC    +STS+FS+R+SDSDLF K S L      SN RK  NSFCW+KCSS EQGL PRP+  PKPSK+D   RK T  K+TH+
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPK--PKPSKIDPDIRKGTSSKKTHI

Query:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE
        +KS VGICSQIEKLVLCKKYRDALEMFE FELE G+ VG ST+DALINACIGLKS+RGVKRLCNYM+D+G EPDQYMRNRVLLMHVKCGMMIDACRLFDE
Subjt:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD
        MP RNAVSW TIISGYVDSGNY EAFRLFI+M EE+ DCGPRTFATMIRASAGLE+IFPGRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFD
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
        EMPDKTIVGWNSIIAGYALHGYSEEAL+L +EMRDSGVKMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
Subjt:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA
        DRMSC+N+ISWNALIAGYGNHG GEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK+KPRAMH+AC+IELLGREGLLDEAYA
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA

Query:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH
        LIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAA+V +TLKRKGLRMLPACSWIEVNNQPHAFLSGDKH
Subjt:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH

Query:  HTEIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRF
        H +IEKVV KVDELML ISKLG+VP EQNF+L DVDE+EEKI+MYHSEKLAIAYGL+NTL++TPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRF
Subjt:  HTEIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRF

Query:  HHFRDGNCSCGDYW
        HHFRDG+CSCGDYW
Subjt:  HHFRDGNCSCGDYW

XP_008459324.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo]0.0e+0087.5Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK
        ME+PLSRYQNY+YDRLQC     ST YFS+R+SDS LF K S L      SNRRK  NSFCWVKCSS EQGL PRP+PKPSK+D  +RK    K+T +RK
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK

Query:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
        S VGICSQIEKLVLCK+YRDALEMFE FELE G+ VGNST+DALINACIGLKS+RGVKRL NYM+D+G EPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
Subjt:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE   CGPRT ATMIRASAGLE+IF GRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHGYSEEAL+L +EM  SGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI
        MSC+NVISWNALIAGYGNHGRGEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK++PRAMH+AC+IELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT
        RKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEPEKLSNYIVLLNIYN+SGKLKEAA+VV+TLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 
Subjt:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT

Query:  EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH
        ++EKVV KVDELMLKISKLG+VP EQNF+L DVDEHEEKI+MYHSEKLAIAYGL+NTL+RTPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRFHH
Subjt:  EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDGNCSCGDYW
Subjt:  FRDGNCSCGDYW

XP_022133879.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia]0.0e+0089.04Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKK-THIR
        MEVPL RYQNY+YDRLQCSSTSSS+SY  VRF+DS LFRKRSLLS Y+LWSNRRK  NSFCW+KCSSLEQGL PRP+P+PSKID D+RKGTSS + T IR
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKK-THIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEM
        KSGVGICSQIEKLVLCKKYRDALEMFE FELEGGYD+GNST+DALINACIGLKS+RGVKRLCNYMID+G EPDQYM+NR+LLMHVKCGMMIDACRLFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE SD GPRTFA MIRASAGLELIFPGRQLHSCA+KAGVGQ+IFVSCALIDMYSKCGSLEDAHCVFDE
Subjt:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD
        MPDKTIVGWNSIIAGYALHGYSEEAL+LCYEMRDSG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD

Query:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYAL
        RMS KN+ISWNALIAGYGNHGRGEEAI+MFE+MLREGM PNHVTFL+VLSACSISGLFERGWEIFQ++T DHKIKPRAMH+AC+IELLGREGLLDEAYAL
Subjt:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYAL

Query:  IRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH
        IR APF+PTANMWAALLRACRVHENLELGK AAE LYGMEP+KLSNYIVLLNIYNSSGKLKEAA+VV+TLKRKGLRM+PACSWIEV NQPH+FLSGDKHH
Subjt:  IRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH

Query:  TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH
         EIEKVVEKVDE+MLKISKLG+V EQNFLL DVDE EEKI MYHSEKLAIAYGL++TLK+TPLQIVQSHRICGDCHS IKLIA+IT+REIV+RDASRFHH
Subjt:  TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDG+CSCGDYW
Subjt:  FRDGNCSCGDYW

XP_022989822.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucurbita maxima]0.0e+0087.62Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK
        MEVPL  YQNY++D L+ +S SSSTSYFS  FS S+LFR RSLLS YSLWSNRRK  NSFCWVKCSSLEQGL PR KPKPSK+D D+RKGT SK+T I K
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK

Query:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
        S V IC  IEKLVLC K+RDALEMFE  ELEGGYDVGNSTFDALI ACIGLKS+RG KRLC YMID+GIEPDQY+ NR+LLMHV+CGMMIDA +LFDEMP
Subjt:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEEY  C PRTFAT+IRASAGLELIFPG+QLHSCAVKAGVGQ+IFVSCALIDMYSKCG LEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHG+SEEALNL ++MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI
        MSCKN+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHKIK RAMHY C+IELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT
        RKAPFQPTANMWAALLRACRVHENLELGK+AAEKLYGMEPEKL NYIVLLNIY SSGKLKEAA+VVRTLKRKGL MLPACSWIEV +QPHAFLSGDKHH 
Subjt:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT

Query:  EIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF
        EIEKVVEKVDELML+ISKLG+VPEQN LL DVD HEEKIQ+YHSEKLAIAYGLINTLK+TPLQIVQ HR+CGDCHSVIKLIAMITKREIV+RDASRFHHF
Subjt:  EIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF

Query:  RDGNCSCGDYW
        RDG CSCGDYW
Subjt:  RDGNCSCGDYW

XP_038890388.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida]0.0e+0086.94Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK
        ME+PLS YQNYLYDR+QC    +STSY S+RFS  DLFR+R  L       NRRK  NS  W+KCSS EQGL PRP+PKPSK+DP + K T  K+TH+ +
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK

Query:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
        S VGICSQIEKLVLCKKYRDALEMFE FELEGG+  GN+T DALINAC+ LKS+RGVK+LCNYM+D+G EPDQYMRNRVLLMHVKCGMMIDACRLFD+MP
Subjt:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSWNTIISG+VDSGNY EAFRLFI+MWEEY DCGPRTFATMIRASAGLELIFPGRQLHSCA+KA +GQ+IFVSCALIDMYSKCGSLEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHGYSEEAL+L YEMRDSG+KMDHFTFSIIIRICSRLASVA AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI
        MSC+N+ISWNALIAGYGNHGRG EAI+MFEKMLREG +PNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHKIKPRAMHYAC+IELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT
        RKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAA+VV+TLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 
Subjt:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT

Query:  EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH
        +IEKVV KVDELMLKISKLG+VP EQNF+L DVDEHEEKIQMYHSEKLAIAYGL+NTL++TPLQIVQSHRIC DCH VIKLIAMITKREIVIRDASRFHH
Subjt:  EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDG+CSCGDYW
Subjt:  FRDGNCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KXD9 DYW_deaminase domain-containing protein0.0e+0087.11Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPK--PKPSKIDPDIRKGTSSKKTHI
        ME+PLSRYQNY+YDRLQC    +STS+FS+R+SDSDLF K S L      SN RK  NSFCW+KCSS EQGL PRP+  PKPSK+D   RK T  K+TH+
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPK--PKPSKIDPDIRKGTSSKKTHI

Query:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE
        +KS VGICSQIEKLVLCKKYRDALEMFE FELE G+ VG ST+DALINACIGLKS+RGVKRLCNYM+D+G EPDQYMRNRVLLMHVKCGMMIDACRLFDE
Subjt:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD
        MP RNAVSW TIISGYVDSGNY EAFRLFI+M EE+ DCGPRTFATMIRASAGLE+IFPGRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFD
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
        EMPDKTIVGWNSIIAGYALHGYSEEAL+L +EMRDSGVKMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
Subjt:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA
        DRMSC+N+ISWNALIAGYGNHG GEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK+KPRAMH+AC+IELLGREGLLDEAYA
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA

Query:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH
        LIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAA+V +TLKRKGLRMLPACSWIEVNNQPHAFLSGDKH
Subjt:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH

Query:  HTEIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRF
        H +IEKVV KVDELML ISKLG+VP EQNF+L DVDE+EEKI+MYHSEKLAIAYGL+NTL++TPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRF
Subjt:  HTEIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRF

Query:  HHFRDGNCSCGDYW
        HHFRDG+CSCGDYW
Subjt:  HHFRDGNCSCGDYW

A0A1S3C9W7 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0087.5Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK
        ME+PLSRYQNY+YDRLQC     ST YFS+R+SDS LF K S L      SNRRK  NSFCWVKCSS EQGL PRP+PKPSK+D  +RK    K+T +RK
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK

Query:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
        S VGICSQIEKLVLCK+YRDALEMFE FELE G+ VGNST+DALINACIGLKS+RGVKRL NYM+D+G EPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
Subjt:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE   CGPRT ATMIRASAGLE+IF GRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHGYSEEAL+L +EM  SGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI
        MSC+NVISWNALIAGYGNHGRGEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK++PRAMH+AC+IELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT
        RKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEPEKLSNYIVLLNIYN+SGKLKEAA+VV+TLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 
Subjt:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT

Query:  EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH
        ++EKVV KVDELMLKISKLG+VP EQNF+L DVDEHEEKI+MYHSEKLAIAYGL+NTL+RTPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRFHH
Subjt:  EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDGNCSCGDYW
Subjt:  FRDGNCSCGDYW

A0A6J1BWH3 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0089.04Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKK-THIR
        MEVPL RYQNY+YDRLQCSSTSSS+SY  VRF+DS LFRKRSLLS Y+LWSNRRK  NSFCW+KCSSLEQGL PRP+P+PSKID D+RKGTSS + T IR
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKK-THIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEM
        KSGVGICSQIEKLVLCKKYRDALEMFE FELEGGYD+GNST+DALINACIGLKS+RGVKRLCNYMID+G EPDQYM+NR+LLMHVKCGMMIDACRLFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE SD GPRTFA MIRASAGLELIFPGRQLHSCA+KAGVGQ+IFVSCALIDMYSKCGSLEDAHCVFDE
Subjt:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD
        MPDKTIVGWNSIIAGYALHGYSEEAL+LCYEMRDSG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD

Query:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYAL
        RMS KN+ISWNALIAGYGNHGRGEEAI+MFE+MLREGM PNHVTFL+VLSACSISGLFERGWEIFQ++T DHKIKPRAMH+AC+IELLGREGLLDEAYAL
Subjt:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYAL

Query:  IRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH
        IR APF+PTANMWAALLRACRVHENLELGK AAE LYGMEP+KLSNYIVLLNIYNSSGKLKEAA+VV+TLKRKGLRM+PACSWIEV NQPH+FLSGDKHH
Subjt:  IRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH

Query:  TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH
         EIEKVVEKVDE+MLKISKLG+V EQNFLL DVDE EEKI MYHSEKLAIAYGL++TLK+TPLQIVQSHRICGDCHS IKLIA+IT+REIV+RDASRFHH
Subjt:  TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDG+CSCGDYW
Subjt:  FRDGNCSCGDYW

A0A6J1GSZ5 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0086.8Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGT-SSKKTHIR
        MEVPL  YQNY++D LQ +S SSSTSYFS  FS S+LFR RSLLS YSLWSN RK  NSFCWVKCSSLEQGL PR KPKPSK++ D+RKGT  SK+T I 
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGT-SSKKTHIR

Query:  KSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEM
        KS V IC  IEKLVLC K+RDALEMFE  ELEGGYDVGNSTFDALINACIGLKS+RG KRLC YMID+GIEPDQY+ NR+LLMHV+CGMMIDA +LFDEM
Subjt:  KSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEEY  C PRTFAT+IRASAGLELIFPGRQLHSCAVKAGVGQ+IFVSCALIDMYSKCG LEDAHCVFDE
Subjt:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD
        MPDKTIVGWNSIIAGYALHGYSEEALNL Y+MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FD
Subjt:  MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD

Query:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYAL
        RMSCKN+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFL+VLSACSISGLFERGWEIFQ++TRDHK+K RAMHY C+IELLGREGLLDEAYAL
Subjt:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYAL

Query:  IRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH
        IRKAPFQPTANMWAALLRACRVHENLELGK+ AEKLYGMEPEKL NYIVLLNIY SSGKLKEAA+VV+TLKRKGL MLPACSWIEV +QPHAF SGDK H
Subjt:  IRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH

Query:  TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH
         EIEKVVEKVDELML+ISKLG+VPE+N LL DVD HEEKIQ+YHSEKLAIAYGLINTL  TPLQIVQ HR+CGDCHSVIKLIAMITKREIV+RDASRFHH
Subjt:  TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGNCSCGDYW

A0A6J1JGW0 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0087.62Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK
        MEVPL  YQNY++D L+ +S SSSTSYFS  FS S+LFR RSLLS YSLWSNRRK  NSFCWVKCSSLEQGL PR KPKPSK+D D+RKGT SK+T I K
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRK

Query:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
        S V IC  IEKLVLC K+RDALEMFE  ELEGGYDVGNSTFDALI ACIGLKS+RG KRLC YMID+GIEPDQY+ NR+LLMHV+CGMMIDA +LFDEMP
Subjt:  SGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEEY  C PRTFAT+IRASAGLELIFPG+QLHSCAVKAGVGQ+IFVSCALIDMYSKCG LEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHG+SEEALNL ++MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI
        MSCKN+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHKIK RAMHY C+IELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT
        RKAPFQPTANMWAALLRACRVHENLELGK+AAEKLYGMEPEKL NYIVLLNIY SSGKLKEAA+VVRTLKRKGL MLPACSWIEV +QPHAFLSGDKHH 
Subjt:  RKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT

Query:  EIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF
        EIEKVVEKVDELML+ISKLG+VPEQN LL DVD HEEKIQ+YHSEKLAIAYGLINTLK+TPLQIVQ HR+CGDCHSVIKLIAMITKREIV+RDASRFHHF
Subjt:  EIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF

Query:  RDGNCSCGDYW
        RDG CSCGDYW
Subjt:  RDGNCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK33 Pentatricopeptide repeat-containing protein At5g50390, chloroplastic4.2e-25559.75Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSS--KKTHI
        ME+PLSRYQ+   D ++ SS++     F  +FS          L G       R+  N F  + CSS+ QGL P+PK KP  I  ++++        T I
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSS--KKTHI

Query:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE
         KSGV ICSQIEKLVLC ++R+A E+FE  E+   + VG ST+DAL+ ACI LKS+R VKR+  +M+ +G EP+QYM NR+LLMHVKCGM+IDA RLFDE
Subjt:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD
        +PERN  S+ +IISG+V+ GNY EAF LF MMWEE SDC   TFA M+RASAGL  I+ G+QLH CA+K GV  N FVSC LIDMYSKCG +EDA C F+
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
         MP+KT V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VF
Subjt:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA
        D++  KN+ISWNAL+ GY NHGRG +A+++FEKM+   + PNHVTFL+VLSAC+ SGL E+GWEIF +M+  H IKPRAMHYAC+IELLGR+GLLDEA A
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA

Query:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDK-
         IR+AP + T NMWAALL ACR+ ENLELG+  AEKLYGM PEKL NY+V+ N+YNS GK  EAA V+ TL+ KGL M+PAC+W+EV +Q H+FLSGD+ 
Subjt:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDK-

Query:  ---HHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDE-HEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRD
           + T   ++ +KVDELM +IS+ G+  E+  LL DVDE  EE++  YHSEKLAIAYGL+NT +  PLQI Q+HRIC +CH V++ I+++T RE+V+RD
Subjt:  ---HHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDE-HEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRD

Query:  ASRFHHFRDGNCSCGDYW
        ASRFHHF++G CSCG YW
Subjt:  ASRFHHFRDGNCSCGDYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial1.1e-13538.8Show/hide
Query:  ELEGGYDVGNSTF-DALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF
        +LEG Y   +  F + L+  C   K +   + +  +++      D  M N +L M+ KCG + +A ++F++MP+R+ V+W T+ISGY       +A   F
Subjt:  ELEGGYDVGNSTF-DALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF

Query:  IMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNL
          M          T +++I+A+A       G QLH   VK G   N+ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL L
Subjt:  IMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNL

Query:  CYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE
           M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ ++V+SWN+L+  Y  HG G+EA+ 
Subjt:  CYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLEL
         FE+M R G+ PN ++FLSVL+ACS SGL + GW  ++ M +D  I P A HY  +++LLGR G L+ A   I + P +PTA +W ALL ACR+H+N EL
Subjt:  MFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLEL

Query:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNF
        G +AAE ++ ++P+    +++L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H + E++  K +E++ KI +LG+VP+ + 
Subjt:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNF

Query:  LLSDVDEHEEKIQM-YHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        ++  VD+ E ++ + YHSEK+A+A+ L+NT   + + I ++ R+CGDCH+ IKL + +  REI++RD +RFHHF+DGNCSC DYW
Subjt:  LLSDVDEHEEKIQM-YHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233301.9e-13036.99Show/hide
Query:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ F +++ +C  +  +R  + +  +++  G++ D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS+NTII+GY  SG YE+A R+   M          T ++++   +    +  G+++H   ++ G+  ++++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  + WNS++AGY  +G   EAL L  +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F++VL+ACS  GL +  W  F +MT+ + +     HYA + +LLGR G L+EAY  I 
Subjt:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIR

Query:  KAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTE
        K   +PT ++W+ LL +C VH+NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A++   +++KGLR  PACSWIE+ N+ H F+SGD+ H  
Subjt:  KAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTE

Query:  IEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF
        ++K+ E +  +M ++ K G+V + + +L DVD EH+ ++   HSE+LA+A+G+INT   T +++ ++ RIC DCH  IK I+ IT+REI++RD SRFHHF
Subjt:  IEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF

Query:  RDGNCSCGDYW
          GNCSCGDYW
Subjt:  RDGNCSCGDYW

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015103.0e-12838.19Show/hide
Query:  YRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS
        Y +++ +F     + G+   + TF  ++ A +GL      ++L    +  G   D  + N++L  + K   +++   LFDEMPE + VS+N +IS Y  +
Subjt:  YRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS

Query:  GNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL
          YE +   F  M     D     FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T V W ++I+GY  
Subjt:  GNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL

Query:  HGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG
         G     L L  +MR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  +N +SWNALI+ + 
Subjt:  HGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ M+  + I P+  HYAC+++LLGR G   EA  L+ + PF+P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLR

Query:  ACRVHENLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKI
        ACR+H+N  L + AAEKL+ ME     + Y+ + NIY ++G+ ++  +V + ++ +G++ +PA SW+EVN++ H F S D+ H   +++V K++EL  +I
Subjt:  ACRVHENLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKI

Query:  SKLGHVPEQNFLLSDVDEHEEKIQ--MYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
         + G+ P+ + ++ DVDE + KI+   YHSE+LA+A+ LI+T +  P+ ++++ R C DCH+ IKLI+ I KREI +RD SRFHHF +G CSCGDYW
Subjt:  SKLGHVPEQNFLLSDVDEHEEKIQ--MYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial6.1e-12938.96Show/hide
Query:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDC
        ++T+  LI  CI  ++V     +C ++  +G  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW T+IS Y     +++A  L ++M  +    
Subjt:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDC

Query:  GPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVK
           T+++++R+  G+  +   R LH   +K G+  ++FV  ALID+++K G  EDA  VFDEM     + WNSII G+A +  S+ AL L   M+ +G  
Subjt:  GPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVK

Query:  MDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM
         +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  ++VI+W+ +I+G   +G  +EA+++FE+M   G 
Subjt:  MDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM

Query:  MPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYG
         PN++T + VL ACS +GL E GW  F++M + + I P   HY C+I+LLG+ G LD+A  L+ +   +P A  W  LL ACRV  N+ L ++AA+K+  
Subjt:  MPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYG

Query:  MEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHE
        ++PE    Y +L NIY +S K     E+   ++ +G++  P CSWIEVN Q HAF+ GD  H +I +V +K+++L+ +++ +G+VPE NF+L D++ E  
Subjt:  MEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHE

Query:  EKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        E    +HSEKLA+A+GL+       ++I ++ RICGDCH   KL + +  R IVIRD  R+HHF+DG CSCGDYW
Subjt:  EKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein4.3e-13038.96Show/hide
Query:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDC
        ++T+  LI  CI  ++V     +C ++  +G  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW T+IS Y     +++A  L ++M  +    
Subjt:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDC

Query:  GPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVK
           T+++++R+  G+  +   R LH   +K G+  ++FV  ALID+++K G  EDA  VFDEM     + WNSII G+A +  S+ AL L   M+ +G  
Subjt:  GPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVK

Query:  MDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM
         +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  ++VI+W+ +I+G   +G  +EA+++FE+M   G 
Subjt:  MDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM

Query:  MPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYG
         PN++T + VL ACS +GL E GW  F++M + + I P   HY C+I+LLG+ G LD+A  L+ +   +P A  W  LL ACRV  N+ L ++AA+K+  
Subjt:  MPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYG

Query:  MEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHE
        ++PE    Y +L NIY +S K     E+   ++ +G++  P CSWIEVN Q HAF+ GD  H +I +V +K+++L+ +++ +G+VPE NF+L D++ E  
Subjt:  MEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHE

Query:  EKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        E    +HSEKLA+A+GL+       ++I ++ RICGDCH   KL + +  R IVIRD  R+HHF+DG CSCGDYW
Subjt:  EKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-12938.19Show/hide
Query:  YRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS
        Y +++ +F     + G+   + TF  ++ A +GL      ++L    +  G   D  + N++L  + K   +++   LFDEMPE + VS+N +IS Y  +
Subjt:  YRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS

Query:  GNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL
          YE +   F  M     D     FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T V W ++I+GY  
Subjt:  GNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL

Query:  HGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG
         G     L L  +MR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  +N +SWNALI+ + 
Subjt:  HGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ M+  + I P+  HYAC+++LLGR G   EA  L+ + PF+P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLR

Query:  ACRVHENLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKI
        ACR+H+N  L + AAEKL+ ME     + Y+ + NIY ++G+ ++  +V + ++ +G++ +PA SW+EVN++ H F S D+ H   +++V K++EL  +I
Subjt:  ACRVHENLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKI

Query:  SKLGHVPEQNFLLSDVDEHEEKIQ--MYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
         + G+ P+ + ++ DVDE + KI+   YHSE+LA+A+ LI+T +  P+ ++++ R C DCH+ IKLI+ I KREI +RD SRFHHF +G CSCGDYW
Subjt:  SKLGHVPEQNFLLSDVDEHEEKIQ--MYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-13136.99Show/hide
Query:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ F +++ +C  +  +R  + +  +++  G++ D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS+NTII+GY  SG YE+A R+   M          T ++++   +    +  G+++H   ++ G+  ++++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  + WNS++AGY  +G   EAL L  +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F++VL+ACS  GL +  W  F +MT+ + +     HYA + +LLGR G L+EAY  I 
Subjt:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIR

Query:  KAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTE
        K   +PT ++W+ LL +C VH+NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A++   +++KGLR  PACSWIE+ N+ H F+SGD+ H  
Subjt:  KAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTE

Query:  IEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF
        ++K+ E +  +M ++ K G+V + + +L DVD EH+ ++   HSE+LA+A+G+INT   T +++ ++ RIC DCH  IK I+ IT+REI++RD SRFHHF
Subjt:  IEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF

Query:  RDGNCSCGDYW
          GNCSCGDYW
Subjt:  RDGNCSCGDYW

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-13037.89Show/hide
Query:  ELEGGYDVGNSTF-DALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF
        +LEG Y   +  F + L+  C   K +   + +  +++      D  M N +L M+ KCG + +A ++F++MP+R+ V+W T+ISGY       +A   F
Subjt:  ELEGGYDVGNSTF-DALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF

Query:  IMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNL
          M          T +++I+A+A       G QLH   VK G   N+ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL L
Subjt:  IMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNL

Query:  CYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE
           M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ ++V+SWN+L+  Y  HG G+EA+ 
Subjt:  CYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLEL
         FE+M R G+ PN ++FLSVL+ACS SGL + GW  ++ M +D  I P A HY  +++LLGR G L+ A   I + P +PTA +W ALL ACR+H+N EL
Subjt:  MFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLEL

Query:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNF
        G +AAE ++ ++P+    +++L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H + E++  K +E++ KI +LG+VP+ + 
Subjt:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNF

Query:  LLSDVDEHEEKIQM-YHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGN
        ++  VD+ E ++ + YHSEK+A+A+ L+NT   + + I ++ R+CGDCH+ IKL + +  REI++RD +RFHHF+D +
Subjt:  LLSDVDEHEEKIQM-YHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGN

AT5G50390.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.0e-25659.75Show/hide
Query:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSS--KKTHI
        ME+PLSRYQ+   D ++ SS++     F  +FS          L G       R+  N F  + CSS+ QGL P+PK KP  I  ++++        T I
Subjt:  MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSS--KKTHI

Query:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE
         KSGV ICSQIEKLVLC ++R+A E+FE  E+   + VG ST+DAL+ ACI LKS+R VKR+  +M+ +G EP+QYM NR+LLMHVKCGM+IDA RLFDE
Subjt:  RKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD
        +PERN  S+ +IISG+V+ GNY EAF LF MMWEE SDC   TFA M+RASAGL  I+ G+QLH CA+K GV  N FVSC LIDMYSKCG +EDA C F+
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
         MP+KT V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VF
Subjt:  EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA
        D++  KN+ISWNAL+ GY NHGRG +A+++FEKM+   + PNHVTFL+VLSAC+ SGL E+GWEIF +M+  H IKPRAMHYAC+IELLGR+GLLDEA A
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYA

Query:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDK-
         IR+AP + T NMWAALL ACR+ ENLELG+  AEKLYGM PEKL NY+V+ N+YNS GK  EAA V+ TL+ KGL M+PAC+W+EV +Q H+FLSGD+ 
Subjt:  LIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDK-

Query:  ---HHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDE-HEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRD
           + T   ++ +KVDELM +IS+ G+  E+  LL DVDE  EE++  YHSEKLAIAYGL+NT +  PLQI Q+HRIC +CH V++ I+++T RE+V+RD
Subjt:  ---HHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDE-HEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRD

Query:  ASRFHHFRDGNCSCGDYW
        ASRFHHF++G CSCG YW
Subjt:  ASRFHHFRDGNCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTCCCTCTCTCCCGCTATCAAAACTATCTTTATGATCGCCTTCAATGCAGCTCCACTTCTAGCTCTACTTCCTACTTCTCCGTTCGTTTCTCAGATTCCGACCT
TTTTAGAAAGAGATCTTTGCTTTCTGGGTATTCTCTGTGGTCTAACAGAAGAAAATCGCTTAATTCGTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGCCTACACC
CACGACCCAAACCTAAACCTTCGAAAATCGATCCGGATATTCGTAAAGGGACCTCTTCGAAGAAGACCCATATCAGAAAATCCGGTGTAGGGATCTGTAGCCAGATAGAG
AAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTTGAGATGTTTGAATTTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAATAGCACGTTTGATGCGTTGATTAATGC
ATGTATTGGCTTGAAATCTGTAAGAGGGGTGAAGAGGTTGTGTAATTACATGATTGATCATGGAATTGAGCCTGATCAATATATGAGGAACAGGGTTCTACTTATGCATG
TGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCCGAGAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGAAATTAT
GAAGAAGCGTTTAGATTGTTCATAATGATGTGGGAAGAGTACTCTGATTGTGGTCCTCGCACCTTTGCCACAATGATACGGGCATCGGCTGGTTTAGAACTTATTTTTCC
TGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGCGTGGGACAGAACATTTTTGTTTCCTGTGCGTTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTC
ATTGTGTTTTTGATGAGATGCCCGATAAGACGATAGTTGGATGGAATTCAATTATTGCTGGTTACGCACTCCATGGCTACAGTGAGGAAGCTCTGAATCTATGTTATGAG
ATGCGTGACTCTGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGGATATGTTCGAGATTAGCTTCTGTAGCACGTGCCAAGCAAGCGCATGCGAGCTTAGT
TCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTGTA
AAAACGTAATATCATGGAATGCTCTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCGAAC
CATGTGACATTTCTTTCTGTTCTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATATTTCAAACAATGACTAGGGATCACAAGATTAAACCACGCGC
TATGCATTACGCTTGCTTGATTGAGTTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCCACAGCAAATATGTGGGCTG
CCTTGCTTCGAGCTTGTAGGGTTCATGAAAATCTAGAACTTGGGAAATTTGCTGCTGAAAAGCTTTATGGGATGGAACCCGAGAAGCTCAGTAATTACATTGTGCTTTTA
AACATATATAACAGTTCTGGCAAGTTAAAGGAAGCAGCTGAGGTTGTTCGGACATTGAAAAGAAAGGGCTTGAGAATGCTTCCGGCATGTAGTTGGATTGAAGTTAATAA
TCAACCCCATGCATTCCTCTCTGGGGATAAACACCATACCGAAATAGAAAAAGTCGTCGAGAAAGTGGACGAATTAATGCTAAAGATCTCAAAGCTTGGTCATGTTCCTG
AACAGAACTTCTTGCTTTCAGATGTTGATGAACATGAAGAAAAGATACAAATGTACCACAGTGAAAAACTGGCAATAGCTTATGGACTTATCAATACGTTGAAGCGAACG
CCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGATCAGAGATGCTAGCAGATT
CCACCATTTCAGAGATGGCAATTGTTCTTGTGGAGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAATCAGTCAAAAAAAAAAGAGAGAGAAAAAAAACTGATTATTCGTGGGGGGAAAAATGCAAAACCTAAAACCCCGGGTTCGTTATTGCGAAGAAGTTGAACTACC
AACCAAAAAACTGGCGCTGGCGGAGTACTCGATGCTGATGCTTGCTTTTGAATCCAGATCCCTTCTTCTTCCTCTCCGGCGTCTCCGCCATTGTTTTTTTCCGGAGGATG
AAGATGAACCAACGCCTAATCTACTACACTTTCATCTTTCCAATTAACCACGTTTTCTCCTTCCAATCCTTCTAATTCATCCAATTTCAGGATATTCATCAGCTTCCTAA
TCTCTCGAGGTAACTTCCCCACCCCCCGCCATTCTTGATTTGACGCATCTCTTCTAATTTTTCAGCATGGAAGTCCCTCTCTCCCGCTATCAAAACTATCTTTATGATCG
CCTTCAATGCAGCTCCACTTCTAGCTCTACTTCCTACTTCTCCGTTCGTTTCTCAGATTCCGACCTTTTTAGAAAGAGATCTTTGCTTTCTGGGTATTCTCTGTGGTCTA
ACAGAAGAAAATCGCTTAATTCGTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGCCTACACCCACGACCCAAACCTAAACCTTCGAAAATCGATCCGGATATTCGT
AAAGGGACCTCTTCGAAGAAGACCCATATCAGAAAATCCGGTGTAGGGATCTGTAGCCAGATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTTGAGATGTT
TGAATTTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAATAGCACGTTTGATGCGTTGATTAATGCATGTATTGGCTTGAAATCTGTAAGAGGGGTGAAGAGGTTGTGTA
ATTACATGATTGATCATGGAATTGAGCCTGATCAATATATGAGGAACAGGGTTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAA
ATGCCCGAGAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGAAATTATGAAGAAGCGTTTAGATTGTTCATAATGATGTGGGAAGAGTACTC
TGATTGTGGTCCTCGCACCTTTGCCACAATGATACGGGCATCGGCTGGTTTAGAACTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGCGTGGGAC
AGAACATTTTTGTTTCCTGTGCGTTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACGATAGTTGGATGG
AATTCAATTATTGCTGGTTACGCACTCCATGGCTACAGTGAGGAAGCTCTGAATCTATGTTATGAGATGCGTGACTCTGGAGTTAAAATGGACCATTTCACCTTTTCTAT
AATTATAAGGATATGTTCGAGATTAGCTTCTGTAGCACGTGCCAAGCAAGCGCATGCGAGCTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTG
TGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTGTAAAAACGTAATATCATGGAATGCTCTGATTGCTGGATATGGGAAT
CATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCGAACCATGTGACATTTCTTTCTGTTCTATCTGCTTGTAGTATTTCAGG
TTTGTTTGAACGTGGATGGGAAATATTTCAAACAATGACTAGGGATCACAAGATTAAACCACGCGCTATGCATTACGCTTGCTTGATTGAGTTGCTAGGTCGAGAAGGGC
TCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCCACAGCAAATATGTGGGCTGCCTTGCTTCGAGCTTGTAGGGTTCATGAAAATCTAGAACTTGGG
AAATTTGCTGCTGAAAAGCTTTATGGGATGGAACCCGAGAAGCTCAGTAATTACATTGTGCTTTTAAACATATATAACAGTTCTGGCAAGTTAAAGGAAGCAGCTGAGGT
TGTTCGGACATTGAAAAGAAAGGGCTTGAGAATGCTTCCGGCATGTAGTTGGATTGAAGTTAATAATCAACCCCATGCATTCCTCTCTGGGGATAAACACCATACCGAAA
TAGAAAAAGTCGTCGAGAAAGTGGACGAATTAATGCTAAAGATCTCAAAGCTTGGTCATGTTCCTGAACAGAACTTCTTGCTTTCAGATGTTGATGAACATGAAGAAAAG
ATACAAATGTACCACAGTGAAAAACTGGCAATAGCTTATGGACTTATCAATACGTTGAAGCGAACGCCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCA
TTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGATCAGAGATGCTAGCAGATTCCACCATTTCAGAGATGGCAATTGTTCTTGTGGAGACTATTGGT
GAGGAAGGTAAATCTATATTGTTACTTTGTTTGTTTATTTTCATAGGAACGGATAACCTCTCTTCCATTCGATAAACATGATATATTGTACATTCTTTTTCAAGGAGCTT
ATAATGAAATGAATTCAT
Protein sequenceShow/hide protein sequence
MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIE
KLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNY
EEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYE
MRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPN
HVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLL
NIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRT
PLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW