; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030074 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030074
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr8:44362312..44364453
RNA-Seq ExpressionLag0030074
SyntenyLag0030074
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148701.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus]0.0e+0086.31Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGL--RPRPEPKPSKIDPDVRKGTSSKET
        M+ME+PLSRYQNYV DRLQC    +S+S+FSLR+SDS+L  K S L      SN RK RNSFCW+KCSS EQGL  RPRP+PKPSK+D   RK T  KET
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGL--RPRPEPKPSKIDPDVRKGTSSKET

Query:  RIRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLF
         ++K S GICSQIEKLVLCKKYRDALEMFEIFELE G+ VG ST+DAL++ACIGLKSIRGVKRLCNYM+DNGFEPDQYMRNR+LLMHVKCGMMIDACRLF
Subjt:  RIRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLF

Query:  DEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCV
        DEMP RNAVSW TIISGYVDSGNY EAFRLFI+M EEF DCGPRTFATM+RASAGLE+IFPGRQLHSCA+KAG+GQDIFVSCALIDMYSKCGSLEDAHCV
Subjt:  DEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCV

Query:  FDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH
        FDEMPDKTI+GWNSIIAGYALHGYSEEALDLY+EMRDSGVK+DHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARH
Subjt:  FDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH

Query:  VFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEA
        VFDRMSC+N+ISWNALIAGYGNHG GEEAI+MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWE FQ+M RD+K+KPRAMH+ACMIELLGREGLLDEA
Subjt:  VFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEA

Query:  YALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGD
        YALIRKAPF PTANMWAALLRACRVH NLELGKFAAEKLYGM P KLSNYIVLLNIYNSSGKLKEAA+V QTLKRKGLRMLPACSWIEV NQPHAFLSGD
Subjt:  YALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGD

Query:  KHHSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDAS
        KHH QI+KV+ KVDELML ISKLGYVPEE NF+LPDVDE+EEK  M+HSEKLAIA+GL+NTLE+TPLQIVQSHRIC DCHSVIKLIA+ITKREIV+RDAS
Subjt:  KHHSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDAS

Query:  RFHHFRDGSCSCGDYW
        RFHHFRDGSCSCGDYW
Subjt:  RFHHFRDGSCSCGDYW

XP_008459324.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo]0.0e+0086.55Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRI
        M+ME+PLSRYQNYV DRLQC ST     YFSLR+SDS L  K S L      SNRRK RNSFCWVKCSS EQGLRPRP+PKPSK+D  VRK    KET +
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRI

Query:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE
        RK S GICSQIEKLVLCK+YRDALEMFEIFELE G+ VGNST+DAL++ACIGLKSIRGVKRL NYM+DNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDE
Subjt:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        MPERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE   CGPRT ATM+RASAGLE+IF GRQLHSCA+KAG+GQDIFVSCALIDMYSKCGSLEDAHCVFD
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
        EMPDKTI+GWNSIIAGYALHGYSEEALDLY+EM  SGVK+DHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
Subjt:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA
        DRMSC+NVISWNALIAGYGNHGRGEEAI+MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWE FQ+M RD+K++PRAMH+ACMIELLGREGLLDEAYA
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA

Query:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH
        LIRKAPF PTANMWAALLRACRVH NLELGKFAAEKLYGM P KLSNYIVLLNIYN+SGKLKEAA+VVQTLKRKGLRMLPACSWIEV NQPHAFLSGDKH
Subjt:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH

Query:  HSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF
        H Q++KV+ KVDELML+ISKLGYVPEE NF+LPDVDEHEEK  M+HSEKLAIA+GL+NTLE+TPLQIVQSHRIC DCHSVIKLIA+ITKREIV+RDASRF
Subjt:  HSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDG+CSCGDYW
Subjt:  HHFRDGSCSCGDYW

XP_022133879.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia]0.0e+0088.8Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKE-TR
        M+MEVPL RYQNYV DRLQC STSSSSSY  +RF+DS+L RKRSLLS Y+LWSNRRKLRNSFCW+KCSSLEQGLRPRPEP+PSKID DVRKGTSS E TR
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKE-TR

Query:  IRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD
        IRK   GICSQIEKLVLCKKYRDALEMFEIFELEGGYD+GNST+DAL++ACIGLKSIRGVKRLCNYMIDNGFEPDQYM+NRILLMHVKCGMMIDACRLFD
Subjt:  IRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE SD GPRTFA M+RASAGLELIFPGRQLHSCA+KAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV
        DEMPDKTI+GWNSIIAGYALHGYSEEALDL YEMRDSG+K+DHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV

Query:  FDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAY
        FDRMS KN+ISWNALIAGYGNHGRGEEAI+MFE+MLREGM PNHVTFLAVLSACSISGLFERGWE FQ++  D+KIKPRAMH+ACMIELLGREGLLDEAY
Subjt:  FDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAY

Query:  ALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDK
        ALIR APF PTANMWAALLRACRVHENLELGK AAE LYGM P KLSNYIVLLNIYNSSGKLKEAA+VVQTLKRKGLRM+PACSWIEVKNQPH+FLSGDK
Subjt:  ALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDK

Query:  HHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF
        HH++I+KV+EKVDE+ML+ISKLGYV E+NFLLPDVDE EEK  M+HSEKLAIA+GL++TL++TPLQIVQSHRICGDCHS IKLIALIT+REIVVRDASRF
Subjt:  HHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

XP_022989822.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucurbita maxima]0.0e+0086.92Show/hide
Query:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRIRK
        MEVPL  YQNYV D L+  S SSS+SYFS  FS SEL R RSLLS YSLWSNRRKLRNSFCWVKCSSLEQGLRPR +PKPSK+D DVRKGT SKETRI K
Subjt:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRIRK

Query:  FSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMP
         S  IC  IEKLVLC K+RDALEMFEI ELEGGYDVGNSTFDAL+ ACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEMP
Subjt:  FSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEE+  C PRTFAT++RASAGLELIFPG+QLHSCAVKAGVGQDIFVSCALIDMYSKCG LEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTI+GWNSIIAGYALHG+SEEAL+LY++MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALI
        MSCKN+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFLAVLSACSISGLFERGWE FQ+M RD+KIK RAMHY CMIELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALI

Query:  RKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHS
        RKAPF PTANMWAALLRACRVHENLELGK+AAEKLYGM P KL NYIVLLNIY SSGKLKEAA+VV+TLKRKGL MLPACSWIEVK+QPHAFLSGDKHH 
Subjt:  RKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHS

Query:  QIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF
        +I+KV+EKVDELMLEISKLGYVPE+N LLPDVD HEEK  ++HSEKLAIA+GLINTL+QTPLQIVQ HR+CGDCHSVIKLIA+ITKREIVVRDASRFHHF
Subjt:  QIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
        RDG CSCGDYW
Subjt:  RDGSCSCGDYW

XP_038890388.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida]0.0e+0085.57Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRI
        M+ME+PLS YQNY+ DR+QC    +S+SY SLRFS  +L R+R  L       NRRK RNS  W+KCSS EQGLRPRP+PKPSK+DP V K T  KET +
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRI

Query:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE
         + S GICSQIEKLVLCKKYRDALEMFEIFELEGG+  GN+T DAL++AC+ LKSIRGVK+LCNYM+DNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+
Subjt:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        MPERNAVSWNTIISG+VDSGNY EAFRLFI+MWEE+ DCGPRTFATM+RASAGLELIFPGRQLHSCA+KA +GQDIFVSCALIDMYSKCGSLEDAHCVFD
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
        EMPDKTI+GWNSIIAGYALHGYSEEALDLYYEMRDSG+K+DHFTFSIIIRICSRLASVA AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
Subjt:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA
        DRMSC+N+ISWNALIAGYGNHGRG EAI+MFEKMLREG +PNHVTFLAVLSACSISGLFERGWE FQ+M RD+KIKPRAMHYACMIELLGREGLLDEAYA
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA

Query:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH
        LIRKAPF PTANMWAALLRACRVH NLELGKFAAEKLYGM P KLSNYIVLLNIYNSSGKLKEAA+VVQTLKRKGLRMLPACSWIEV NQPHAFLSGDKH
Subjt:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH

Query:  HSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF
        H QI+KV+ KVDELML+ISKLGYVPEE NF+LPDVDEHEEK  M+HSEKLAIA+GL+NTLEQTPLQIVQSHRIC DCH VIKLIA+ITKREIV+RDASRF
Subjt:  HSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KXD9 DYW_deaminase domain-containing protein0.0e+0086.31Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGL--RPRPEPKPSKIDPDVRKGTSSKET
        M+ME+PLSRYQNYV DRLQC    +S+S+FSLR+SDS+L  K S L      SN RK RNSFCW+KCSS EQGL  RPRP+PKPSK+D   RK T  KET
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGL--RPRPEPKPSKIDPDVRKGTSSKET

Query:  RIRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLF
         ++K S GICSQIEKLVLCKKYRDALEMFEIFELE G+ VG ST+DAL++ACIGLKSIRGVKRLCNYM+DNGFEPDQYMRNR+LLMHVKCGMMIDACRLF
Subjt:  RIRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLF

Query:  DEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCV
        DEMP RNAVSW TIISGYVDSGNY EAFRLFI+M EEF DCGPRTFATM+RASAGLE+IFPGRQLHSCA+KAG+GQDIFVSCALIDMYSKCGSLEDAHCV
Subjt:  DEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCV

Query:  FDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH
        FDEMPDKTI+GWNSIIAGYALHGYSEEALDLY+EMRDSGVK+DHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARH
Subjt:  FDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH

Query:  VFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEA
        VFDRMSC+N+ISWNALIAGYGNHG GEEAI+MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWE FQ+M RD+K+KPRAMH+ACMIELLGREGLLDEA
Subjt:  VFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEA

Query:  YALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGD
        YALIRKAPF PTANMWAALLRACRVH NLELGKFAAEKLYGM P KLSNYIVLLNIYNSSGKLKEAA+V QTLKRKGLRMLPACSWIEV NQPHAFLSGD
Subjt:  YALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGD

Query:  KHHSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDAS
        KHH QI+KV+ KVDELML ISKLGYVPEE NF+LPDVDE+EEK  M+HSEKLAIA+GL+NTLE+TPLQIVQSHRIC DCHSVIKLIA+ITKREIV+RDAS
Subjt:  KHHSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDAS

Query:  RFHHFRDGSCSCGDYW
        RFHHFRDGSCSCGDYW
Subjt:  RFHHFRDGSCSCGDYW

A0A1S3C9W7 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0086.55Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRI
        M+ME+PLSRYQNYV DRLQC ST     YFSLR+SDS L  K S L      SNRRK RNSFCWVKCSS EQGLRPRP+PKPSK+D  VRK    KET +
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRI

Query:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE
        RK S GICSQIEKLVLCK+YRDALEMFEIFELE G+ VGNST+DAL++ACIGLKSIRGVKRL NYM+DNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDE
Subjt:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        MPERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE   CGPRT ATM+RASAGLE+IF GRQLHSCA+KAG+GQDIFVSCALIDMYSKCGSLEDAHCVFD
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
        EMPDKTI+GWNSIIAGYALHGYSEEALDLY+EM  SGVK+DHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
Subjt:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA
        DRMSC+NVISWNALIAGYGNHGRGEEAI+MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWE FQ+M RD+K++PRAMH+ACMIELLGREGLLDEAYA
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA

Query:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH
        LIRKAPF PTANMWAALLRACRVH NLELGKFAAEKLYGM P KLSNYIVLLNIYN+SGKLKEAA+VVQTLKRKGLRMLPACSWIEV NQPHAFLSGDKH
Subjt:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH

Query:  HSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF
        H Q++KV+ KVDELML+ISKLGYVPEE NF+LPDVDEHEEK  M+HSEKLAIA+GL+NTLE+TPLQIVQSHRIC DCHSVIKLIA+ITKREIV+RDASRF
Subjt:  HSQIDKVIEKVDELMLEISKLGYVPEE-NFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDG+CSCGDYW
Subjt:  HHFRDGSCSCGDYW

A0A6J1BWH3 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0088.8Show/hide
Query:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKE-TR
        M+MEVPL RYQNYV DRLQC STSSSSSY  +RF+DS+L RKRSLLS Y+LWSNRRKLRNSFCW+KCSSLEQGLRPRPEP+PSKID DVRKGTSS E TR
Subjt:  MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKE-TR

Query:  IRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD
        IRK   GICSQIEKLVLCKKYRDALEMFEIFELEGGYD+GNST+DAL++ACIGLKSIRGVKRLCNYMIDNGFEPDQYM+NRILLMHVKCGMMIDACRLFD
Subjt:  IRKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE SD GPRTFA M+RASAGLELIFPGRQLHSCA+KAGVGQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV
        DEMPDKTI+GWNSIIAGYALHGYSEEALDL YEMRDSG+K+DHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV

Query:  FDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAY
        FDRMS KN+ISWNALIAGYGNHGRGEEAI+MFE+MLREGM PNHVTFLAVLSACSISGLFERGWE FQ++  D+KIKPRAMH+ACMIELLGREGLLDEAY
Subjt:  FDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAY

Query:  ALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDK
        ALIR APF PTANMWAALLRACRVHENLELGK AAE LYGM P KLSNYIVLLNIYNSSGKLKEAA+VVQTLKRKGLRM+PACSWIEVKNQPH+FLSGDK
Subjt:  ALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDK

Query:  HHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF
        HH++I+KV+EKVDE+ML+ISKLGYV E+NFLLPDVDE EEK  M+HSEKLAIA+GL++TL++TPLQIVQSHRICGDCHS IKLIALIT+REIVVRDASRF
Subjt:  HHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

A0A6J1GSZ5 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0086.38Show/hide
Query:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGT-SSKETRIR
        MEVPL  YQNYV D LQ  S SSS+SYFS  FS SEL R RSLLS YSLWSN RKLRNSFCWVKCSSLEQGLRPR +PKPSK++ DVRKGT  SKETRI 
Subjt:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGT-SSKETRIR

Query:  KFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEM
        K S  IC  IEKLVLC K+RDALEMFEI ELEGGYDVGNSTFDAL++ACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEM
Subjt:  KFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE
        PERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEE+  C PRTFAT++RASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCG LEDAHCVFDE
Subjt:  PERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE

Query:  MPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD
        MPDKTI+GWNSIIAGYALHGYSEEAL+LYY+MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FD
Subjt:  MPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD

Query:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYAL
        RMSCKN+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFLAVLSACSISGLFERGWE FQ++ RD+K+K RAMHY CMIELLGREGLLDEAYAL
Subjt:  RMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYAL

Query:  IRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHH
        IRKAPF PTANMWAALLRACRVHENLELGK+ AEKLYGM P KL NYIVLLNIY SSGKLKEAA+VVQTLKRKGL MLPACSWIEVK+QPHAF SGDK H
Subjt:  IRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHH

Query:  SQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHH
         +I+KV+EKVDELMLEISKLGYVPE N LLPDVD HEEK  ++HSEKLAIA+GLINTL  TPLQIVQ HR+CGDCHSVIKLIA+ITKREIVVRDASRFHH
Subjt:  SQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGSCSCGDYW

A0A6J1JGW0 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0086.92Show/hide
Query:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRIRK
        MEVPL  YQNYV D L+  S SSS+SYFS  FS SEL R RSLLS YSLWSNRRKLRNSFCWVKCSSLEQGLRPR +PKPSK+D DVRKGT SKETRI K
Subjt:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRIRK

Query:  FSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMP
         S  IC  IEKLVLC K+RDALEMFEI ELEGGYDVGNSTFDAL+ ACIGLKSIRG KRLC YMIDNG EPDQY+ NRILLMHV+CGMMIDA +LFDEMP
Subjt:  FSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEE+  C PRTFAT++RASAGLELIFPG+QLHSCAVKAGVGQDIFVSCALIDMYSKCG LEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTI+GWNSIIAGYALHG+SEEAL+LY++MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALI
        MSCKN+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFLAVLSACSISGLFERGWE FQ+M RD+KIK RAMHY CMIELLGREGLLDEAYALI
Subjt:  MSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALI

Query:  RKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHS
        RKAPF PTANMWAALLRACRVHENLELGK+AAEKLYGM P KL NYIVLLNIY SSGKLKEAA+VV+TLKRKGL MLPACSWIEVK+QPHAFLSGDKHH 
Subjt:  RKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHS

Query:  QIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF
        +I+KV+EKVDELMLEISKLGYVPE+N LLPDVD HEEK  ++HSEKLAIA+GLINTL+QTPLQIVQ HR+CGDCHSVIKLIA+ITKREIVVRDASRFHHF
Subjt:  QIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
        RDG CSCGDYW
Subjt:  RDGSCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK33 Pentatricopeptide repeat-containing protein At5g50390, chloroplastic1.5e-25560.31Show/hide
Query:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSS--KETRI
        ME+PLSRYQ+   D ++  S++     F  +FS          L G       R+ +N F  + CSS+ QGL+P+P+ KP  I  +V++       +T+I
Subjt:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSS--KETRI

Query:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE
         K    ICSQIEKLVLC ++R+A E+FEI E+   + VG ST+DALV ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFDE
Subjt:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        +PERN  S+ +IISG+V+ GNY EAF LF MMWEE SDC   TFA M+RASAGL  I+ G+QLH CA+K GV  + FVSC LIDMYSKCG +EDA C F+
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
         MP+KT + WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VF
Subjt:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA
        D++  KN+ISWNAL+ GY NHGRG +A+++FEKM+   + PNHVTFLAVLSAC+ SGL E+GWE F +M+  + IKPRAMHYACMIELLGR+GLLDEA A
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA

Query:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH
         IR+AP   T NMWAALL ACR+ ENLELG+  AEKLYGM P KL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+FLSGD+ 
Subjt:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH

Query:  HSQID----KVIEKVDELMLEISKLGYVPEENFLLPDVDE-HEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRD
         S  +    ++ +KVDELM EIS+ GY  EE  LLPDVDE  EE+   +HSEKLAIA+GL+NT E  PLQI Q+HRIC +CH V++ I+L+T RE+VVRD
Subjt:  HSQID----KVIEKVDELMLEISKLGYVPEENFLLPDVDE-HEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRD

Query:  ASRFHHFRDGSCSCGDYW
        ASRFHHF++G CSCG YW
Subjt:  ASRFHHFRDGSCSCGDYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial8.3e-13438.46Show/hide
Query:  ELEGGYDVGNSTF-DALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF
        +LEG Y   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+W T+ISGY       +A   F
Subjt:  ELEGGYDVGNSTF-DALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF

Query:  IMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDL
          M          T +++++A+A       G QLH   VK G   ++ V  AL+D+Y++ G ++DA  VFD +  +  + WN++IAG+A    +E+AL+L
Subjt:  IMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDL

Query:  YYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE
        +  M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ ++V+SWN+L+  Y  HG G+EA+ 
Subjt:  YYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLEL
         FE+M R G+ PN ++FL+VL+ACS SGL + GW +++ M +D  I P A HY  +++LLGR G L+ A   I + P  PTA +W ALL ACR+H+N EL
Subjt:  MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLEL

Query:  GKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENF
        G +AAE ++ + P     +++L NIY S G+  +AA V + +K  G++  PACSW+E++N  H F++ D+ H Q +++  K +E++ +I +LGYVP+ + 
Subjt:  GKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENF

Query:  LLPDVDEHE-EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW
        ++  VD+ E E    +HSEK+A+AF L+NT   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+DG+CSC DYW
Subjt:  LLPDVDEHE-EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233305.9e-13237.97Show/hide
Query:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ F +++ +C  +  +R  + +  +++  G + D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS+NTII+GY  SG YE+A R+   M          T ++++   +    +  G+++H   ++ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  I WNS++AGY  +G   EAL L+ +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F+AVL+ACS  GL +  W +F +M + Y +     HYA + +LLGR G L+EAY  I 
Subjt:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIR

Query:  KAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQ
        K    PT ++W+ LL +C VH+NLEL +  AEK++ +    +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACSWIE+KN+ H F+SGD+ H  
Subjt:  KAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQ

Query:  IDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMF-HSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF
        +DK+ E +  +M ++ K GYV + + +L DVDE  ++  +F HSE+LA+AFG+INT   T +++ ++ RIC DCH  IK I+ IT+REI+VRD SRFHHF
Subjt:  IDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMF-HSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
          G+CSCGDYW
Subjt:  RDGSCSCGDYW

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015105.0e-13138.76Show/hide
Query:  YRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS
        Y +++ +F +   + G+   + TF  ++ A +GL      ++L    +  GF  D  + N+IL  + K   +++   LFDEMPE + VS+N +IS Y  +
Subjt:  YRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS

Query:  GNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYAL
          YE +   F  M     D     FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T + W ++I+GY  
Subjt:  GNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYAL

Query:  HGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG
         G     L L+ +MR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  +N +SWNALI+ + 
Subjt:  HGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E+FQ M+  Y I P+  HYACM++LLGR G   EA  L+ + PF P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLR

Query:  ACRVHENLELGKFAAEKLYGMVPGK-LSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEI
        ACR+H+N  L + AAEKL+ M   +  + Y+ + NIY ++G+ ++  +V + ++ +G++ +PA SW+EV ++ H F S D+ H   D+++ K++EL  EI
Subjt:  ACRVHENLELGKFAAEKLYGMVPGK-LSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEI

Query:  SKLGYVPEENFLLPDVDEHEE-KAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW
         + GY P+ + ++ DVDE  + ++  +HSE+LA+AF LI+T E  P+ ++++ R C DCH+ IKLI+ I KREI VRD SRFHHF +G CSCGDYW
Subjt:  SKLGYVPEENFLLPDVDEHEE-KAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial1.9e-13039.3Show/hide
Query:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDC
        ++T+  L+  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW T+IS Y     +++A  L ++M  +    
Subjt:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDC

Query:  GPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVK
           T+++++R+  G+  +   R LH   +K G+  D+FV  ALID+++K G  EDA  VFDEM     I WNSII G+A +  S+ AL+L+  M+ +G  
Subjt:  GPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVK

Query:  LDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM
         +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  ++VI+W+ +I+G   +G  +EA+++FE+M   G 
Subjt:  LDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM

Query:  MPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYG
         PN++T + VL ACS +GL E GW +F++M + Y I P   HY CMI+LLG+ G LD+A  L+ +    P A  W  LL ACRV  N+ L ++AA+K+  
Subjt:  MPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYG

Query:  MVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVD-EHE
        + P     Y +L NIY +S K      +   ++ +G++  P CSWIEV  Q HAF+ GD  H QI +V +K+++L+  ++ +GYVPE NF+L D++ E  
Subjt:  MVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVD-EHE

Query:  EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW
        E +   HSEKLA+AFGL+    +  ++I ++ RICGDCH   KL + +  R IV+RD  R+HHF+DG CSCGDYW
Subjt:  EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-13139.3Show/hide
Query:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDC
        ++T+  L+  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW T+IS Y     +++A  L ++M  +    
Subjt:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDC

Query:  GPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVK
           T+++++R+  G+  +   R LH   +K G+  D+FV  ALID+++K G  EDA  VFDEM     I WNSII G+A +  S+ AL+L+  M+ +G  
Subjt:  GPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVK

Query:  LDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM
         +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  ++VI+W+ +I+G   +G  +EA+++FE+M   G 
Subjt:  LDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM

Query:  MPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYG
         PN++T + VL ACS +GL E GW +F++M + Y I P   HY CMI+LLG+ G LD+A  L+ +    P A  W  LL ACRV  N+ L ++AA+K+  
Subjt:  MPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYG

Query:  MVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVD-EHE
        + P     Y +L NIY +S K      +   ++ +G++  P CSWIEV  Q HAF+ GD  H QI +V +K+++L+  ++ +GYVPE NF+L D++ E  
Subjt:  MVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVD-EHE

Query:  EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW
        E +   HSEKLA+AFGL+    +  ++I ++ RICGDCH   KL + +  R IV+RD  R+HHF+DG CSCGDYW
Subjt:  EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-13238.76Show/hide
Query:  YRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS
        Y +++ +F +   + G+   + TF  ++ A +GL      ++L    +  GF  D  + N+IL  + K   +++   LFDEMPE + VS+N +IS Y  +
Subjt:  YRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDS

Query:  GNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYAL
          YE +   F  M     D     FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T + W ++I+GY  
Subjt:  GNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYAL

Query:  HGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG
         G     L L+ +MR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  +N +SWNALI+ + 
Subjt:  HGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E+FQ M+  Y I P+  HYACM++LLGR G   EA  L+ + PF P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLR

Query:  ACRVHENLELGKFAAEKLYGMVPGK-LSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEI
        ACR+H+N  L + AAEKL+ M   +  + Y+ + NIY ++G+ ++  +V + ++ +G++ +PA SW+EV ++ H F S D+ H   D+++ K++EL  EI
Subjt:  ACRVHENLELGKFAAEKLYGMVPGK-LSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEI

Query:  SKLGYVPEENFLLPDVDEHEE-KAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW
         + GY P+ + ++ DVDE  + ++  +HSE+LA+AF LI+T E  P+ ++++ R C DCH+ IKLI+ I KREI VRD SRFHHF +G CSCGDYW
Subjt:  SKLGYVPEENFLLPDVDEHEE-KAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-13337.97Show/hide
Query:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ F +++ +C  +  +R  + +  +++  G + D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS+NTII+GY  SG YE+A R+   M          T ++++   +    +  G+++H   ++ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  I WNS++AGY  +G   EAL L+ +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F+AVL+ACS  GL +  W +F +M + Y +     HYA + +LLGR G L+EAY  I 
Subjt:  SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIR

Query:  KAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQ
        K    PT ++W+ LL +C VH+NLEL +  AEK++ +    +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACSWIE+KN+ H F+SGD+ H  
Subjt:  KAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQ

Query:  IDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMF-HSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF
        +DK+ E +  +M ++ K GYV + + +L DVDE  ++  +F HSE+LA+AFG+INT   T +++ ++ RIC DCH  IK I+ IT+REI+VRD SRFHHF
Subjt:  IDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMF-HSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
          G+CSCGDYW
Subjt:  RDGSCSCGDYW

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-12937.89Show/hide
Query:  ELEGGYDVGNSTF-DALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF
        +LEG Y   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+W T+ISGY       +A   F
Subjt:  ELEGGYDVGNSTF-DALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLF

Query:  IMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDL
          M          T +++++A+A       G QLH   VK G   ++ V  AL+D+Y++ G ++DA  VFD +  +  + WN++IAG+A    +E+AL+L
Subjt:  IMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDL

Query:  YYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE
        +  M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ ++V+SWN+L+  Y  HG G+EA+ 
Subjt:  YYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLEL
         FE+M R G+ PN ++FL+VL+ACS SGL + GW +++ M +D  I P A HY  +++LLGR G L+ A   I + P  PTA +W ALL ACR+H+N EL
Subjt:  MFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLEL

Query:  GKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENF
        G +AAE ++ + P     +++L NIY S G+  +AA V + +K  G++  PACSW+E++N  H F++ D+ H Q +++  K +E++ +I +LGYVP+ + 
Subjt:  GKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENF

Query:  LLPDVDEHE-EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGS
        ++  VD+ E E    +HSEK+A+AF L+NT   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+D S
Subjt:  LLPDVDEHE-EKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGS

AT5G50390.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.0e-25660.31Show/hide
Query:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSS--KETRI
        ME+PLSRYQ+   D ++  S++     F  +FS          L G       R+ +N F  + CSS+ QGL+P+P+ KP  I  +V++       +T+I
Subjt:  MEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSS--KETRI

Query:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE
         K    ICSQIEKLVLC ++R+A E+FEI E+   + VG ST+DALV ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFDE
Subjt:  RKFSAGICSQIEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDE

Query:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD
        +PERN  S+ +IISG+V+ GNY EAF LF MMWEE SDC   TFA M+RASAGL  I+ G+QLH CA+K GV  + FVSC LIDMYSKCG +EDA C F+
Subjt:  MPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFD

Query:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF
         MP+KT + WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VF
Subjt:  EMPDKTIIGWNSIIAGYALHGYSEEALDLYYEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVF

Query:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA
        D++  KN+ISWNAL+ GY NHGRG +A+++FEKM+   + PNHVTFLAVLSAC+ SGL E+GWE F +M+  + IKPRAMHYACMIELLGR+GLLDEA A
Subjt:  DRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYA

Query:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH
         IR+AP   T NMWAALL ACR+ ENLELG+  AEKLYGM P KL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+FLSGD+ 
Subjt:  LIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIVLLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKH

Query:  HSQID----KVIEKVDELMLEISKLGYVPEENFLLPDVDE-HEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRD
         S  +    ++ +KVDELM EIS+ GY  EE  LLPDVDE  EE+   +HSEKLAIA+GL+NT E  PLQI Q+HRIC +CH V++ I+L+T RE+VVRD
Subjt:  HSQID----KVIEKVDELMLEISKLGYVPEENFLLPDVDE-HEEKAWMFHSEKLAIAFGLINTLEQTPLQIVQSHRICGDCHSVIKLIALITKREIVVRD

Query:  ASRFHHFRDGSCSCGDYW
        ASRFHHF++G CSCG YW
Subjt:  ASRFHHFRDGSCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATGGAAGTCCCTCTCTCCCGCTATCAAAACTACGTTTGTGATCGGCTTCAATGTATCTCCACTTCTAGCTCTAGTTCGTACTTCTCCCTTCGTTTCTCAGATTC
CGAGCTTGTTAGGAAGAGATCTTTGCTTTCTGGGTATTCTCTATGGTCCAATAGAAGAAAATTGCGAAATTCCTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGGC
TACGGCCACGACCCGAACCTAAACCTTCGAAAATCGATCCGGATGTTCGTAAAGGGACGTCTTCGAAGGAGACCCGTATCAGAAAATTCAGTGCGGGGATCTGTAGTCAG
ATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCGCTTGAGATGTTTGAAATTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAACAGCACGTTTGATGCGCTGGT
TAGTGCGTGTATTGGCTTGAAATCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGATTGATAATGGATTTGAGCCTGATCAATATATGAGGAACAGGATTCTACTTA
TGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGA
AATTATGAAGAAGCCTTTAGATTGTTTATTATGATGTGGGAAGAGTTTTCTGATTGTGGTCCTCGCACCTTTGCCACAATGATGCGGGCATCGGCTGGTTTGGAACTAAT
TTTTCCTGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGTGTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAG
ATGCTCACTGTGTTTTTGATGAGATGCCCGATAAGACAATAATTGGATGGAATTCAATTATAGCCGGTTACGCCCTCCATGGCTACAGTGAGGAAGCTCTGGATCTATAT
TATGAGATGCGTGACTCTGGAGTTAAATTGGACCATTTCACCTTCTCTATAATTATAAGAATATGCTCGAGATTGGCCTCTGTAGCACGTGCTAAGCAAGCGCATGCTAG
TTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTCGGCATGTGTTTGACAGGATGT
CCTGTAAAAACGTAATATCATGGAATGCTTTGATTGCTGGATATGGGAATCACGGCCGGGGAGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTAGGGAAGGCATGATG
CCAAACCATGTTACATTTCTTGCTGTCTTATCTGCTTGTAGTATTTCAGGTTTGTTCGAACGTGGATGGGAATTTTTTCAAACAATGGCCAGGGATTACAAGATTAAACC
GCGCGCTATGCATTACGCGTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGAAAAGCTCCATTTCCACCCACAGCAAATATGT
GGGCTGCCTTGCTTAGAGCTTGTAGAGTTCATGAAAATCTAGAACTTGGGAAATTTGCTGCTGAAAAACTTTATGGGATGGTACCCGGTAAGCTTAGTAATTATATTGTG
CTATTAAACATATATAACAGTTCTGGTAAGTTAAAGGAAGCAGCTAATGTTGTTCAGACATTGAAAAGAAAGGGATTGAGAATGCTTCCAGCTTGCAGTTGGATTGAAGT
TAAAAATCAGCCCCATGCCTTCCTGTCTGGGGATAAACATCATTCCCAAATAGACAAAGTTATCGAGAAAGTGGATGAATTAATGTTAGAGATCTCAAAGCTTGGTTATG
TTCCTGAAGAGAACTTCTTGCTTCCAGATGTAGACGAACACGAAGAAAAGGCATGGATGTTCCACAGTGAGAAATTGGCAATAGCTTTTGGACTTATCAATACTTTAGAG
CAAACACCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTTTGATAACAAAACGTGAAATTGTGGTCAGAGACGCTAG
CAGATTCCATCATTTCAGAGATGGGAGTTGTTCTTGTGGAGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATGGAAGTCCCTCTCTCCCGCTATCAAAACTACGTTTGTGATCGGCTTCAATGTATCTCCACTTCTAGCTCTAGTTCGTACTTCTCCCTTCGTTTCTCAGATTC
CGAGCTTGTTAGGAAGAGATCTTTGCTTTCTGGGTATTCTCTATGGTCCAATAGAAGAAAATTGCGAAATTCCTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGGC
TACGGCCACGACCCGAACCTAAACCTTCGAAAATCGATCCGGATGTTCGTAAAGGGACGTCTTCGAAGGAGACCCGTATCAGAAAATTCAGTGCGGGGATCTGTAGTCAG
ATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCGCTTGAGATGTTTGAAATTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAACAGCACGTTTGATGCGCTGGT
TAGTGCGTGTATTGGCTTGAAATCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGATTGATAATGGATTTGAGCCTGATCAATATATGAGGAACAGGATTCTACTTA
TGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGA
AATTATGAAGAAGCCTTTAGATTGTTTATTATGATGTGGGAAGAGTTTTCTGATTGTGGTCCTCGCACCTTTGCCACAATGATGCGGGCATCGGCTGGTTTGGAACTAAT
TTTTCCTGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGTGTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAG
ATGCTCACTGTGTTTTTGATGAGATGCCCGATAAGACAATAATTGGATGGAATTCAATTATAGCCGGTTACGCCCTCCATGGCTACAGTGAGGAAGCTCTGGATCTATAT
TATGAGATGCGTGACTCTGGAGTTAAATTGGACCATTTCACCTTCTCTATAATTATAAGAATATGCTCGAGATTGGCCTCTGTAGCACGTGCTAAGCAAGCGCATGCTAG
TTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTCGGCATGTGTTTGACAGGATGT
CCTGTAAAAACGTAATATCATGGAATGCTTTGATTGCTGGATATGGGAATCACGGCCGGGGAGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTAGGGAAGGCATGATG
CCAAACCATGTTACATTTCTTGCTGTCTTATCTGCTTGTAGTATTTCAGGTTTGTTCGAACGTGGATGGGAATTTTTTCAAACAATGGCCAGGGATTACAAGATTAAACC
GCGCGCTATGCATTACGCGTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGAAAAGCTCCATTTCCACCCACAGCAAATATGT
GGGCTGCCTTGCTTAGAGCTTGTAGAGTTCATGAAAATCTAGAACTTGGGAAATTTGCTGCTGAAAAACTTTATGGGATGGTACCCGGTAAGCTTAGTAATTATATTGTG
CTATTAAACATATATAACAGTTCTGGTAAGTTAAAGGAAGCAGCTAATGTTGTTCAGACATTGAAAAGAAAGGGATTGAGAATGCTTCCAGCTTGCAGTTGGATTGAAGT
TAAAAATCAGCCCCATGCCTTCCTGTCTGGGGATAAACATCATTCCCAAATAGACAAAGTTATCGAGAAAGTGGATGAATTAATGTTAGAGATCTCAAAGCTTGGTTATG
TTCCTGAAGAGAACTTCTTGCTTCCAGATGTAGACGAACACGAAGAAAAGGCATGGATGTTCCACAGTGAGAAATTGGCAATAGCTTTTGGACTTATCAATACTTTAGAG
CAAACACCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTTTGATAACAAAACGTGAAATTGTGGTCAGAGACGCTAG
CAGATTCCATCATTTCAGAGATGGGAGTTGTTCTTGTGGAGACTATTGGTGA
Protein sequenceShow/hide protein sequence
MSMEVPLSRYQNYVCDRLQCISTSSSSSYFSLRFSDSELVRKRSLLSGYSLWSNRRKLRNSFCWVKCSSLEQGLRPRPEPKPSKIDPDVRKGTSSKETRIRKFSAGICSQ
IEKLVLCKKYRDALEMFEIFELEGGYDVGNSTFDALVSACIGLKSIRGVKRLCNYMIDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSG
NYEEAFRLFIMMWEEFSDCGPRTFATMMRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIIGWNSIIAGYALHGYSEEALDLY
YEMRDSGVKLDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMM
PNHVTFLAVLSACSISGLFERGWEFFQTMARDYKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFPPTANMWAALLRACRVHENLELGKFAAEKLYGMVPGKLSNYIV
LLNIYNSSGKLKEAANVVQTLKRKGLRMLPACSWIEVKNQPHAFLSGDKHHSQIDKVIEKVDELMLEISKLGYVPEENFLLPDVDEHEEKAWMFHSEKLAIAFGLINTLE
QTPLQIVQSHRICGDCHSVIKLIALITKREIVVRDASRFHHFRDGSCSCGDYW