; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G017600 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G017600
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr05:29934139..29942758
RNA-Seq ExpressionClCG05G017600
SyntenyClCG05G017600
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000644 - CBS domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466303.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X3 [Cucumis melo]7.4e-30388.7Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+ +S+ST VFSCSL SRTAT++R RDSPRSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGPTLSAPLIYGLLNAL+EAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA+HFFEEMKE+ADKYDQED+FPDVVTYTTLLK FG+LKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
          L IDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLLMEAALND
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQ--------IDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVP
        NQ        IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVP
Subjt:  NQ--------IDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVP

Query:  IVDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPR
        I+DDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKR+KM++VVRHSKF  Y G SLRA+GVFTIEQLYGF+SPIP+  +PN+PR
Subjt:  IVDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPR

Query:  KT
        KT
Subjt:  KT

XP_008466304.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X4 [Cucumis melo]5.7e-30388.85Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+ +S+ST VFSCSL SRTAT++R RDSPRSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGPTLSAPLIYGLLNAL+EAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA+HFFEEMKE+ADKYDQED+FPDVVTYTTLLK FG+LKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSIN-------GALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLL
          L IDRTAYT MIDALVNCGSIN       GALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLL
Subjt:  QDLLIDRTAYTTMIDALVNCGSIN-------GALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLL

Query:  MEAALNDNQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPI
        MEAALNDNQIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVPI
Subjt:  MEAALNDNQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPI

Query:  VDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRK
        +DDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKR+KM++VVRHSKF  Y G SLRA+GVFTIEQLYGF+SPIP+  +PN+PRK
Subjt:  VDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRK

Query:  T
        T
Subjt:  T

XP_008466305.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X5 [Cucumis melo]4.7e-30589.9Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+ +S+ST VFSCSL SRTAT++R RDSPRSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGPTLSAPLIYGLLNAL+EAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA+HFFEEMKE+ADKYDQED+FPDVVTYTTLLK FG+LKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
          L IDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLLMEAALND
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC
        NQIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVPI+DDWGRC
Subjt:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC

Query:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT
        IGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKR+KM++VVRHSKF  Y G SLRA+GVFTIEQLYGF+SPIP+  +PN+PRKT
Subjt:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT

XP_011652532.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucumis sativus]2.8e-30289.73Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+N+S+ST +F CSL SRTAT++ RR SPRSPNLKRLTSRVV LTRRKQLHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGP LSAPLIYGLLNAL+EAGDMRRANGLIARYGFLLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA++FF+EMKE+ADKYDQEDIFPDVVTYTTLLK FG+LKDV LVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
        + L IDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLLMEAA ND
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC
        NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVPI+DDWG C
Subjt:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC

Query:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT
        IGLLHREDCTELD PLWKMMRSPPPGVTTT SIGHVANLILQKR+KM++VVRHSK+ TY G SLRA+GVFTIEQLYGFISPIPIQ +PNIP KT
Subjt:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT

XP_023535821.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita pepo subsp. pepo]7.4e-30389.39Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA N SDS  VFSCSL SRTA A+RR DSPRSPNLKRLTSRVVRLTRRKQLHQ+FEEI IAK+RYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALA+YNEMLN+ELKPD+LTYNTLISACVKINKLDAA++FFEEMKE+A KY+QEDIFPDVVTYTTLLKGFG+LKDV LVHKIVLEMKS 
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
         DLLIDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLT MR FSSRGDY MVK LHRRMWLDSSGSISP FQEEADHLLMEAAL+D
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC
        NQIDVA EKLSTIIK+WKGISW+SRGGSVALRIEALLGLTKSFFS PCIFPRVNPGAPIE+VM+PFKAVQPLNG+L LKEVVMRFFDKSVVPI+DDWGRC
Subjt:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC

Query:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT
        IGLLHREDC+EL++PLWKMMRSPPPGVTTTTSIGHV NLIL+KR+KMII+VRHSKF TYD  S RAVGVFTIEQLYGFISPIP+Q QPNIP KT
Subjt:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT

TrEMBL top hitse value%identityAlignment
A0A0A0LHF8 Uncharacterized protein1.4e-30289.73Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+N+S+ST +F CSL SRTAT++ RR SPRSPNLKRLTSRVV LTRRKQLHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGP LSAPLIYGLLNAL+EAGDMRRANGLIARYGFLLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA++FF+EMKE+ADKYDQEDIFPDVVTYTTLLK FG+LKDV LVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
        + L IDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLLMEAA ND
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC
        NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVPI+DDWG C
Subjt:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC

Query:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT
        IGLLHREDCTELD PLWKMMRSPPPGVTTT SIGHVANLILQKR+KM++VVRHSK+ TY G SLRA+GVFTIEQLYGFISPIPIQ +PNIP KT
Subjt:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT

A0A1S3CQX2 pentatricopeptide repeat-containing protein At5g10690 isoform X42.8e-30388.85Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+ +S+ST VFSCSL SRTAT++R RDSPRSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGPTLSAPLIYGLLNAL+EAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA+HFFEEMKE+ADKYDQED+FPDVVTYTTLLK FG+LKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSIN-------GALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLL
          L IDRTAYT MIDALVNCGSIN       GALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLL
Subjt:  QDLLIDRTAYTTMIDALVNCGSIN-------GALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLL

Query:  MEAALNDNQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPI
        MEAALNDNQIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVPI
Subjt:  MEAALNDNQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPI

Query:  VDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRK
        +DDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKR+KM++VVRHSKF  Y G SLRA+GVFTIEQLYGF+SPIP+  +PN+PRK
Subjt:  VDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRK

Query:  T
        T
Subjt:  T

A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X52.3e-30589.9Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+ +S+ST VFSCSL SRTAT++R RDSPRSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGPTLSAPLIYGLLNAL+EAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA+HFFEEMKE+ADKYDQED+FPDVVTYTTLLK FG+LKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
          L IDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLLMEAALND
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC
        NQIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVPI+DDWGRC
Subjt:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC

Query:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT
        IGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKR+KM++VVRHSKF  Y G SLRA+GVFTIEQLYGF+SPIP+  +PN+PRKT
Subjt:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT

A0A1S3CS90 pentatricopeptide repeat-containing protein At5g10690 isoform X33.6e-30388.7Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML +ASFSS  LA+ +S+ST VFSCSL SRTAT++R RDSPRSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIGGPTLSAPLIYGLLNAL+EAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLN+ELKPDRLTYNTLISACVKINKLDAA+HFFEEMKE+ADKYDQED+FPDVVTYTTLLK FG+LKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
          L IDRTAYT MIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLTLMRVFSSRGDY MVK LHRRMWLDSSG+IS  +QEEADHLLMEAALND
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQ--------IDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVP
        NQ        IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GAPIE+VM+PFKAVQPLNGSL LKEVVMRFFDKSVVP
Subjt:  NQ--------IDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVP

Query:  IVDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPR
        I+DDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKR+KM++VVRHSKF  Y G SLRA+GVFTIEQLYGF+SPIP+  +PN+PR
Subjt:  IVDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPR

Query:  KT
        KT
Subjt:  KT

A0A6J1FEL4 pentatricopeptide repeat-containing protein At5g10690 isoform X11.4e-30289.23Show/hide
Query:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML + SFSS  LA N SDS  VFSCSL SRTA A+RR DSPRSPNLKRLTSRVVRLTRRKQLHQ+FEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCGLD+VSYGTLLKGLGEARKIDEAFQLLESVEEG+AIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALA+YNEMLN+ELKPD+LTYNTLISACVKINKLDAA++FFEEMKE+A KY+QEDIFPDVVTYTTLLKGFG+LKDV LVHKIVLEMK+C
Subjt:  MKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSC

Query:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND
         DLLIDRTAYT MIDALVNCGSINGALSLFGELLKLSGW LDLRPKPHLYLT MR FSSRGDY MVK LHRRMWLDSSGSISP FQEEADHLLMEAAL+D
Subjt:  QDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC
        NQIDVA EKLSTIIK+WKGISW+SRGGSVALRIEALLGLTKSFFS PCIFPRVNPGAPIE+VM+PFKAVQPLNG+L LKEVVMRFFDKSVVPI+DDWGRC
Subjt:  NQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRC

Query:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT
        IGLLHREDC+EL+APLWKMMRSPPPGVTTTTSIGHV NLIL+KR+KMII+VR+SKF TYD  S RAVGVFTIEQLYGFISP PIQ QPNIP KT
Subjt:  IGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT

SwissProt top hitse value%identityAlignment
Q8VYD6 Pentatricopeptide repeat-containing protein At5g106905.7e-18158.3Show/hide
Query:  SCS-LSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGT
        SCS + +R    +R     R  NLK LTSR+V LTRR+QL Q+ EE+  AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F+EM++P   G+D +SY T
Subjt:  SCS-LSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGT

Query:  LLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLN
        +LKGLG+AR+IDEAFQ+LE++E G+A G P LS+ LIYGLL+AL+ AGD+RRANGL+ARY  LL + G  S+ +YNLLMKGY++S  PQAA+ + +EML 
Subjt:  LLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLN

Query:  VELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGS
        + L+PDRLTYNTLI AC+K   LDAA+ FF +MKE+A++Y  + + PDVVTYTTL+KGFG   D+  + +I LEMK C+++ IDRTA+T ++DA++ CGS
Subjt:  VELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGS

Query:  INGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALNDNQIDVAIEKLSTIIKKWKGISW
         +GAL +FGE+LK SG N  LRPKPHLYL++MR F+ +GDY MV+ L+ R+W DSSGSIS   Q+EAD+LLMEAALND Q+D A+  L +I+++WK I W
Subjt:  INGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALNDNQIDVAIEKLSTIIKKWKGISW

Query:  ASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRCIGLLHREDCTELDAPLWKMMRS
         + GG  A+R+E LLG +KS   P  +  +V P  PIE++M+ F+A +PL G+L LK V MRFF + VVPIVDD G CIGLLHREDC  LDAPL  MMRS
Subjt:  ASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRCIGLLHREDCTELDAPLWKMMRS

Query:  PPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLY
        PP  V+TTTSIG V +L+L+K+ KM+IVV    F +  G S +AVG FT  QLY
Subjt:  PPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.6e-1827.55Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLI
        N    N ++      G+ID+AL  F++M +   C  + V+Y TL+ G  + RKID+ F+LL S+    A+ G   +      ++N L   G M+  + ++
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLI

Query:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLK
             + R G +L    YN L+KGY   G    AL M+ EML   L P  +TY +LI +  K   ++ A+ F ++M+ +        + P+  TYTTL+ 
Subjt:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLK

Query:  GFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRM
        GF     ++  ++++ EM            Y  +I+     G +  A+++  E +K  G    L P    Y T++  F    D     R+ R M
Subjt:  GFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRM

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.2e-1629.75Show/hide
Query:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLI
        KQ+ + F+E+   +R   + + I  N++L  C   G  + A   F+EM+       D  SY TLL  + +  ++D AF++L  +     +      + +I
Subjt:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLI

Query:  YGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQA
         G      +AG    A  L     +L   G  L    YN L+  Y   G  + AL +  EM +V +K D +TYN L+    K  K D     F EMK   
Subjt:  YGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQA

Query:  DKYDQEDIFPDVVTYTTLLKGF---GLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLK
            +E + P+++TY+TL+ G+   GL K+     +I  E KS   L  D   Y+ +IDAL   G +  A+SL  E+ K
Subjt:  DKYDQEDIFPDVVTYTTLLKGF---GLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLK

Q9SXD8 Pentatricopeptide repeat-containing protein At1g625902.8e-1828.91Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVE-EGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGL
        N +    V+      GD DLAL   N+M +      D V + T++  L + R +D+A  L + +E +G      T S+     L++ L   G    A+ L
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVE-EGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGL

Query:  IARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLL
        ++    ++ +  N ++  +N L+  ++  G    A  +Y++M+   + PD  TYN+L++     ++LD A   FE M         +D FPDVVTY TL+
Subjt:  IARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLL

Query:  KGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELL
        KGF   K V    ++  EM S + L+ D   YTT+I  L + G  + A  +F +++
Subjt:  KGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELL

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic2.1e-1826.43Show/hide
Query:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYG-LLNALVEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +A  +D+A  L   +        PT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYG-LLNALVEAGDMRRANGLIARYG

Query:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGL
         +L  G   + ++YN+L+ G+  +G   AA A++  M+   ++PD  TY+ L+     + ++D  +H+F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGL

Query:  LKDVHLVHKIVL--EMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRG
         K   L   +VL  EMK+ + +  D   Y ++I  L   G +  A  ++ E+ +       L P    +  L+R +S  G
Subjt:  LKDVHLVHKIVL--EMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRG

Arabidopsis top hitse value%identityAlignment
AT1G62590.1 pentatricopeptide (PPR) repeat-containing protein2.0e-1928.91Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVE-EGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGL
        N +    V+      GD DLAL   N+M +      D V + T++  L + R +D+A  L + +E +G      T S+     L++ L   G    A+ L
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVE-EGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGL

Query:  IARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLL
        ++    ++ +  N ++  +N L+  ++  G    A  +Y++M+   + PD  TYN+L++     ++LD A   FE M         +D FPDVVTY TL+
Subjt:  IARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLL

Query:  KGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELL
        KGF   K V    ++  EM S + L+ D   YTT+I  L + G  + A  +F +++
Subjt:  KGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELL

AT2G31400.1 genomes uncoupled 18.3e-1829.75Show/hide
Query:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLI
        KQ+ + F+E+   +R   + + I  N++L  C   G  + A   F+EM+       D  SY TLL  + +  ++D AF++L  +     +      + +I
Subjt:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLI

Query:  YGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQA
         G      +AG    A  L     +L   G  L    YN L+  Y   G  + AL +  EM +V +K D +TYN L+    K  K D     F EMK   
Subjt:  YGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQA

Query:  DKYDQEDIFPDVVTYTTLLKGF---GLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLK
            +E + P+++TY+TL+ G+   GL K+     +I  E KS   L  D   Y+ +IDAL   G +  A+SL  E+ K
Subjt:  DKYDQEDIFPDVVTYTTLLKGF---GLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLK

AT4G31850.1 proton gradient regulation 31.5e-1926.43Show/hide
Query:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYG-LLNALVEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +A  +D+A  L   +        PT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYG-LLNALVEAGDMRRANGLIARYG

Query:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGL
         +L  G   + ++YN+L+ G+  +G   AA A++  M+   ++PD  TY+ L+     + ++D  +H+F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGL

Query:  LKDVHLVHKIVL--EMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRG
         K   L   +VL  EMK+ + +  D   Y ++I  L   G +  A  ++ E+ +       L P    +  L+R +S  G
Subjt:  LKDVHLVHKIVL--EMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRG

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein4.1e-18258.3Show/hide
Query:  SCS-LSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGT
        SCS + +R    +R     R  NLK LTSR+V LTRR+QL Q+ EE+  AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F+EM++P   G+D +SY T
Subjt:  SCS-LSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGT

Query:  LLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLN
        +LKGLG+AR+IDEAFQ+LE++E G+A G P LS+ LIYGLL+AL+ AGD+RRANGL+ARY  LL + G  S+ +YNLLMKGY++S  PQAA+ + +EML 
Subjt:  LLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLN

Query:  VELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGS
        + L+PDRLTYNTLI AC+K   LDAA+ FF +MKE+A++Y  + + PDVVTYTTL+KGFG   D+  + +I LEMK C+++ IDRTA+T ++DA++ CGS
Subjt:  VELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGS

Query:  INGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALNDNQIDVAIEKLSTIIKKWKGISW
         +GAL +FGE+LK SG N  LRPKPHLYL++MR F+ +GDY MV+ L+ R+W DSSGSIS   Q+EAD+LLMEAALND Q+D A+  L +I+++WK I W
Subjt:  INGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALNDNQIDVAIEKLSTIIKKWKGISW

Query:  ASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRCIGLLHREDCTELDAPLWKMMRS
         + GG  A+R+E LLG +KS   P  +  +V P  PIE++M+ F+A +PL G+L LK V MRFF + VVPIVDD G CIGLLHREDC  LDAPL  MMRS
Subjt:  ASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRCIGLLHREDCTELDAPLWKMMRS

Query:  PPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLY
        PP  V+TTTSIG V +L+L+K+ KM+IVV    F +  G S +AVG FT  QLY
Subjt:  PPPGVTTTTSIGHVANLILQKRHKMIIVVRHSKFRTYDGLSLRAVGVFTIEQLY

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-1927.55Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLI
        N    N ++      G+ID+AL  F++M +   C  + V+Y TL+ G  + RKID+ F+LL S+    A+ G   +      ++N L   G M+  + ++
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLI

Query:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLK
             + R G +L    YN L+KGY   G    AL M+ EML   L P  +TY +LI +  K   ++ A+ F ++M+ +        + P+  TYTTL+ 
Subjt:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLK

Query:  GFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRM
        GF     ++  ++++ EM            Y  +I+     G +  A+++  E +K  G    L P    Y T++  F    D     R+ R M
Subjt:  GFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACGAATGGCCTCATTCTCCTCTGGTCCTTTGGCGGTAAATTCATCAGACTCAACGGCCGTATTTTCTTGTTCGCTCTCTTCTCGCACAGCCACCGCGCAACGTCG
GAGAGACTCTCCTCGGAGTCCTAATCTCAAGCGACTGACCTCTCGTGTGGTCAGACTCACCCGTCGCAAACAGCTCCACCAGGTATTTGAGGAAATTGCAATTGCCAAGA
GACGTTATGGAAAACTTAATACAATTGTTATGAATGCCGTCTTGGAGGCTTGTGTTCACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAGCCA
GATAATTGTGGTCTAGACGATGTCAGTTATGGCACGCTATTAAAGGGTTTGGGTGAAGCTCGGAAGATTGATGAAGCATTTCAATTACTTGAATCTGTGGAAGAAGGTTC
CGCTATTGGAGGTCCAACATTGTCAGCACCACTTATTTATGGTCTTCTAAACGCTTTAGTTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATAGCACGATATGGGT
TCTTACTTCGTGAAGGAGGCAATCTCTCTATATCCGTTTACAACTTACTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAAGCTGCTTTAGCAATGTACAATGAGATG
CTAAATGTGGAGTTAAAACCTGATAGGCTCACATATAATACATTAATCTCTGCTTGTGTGAAGATTAACAAACTGGACGCAGCAATACATTTCTTTGAGGAAATGAAGGA
ACAAGCTGATAAGTATGATCAGGAAGATATCTTTCCTGATGTTGTGACGTACACTACTTTACTTAAGGGTTTTGGGCTTCTGAAAGATGTCCATCTTGTTCACAAGATTG
TGCTGGAAATGAAGTCTTGTCAAGATTTATTGATTGATCGAACAGCATACACCACAATGATTGATGCTTTGGTTAACTGTGGCTCTATAAACGGTGCTCTTTCTTTATTT
GGGGAATTATTGAAGCTTTCTGGATGGAATTTGGACTTACGGCCAAAGCCACATCTTTATCTCACTCTTATGAGAGTTTTTTCTAGTAGAGGAGATTATTGGATGGTCAA
ACGTTTGCATAGACGCATGTGGTTGGACTCCTCTGGAAGTATTTCTCCTGTATTTCAAGAAGAAGCAGATCATCTTCTCATGGAGGCAGCTTTAAATGACAATCAGATTG
ATGTGGCAATAGAGAAACTTTCAACAATTATTAAGAAATGGAAGGGAATCTCATGGGCTAGTCGAGGAGGCAGTGTTGCTCTGCGTATTGAAGCATTGCTGGGACTCACC
AAATCCTTTTTTAGTCCTCCTTGCATATTTCCTCGGGTAAATCCAGGTGCACCTATTGAGAATGTTATGTTGCCATTTAAAGCCGTTCAGCCATTAAATGGAAGCTTACA
CTTGAAGGAAGTGGTTATGCGTTTCTTTGACAAATCAGTTGTGCCTATCGTAGACGACTGGGGTAGATGCATTGGACTGCTGCACCGAGAAGACTGTACTGAGTTGGATG
CTCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTGGTGTAACAACTACCACATCCATTGGACATGTTGCGAATCTAATTCTACAAAAGAGGCACAAAATGATTATTGTT
GTAAGACATAGCAAGTTTAGGACATATGATGGCTTGAGTTTGAGGGCTGTCGGCGTTTTTACTATCGAGCAATTGTATGGCTTTATTTCTCCCATTCCGATACAGTCTCA
GCCAAACATTCCACGTAAGACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTACGAATGGCCTCATTCTCCTCTGGTCCTTTGGCGGTAAATTCATCAGACTCAACGGCCGTATTTTCTTGTTCGCTCTCTTCTCGCACAGCCACCGCGCAACGTCG
GAGAGACTCTCCTCGGAGTCCTAATCTCAAGCGACTGACCTCTCGTGTGGTCAGACTCACCCGTCGCAAACAGCTCCACCAGGTATTTGAGGAAATTGCAATTGCCAAGA
GACGTTATGGAAAACTTAATACAATTGTTATGAATGCCGTCTTGGAGGCTTGTGTTCACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAGCCA
GATAATTGTGGTCTAGACGATGTCAGTTATGGCACGCTATTAAAGGGTTTGGGTGAAGCTCGGAAGATTGATGAAGCATTTCAATTACTTGAATCTGTGGAAGAAGGTTC
CGCTATTGGAGGTCCAACATTGTCAGCACCACTTATTTATGGTCTTCTAAACGCTTTAGTTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATAGCACGATATGGGT
TCTTACTTCGTGAAGGAGGCAATCTCTCTATATCCGTTTACAACTTACTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAAGCTGCTTTAGCAATGTACAATGAGATG
CTAAATGTGGAGTTAAAACCTGATAGGCTCACATATAATACATTAATCTCTGCTTGTGTGAAGATTAACAAACTGGACGCAGCAATACATTTCTTTGAGGAAATGAAGGA
ACAAGCTGATAAGTATGATCAGGAAGATATCTTTCCTGATGTTGTGACGTACACTACTTTACTTAAGGGTTTTGGGCTTCTGAAAGATGTCCATCTTGTTCACAAGATTG
TGCTGGAAATGAAGTCTTGTCAAGATTTATTGATTGATCGAACAGCATACACCACAATGATTGATGCTTTGGTTAACTGTGGCTCTATAAACGGTGCTCTTTCTTTATTT
GGGGAATTATTGAAGCTTTCTGGATGGAATTTGGACTTACGGCCAAAGCCACATCTTTATCTCACTCTTATGAGAGTTTTTTCTAGTAGAGGAGATTATTGGATGGTCAA
ACGTTTGCATAGACGCATGTGGTTGGACTCCTCTGGAAGTATTTCTCCTGTATTTCAAGAAGAAGCAGATCATCTTCTCATGGAGGCAGCTTTAAATGACAATCAGATTG
ATGTGGCAATAGAGAAACTTTCAACAATTATTAAGAAATGGAAGGGAATCTCATGGGCTAGTCGAGGAGGCAGTGTTGCTCTGCGTATTGAAGCATTGCTGGGACTCACC
AAATCCTTTTTTAGTCCTCCTTGCATATTTCCTCGGGTAAATCCAGGTGCACCTATTGAGAATGTTATGTTGCCATTTAAAGCCGTTCAGCCATTAAATGGAAGCTTACA
CTTGAAGGAAGTGGTTATGCGTTTCTTTGACAAATCAGTTGTGCCTATCGTAGACGACTGGGGTAGATGCATTGGACTGCTGCACCGAGAAGACTGTACTGAGTTGGATG
CTCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTGGTGTAACAACTACCACATCCATTGGACATGTTGCGAATCTAATTCTACAAAAGAGGCACAAAATGATTATTGTT
GTAAGACATAGCAAGTTTAGGACATATGATGGCTTGAGTTTGAGGGCTGTCGGCGTTTTTACTATCGAGCAATTGTATGGCTTTATTTCTCCCATTCCGATACAGTCTCA
GCCAAACATTCCACGTAAGACGTAA
Protein sequenceShow/hide protein sequence
MLRMASFSSGPLAVNSSDSTAVFSCSLSSRTATAQRRRDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKP
DNCGLDDVSYGTLLKGLGEARKIDEAFQLLESVEEGSAIGGPTLSAPLIYGLLNALVEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEM
LNVELKPDRLTYNTLISACVKINKLDAAIHFFEEMKEQADKYDQEDIFPDVVTYTTLLKGFGLLKDVHLVHKIVLEMKSCQDLLIDRTAYTTMIDALVNCGSINGALSLF
GELLKLSGWNLDLRPKPHLYLTLMRVFSSRGDYWMVKRLHRRMWLDSSGSISPVFQEEADHLLMEAALNDNQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLT
KSFFSPPCIFPRVNPGAPIENVMLPFKAVQPLNGSLHLKEVVMRFFDKSVVPIVDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRHKMIIV
VRHSKFRTYDGLSLRAVGVFTIEQLYGFISPIPIQSQPNIPRKT