; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G008250 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G008250
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationchr02:7682654..7685293
RNA-Seq ExpressionLsi02G008250
SyntenyLsi02G008250
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057054.1 pentatricopeptide repeat-containing protein DOT4 [Cucumis melo var. makuwa]0.0e+0090.44Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN
        M+LL AK P TFW SP  AG+DHR SV+LKFRQSF+F KPNSK SFS+ AYA      P +ETKSY+DVELD+SRKIVEFCEVGDLKNAMELLC SQNSN
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN

Query:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI
         DLD +CSILQLCAE+KSIRDGRRVHS+IES+GV+IDGILGVKLVFMYVKCGDLKEGRM+FD+LSE KVF+WNLMISEY G GNYGESINLFKQMLELGI
Subjt:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI

Query:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS
        KPNSYTFSSVLKCFAA+A VEEGRQVHGLI KLG+ SYNTVVNSLISFYFVGRKVR AQKLFDELTDRDVISWNSMISGYVKNGL+DRGIEIFIKMLV  
Subjt:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS

Query:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
        V+IDLATMVNVLVACAN G+LL GK LHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
Subjt:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS

Query:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH
        RGV+PDVYAVTSILHACA NGNL SG+IVH+YIRENNLE N FVSNALTDMYAKCGSMKDAHDVFSHMK+KDVISWNTMIGGYSKNRLPNEAL LFAE+ 
Subjt:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH

Query:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR
         ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYS+DKYVVNAL+DMYVKCGLLVLARS FDMILNKDLVSWTVMIAGYGMHGFGSEA++TFNQMR
Subjt:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR

Query:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
        + GI+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
Subjt:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE

Query:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD
        RIFELEPENTGYYVLLANIYAEAE+WEEVQKLRK+IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKT YALLNAD
Subjt:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD

Query:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        EREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKS SREI+LRDSSRFHHFKDG+CSCRG+W
Subjt:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

KAG7019566.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.01Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPT-LETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS
        M LLVAK P TFW S   AG DHR  V+LKFRQS  FVKPNS+ SFSNSA+A TESYTPT LE K+YIDVEL+NSRKIV+FCEVGDLKNA+ELLC SQNS
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPT-LETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS

Query:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG
        NLDLDTYC ILQLCAEQKSIRDGRRVHS+IESN V+IDGILG KL+FMYVKCGDL+EGRM+FD+LSE KVFLWNLMISEYSG GNYGESINLFK+MLELG
Subjt:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG

Query:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL
        I PNSYTFSSVLKCFAA+ARVEEG QVHGLICKLGFTSYN VVNSLISFYFVGRKVRSA+KLFDE++DRDVISWNSMISGYVKNGLEDRGIEIF++MLV 
Subjt:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL

Query:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIK-AAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEM
        SVD+DLATMVNVLVACANMG+L LGK LHSYSIK AAALDR+V FNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMI GYVREGLSDGAI+LF+EM
Subjt:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIK-AAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEM

Query:  KSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE
        KSRGVLPDVYAV SILHACATNGNL+SGK +HNYIRENNLE N FVSNAL DMYAKCGSM+DA DVFSHMKRKDVISWNTMIGGYSKNRLPNEAL+LFAE
Subjt:  KSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE

Query:  IHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQ
        + RESKPDGTTVACILPACASLAALD+GREIHGYALRNGYSKDK+V NALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHG+GSEAV  FNQ
Subjt:  IHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQ

Query:  MRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV
        MRIAGIEPDEVSFISILYACSHSGLLDEGW FFNIMKKECQIEPNLEHYACMVDLLARTGNL +AHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV
Subjt:  MRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV

Query:  AERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLN
        AERIFELEPENTGYYVLLANIYAEAE+WEEVQKLR +IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL RLR+KMKEEGYSPKTRYALLN
Subjt:  AERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLN

Query:  ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        ADEREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMA+FMSK   REIVLRDSSRFHHFKDG CSCRGYW
Subjt:  ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

XP_008443463.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis melo]0.0e+0090.33Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN
        M+LL AK P TFW SP  AG+DHR SV+LKFRQSF+F KPNSK SFS+ AYA      P +ETKSY+DVELD+SRKIVEFCEVGDLKNAMELLC SQNSN
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN

Query:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI
         DLD +CSILQLCAE+KSIRDGRRVHS+IES+GV+IDGILGVKLVFMYVKCGDLKEGRM+FD+LSE KVF+WNLMISEY G GNYGESINLFKQMLELGI
Subjt:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI

Query:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS
        KPNSYTFSSVLKCFAA+A VEEGRQVHGLI KLG+ SYNTVVNSLISFYFVGRKVR AQKLFDELTDRDVISWNSMISGYVKNGL+DRGIEIFIKMLV  
Subjt:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS

Query:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
        V+IDLATMVNVLVACAN G+LL GK LHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
Subjt:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS

Query:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH
        RGV+PDVYAVTSILHACA NGNL SG+IVH+YIRENNLE N FVSNALTDMYAKCGSMKDAHDVFSHMK+KDVISWNTMIGGYSKNRLPNEAL LFAE+ 
Subjt:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH

Query:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR
         ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYS+DKYVVNAL+DMYVKCGLLVLARS FDMILNKDLVSWTVMIAGYGMHGFGSEA++TFNQMR
Subjt:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR

Query:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
        + GI+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
Subjt:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE

Query:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD
        RIFELEPENTGYYVLLANIYAEAE+WEEVQKLRK+IGQRGLKKNPGC+WIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKT YALLNAD
Subjt:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD

Query:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        EREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKS SREI+LRDSSRFHHFKDG+CSCRG+W
Subjt:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

XP_011657608.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus]0.0e+0090.9Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN
        M+LL AK P TFW SP  AGNDHR SV+LKFRQSF+FV P+SK SFS+ AYA      P LETKSY+DVELD+SRKIVEFCEVGDLKNAMELLC SQNSN
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN

Query:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI
         DL  YCSILQLCAE+KSIRDGRRV S+IES+GVMIDGILGVKLVFMYVKCGDLKEGRMVFD+LSE K+FLWNLMISEYSG GNYGESINLFKQMLELGI
Subjt:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI

Query:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS
        KPNSYTFSS+LKCFAA+ARVEEGRQVHGLICKLGF SYNTVVNSLISFYFVGRKVR AQKLFDELTDRDVISWNSMISGYVKNGL+DRGIEIFIKMLV  
Subjt:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS

Query:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
        VDIDLATMVNV VACAN+G+LLLGK LHSYSIKAA LDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
Subjt:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS

Query:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH
        RGV+PDVYAVTSIL+ACA NGNL SGKIVH+YIRENNLE N FVSNALTDMYAKCGSMKDAHDVFSHMK+KDVISWNTMIGGY+KN LPNEAL LFAE+ 
Subjt:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH

Query:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR
        RESKPDGTTVACILPACASLAALD+GREIHGYALRNGYS+DKYV NA+VDMYVKCGLLVLARSLFDMI NKDLVSWTVMIAGYGMHG+GSEA++TFNQMR
Subjt:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR

Query:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
        + GIEPDEVSFISILYACSHSGLLDEGWK FNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFI+ MPIKPDATIWGALLCGCRIHHDVKLAEKVAE
Subjt:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE

Query:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD
        RIFELEPENTGYYVLLANIYAEAE+WEEVQKLRKKIGQRGLKKNPGCSWIEIKGK+NIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKT YALLNAD
Subjt:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD

Query:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        EREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREI+LRDSSRFHHFKDG+CSCRGYW
Subjt:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

XP_038893908.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Benincasa hispida]0.0e+0092.95Show/hide
Query:  LLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLET--KSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS
        +LLVAKPPTTFW SP   G DHR  +SLKFRQSFVFVKPNSKFSFSNSA+ACTE YTP LET  KSYIDVELDNS KIVEFCE+GDLKNAMELLCGSQNS
Subjt:  LLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLET--KSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS

Query:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG
          DLDTYCSILQLCAEQKSIRDGRRVHS+IESNGVMIDGILGVKLVFMYVKCGDLKEGR++FD+LSE+KVFLWNLMISEYSG GNYGESINLFKQMLELG
Subjt:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG

Query:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL
        IKPNSYTFSSVLKC AA+ARVEEGRQVHGLICKLGF SYNTVVNSLISFYFV RKVR AQKLFDELTDRDVISWNSMISGYVKNGLED+GIEIFIKML  
Subjt:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL

Query:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK
        S+D DLATMVNVLVACANMG+LLLGKALHSY+IKAAAL++EV FNNTLLDMYSKCG LNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK
Subjt:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK

Query:  SRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEI
        S+G+LPDVYAVTSILHACA NGNL+SGKIVHNYIREN LE N FVSNAL DMYAK GSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE+
Subjt:  SRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEI

Query:  HRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQM
         RE KPD TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMI NKDLVSWTVMIAGYGMHGFGSEA++TFNQM
Subjt:  HRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQM

Query:  RIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVA
        RIAGIEPDEVSFISILYACSHSGLLDEGWKF+NIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVA
Subjt:  RIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVA

Query:  ERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNA
        E+IFELEPENTGYYVLLANIYAEAE+WEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNA
Subjt:  ERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNA

Query:  DEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        DEREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREI+LRDSSRFH+FKDGNCSCRGYW
Subjt:  DEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

TrEMBL top hitse value%identityAlignment
A0A0A0LV30 DYW_deaminase domain-containing protein0.0e+0090.9Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN
        M+LL AK P TFW SP  AGNDHR SV+LKFRQSF+FV P+SK SFS+ AYA      P LETKSY+DVELD+SRKIVEFCEVGDLKNAMELLC SQNSN
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN

Query:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI
         DL  YCSILQLCAE+KSIRDGRRV S+IES+GVMIDGILGVKLVFMYVKCGDLKEGRMVFD+LSE K+FLWNLMISEYSG GNYGESINLFKQMLELGI
Subjt:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI

Query:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS
        KPNSYTFSS+LKCFAA+ARVEEGRQVHGLICKLGF SYNTVVNSLISFYFVGRKVR AQKLFDELTDRDVISWNSMISGYVKNGL+DRGIEIFIKMLV  
Subjt:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS

Query:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
        VDIDLATMVNV VACAN+G+LLLGK LHSYSIKAA LDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
Subjt:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS

Query:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH
        RGV+PDVYAVTSIL+ACA NGNL SGKIVH+YIRENNLE N FVSNALTDMYAKCGSMKDAHDVFSHMK+KDVISWNTMIGGY+KN LPNEAL LFAE+ 
Subjt:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH

Query:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR
        RESKPDGTTVACILPACASLAALD+GREIHGYALRNGYS+DKYV NA+VDMYVKCGLLVLARSLFDMI NKDLVSWTVMIAGYGMHG+GSEA++TFNQMR
Subjt:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR

Query:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
        + GIEPDEVSFISILYACSHSGLLDEGWK FNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFI+ MPIKPDATIWGALLCGCRIHHDVKLAEKVAE
Subjt:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE

Query:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD
        RIFELEPENTGYYVLLANIYAEAE+WEEVQKLRKKIGQRGLKKNPGCSWIEIKGK+NIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKT YALLNAD
Subjt:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD

Query:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        EREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREI+LRDSSRFHHFKDG+CSCRGYW
Subjt:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

A0A1S3B857 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0090.33Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN
        M+LL AK P TFW SP  AG+DHR SV+LKFRQSF+F KPNSK SFS+ AYA      P +ETKSY+DVELD+SRKIVEFCEVGDLKNAMELLC SQNSN
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN

Query:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI
         DLD +CSILQLCAE+KSIRDGRRVHS+IES+GV+IDGILGVKLVFMYVKCGDLKEGRM+FD+LSE KVF+WNLMISEY G GNYGESINLFKQMLELGI
Subjt:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI

Query:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS
        KPNSYTFSSVLKCFAA+A VEEGRQVHGLI KLG+ SYNTVVNSLISFYFVGRKVR AQKLFDELTDRDVISWNSMISGYVKNGL+DRGIEIFIKMLV  
Subjt:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS

Query:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
        V+IDLATMVNVLVACAN G+LL GK LHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
Subjt:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS

Query:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH
        RGV+PDVYAVTSILHACA NGNL SG+IVH+YIRENNLE N FVSNALTDMYAKCGSMKDAHDVFSHMK+KDVISWNTMIGGYSKNRLPNEAL LFAE+ 
Subjt:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH

Query:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR
         ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYS+DKYVVNAL+DMYVKCGLLVLARS FDMILNKDLVSWTVMIAGYGMHGFGSEA++TFNQMR
Subjt:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR

Query:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
        + GI+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
Subjt:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE

Query:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD
        RIFELEPENTGYYVLLANIYAEAE+WEEVQKLRK+IGQRGLKKNPGC+WIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKT YALLNAD
Subjt:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD

Query:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        EREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKS SREI+LRDSSRFHHFKDG+CSCRG+W
Subjt:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

A0A5A7UPC9 Pentatricopeptide repeat-containing protein DOT40.0e+0090.44Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN
        M+LL AK P TFW SP  AG+DHR SV+LKFRQSF+F KPNSK SFS+ AYA      P +ETKSY+DVELD+SRKIVEFCEVGDLKNAMELLC SQNSN
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSN

Query:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI
         DLD +CSILQLCAE+KSIRDGRRVHS+IES+GV+IDGILGVKLVFMYVKCGDLKEGRM+FD+LSE KVF+WNLMISEY G GNYGESINLFKQMLELGI
Subjt:  LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGI

Query:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS
        KPNSYTFSSVLKCFAA+A VEEGRQVHGLI KLG+ SYNTVVNSLISFYFVGRKVR AQKLFDELTDRDVISWNSMISGYVKNGL+DRGIEIFIKMLV  
Subjt:  KPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLS

Query:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
        V+IDLATMVNVLVACAN G+LL GK LHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS
Subjt:  VDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKS

Query:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH
        RGV+PDVYAVTSILHACA NGNL SG+IVH+YIRENNLE N FVSNALTDMYAKCGSMKDAHDVFSHMK+KDVISWNTMIGGYSKNRLPNEAL LFAE+ 
Subjt:  RGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH

Query:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR
         ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYS+DKYVVNAL+DMYVKCGLLVLARS FDMILNKDLVSWTVMIAGYGMHGFGSEA++TFNQMR
Subjt:  RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMR

Query:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
        + GI+PDEVSFISILYACSHSGLLDEGWK F+IMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKVAE
Subjt:  IAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAE

Query:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD
        RIFELEPENTGYYVLLANIYAEAE+WEEVQKLRK+IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKT YALLNAD
Subjt:  RIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNAD

Query:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        EREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKS SREI+LRDSSRFHHFKDG+CSCRG+W
Subjt:  EREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

A0A6J1EHU9 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0090.01Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPT-LETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS
        M LLVAK P TFW S   AG DHR  V+LKFRQS  FVKPNS+ SFSNSA+A TESYTPT LE K+YIDVEL+NSRKIV+FCEVGDLKNA+ELLC SQNS
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPT-LETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS

Query:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG
        NLDLDTYC ILQLCAEQKSIRDGRRVHS+IESN V+IDGILG KLVFMYVKCGDL+EGRM+FD+LSE KVFLWNLMISEYSG GNYGESINLFK+MLELG
Subjt:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG

Query:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL
        I PNSYTFSSVLKCFAA+ARVEEG QVHGLICKLGFTSYN VVNSLISFYFVGRKVRSA+KLFDE++DRDVISWNSMISGYVKNGLEDRGIEIF++MLV 
Subjt:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL

Query:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIK-AAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEM
        SVD+DLATMVNVLVACANMG+L LGK LHSYSIK AAALDR+V FNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTS+I GYVREGLSDGAI+LF+EM
Subjt:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIK-AAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEM

Query:  KSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE
        KSRGVLPDVYAV SILHACATNGNL+SGK +HNYIRENNLE N FVSNAL DMYAKCGSM+DA DVFSHMKRKDVISWNTMIGGYSKNRLPNEAL+LFAE
Subjt:  KSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE

Query:  IHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQ
        + RESKPDGTTVACILPACASLAALD+GREIHGYALRNGYSKDK+V NALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHG+GSEAV  FNQ
Subjt:  IHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQ

Query:  MRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV
        MRIAGIEPDEVSFISILYACSHSGLLDEGW FFNIMKKECQIEPNLEHYACMVDLLARTGNL +AHKFI+TMPIKPDATIWGALLCGCRIHHDVKLAEKV
Subjt:  MRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV

Query:  AERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLN
        AERIFELEPENTGYYVLLANIYAEAE+WEEVQKLR +IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL RLRSKMKEEGYSPKTRYALLN
Subjt:  AERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLN

Query:  ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        ADEREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMA+FMSK   REIVLRDSSRFHHFKDG CSCRGYW
Subjt:  ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

A0A6J1KNK7 pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0089.2Show/hide
Query:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTP-TLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS
        M LLVAK P TFW S   AG DHR  V+LKFRQS  FVKPNS+ SFSNSAYA TESYTP  LE K+YID EL+NSRKIV+FCEVGDLKNA+ELLC SQNS
Subjt:  MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTP-TLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNS

Query:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG
        NLDLDTYC ILQLCAEQKSIRDGRRVHS+IESN V+IDGILG KLVFMYVKCGDL+EGRM+FD+LSE KVFLWNLMISEYSG GNYGESINLFK+MLELG
Subjt:  NLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELG

Query:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL
        I PNSYTFSSVLKCFAA+ RVEEGRQVHGLICKLGFTSYN VVNSLISFYFVGRKVRSA+KLFDE++DRDVISWNSMISGYVKNGLEDRGIEIF++MLV 
Subjt:  IKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVL

Query:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIK-AAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEM
        SVD+DLATMVNVLVACANMG+L LGK LHSYSIK AAALDR+V FNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMI GYVREGLSDGAI+LF+EM
Subjt:  SVDIDLATMVNVLVACANMGSLLLGKALHSYSIK-AAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEM

Query:  KSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE
        KSRGVLPDVYAV SILHACA NGNL+SGK +HNYI+ENNLE N FVSNAL DMYAKCGSMKDA DVFSH+KRKDVISWNTMIGGYSKNRLPNEAL+LFAE
Subjt:  KSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE

Query:  IHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQ
        + RESKPDGTTVACILPACASLAALD+GREIHGYALRNGYS+DK+V NALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHG+GSEAV+ FNQ
Subjt:  IHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQ

Query:  MRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV
        MRIAGIEPDEVSFISILYACSHSGLLDEGW FF IMKKECQIEP LEHYACMVDLLARTGNL +AHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV
Subjt:  MRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKV

Query:  AERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLN
        AERIFELEPENTGYYVL+ANIYAEAE+WEEVQKLR +IG+ GLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL RLRSKMKEEGYSPKTRYALLN
Subjt:  AERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLN

Query:  ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGY
        ADEREKEVALCGH EKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMA+FMSK   REIVLRDSSRFHHFKDG CSCRGY
Subjt:  ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGY

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.6e-16938.86Show/hide
Query:  ILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFS
        +L+ C+   S+++ R++  ++  NG+  +     KLV ++ + G + E   VF+ +      L++ M+  ++   +  +++  F +M    ++P  Y F+
Subjt:  ILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFS

Query:  SVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATM
         +LK     A +  G+++HGL+ K GF+     +  L + Y   R+V  A+K+FD + +RD++SWN++++GY +NG+    +E+   M   ++     T+
Subjt:  SVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATM

Query:  VNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVY
        V+VL A + +  + +GK +H Y+++ +  D  V  +  L+DMY+KCG L +A ++F+ M E+ VVSW SMI  YV+      A+ +F +M   GV P   
Subjt:  VNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVY

Query:  AVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH-RESKPDG
        +V   LHACA  G+L  G+ +H    E  L+ N  V N+L  MY KC  +  A  +F  ++ + ++SWN MI G+++N  P +ALN F+++  R  KPD 
Subjt:  AVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH-RESKPDG

Query:  TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPD
         T   ++ A A L+     + IHG  +R+   K+ +V  ALVDMY KCG +++AR +FDM+  + + +W  MI GYG HGFG  A++ F +M+   I+P+
Subjt:  TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPD

Query:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP
         V+F+S++ ACSHSGL++ G K F +MK+   IE +++HY  MVDLL R G L +A  FI  MP+KP   ++GA+L  C+IH +V  AEK AER+FEL P
Subjt:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP

Query:  ENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVA
        ++ GY+VLLANIY  A  WE+V ++R  + ++GL+K PGCS +EIK +V+ F +G  + P +KKI   L++L   +KE GY P T   +L  +   KE  
Subjt:  ENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVA

Query:  LCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        L  H EKLA++FG+LN   G TI V KNLRVC DCH   K++S    REIV+RD  RFHHFK+G CSC  YW
Subjt:  LCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168601.2e-16940Show/hide
Query:  KSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQL--SEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCF
        K+I   + +H  + S G++   +    L+  Y+  G L     +  +   S+  V+ WN +I  Y   G   + + LF  M  L   P++YTF  V K  
Subjt:  KSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQL--SEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCF

Query:  AAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKML-VLSVDIDLATMVNVLV
          I+ V  G   H L    GF S   V N+L++ Y   R +  A+K+FDE++  DV+SWNS+I  Y K G     +E+F +M        D  T+VNVL 
Subjt:  AAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKML-VLSVDIDLATMVNVLV

Query:  ACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK--------------
         CA++G+  LGK LH +++ +  + + +   N L+DMY+KCG ++ A  VF  M  K VVSW +M+ GY + G  + A++LF++M+              
Subjt:  ACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK--------------

Query:  ---------------------SRGVLPDVYAVTSILHACATNGNLSSGKIVHNY-------IRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHM--K
                             S G+ P+   + S+L  CA+ G L  GK +H Y       +R+N       V N L DMYAKC  +  A  +F  +  K
Subjt:  ---------------------SRGVLPDVYAVTSILHACATNGNLSSGKIVHNY-------IRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHM--K

Query:  RKDVISWNTMIGGYSKNRLPNEALNLFAEIHRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK-DKYVVNALVDMYVKCGLLVLARSLF
         +DV++W  MIGGYS++   N+AL L +E+  E   ++P+  T++C L ACASLAAL  G++IH YALRN  +    +V N L+DMY KCG +  AR +F
Subjt:  RKDVISWNTMIGGYSKNRLPNEALNLFAEIHRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK-DKYVVNALVDMYVKCGLLVLARSLF

Query:  DMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHK
        D ++ K+ V+WT ++ GYGMHG+G EA+  F++MR  G + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A +
Subjt:  DMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHK

Query:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGKVNIFVAGDC
         IE MP++P   +W A L  CRIH  V+L E  AE+I EL   + G Y LL+N+YA A RW++V ++R  +  +G+KK PGCSW+E IKG    FV GD 
Subjt:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGKVNIFVAGDC

Query:  SKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSR
        + P AK+I  +L     ++K+ GY P+T +AL + D+ EK+  L  H EKLA+A+G+L  P G  IR+TKNLRVCGDCH    +MS+    +I+LRDSSR
Subjt:  SKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSR

Query:  FHHFKDGNCSCRGYW
        FHHFK+G+CSC+GYW
Subjt:  FHHFKDGNCSCRGYW

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic3.0e-17840.1Show/hide
Query:  CEVGDLKNAMELLCGSQNSN--LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISE
        C  G L+ AM+LL   Q     +D D + ++++LC  +++  +G +V+S+  S+   +   LG   + M+V+ G+L +   VF ++SE  +F WN+++  
Subjt:  CEVGDLKNAMELLCGSQNSN--LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISE

Query:  YSGGGNYGESINLFKQMLEL-GIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI
        Y+  G + E++ L+ +ML + G+KP+ YTF  VL+    I  +  G++VH  + + G+     VVN+LI+ Y     V+SA+ LFD +  RD+ISWN+MI
Subjt:  YSGGGNYGESINLFKQMLEL-GIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI

Query:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTS
        SGY +NG+   G+E+F  M  LSVD DL T+ +V+ AC  +G   LG+ +H+Y I       ++   N+L  MY   G    A ++F RM+ K +VSWT+
Subjt:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTS

Query:  MITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWN
        MI+GY    L D AI  +  M    V PD   V ++L ACAT G+L +G  +H    +  L     V+N L +MY+KC  +  A D+F ++ RK+VISW 
Subjt:  MITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWN

Query:  TMIGGYSKNRLPNEALNLFAEIHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWT
        ++I G   N    EAL    ++    +P+  T+   L ACA + AL  G+EIH + LR G   D ++ NAL+DMYV+CG +  A S F+    KD+ SW 
Subjt:  TMIGGYSKNRLPNEALNLFAEIHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWT

Query:  VMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDAT
        +++ GY   G GS  V+ F++M  + + PDE++FIS+L  CS S ++ +G  +F+ M ++  + PNL+HYAC+VDLL R G L +AHKFI+ MP+ PD  
Subjt:  VMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDAT

Query:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLK
        +WGALL  CRIHH + L E  A+ IFEL+ ++ GYY+LL N+YA+  +W EV K+R+ + + GL  + GCSW+E+KGKV+ F++ D   PQ K+I  +L+
Subjt:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLK

Query:  RLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSC
            KM E G +  +  + ++  E  ++   CGH E+ A+AFG++N  PG  I VTKNL +C +CH+  KF+SK+  REI +RD+  FHHFKDG CSC
Subjt:  RLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSC

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic0.0e+0062.84Show/hide
Query:  DNSRKIVEFCEVGDLKNAMELLCGSQNSNLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFL
        D + ++  FCE G+L+NA++LLC S   ++D  T CS+LQLCA+ KS++DG+ V + I  NG +ID  LG KL  MY  CGDLKE   VFD++  +K   
Subjt:  DNSRKIVEFCEVGDLKNAMELLCGSQNSNLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFL

Query:  WNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVI
        WN++++E +  G++  SI LFK+M+  G++ +SYTFS V K F+++  V  G Q+HG I K GF   N+V NSL++FY   ++V SA+K+FDE+T+RDVI
Subjt:  WNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVI

Query:  SWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKT
        SWNS+I+GYV NGL ++G+ +F++MLV  ++IDLAT+V+V   CA+   + LG+A+HS  +K A   RE RF NTLLDMYSKCGDL+SA  VF  M +++
Subjt:  SWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKT

Query:  VVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRK
        VVS+TSMI GY REGL+  A+KLF+EM+  G+ PDVY VT++L+ CA    L  GK VH +I+EN+L  + FVSNAL DMYAKCGSM++A  VFS M+ K
Subjt:  VVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRK

Query:  DVISWNTMIGGYSKNRLPNEALNLFAEIHRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMIL
        D+ISWNT+IGGYSKN   NEAL+LF  +  E +  PD  TVAC+LPACASL+A D+GREIHGY +RNGY  D++V N+LVDMY KCG L+LA  LFD I 
Subjt:  DVISWNTMIGGYSKNRLPNEALNLFAEIHRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMIL

Query:  NKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIET
        +KDLVSWTVMIAGYGMHGFG EA+  FNQMR AGIE DE+SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+LARTG+L+KA++FIE 
Subjt:  NKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIET

Query:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQA
        MPI PDATIWGALLCGCRIHHDVKLAEKVAE++FELEPENTGYYVL+ANIYAEAE+WE+V++LRK+IGQRGL+KNPGCSWIEIKG+VNIFVAGD S P+ 
Subjt:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQA

Query:  KKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFK
        + IE  L+++R++M EEGYSP T+YAL++A+E EKE ALCGH EKLAMA G+++   GK IRVTKNLRVCGDCHEMAKFMSK   REIVLRDS+RFH FK
Subjt:  KKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFK

Query:  DGNCSCRGYW
        DG+CSCRG+W
Subjt:  DGNCSCRGYW

Q9SS60 Pentatricopeptide repeat-containing protein At3g035801.2e-17140.75Show/hide
Query:  FCEVGDLKNAMELLCGSQNSNLDLD--TYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMIS
        F + G    A+E     + S +  D  T+ S+++ CA       G  V+  I   G   D  +G  LV MY + G L   R VFD++    +  WN +IS
Subjt:  FCEVGDLKNAMELLCGSQNSNLDLD--TYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMIS

Query:  EYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI
         YS  G Y E++ ++ ++    I P+S+T SSVL  F  +  V++G+ +HG   K G  S   V N L++ Y   R+   A+++FDE+  RD +S+N+MI
Subjt:  EYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI

Query:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAA-ALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWT
         GY+K  + +  + +F++ L      DL T+ +VL AC ++  L L K +++Y +KA   L+  VR  N L+D+Y+KCGD+ +A  VF  M+ K  VSW 
Subjt:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAA-ALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWT

Query:  SMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISW
        S+I+GY++ G    A+KLF  M       D      ++       +L  GK +H+   ++ + I+  VSNAL DMYAKCG + D+  +FS M   D ++W
Subjt:  SMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISW

Query:  NTMIGGYSKNRLPNEALNLFAEIHR-ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVS
        NT+I    +       L +  ++ + E  PD  T    LP CASLAA   G+EIH   LR GY  +  + NAL++MY KCG L  +  +F+ +  +D+V+
Subjt:  NTMIGGYSKNRLPNEALNLFAEIHR-ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVS

Query:  WTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPD
        WT MI  YGM+G G +A++TF  M  +GI PD V FI+I+YACSHSGL+DEG   F  MK   +I+P +EHYAC+VDLL+R+  + KA +FI+ MPIKPD
Subjt:  WTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPD

Query:  ATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELL
        A+IW ++L  CR   D++ AE+V+ RI EL P++ GY +L +N YA   +W++V  +RK +  + + KNPG SWIE+   V++F +GD S PQ++ I   
Subjt:  ATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELL

Query:  LKRLRSKMKEEGYSPKTRYALLN-ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCS
        L+ L S M +EGY P  R    N  +E EK   +CGH E+LA+AFG+LN  PG  ++V KNLRVCGDCHE+ K +SK   REI++RD++RFH FKDG CS
Subjt:  LKRLRSKMKEEGYSPKTRYALLN-ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCS

Query:  CRGYW
        C+  W
Subjt:  CRGYW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-17038.86Show/hide
Query:  ILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFS
        +L+ C+   S+++ R++  ++  NG+  +     KLV ++ + G + E   VF+ +      L++ M+  ++   +  +++  F +M    ++P  Y F+
Subjt:  ILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFS

Query:  SVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATM
         +LK     A +  G+++HGL+ K GF+     +  L + Y   R+V  A+K+FD + +RD++SWN++++GY +NG+    +E+   M   ++     T+
Subjt:  SVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATM

Query:  VNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVY
        V+VL A + +  + +GK +H Y+++ +  D  V  +  L+DMY+KCG L +A ++F+ M E+ VVSW SMI  YV+      A+ +F +M   GV P   
Subjt:  VNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVY

Query:  AVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH-RESKPDG
        +V   LHACA  G+L  G+ +H    E  L+ N  V N+L  MY KC  +  A  +F  ++ + ++SWN MI G+++N  P +ALN F+++  R  KPD 
Subjt:  AVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIH-RESKPDG

Query:  TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPD
         T   ++ A A L+     + IHG  +R+   K+ +V  ALVDMY KCG +++AR +FDM+  + + +W  MI GYG HGFG  A++ F +M+   I+P+
Subjt:  TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPD

Query:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP
         V+F+S++ ACSHSGL++ G K F +MK+   IE +++HY  MVDLL R G L +A  FI  MP+KP   ++GA+L  C+IH +V  AEK AER+FEL P
Subjt:  EVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEP

Query:  ENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVA
        ++ GY+VLLANIY  A  WE+V ++R  + ++GL+K PGCS +EIK +V+ F +G  + P +KKI   L++L   +KE GY P T   +L  +   KE  
Subjt:  ENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVA

Query:  LCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW
        L  H EKLA++FG+LN   G TI V KNLRVC DCH   K++S    REIV+RD  RFHHFK+G CSC  YW
Subjt:  LCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-17940.1Show/hide
Query:  CEVGDLKNAMELLCGSQNSN--LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISE
        C  G L+ AM+LL   Q     +D D + ++++LC  +++  +G +V+S+  S+   +   LG   + M+V+ G+L +   VF ++SE  +F WN+++  
Subjt:  CEVGDLKNAMELLCGSQNSN--LDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISE

Query:  YSGGGNYGESINLFKQMLEL-GIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI
        Y+  G + E++ L+ +ML + G+KP+ YTF  VL+    I  +  G++VH  + + G+     VVN+LI+ Y     V+SA+ LFD +  RD+ISWN+MI
Subjt:  YSGGGNYGESINLFKQMLEL-GIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI

Query:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTS
        SGY +NG+   G+E+F  M  LSVD DL T+ +V+ AC  +G   LG+ +H+Y I       ++   N+L  MY   G    A ++F RM+ K +VSWT+
Subjt:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTS

Query:  MITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWN
        MI+GY    L D AI  +  M    V PD   V ++L ACAT G+L +G  +H    +  L     V+N L +MY+KC  +  A D+F ++ RK+VISW 
Subjt:  MITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWN

Query:  TMIGGYSKNRLPNEALNLFAEIHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWT
        ++I G   N    EAL    ++    +P+  T+   L ACA + AL  G+EIH + LR G   D ++ NAL+DMYV+CG +  A S F+    KD+ SW 
Subjt:  TMIGGYSKNRLPNEALNLFAEIHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVSWT

Query:  VMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDAT
        +++ GY   G GS  V+ F++M  + + PDE++FIS+L  CS S ++ +G  +F+ M ++  + PNL+HYAC+VDLL R G L +AHKFI+ MP+ PD  
Subjt:  VMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPDAT

Query:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLK
        +WGALL  CRIHH + L E  A+ IFEL+ ++ GYY+LL N+YA+  +W EV K+R+ + + GL  + GCSW+E+KGKV+ F++ D   PQ K+I  +L+
Subjt:  IWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLK

Query:  RLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSC
            KM E G +  +  + ++  E  ++   CGH E+ A+AFG++N  PG  I VTKNL +C +CH+  KF+SK+  REI +RD+  FHHFKDG CSC
Subjt:  RLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSC

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.7e-17340.75Show/hide
Query:  FCEVGDLKNAMELLCGSQNSNLDLD--TYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMIS
        F + G    A+E     + S +  D  T+ S+++ CA       G  V+  I   G   D  +G  LV MY + G L   R VFD++    +  WN +IS
Subjt:  FCEVGDLKNAMELLCGSQNSNLDLD--TYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMIS

Query:  EYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI
         YS  G Y E++ ++ ++    I P+S+T SSVL  F  +  V++G+ +HG   K G  S   V N L++ Y   R+   A+++FDE+  RD +S+N+MI
Subjt:  EYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMI

Query:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAA-ALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWT
         GY+K  + +  + +F++ L      DL T+ +VL AC ++  L L K +++Y +KA   L+  VR  N L+D+Y+KCGD+ +A  VF  M+ K  VSW 
Subjt:  SGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAA-ALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWT

Query:  SMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISW
        S+I+GY++ G    A+KLF  M       D      ++       +L  GK +H+   ++ + I+  VSNAL DMYAKCG + D+  +FS M   D ++W
Subjt:  SMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISW

Query:  NTMIGGYSKNRLPNEALNLFAEIHR-ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVS
        NT+I    +       L +  ++ + E  PD  T    LP CASLAA   G+EIH   LR GY  +  + NAL++MY KCG L  +  +F+ +  +D+V+
Subjt:  NTMIGGYSKNRLPNEALNLFAEIHR-ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILNKDLVS

Query:  WTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPD
        WT MI  YGM+G G +A++TF  M  +GI PD V FI+I+YACSHSGL+DEG   F  MK   +I+P +EHYAC+VDLL+R+  + KA +FI+ MPIKPD
Subjt:  WTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIETMPIKPD

Query:  ATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELL
        A+IW ++L  CR   D++ AE+V+ RI EL P++ GY +L +N YA   +W++V  +RK +  + + KNPG SWIE+   V++F +GD S PQ++ I   
Subjt:  ATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELL

Query:  LKRLRSKMKEEGYSPKTRYALLN-ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCS
        L+ L S M +EGY P  R    N  +E EK   +CGH E+LA+AFG+LN  PG  ++V KNLRVCGDCHE+ K +SK   REI++RD++RFH FKDG CS
Subjt:  LKRLRSKMKEEGYSPKTRYALLN-ADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCS

Query:  CRGYW
        C+  W
Subjt:  CRGYW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein0.0e+0062.84Show/hide
Query:  DNSRKIVEFCEVGDLKNAMELLCGSQNSNLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFL
        D + ++  FCE G+L+NA++LLC S   ++D  T CS+LQLCA+ KS++DG+ V + I  NG +ID  LG KL  MY  CGDLKE   VFD++  +K   
Subjt:  DNSRKIVEFCEVGDLKNAMELLCGSQNSNLDLDTYCSILQLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFL

Query:  WNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVI
        WN++++E +  G++  SI LFK+M+  G++ +SYTFS V K F+++  V  G Q+HG I K GF   N+V NSL++FY   ++V SA+K+FDE+T+RDVI
Subjt:  WNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVI

Query:  SWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKT
        SWNS+I+GYV NGL ++G+ +F++MLV  ++IDLAT+V+V   CA+   + LG+A+HS  +K A   RE RF NTLLDMYSKCGDL+SA  VF  M +++
Subjt:  SWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKT

Query:  VVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRK
        VVS+TSMI GY REGL+  A+KLF+EM+  G+ PDVY VT++L+ CA    L  GK VH +I+EN+L  + FVSNAL DMYAKCGSM++A  VFS M+ K
Subjt:  VVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHMKRK

Query:  DVISWNTMIGGYSKNRLPNEALNLFAEIHRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMIL
        D+ISWNT+IGGYSKN   NEAL+LF  +  E +  PD  TVAC+LPACASL+A D+GREIHGY +RNGY  D++V N+LVDMY KCG L+LA  LFD I 
Subjt:  DVISWNTMIGGYSKNRLPNEALNLFAEIHRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMIL

Query:  NKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIET
        +KDLVSWTVMIAGYGMHGFG EA+  FNQMR AGIE DE+SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+LARTG+L+KA++FIE 
Subjt:  NKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIET

Query:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQA
        MPI PDATIWGALLCGCRIHHDVKLAEKVAE++FELEPENTGYYVL+ANIYAEAE+WE+V++LRK+IGQRGL+KNPGCSWIEIKG+VNIFVAGD S P+ 
Subjt:  MPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQA

Query:  KKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFK
        + IE  L+++R++M EEGYSP T+YAL++A+E EKE ALCGH EKLAMA G+++   GK IRVTKNLRVCGDCHEMAKFMSK   REIVLRDS+RFH FK
Subjt:  KKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFK

Query:  DGNCSCRGYW
        DG+CSCRG+W
Subjt:  DGNCSCRGYW

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-17140Show/hide
Query:  KSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQL--SEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCF
        K+I   + +H  + S G++   +    L+  Y+  G L     +  +   S+  V+ WN +I  Y   G   + + LF  M  L   P++YTF  V K  
Subjt:  KSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQL--SEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCF

Query:  AAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKML-VLSVDIDLATMVNVLV
          I+ V  G   H L    GF S   V N+L++ Y   R +  A+K+FDE++  DV+SWNS+I  Y K G     +E+F +M        D  T+VNVL 
Subjt:  AAIARVEEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKML-VLSVDIDLATMVNVLV

Query:  ACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK--------------
         CA++G+  LGK LH +++ +  + + +   N L+DMY+KCG ++ A  VF  M  K VVSW +M+ GY + G  + A++LF++M+              
Subjt:  ACANMGSLLLGKALHSYSIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMK--------------

Query:  ---------------------SRGVLPDVYAVTSILHACATNGNLSSGKIVHNY-------IRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHM--K
                             S G+ P+   + S+L  CA+ G L  GK +H Y       +R+N       V N L DMYAKC  +  A  +F  +  K
Subjt:  ---------------------SRGVLPDVYAVTSILHACATNGNLSSGKIVHNY-------IRENNLEINPFVSNALTDMYAKCGSMKDAHDVFSHM--K

Query:  RKDVISWNTMIGGYSKNRLPNEALNLFAEIHRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK-DKYVVNALVDMYVKCGLLVLARSLF
         +DV++W  MIGGYS++   N+AL L +E+  E   ++P+  T++C L ACASLAAL  G++IH YALRN  +    +V N L+DMY KCG +  AR +F
Subjt:  RKDVISWNTMIGGYSKNRLPNEALNLFAEIHRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK-DKYVVNALVDMYVKCGLLVLARSLF

Query:  DMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHK
        D ++ K+ V+WT ++ GYGMHG+G EA+  F++MR  G + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A +
Subjt:  DMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHK

Query:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGKVNIFVAGDC
         IE MP++P   +W A L  CRIH  V+L E  AE+I EL   + G Y LL+N+YA A RW++V ++R  +  +G+KK PGCSW+E IKG    FV GD 
Subjt:  FIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGKVNIFVAGDC

Query:  SKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSR
        + P AK+I  +L     ++K+ GY P+T +AL + D+ EK+  L  H EKLA+A+G+L  P G  IR+TKNLRVCGDCH    +MS+    +I+LRDSSR
Subjt:  SKPQAKKIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSR

Query:  FHHFKDGNCSCRGYW
        FHHFK+G+CSC+GYW
Subjt:  FHHFKDGNCSCRGYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTACTGGTAGCAAAACCCCCAACAACCTTCTGGCCATCTCCGGCCGGGGCCGGGAACGATCATCGTGACTCAGTGAGCTTGAAATTCCGGCAATCTTTTGTCTT
TGTCAAACCCAATTCGAAATTTTCCTTTTCGAATTCGGCATATGCTTGTACAGAGAGTTACACCCCAACATTGGAAACGAAAAGCTATATCGATGTTGAACTGGATAATT
CCCGCAAAATTGTCGAGTTTTGTGAAGTGGGTGACCTCAAAAATGCTATGGAGCTTCTTTGCGGTTCCCAAAATTCCAACCTTGACTTGGATACTTATTGCTCCATCTTG
CAGCTGTGTGCTGAACAAAAATCTATACGAGATGGAAGAAGGGTTCATTCAGTAATTGAATCTAATGGGGTTATGATAGATGGAATCTTGGGGGTGAAACTAGTTTTTAT
GTATGTTAAATGCGGGGATCTTAAAGAAGGGAGGATGGTTTTTGATCAACTTTCTGAAGATAAGGTTTTCCTCTGGAACCTTATGATCAGTGAGTATTCGGGAGGTGGTA
ACTATGGAGAGAGTATAAATTTATTCAAGCAAATGCTGGAGTTGGGGATAAAACCTAATTCATATACATTTTCTAGTGTTTTGAAATGTTTTGCAGCAATTGCACGTGTA
GAAGAGGGTAGGCAGGTTCATGGGTTGATCTGTAAATTGGGTTTTACTTCTTATAATACAGTCGTTAATTCATTAATTTCTTTCTACTTTGTGGGTAGAAAGGTTAGAAG
TGCACAGAAGCTGTTCGATGAATTGACTGACCGAGATGTCATTTCATGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACAGAGGAATTGAGATTTTCA
TAAAGATGTTAGTTTTGAGTGTCGATATTGATTTGGCTACAATGGTCAATGTGCTTGTAGCTTGTGCAAATATGGGCAGTCTTTTGTTGGGTAAGGCACTTCATTCTTAT
TCAATAAAGGCTGCTGCTCTTGACAGAGAAGTTAGGTTCAATAATACTTTACTGGACATGTACTCAAAATGTGGGGATTTGAACAGTGCCATTCGGGTTTTTGAGAGAAT
GGATGAGAAAACTGTTGTATCGTGGACTTCGATGATCACAGGCTATGTTCGTGAAGGTTTGTCCGATGGTGCAATCAAGTTGTTTGATGAAATGAAAAGCAGAGGCGTTC
TTCCAGATGTCTATGCTGTTACAAGCATTCTTCATGCTTGTGCTACTAATGGCAACTTGAGCAGTGGGAAGATTGTACACAACTACATCAGAGAAAACAACTTGGAAATT
AACCCGTTTGTTAGTAATGCTCTTACGGACATGTATGCCAAATGCGGCAGCATGAAGGACGCACATGATGTGTTTTCTCACATGAAAAGGAAGGATGTAATTTCATGGAA
TACTATGATTGGAGGTTACTCGAAGAATCGTCTTCCAAATGAAGCTCTTAACTTGTTCGCAGAGATTCATAGAGAATCAAAGCCCGACGGCACCACAGTGGCGTGCATCC
TTCCAGCCTGTGCGAGCCTTGCAGCCTTGGATAGGGGTAGAGAAATCCATGGATATGCATTAAGAAATGGATATTCTAAAGACAAATATGTCGTTAATGCACTTGTTGAT
ATGTATGTGAAGTGTGGGCTATTAGTTCTTGCCCGGTCACTCTTCGATATGATTCTCAATAAGGACCTTGTCTCATGGACAGTGATGATAGCTGGATATGGGATGCATGG
CTTTGGTAGTGAAGCCGTCGATACGTTTAATCAGATGAGAATTGCTGGGATTGAGCCCGATGAAGTATCCTTCATTTCAATCCTTTATGCCTGCAGCCATTCTGGATTGC
TTGATGAAGGATGGAAATTTTTCAATATTATGAAGAAAGAATGTCAGATTGAACCCAACCTAGAGCACTATGCCTGTATGGTGGATCTTCTTGCCCGAACCGGGAATCTG
GTGAAGGCACATAAATTCATCGAAACGATGCCAATCAAACCAGATGCCACGATTTGGGGTGCATTGTTGTGCGGATGCAGGATACACCATGATGTCAAACTAGCAGAGAA
AGTTGCAGAACGAATCTTTGAGCTAGAACCAGAAAACACTGGCTATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAGAGATGGGAAGAAGTTCAAAAGTTAAGGA
AGAAAATCGGACAACGCGGTTTGAAGAAAAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAAGGTCAACATCTTTGTTGCTGGAGATTGCTCCAAACCCCAAGCCAAG
AAGATAGAGCTACTTCTGAAAAGACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACTAGGTATGCTTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGC
CCTCTGTGGACACGGTGAGAAGTTAGCCATGGCTTTTGGTATGCTGAATCTCCCACCCGGTAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGCGACTGTCATG
AGATGGCTAAGTTCATGTCGAAGTCCGCTTCGAGAGAAATCGTTTTGAGAGATTCTAGTCGTTTTCATCATTTCAAAGATGGAAATTGTTCTTGTAGAGGTTACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTATTACTGGTAGCAAAACCCCCAACAACCTTCTGGCCATCTCCGGCCGGGGCCGGGAACGATCATCGTGACTCAGTGAGCTTGAAATTCCGGCAATCTTTTGTCTT
TGTCAAACCCAATTCGAAATTTTCCTTTTCGAATTCGGCATATGCTTGTACAGAGAGTTACACCCCAACATTGGAAACGAAAAGCTATATCGATGTTGAACTGGATAATT
CCCGCAAAATTGTCGAGTTTTGTGAAGTGGGTGACCTCAAAAATGCTATGGAGCTTCTTTGCGGTTCCCAAAATTCCAACCTTGACTTGGATACTTATTGCTCCATCTTG
CAGCTGTGTGCTGAACAAAAATCTATACGAGATGGAAGAAGGGTTCATTCAGTAATTGAATCTAATGGGGTTATGATAGATGGAATCTTGGGGGTGAAACTAGTTTTTAT
GTATGTTAAATGCGGGGATCTTAAAGAAGGGAGGATGGTTTTTGATCAACTTTCTGAAGATAAGGTTTTCCTCTGGAACCTTATGATCAGTGAGTATTCGGGAGGTGGTA
ACTATGGAGAGAGTATAAATTTATTCAAGCAAATGCTGGAGTTGGGGATAAAACCTAATTCATATACATTTTCTAGTGTTTTGAAATGTTTTGCAGCAATTGCACGTGTA
GAAGAGGGTAGGCAGGTTCATGGGTTGATCTGTAAATTGGGTTTTACTTCTTATAATACAGTCGTTAATTCATTAATTTCTTTCTACTTTGTGGGTAGAAAGGTTAGAAG
TGCACAGAAGCTGTTCGATGAATTGACTGACCGAGATGTCATTTCATGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACAGAGGAATTGAGATTTTCA
TAAAGATGTTAGTTTTGAGTGTCGATATTGATTTGGCTACAATGGTCAATGTGCTTGTAGCTTGTGCAAATATGGGCAGTCTTTTGTTGGGTAAGGCACTTCATTCTTAT
TCAATAAAGGCTGCTGCTCTTGACAGAGAAGTTAGGTTCAATAATACTTTACTGGACATGTACTCAAAATGTGGGGATTTGAACAGTGCCATTCGGGTTTTTGAGAGAAT
GGATGAGAAAACTGTTGTATCGTGGACTTCGATGATCACAGGCTATGTTCGTGAAGGTTTGTCCGATGGTGCAATCAAGTTGTTTGATGAAATGAAAAGCAGAGGCGTTC
TTCCAGATGTCTATGCTGTTACAAGCATTCTTCATGCTTGTGCTACTAATGGCAACTTGAGCAGTGGGAAGATTGTACACAACTACATCAGAGAAAACAACTTGGAAATT
AACCCGTTTGTTAGTAATGCTCTTACGGACATGTATGCCAAATGCGGCAGCATGAAGGACGCACATGATGTGTTTTCTCACATGAAAAGGAAGGATGTAATTTCATGGAA
TACTATGATTGGAGGTTACTCGAAGAATCGTCTTCCAAATGAAGCTCTTAACTTGTTCGCAGAGATTCATAGAGAATCAAAGCCCGACGGCACCACAGTGGCGTGCATCC
TTCCAGCCTGTGCGAGCCTTGCAGCCTTGGATAGGGGTAGAGAAATCCATGGATATGCATTAAGAAATGGATATTCTAAAGACAAATATGTCGTTAATGCACTTGTTGAT
ATGTATGTGAAGTGTGGGCTATTAGTTCTTGCCCGGTCACTCTTCGATATGATTCTCAATAAGGACCTTGTCTCATGGACAGTGATGATAGCTGGATATGGGATGCATGG
CTTTGGTAGTGAAGCCGTCGATACGTTTAATCAGATGAGAATTGCTGGGATTGAGCCCGATGAAGTATCCTTCATTTCAATCCTTTATGCCTGCAGCCATTCTGGATTGC
TTGATGAAGGATGGAAATTTTTCAATATTATGAAGAAAGAATGTCAGATTGAACCCAACCTAGAGCACTATGCCTGTATGGTGGATCTTCTTGCCCGAACCGGGAATCTG
GTGAAGGCACATAAATTCATCGAAACGATGCCAATCAAACCAGATGCCACGATTTGGGGTGCATTGTTGTGCGGATGCAGGATACACCATGATGTCAAACTAGCAGAGAA
AGTTGCAGAACGAATCTTTGAGCTAGAACCAGAAAACACTGGCTATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAGAGATGGGAAGAAGTTCAAAAGTTAAGGA
AGAAAATCGGACAACGCGGTTTGAAGAAAAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAAGGTCAACATCTTTGTTGCTGGAGATTGCTCCAAACCCCAAGCCAAG
AAGATAGAGCTACTTCTGAAAAGACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACTAGGTATGCTTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGC
CCTCTGTGGACACGGTGAGAAGTTAGCCATGGCTTTTGGTATGCTGAATCTCCCACCCGGTAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGCGACTGTCATG
AGATGGCTAAGTTCATGTCGAAGTCCGCTTCGAGAGAAATCGTTTTGAGAGATTCTAGTCGTTTTCATCATTTCAAAGATGGAAATTGTTCTTGTAGAGGTTACTGGTGA
Protein sequenceShow/hide protein sequence
MLLLVAKPPTTFWPSPAGAGNDHRDSVSLKFRQSFVFVKPNSKFSFSNSAYACTESYTPTLETKSYIDVELDNSRKIVEFCEVGDLKNAMELLCGSQNSNLDLDTYCSIL
QLCAEQKSIRDGRRVHSVIESNGVMIDGILGVKLVFMYVKCGDLKEGRMVFDQLSEDKVFLWNLMISEYSGGGNYGESINLFKQMLELGIKPNSYTFSSVLKCFAAIARV
EEGRQVHGLICKLGFTSYNTVVNSLISFYFVGRKVRSAQKLFDELTDRDVISWNSMISGYVKNGLEDRGIEIFIKMLVLSVDIDLATMVNVLVACANMGSLLLGKALHSY
SIKAAALDREVRFNNTLLDMYSKCGDLNSAIRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVLPDVYAVTSILHACATNGNLSSGKIVHNYIRENNLEI
NPFVSNALTDMYAKCGSMKDAHDVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEIHRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVD
MYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGFGSEAVDTFNQMRIAGIEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECQIEPNLEHYACMVDLLARTGNL
VKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAERWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAK
KIELLLKRLRSKMKEEGYSPKTRYALLNADEREKEVALCGHGEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIVLRDSSRFHHFKDGNCSCRGYW