; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028850 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028850
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153207:1299674..1316749
RNA-Seq ExpressionSgr028850
SyntenySgr028850
Gene Ontology termsGO:0016567 - protein ubiquitination (biological process)
GO:1902553 - positive regulation of catalase activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0061630 - ubiquitin protein ligase activity (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648628.1 hypothetical protein Csa_009292 [Cucumis sativus]1.0e-22177.29Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LLK+  ETHN RKTLETTCSM+V+CYIKERMVT+AL L+ QMKHL IFPSIWVYKS+I+ALL+T                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG+GWKVLLEL+NFGSKPD VDYT VI+SLCK+SLLKEAT+LLFKM +FGVSPD VTMSS+IDG+CK+GK D+ACKILKYFRLPLNIFI NSFITKL T
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EG+MVKAS+VFLEM+EVGLVPDC+SYTTMIGGYCKVG+IN AF YL KMLKSGIQPSVITYTLF+D  C+  DVEMAEV+F+KMI EGLKPDVV YNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        D YGKKGY+HKAF LLD+MRS NVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGF +DVVTYTN I+GYS RGNFEEAFL+WYHM ENCV PDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+E R++EANALF KML+IGL PDLILYNTLIHGFCS+GNVDE CNLVKKMIESSIIPNNVTH+ALVLGFQKKRV +P +SATSKLQEIL++
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

XP_011655513.1 pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus]1.0e-22177.29Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LLK+  ETHN RKTLETTCSM+V+CYIKERMVT+AL L+ QMKHL IFPSIWVYKS+I+ALL+T                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG+GWKVLLEL+NFGSKPD VDYT VI+SLCK+SLLKEAT+LLFKM +FGVSPD VTMSS+IDG+CK+GK D+ACKILKYFRLPLNIFI NSFITKL T
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EG+MVKAS+VFLEM+EVGLVPDC+SYTTMIGGYCKVG+IN AF YL KMLKSGIQPSVITYTLF+D  C+  DVEMAEV+F+KMI EGLKPDVV YNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        D YGKKGY+HKAF LLD+MRS NVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGF +DVVTYTN I+GYS RGNFEEAFL+WYHM ENCV PDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+E R++EANALF KML+IGL PDLILYNTLIHGFCS+GNVDE CNLVKKMIESSIIPNNVTH+ALVLGFQKKRV +P +SATSKLQEIL++
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

XP_022139130.1 pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia]3.9e-23482.24Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        MLLKL +E  N+RKTLET CSMLV CYIKERMVTAAL LM QMKHLKIFPSIWVY+S+IQ LLET                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG GWKVLLEL+NFGSKPDAVDYTIVIDSLCK SLLKEATSLLFKM++FGVSPDSVTMSSVIDGYCK+G LDVACKILKYFRLPLNIF  NSFITKLCT
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EGNMV ASEVFLEMSEVGL+PDCVSYTTM+GGYCKVGDINKAFLYLGKMLKSGIQPSVITYTL IDNLCK G+VEMAE+ FQKM+TEG+KPDVV +NILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        DGYGKKGYLHKAF LLD+MRS NVTPDVVTYNTLINGL  RGFLREAKDILDELIRRGF IDVVTYTNFIYGYSKRGNFEEAFLVWYHMT+NCVKPDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+EHR++EANALFYKML+IGLNPDLILYNTLIHGFCS+GNVDEACNLV KMIESSI+PNNVTH+ALV GFQKK+VI+P ESAT KLQEIL +
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  Y
        Y
Subjt:  Y

XP_022957015.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata]2.9e-22980.88Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LLKL YETH++RKTLETTCSMLVDCYIKERMVTAAL LM QMK   IFPSIWVYKS+IQALL+T                                 KG
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NL RGWKVLLEL+ FGSKPDAVDYTIVI+SLCKISLLKEAT+LLFKMT+FGVSPDSVTMSSVIDGYCK+GKLD+ACKILKYFR PLNIFI NSFITKLC 
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EGN VKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVG+IN+AF YLGKMLKSGI+PSVITYTLFID  CK  DVEMAEV+ QKMI EGL PDVVTYNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        DGYGKKGYLHKAF LLD MRS N+TPDVVTYNTLINGLV RGFL+EAKD+LDEL RRGF IDVVTYTN I+GYSKRGNFEEAFLVW+HMT+NCVKPDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+E RI+EANALF KML+IGLNPDLILYNTLIHGFCS+GNVDE CNLVKKMIE+SI+PNNVTH+ALVLGFQK++VI+P ESATSKLQEILL+
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

XP_038892184.1 pentatricopeptide repeat-containing protein At2g19280 [Benincasa hispida]1.4e-22880.88Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LLKLLYETH +RKTLETTCSMLV CYIKERMVTAAL LM QMKHL IFPSIWVYKS+IQALL+T                                 KG
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLGRGWKVLLEL+NFGSKPDAVDYTI+I+SLCKISLLKEAT+LLFKM +FGVSPDSV MSSVIDGYCK+GK D+ACKILKYFRLPLNIF  NSFIT+LC 
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EGNM KAS+VFLEM EVGLVPD VSYTTMIGGYCKV +INKAF YL KM+KSGIQPS+ITYTLFIDN CK GDVEMAEVLFQK+I EGLKPDVVTYNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        DGYGKKGYLHK F LLD+MRS NVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRR F IDVVTYTN IYGYSKRGNFEEAFL+WYHMT+NCVKPDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+E R++EANALF KML+IGL+PDLILYNTLIHGFCS+GNVDE CN VKKMIESSIIPNNVTH ALVLGFQKKR INP  SATSKLQEILL+
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        Y+
Subjt:  YD

TrEMBL top hitse value%identityAlignment
A0A0A0KV38 Uncharacterized protein4.8e-22277.29Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LLK+  ETHN RKTLETTCSM+V+CYIKERMVT+AL L+ QMKHL IFPSIWVYKS+I+ALL+T                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG+GWKVLLEL+NFGSKPD VDYT VI+SLCK+SLLKEAT+LLFKM +FGVSPD VTMSS+IDG+CK+GK D+ACKILKYFRLPLNIFI NSFITKL T
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EG+MVKAS+VFLEM+EVGLVPDC+SYTTMIGGYCKVG+IN AF YL KMLKSGIQPSVITYTLF+D  C+  DVEMAEV+F+KMI EGLKPDVV YNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        D YGKKGY+HKAF LLD+MRS NVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGF +DVVTYTN I+GYS RGNFEEAFL+WYHM ENCV PDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+E R++EANALF KML+IGL PDLILYNTLIHGFCS+GNVDE CNLVKKMIESSIIPNNVTH+ALVLGFQKKRV +P +SATSKLQEIL++
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

A0A1S3BDC2 pentatricopeptide repeat-containing protein At2g192805.7e-21574.3Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LL++ Y+THN RKTLETTC M+++CYIKE MVT+A+ L+ QM+ L +FPSIWVYKS+I+ALL+T                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG+GWKVLLEL+NFGSKPD VDYT VI+SLCKISLLKEAT+LLFKM +FGVSPD VTMSS+IDG+CK+GK D+ACKILKYF++PLNIFI NSFIT+L  
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EG+ VKAS+VFLEMSEVGLVPDCVSYTTMIGGYCKVG+IN AF YL KMLKSGIQPSVITYTLF+D  C+ GDVEMAEV+F+KMI E LKPDVV YNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        D YGKKGY+HKAF LLD+MRS NVTPDVVTYN+LI+GLVMRGFL+EAKDILDELIRRGF IDVVTYTN ++GYSKRGNFEEAFL+WYHM +NCV PDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+   ++EANALF +ML+IGL PDLILYNTLIHGFCS+GNVDE CNLVKKMIESSIIPNNVTH+ALVLGFQKKRV++P +SATSKLQEIL++
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

A0A5D3CZD6 Pentatricopeptide repeat-containing protein5.7e-21574.3Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LL++ Y+THN RKTLETTC M+++CYIKE MVT+A+ L+ QM+ L +FPSIWVYKS+I+ALL+T                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG+GWKVLLEL+NFGSKPD VDYT VI+SLCKISLLKEAT+LLFKM +FGVSPD VTMSS+IDG+CK+GK D+ACKILKYF++PLNIFI NSFIT+L  
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EG+ VKAS+VFLEMSEVGLVPDCVSYTTMIGGYCKVG+IN AF YL KMLKSGIQPSVITYTLF+D  C+ GDVEMAEV+F+KMI E LKPDVV YNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        D YGKKGY+HKAF LLD+MRS NVTPDVVTYN+LI+GLVMRGFL+EAKDILDELIRRGF IDVVTYTN ++GYSKRGNFEEAFL+WYHM +NCV PDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+   ++EANALF +ML+IGL PDLILYNTLIHGFCS+GNVDE CNLVKKMIESSIIPNNVTH+ALVLGFQKKRV++P +SATSKLQEIL++
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

A0A6J1CBR5 pentatricopeptide repeat-containing protein At2g19280-like isoform X11.9e-23482.24Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        MLLKL +E  N+RKTLET CSMLV CYIKERMVTAAL LM QMKHLKIFPSIWVY+S+IQ LLET                                 +G
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NLG GWKVLLEL+NFGSKPDAVDYTIVIDSLCK SLLKEATSLLFKM++FGVSPDSVTMSSVIDGYCK+G LDVACKILKYFRLPLNIF  NSFITKLCT
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EGNMV ASEVFLEMSEVGL+PDCVSYTTM+GGYCKVGDINKAFLYLGKMLKSGIQPSVITYTL IDNLCK G+VEMAE+ FQKM+TEG+KPDVV +NILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        DGYGKKGYLHKAF LLD+MRS NVTPDVVTYNTLINGL  RGFLREAKDILDELIRRGF IDVVTYTNFIYGYSKRGNFEEAFLVWYHMT+NCVKPDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+EHR++EANALFYKML+IGLNPDLILYNTLIHGFCS+GNVDEACNLV KMIESSI+PNNVTH+ALV GFQKK+VI+P ESAT KLQEIL +
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  Y
        Y
Subjt:  Y

A0A6J1GY27 pentatricopeptide repeat-containing protein At2g19280 isoform X11.4e-22980.88Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG
        +LLKL YETH++RKTLETTCSMLVDCYIKERMVTAAL LM QMK   IFPSIWVYKS+IQALL+T                                 KG
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLET---------------------------------KG

Query:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT
        NL RGWKVLLEL+ FGSKPDAVDYTIVI+SLCKISLLKEAT+LLFKMT+FGVSPDSVTMSSVIDGYCK+GKLD+ACKILKYFR PLNIFI NSFITKLC 
Subjt:  NLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCT

Query:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM
        EGN VKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVG+IN+AF YLGKMLKSGI+PSVITYTLFID  CK  DVEMAEV+ QKMI EGL PDVVTYNILM
Subjt:  EGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILM

Query:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT
        DGYGKKGYLHKAF LLD MRS N+TPDVVTYNTLINGLV RGFL+EAKD+LDEL RRGF IDVVTYTN I+GYSKRGNFEEAFLVW+HMT+NCVKPDVVT
Subjt:  DGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVT

Query:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS
        CSALLSGYC+E RI+EANALF KML+IGLNPDLILYNTLIHGFCS+GNVDE CNLVKKMIE+SI+PNNVTH+ALVLGFQK++VI+P ESATSKLQEILL+
Subjt:  CSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILLS

Query:  YD
        YD
Subjt:  YD

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial6.8e-6436.12Show/hide
Query:  VLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNM
        V  E    G   +   Y IVI  +C++  +KEA  LL  M   G +PD ++ S+V++GYC+ G+LD   K+++  +   L  N +I  S I  LC    +
Subjt:  VLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNM

Query:  VKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYG
         +A E F EM   G++PD V YTT+I G+CK GDI  A  +  +M    I P V+TYT  I   C+ GD+  A  LF +M  +GL+PD VT+  L++GY 
Subjt:  VKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYG

Query:  KKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSAL
        K G++  AF + + M  A  +P+VVTY TLI+GL   G L  A ++L E+ + G   ++ TY + + G  K GN EEA  +        +  D VT + L
Subjt:  KKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSAL

Query:  LSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALV
        +  YC+   +++A  +  +ML  GL P ++ +N L++GFC  G +++   L+  M+   I PN  T  +LV
Subjt:  LSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALV

Q6NKW7 Pentatricopeptide repeat-containing protein At2g192801.1e-13347.11Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLE----------------------------------TK
        +++K L+ET  DR+ LET  S+L+DC I+ER V  AL L  ++    IFPS  V  SL++ +L                                   + 
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLE----------------------------------TK

Query:  GNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLC
        G   +GW++L+ ++++G +PD V +T+ ID LCK   LKEATS+LFK+  FG+S DSV++SSVIDG+CK+GK + A K++  FRL  NIF+ +SF++ +C
Subjt:  GNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLC

Query:  TEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNIL
        + G+M++AS +F E+ E+GL+PDCV YTTMI GYC +G  +KAF Y G +LKSG  PS+ T T+ I    +FG +  AE +F+ M TEGLK DVVTYN L
Subjt:  TEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNIL

Query:  MDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVV
        M GYGK   L+K F L+D MRSA ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T+ I G+SKRG+F+EAF++W++M +  +KPDVV
Subjt:  MDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVV

Query:  TCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL
        TCSALL GYC+  R+E+A  LF K+L+ GL PD++LYNTLIHG+CS+G++++AC L+  M++  ++PN  TH ALVLG + KR +N +  A+  L+EI++
Subjt:  TCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL

Query:  S
        +
Subjt:  S

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397109.5e-7432.84Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCK
        ++ K L ET++   +  +   ++V  Y +  ++  AL+++   +     P +  Y +++ A + +K N+     V  E+      P+   Y I+I   C 
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCK

Query:  ISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMI
           +  A +L  KM + G  P+ VT +++IDGYCK+ K+D   K+L+      L  N+   N  I  LC EG M + S V  EM+  G   D V+Y T+I
Subjt:  ISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMI

Query:  GGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVT
         GYCK G+ ++A +   +ML+ G+ PSVITYT  I ++CK G++  A     +M   GL P+  TY  L+DG+ +KGY+++A+ +L  M     +P VVT
Subjt:  GGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVT

Query:  YNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLN
        YN LING  + G + +A  +L+++  +G   DVV+Y+  + G+ +  + +EA  V   M E  +KPD +T S+L+ G+C++ R +EA  L+ +ML +GL 
Subjt:  YNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLN

Query:  PDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL
        PD   Y  LI+ +C  G++++A  L  +M+E  ++P+ VT+  L+ G         K+S T + + +LL
Subjt:  PDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL

Q9FJE6 Putative pentatricopeptide repeat-containing protein At5g599003.2e-6131.46Show/hide
Query:  TTCSMLVD-CYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTS
        T C+++   C ++E  +   L +M +M  L+  PS     SL++  L  +G +     ++  + +FG  P+   Y  +IDSLCK     EA  L  +M  
Subjt:  TTCSMLVD-CYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTS

Query:  FGVSPDSVTMSSVIDGYCKMGKLDVACKILKYF---RLPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYL
         G+ P+ VT S +ID +C+ GKLD A   L       L L+++  NS I   C  G++  A     EM    L P  V+YT+++GGYC  G INKA    
Subjt:  FGVSPDSVTMSSVIDGYCKMGKLDVACKILKYF---RLPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYL

Query:  GKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLRE
         +M   GI PS+ T+T  +  L + G +  A  LF +M    +KP+ VTYN++++GY ++G + KAF  L  M    + PD  +Y  LI+GL + G   E
Subjt:  GKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLRE

Query:  AKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSI
        AK  +D L +    ++ + YT  ++G+ + G  EEA  V   M +  V  D+V    L+ G  +    +    L  +M + GL PD ++Y ++I      
Subjt:  AKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSI

Query:  GNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEI-----LLSY----DTGGRGPVANQKAAEIDGVL--GILNSSELLRALI
        G+  EA  +   MI    +PN VT+ A++ G  K   +N  E   SK+Q +      ++Y    D   +G V  QKA E+   +  G+L ++     LI
Subjt:  GNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEI-----LLSY----DTGGRGPVANQKAAEIDGVL--GILNSSELLRALI

Q9M2V1 Protein NCA12.3e-14468.43Show/hide
Query:  TPVCPFVKSARPDDYSSRKHQ--PESACPFAKSGRSDDASSRK--EQPEAACPFAKSARPDDASLRKNQAEAESNEAEKDVADAARTGGKCPFGYDSQTF
        T VCPF K+ARPDD S+RK      S CPF+K+ R DDAS+RK  E   + CPF+KSARPD+    K   E E N   KD  D+A    KCPFGYDSQTF
Subjt:  TPVCPFVKSARPDDYSSRKHQ--PESACPFAKSGRSDDASSRK--EQPEAACPFAKSARPDDASLRKNQAEAESNEAEKDVADAARTGGKCPFGYDSQTF

Query:  KIGPLSCMICQALLFECSRCVPCSHVYCKACISRFKDCPLCGADIEKIEADADLQGMVDRFIEGHARIKRSQVNSDKEQEEVSESKPVIYEDVSLERGAF
        K+GP SCM+CQALL+E SRCVPC+HV+CK C++RFKDCPLCGADIE IE D +LQ MVD+FIEGHARIKRS VN  +++E  +++K VIY DVS+ERG+F
Subjt:  KIGPLSCMICQALLFECSRCVPCSHVYCKACISRFKDCPLCGADIEKIEADADLQGMVDRFIEGHARIKRSQVNSDKEQEEVSESKPVIYEDVSLERGAF

Query:  LIQQAMRAFRAQNIESAKSRLTICVEDIRDQLERMGNSSELCSQLGAVLGILGDCCRAAGDASSAIKHFEESVEFLSKLPTKNHEITHTLSVSLNKIGDL
        L+QQAMRAF AQN ESAKSRL +C EDIRDQL R GN+ ELCSQLGAVLG+LGDC RA GD+SSA+KHFEESVEFL KLP  + EITHTLSVSLNKIGDL
Subjt:  LIQQAMRAFRAQNIESAKSRLTICVEDIRDQLERMGNSSELCSQLGAVLGILGDCCRAAGDASSAIKHFEESVEFLSKLPTKNHEITHTLSVSLNKIGDL

Query:  KYYEGDLQAARSYYFRSLNVRQDASKQHPDDPSQILDVAVSLAKVADVDSGLGNEDVAVNGFEEGIKLLESLTLNSENSGLEQRRQSVLKFLEGQL
        KYY+ DLQAARSYY R+LNVR+DA K HP+ PSQILDVAVSLAKVAD+D  L NE  A +GF+EG++LLESL L+SE+S LEQRR SVL+FL+ Q+
Subjt:  KYYEGDLQAARSYYFRSLNVRQDASKQHPDDPSQILDVAVSLAKVADVDSGLGNEDVAVNGFEEGIKLLESLTLNSENSGLEQRRQSVLKFLEGQL

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.8e-6536.12Show/hide
Query:  VLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNM
        V  E    G   +   Y IVI  +C++  +KEA  LL  M   G +PD ++ S+V++GYC+ G+LD   K+++  +   L  N +I  S I  LC    +
Subjt:  VLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNM

Query:  VKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYG
         +A E F EM   G++PD V YTT+I G+CK GDI  A  +  +M    I P V+TYT  I   C+ GD+  A  LF +M  +GL+PD VT+  L++GY 
Subjt:  VKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYG

Query:  KKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSAL
        K G++  AF + + M  A  +P+VVTY TLI+GL   G L  A ++L E+ + G   ++ TY + + G  K GN EEA  +        +  D VT + L
Subjt:  KKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSAL

Query:  LSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALV
        +  YC+   +++A  +  +ML  GL P ++ +N L++GFC  G +++   L+  M+   I PN  T  +LV
Subjt:  LSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALV

AT2G19280.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-13547.11Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLE----------------------------------TK
        +++K L+ET  DR+ LET  S+L+DC I+ER V  AL L  ++    IFPS  V  SL++ +L                                   + 
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLE----------------------------------TK

Query:  GNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLC
        G   +GW++L+ ++++G +PD V +T+ ID LCK   LKEATS+LFK+  FG+S DSV++SSVIDG+CK+GK + A K++  FRL  NIF+ +SF++ +C
Subjt:  GNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLC

Query:  TEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNIL
        + G+M++AS +F E+ E+GL+PDCV YTTMI GYC +G  +KAF Y G +LKSG  PS+ T T+ I    +FG +  AE +F+ M TEGLK DVVTYN L
Subjt:  TEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNIL

Query:  MDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVV
        M GYGK   L+K F L+D MRSA ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T+ I G+SKRG+F+EAF++W++M +  +KPDVV
Subjt:  MDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVV

Query:  TCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL
        TCSALL GYC+  R+E+A  LF K+L+ GL PD++LYNTLIHG+CS+G++++AC L+  M++  ++PN  TH ALVLG + KR +N +  A+  L+EI++
Subjt:  TCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL

Query:  S
        +
Subjt:  S

AT2G19280.2 Pentatricopeptide repeat (PPR) superfamily protein7.5e-13547.11Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLE----------------------------------TK
        +++K L+ET  DR+ LET  S+L+DC I+ER V  AL L  ++    IFPS  V  SL++ +L                                   + 
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLE----------------------------------TK

Query:  GNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLC
        G   +GW++L+ ++++G +PD V +T+ ID LCK   LKEATS+LFK+  FG+S DSV++SSVIDG+CK+GK + A K++  FRL  NIF+ +SF++ +C
Subjt:  GNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLC

Query:  TEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNIL
        + G+M++AS +F E+ E+GL+PDCV YTTMI GYC +G  +KAF Y G +LKSG  PS+ T T+ I    +FG +  AE +F+ M TEGLK DVVTYN L
Subjt:  TEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNIL

Query:  MDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVV
        M GYGK   L+K F L+D MRSA ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T+ I G+SKRG+F+EAF++W++M +  +KPDVV
Subjt:  MDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVV

Query:  TCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL
        TCSALL GYC+  R+E+A  LF K+L+ GL PD++LYNTLIHG+CS+G++++AC L+  M++  ++PN  TH ALVLG + KR +N +  A+  L+EI++
Subjt:  TCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL

Query:  S
        +
Subjt:  S

AT3G54360.1 zinc ion binding1.6e-14568.43Show/hide
Query:  TPVCPFVKSARPDDYSSRKHQ--PESACPFAKSGRSDDASSRK--EQPEAACPFAKSARPDDASLRKNQAEAESNEAEKDVADAARTGGKCPFGYDSQTF
        T VCPF K+ARPDD S+RK      S CPF+K+ R DDAS+RK  E   + CPF+KSARPD+    K   E E N   KD  D+A    KCPFGYDSQTF
Subjt:  TPVCPFVKSARPDDYSSRKHQ--PESACPFAKSGRSDDASSRK--EQPEAACPFAKSARPDDASLRKNQAEAESNEAEKDVADAARTGGKCPFGYDSQTF

Query:  KIGPLSCMICQALLFECSRCVPCSHVYCKACISRFKDCPLCGADIEKIEADADLQGMVDRFIEGHARIKRSQVNSDKEQEEVSESKPVIYEDVSLERGAF
        K+GP SCM+CQALL+E SRCVPC+HV+CK C++RFKDCPLCGADIE IE D +LQ MVD+FIEGHARIKRS VN  +++E  +++K VIY DVS+ERG+F
Subjt:  KIGPLSCMICQALLFECSRCVPCSHVYCKACISRFKDCPLCGADIEKIEADADLQGMVDRFIEGHARIKRSQVNSDKEQEEVSESKPVIYEDVSLERGAF

Query:  LIQQAMRAFRAQNIESAKSRLTICVEDIRDQLERMGNSSELCSQLGAVLGILGDCCRAAGDASSAIKHFEESVEFLSKLPTKNHEITHTLSVSLNKIGDL
        L+QQAMRAF AQN ESAKSRL +C EDIRDQL R GN+ ELCSQLGAVLG+LGDC RA GD+SSA+KHFEESVEFL KLP  + EITHTLSVSLNKIGDL
Subjt:  LIQQAMRAFRAQNIESAKSRLTICVEDIRDQLERMGNSSELCSQLGAVLGILGDCCRAAGDASSAIKHFEESVEFLSKLPTKNHEITHTLSVSLNKIGDL

Query:  KYYEGDLQAARSYYFRSLNVRQDASKQHPDDPSQILDVAVSLAKVADVDSGLGNEDVAVNGFEEGIKLLESLTLNSENSGLEQRRQSVLKFLEGQL
        KYY+ DLQAARSYY R+LNVR+DA K HP+ PSQILDVAVSLAKVAD+D  L NE  A +GF+EG++LLESL L+SE+S LEQRR SVL+FL+ Q+
Subjt:  KYYEGDLQAARSYYFRSLNVRQDASKQHPDDPSQILDVAVSLAKVADVDSGLGNEDVAVNGFEEGIKLLESLTLNSENSGLEQRRQSVLKFLEGQL

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-7532.84Show/hide
Query:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCK
        ++ K L ET++   +  +   ++V  Y +  ++  AL+++   +     P +  Y +++ A + +K N+     V  E+      P+   Y I+I   C 
Subjt:  MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCK

Query:  ISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMI
           +  A +L  KM + G  P+ VT +++IDGYCK+ K+D   K+L+      L  N+   N  I  LC EG M + S V  EM+  G   D V+Y T+I
Subjt:  ISLLKEATSLLFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFR---LPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMI

Query:  GGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVT
         GYCK G+ ++A +   +ML+ G+ PSVITYT  I ++CK G++  A     +M   GL P+  TY  L+DG+ +KGY+++A+ +L  M     +P VVT
Subjt:  GGYCKVGDINKAFLYLGKMLKSGIQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVT

Query:  YNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLN
        YN LING  + G + +A  +L+++  +G   DVV+Y+  + G+ +  + +EA  V   M E  +KPD +T S+L+ G+C++ R +EA  L+ +ML +GL 
Subjt:  YNTLINGLVMRGFLREAKDILDELIRRGFCIDVVTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLN

Query:  PDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL
        PD   Y  LI+ +C  G++++A  L  +M+E  ++P+ VT+  L+ G         K+S T + + +LL
Subjt:  PDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQALVLGFQKKRVINPKESATSKLQEILL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCTGAAACTTTTGTATGAAACACATAACGATAGGAAGACTTTAGAAACCACATGTAGCATGCTGGTTGACTGTTATATCAAGGAAAGAATGGTAACAGCTGCTCT
TACATTGATGTGTCAAATGAAGCATCTTAAAATATTTCCTTCTATATGGGTATACAAGTCGCTGATACAAGCTTTATTAGAAACCAAAGGTAATCTTGGTAGGGGGTGGA
AAGTGCTTTTGGAGTTGCAGAATTTTGGATCTAAGCCTGATGCAGTTGATTACACAATTGTAATTGACTCACTTTGCAAAATCTCTCTTTTAAAAGAAGCCACCTCCTTG
TTGTTTAAAATGACTTCTTTCGGTGTTTCCCCTGATTCAGTTACGATGAGTTCTGTTATTGATGGTTATTGTAAAATGGGAAAGTTGGATGTAGCTTGTAAAATATTGAA
GTATTTTAGGCTTCCCCTCAATATTTTCATATGCAATAGCTTTATAACAAAGTTATGTACAGAAGGAAACATGGTAAAAGCTTCTGAAGTTTTTCTTGAAATGTCTGAGG
TGGGCTTAGTTCCAGATTGTGTTAGTTACACTACCATGATAGGAGGCTATTGTAAAGTGGGAGACATAAACAAAGCATTTTTATACCTGGGCAAGATGTTAAAAAGTGGA
ATTCAACCATCTGTTATCACATATACTTTGTTCATTGATAACTTATGCAAGTTTGGAGATGTGGAAATGGCTGAAGTATTGTTCCAAAAGATGATTACCGAGGGTTTAAA
ACCCGATGTTGTCACATATAATATTTTGATGGATGGATATGGAAAGAAGGGCTACTTGCACAAGGCTTTTGGACTCCTTGATATAATGAGATCTGCAAATGTTACACCAG
ATGTTGTGACATATAACACGCTCATTAATGGTCTTGTTATGAGAGGGTTTCTTCGAGAGGCAAAGGATATTTTAGACGAGCTCATCAGGAGGGGTTTCTGTATAGATGTT
GTCACATACACTAATTTCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTGTTTGGTATCATATGACTGAGAACTGTGTGAAGCCTGATGTTGTTAC
TTGCAGTGCTCTTCTTAGTGGGTATTGCCAAGAACATCGCATAGAAGAAGCAAATGCTCTATTTTATAAAATGCTGGAAATTGGATTAAATCCAGACTTGATATTGTATA
ATACTTTAATCCATGGATTTTGCAGTATTGGTAATGTGGATGAAGCTTGCAATTTGGTAAAGAAGATGATCGAAAGCAGTATCATTCCGAACAATGTTACTCATCAGGCA
CTTGTCCTGGGATTTCAGAAAAAGAGGGTTATCAATCCAAAAGAGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTTCATATGATACTGGTGGTCGGGGACCCGTCGC
GAATCAGAAAGCGGCAGAGATCGACGGAGTTTTGGGGATTCTCAATTCAAGTGAGCTGCTTAGGGCATTGATTTTCCAGGGAGATCGGTCTAGAATCGAGATGACTCCTG
TTTGTCCCTTCGTCAAATCTGCTCGTCCCGACGACTATTCTTCGAGGAAGCACCAGCCGGAATCTGCTTGTCCTTTCGCGAAATCCGGCCGTTCCGATGATGCTTCTTCG
AGAAAGGAGCAACCGGAGGCTGCTTGTCCTTTTGCGAAATCCGCTCGTCCCGACGATGCTTCTTTGAGGAAGAATCAGGCTGAGGCGGAGAGTAACGAGGCAGAGAAAGA
TGTTGCTGATGCTGCTAGAACCGGCGGCAAGTGTCCCTTTGGATATGATTCTCAAACTTTCAAGATTGGCCCTCTAAGCTGTATGATATGTCAGGCACTTCTCTTTGAAT
GCAGCAGATGTGTCCCCTGTTCTCACGTTTACTGCAAAGCATGTATATCACGTTTCAAAGACTGTCCGCTGTGTGGAGCTGACATCGAGAAGATTGAAGCCGACGCTGAT
CTGCAAGGTATGGTTGATCGCTTCATTGAAGGTCATGCGAGAATCAAGAGATCCCAGGTGAACTCAGACAAAGAGCAAGAGGAAGTTAGTGAGAGTAAACCAGTAATATA
TGAAGACGTGTCCTTGGAGAGAGGTGCTTTCTTGATCCAGCAAGCCATGAGGGCATTTCGTGCCCAGAATATTGAAAGTGCCAAATCCAGGCTCACTATCTGTGTTGAAG
ATATTCGAGATCAATTAGAAAGAATGGGCAATTCATCAGAATTGTGCTCACAGCTTGGAGCGGTTCTTGGCATACTTGGCGATTGCTGTCGAGCAGCTGGAGATGCTAGT
TCCGCAATCAAGCATTTTGAAGAAAGTGTAGAATTTCTCTCAAAATTGCCTACAAAGAATCACGAGATCACTCATACACTTTCTGTATCACTTAATAAAATTGGCGATCT
TAAGTATTATGAAGGAGACCTCCAAGCGGCAAGATCTTATTATTTCCGGTCTCTTAATGTTCGCCAAGATGCTAGCAAGCAACATCCAGATGATCCATCCCAGATCCTAG
ATGTGGCTGTTTCGCTTGCAAAAGTGGCAGATGTGGACAGTGGTCTAGGAAATGAGGATGTGGCGGTTAATGGATTTGAGGAAGGCATAAAGTTGTTGGAATCATTAACA
CTGAATTCTGAGAATTCTGGTCTTGAACAGCGACGTCAATCCGTGCTGAAGTTCCTTGAAGGTCAACTTGCCGAGCGACAAAAGCCAACTCAATCATACAGAAGCATACA
TAGTCGATGTCGCTTTATCTGCATAAGCTTGAATTGGCCTGGCAGTTGTGAAGCTGTGATCATTCTAATCAGCTGGCGGGATTGTACCAACTTAGATGTTATTGAAGTTG
AGTCTCAGCCTTCCATCTGGAGATGGGAGACACCCTACCCTAATCCATTGAACTCAGTTGGAACTTCCCAGCCGGAGATGCAAGAACATCTTAATCAGCTGAATAAAGTT
GGACCTTCCAATCAGAGAAAGATGGAAATCAGATACCATTGGAGTTTCAATGATAAGAGCTTTTGGTTTGACGCACTTAAGTTCAAGGCTCAAAAGGTTTGGTGCACATA
TGCTCAGAGAAACTGGAGCATTCGACACTGTCCATTGACAGAAGCGGAGTGCACCTATGCTACTCATTCTGCTATGGTAGCACCCACCACTATATTACGAACACATCAGA
AGCAATCGTACTTAAGAACTGGATTGCAATGGAGGGCGTATCGTCACAGTACACACACTTCCAAAGTTGACATCCACGACGGCCCATATTTCCGTGCAAGATTGAACCCA
AAAATCGATGATCGACAAGGACGTCAAATCTCTACCAATCGCCGGCAGCCACTCCCTAGCGAAGCCAACATCCGTCAGCGATCCAATGAGCAGAAGAGGTTGAGTGACCT
GACTTCGCTGGAGAGAGCATTGAGAGCTTTGGAAGCCACTCGGCACCGAGCCAAATCGGCGGAATCGGCCAATCGATTGAGGATTTCGAGAACTAACGACGGCGGTAGAT
CATCCATGGAGGAAAAGAAGAACGGTTATGGTGGCTTCTCTCGGAATTTCAGGCGATGGATGAATCCATCTCTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCTGAAACTTTTGTATGAAACACATAACGATAGGAAGACTTTAGAAACCACATGTAGCATGCTGGTTGACTGTTATATCAAGGAAAGAATGGTAACAGCTGCTCT
TACATTGATGTGTCAAATGAAGCATCTTAAAATATTTCCTTCTATATGGGTATACAAGTCGCTGATACAAGCTTTATTAGAAACCAAAGGTAATCTTGGTAGGGGGTGGA
AAGTGCTTTTGGAGTTGCAGAATTTTGGATCTAAGCCTGATGCAGTTGATTACACAATTGTAATTGACTCACTTTGCAAAATCTCTCTTTTAAAAGAAGCCACCTCCTTG
TTGTTTAAAATGACTTCTTTCGGTGTTTCCCCTGATTCAGTTACGATGAGTTCTGTTATTGATGGTTATTGTAAAATGGGAAAGTTGGATGTAGCTTGTAAAATATTGAA
GTATTTTAGGCTTCCCCTCAATATTTTCATATGCAATAGCTTTATAACAAAGTTATGTACAGAAGGAAACATGGTAAAAGCTTCTGAAGTTTTTCTTGAAATGTCTGAGG
TGGGCTTAGTTCCAGATTGTGTTAGTTACACTACCATGATAGGAGGCTATTGTAAAGTGGGAGACATAAACAAAGCATTTTTATACCTGGGCAAGATGTTAAAAAGTGGA
ATTCAACCATCTGTTATCACATATACTTTGTTCATTGATAACTTATGCAAGTTTGGAGATGTGGAAATGGCTGAAGTATTGTTCCAAAAGATGATTACCGAGGGTTTAAA
ACCCGATGTTGTCACATATAATATTTTGATGGATGGATATGGAAAGAAGGGCTACTTGCACAAGGCTTTTGGACTCCTTGATATAATGAGATCTGCAAATGTTACACCAG
ATGTTGTGACATATAACACGCTCATTAATGGTCTTGTTATGAGAGGGTTTCTTCGAGAGGCAAAGGATATTTTAGACGAGCTCATCAGGAGGGGTTTCTGTATAGATGTT
GTCACATACACTAATTTCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTGTTTGGTATCATATGACTGAGAACTGTGTGAAGCCTGATGTTGTTAC
TTGCAGTGCTCTTCTTAGTGGGTATTGCCAAGAACATCGCATAGAAGAAGCAAATGCTCTATTTTATAAAATGCTGGAAATTGGATTAAATCCAGACTTGATATTGTATA
ATACTTTAATCCATGGATTTTGCAGTATTGGTAATGTGGATGAAGCTTGCAATTTGGTAAAGAAGATGATCGAAAGCAGTATCATTCCGAACAATGTTACTCATCAGGCA
CTTGTCCTGGGATTTCAGAAAAAGAGGGTTATCAATCCAAAAGAGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTTCATATGATACTGGTGGTCGGGGACCCGTCGC
GAATCAGAAAGCGGCAGAGATCGACGGAGTTTTGGGGATTCTCAATTCAAGTGAGCTGCTTAGGGCATTGATTTTCCAGGGAGATCGGTCTAGAATCGAGATGACTCCTG
TTTGTCCCTTCGTCAAATCTGCTCGTCCCGACGACTATTCTTCGAGGAAGCACCAGCCGGAATCTGCTTGTCCTTTCGCGAAATCCGGCCGTTCCGATGATGCTTCTTCG
AGAAAGGAGCAACCGGAGGCTGCTTGTCCTTTTGCGAAATCCGCTCGTCCCGACGATGCTTCTTTGAGGAAGAATCAGGCTGAGGCGGAGAGTAACGAGGCAGAGAAAGA
TGTTGCTGATGCTGCTAGAACCGGCGGCAAGTGTCCCTTTGGATATGATTCTCAAACTTTCAAGATTGGCCCTCTAAGCTGTATGATATGTCAGGCACTTCTCTTTGAAT
GCAGCAGATGTGTCCCCTGTTCTCACGTTTACTGCAAAGCATGTATATCACGTTTCAAAGACTGTCCGCTGTGTGGAGCTGACATCGAGAAGATTGAAGCCGACGCTGAT
CTGCAAGGTATGGTTGATCGCTTCATTGAAGGTCATGCGAGAATCAAGAGATCCCAGGTGAACTCAGACAAAGAGCAAGAGGAAGTTAGTGAGAGTAAACCAGTAATATA
TGAAGACGTGTCCTTGGAGAGAGGTGCTTTCTTGATCCAGCAAGCCATGAGGGCATTTCGTGCCCAGAATATTGAAAGTGCCAAATCCAGGCTCACTATCTGTGTTGAAG
ATATTCGAGATCAATTAGAAAGAATGGGCAATTCATCAGAATTGTGCTCACAGCTTGGAGCGGTTCTTGGCATACTTGGCGATTGCTGTCGAGCAGCTGGAGATGCTAGT
TCCGCAATCAAGCATTTTGAAGAAAGTGTAGAATTTCTCTCAAAATTGCCTACAAAGAATCACGAGATCACTCATACACTTTCTGTATCACTTAATAAAATTGGCGATCT
TAAGTATTATGAAGGAGACCTCCAAGCGGCAAGATCTTATTATTTCCGGTCTCTTAATGTTCGCCAAGATGCTAGCAAGCAACATCCAGATGATCCATCCCAGATCCTAG
ATGTGGCTGTTTCGCTTGCAAAAGTGGCAGATGTGGACAGTGGTCTAGGAAATGAGGATGTGGCGGTTAATGGATTTGAGGAAGGCATAAAGTTGTTGGAATCATTAACA
CTGAATTCTGAGAATTCTGGTCTTGAACAGCGACGTCAATCCGTGCTGAAGTTCCTTGAAGGTCAACTTGCCGAGCGACAAAAGCCAACTCAATCATACAGAAGCATACA
TAGTCGATGTCGCTTTATCTGCATAAGCTTGAATTGGCCTGGCAGTTGTGAAGCTGTGATCATTCTAATCAGCTGGCGGGATTGTACCAACTTAGATGTTATTGAAGTTG
AGTCTCAGCCTTCCATCTGGAGATGGGAGACACCCTACCCTAATCCATTGAACTCAGTTGGAACTTCCCAGCCGGAGATGCAAGAACATCTTAATCAGCTGAATAAAGTT
GGACCTTCCAATCAGAGAAAGATGGAAATCAGATACCATTGGAGTTTCAATGATAAGAGCTTTTGGTTTGACGCACTTAAGTTCAAGGCTCAAAAGGTTTGGTGCACATA
TGCTCAGAGAAACTGGAGCATTCGACACTGTCCATTGACAGAAGCGGAGTGCACCTATGCTACTCATTCTGCTATGGTAGCACCCACCACTATATTACGAACACATCAGA
AGCAATCGTACTTAAGAACTGGATTGCAATGGAGGGCGTATCGTCACAGTACACACACTTCCAAAGTTGACATCCACGACGGCCCATATTTCCGTGCAAGATTGAACCCA
AAAATCGATGATCGACAAGGACGTCAAATCTCTACCAATCGCCGGCAGCCACTCCCTAGCGAAGCCAACATCCGTCAGCGATCCAATGAGCAGAAGAGGTTGAGTGACCT
GACTTCGCTGGAGAGAGCATTGAGAGCTTTGGAAGCCACTCGGCACCGAGCCAAATCGGCGGAATCGGCCAATCGATTGAGGATTTCGAGAACTAACGACGGCGGTAGAT
CATCCATGGAGGAAAAGAAGAACGGTTATGGTGGCTTCTCTCGGAATTTCAGGCGATGGATGAATCCATCTCTCCTTTGA
Protein sequenceShow/hide protein sequence
MLLKLLYETHNDRKTLETTCSMLVDCYIKERMVTAALTLMCQMKHLKIFPSIWVYKSLIQALLETKGNLGRGWKVLLELQNFGSKPDAVDYTIVIDSLCKISLLKEATSL
LFKMTSFGVSPDSVTMSSVIDGYCKMGKLDVACKILKYFRLPLNIFICNSFITKLCTEGNMVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGDINKAFLYLGKMLKSG
IQPSVITYTLFIDNLCKFGDVEMAEVLFQKMITEGLKPDVVTYNILMDGYGKKGYLHKAFGLLDIMRSANVTPDVVTYNTLINGLVMRGFLREAKDILDELIRRGFCIDV
VTYTNFIYGYSKRGNFEEAFLVWYHMTENCVKPDVVTCSALLSGYCQEHRIEEANALFYKMLEIGLNPDLILYNTLIHGFCSIGNVDEACNLVKKMIESSIIPNNVTHQA
LVLGFQKKRVINPKESATSKLQEILLSYDTGGRGPVANQKAAEIDGVLGILNSSELLRALIFQGDRSRIEMTPVCPFVKSARPDDYSSRKHQPESACPFAKSGRSDDASS
RKEQPEAACPFAKSARPDDASLRKNQAEAESNEAEKDVADAARTGGKCPFGYDSQTFKIGPLSCMICQALLFECSRCVPCSHVYCKACISRFKDCPLCGADIEKIEADAD
LQGMVDRFIEGHARIKRSQVNSDKEQEEVSESKPVIYEDVSLERGAFLIQQAMRAFRAQNIESAKSRLTICVEDIRDQLERMGNSSELCSQLGAVLGILGDCCRAAGDAS
SAIKHFEESVEFLSKLPTKNHEITHTLSVSLNKIGDLKYYEGDLQAARSYYFRSLNVRQDASKQHPDDPSQILDVAVSLAKVADVDSGLGNEDVAVNGFEEGIKLLESLT
LNSENSGLEQRRQSVLKFLEGQLAERQKPTQSYRSIHSRCRFICISLNWPGSCEAVIILISWRDCTNLDVIEVESQPSIWRWETPYPNPLNSVGTSQPEMQEHLNQLNKV
GPSNQRKMEIRYHWSFNDKSFWFDALKFKAQKVWCTYAQRNWSIRHCPLTEAECTYATHSAMVAPTTILRTHQKQSYLRTGLQWRAYRHSTHTSKVDIHDGPYFRARLNP
KIDDRQGRQISTNRRQPLPSEANIRQRSNEQKRLSDLTSLERALRALEATRHRAKSAESANRLRISRTNDGGRSSMEEKKNGYGGFSRNFRRWMNPSLL