; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G00650 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G00650
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr01:448316..454827
RNA-Seq ExpressionClc01G00650
SyntenyClc01G00650
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605770.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.68Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        M+RVSS FGTWKHNR KACL+LFP+ CK LHTENS I+STNICISRHVRNGHLDLA+TLF+EMPVRSVVSWNIMISGYSK+G+Y+EAL LASGMHC+NVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNE TFSTLLSICAHSGCT EGKQ+HCLVLKSG QIFELVGSALLY YAN +DI+GAK VFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLF KIPTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSEHNC+RALELFCSM  N EVEPNEFTFDC+VRACGR+  LSQGKV+HGILTKYGFHFDHSIC ALI FY QCEA+D AK VYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASN+LLEGL+L GRINDAEEIF KLREK+PVSYNLMLKGYA+SGRIEESK+LFE+MTHKT+ISSNTMI+VYSRNGEI+KA KLFES K EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH EAL+LY TMC+TSVERSRSTFS L QACTCLG+I  G+SLH HAIKTAFDSNVYVGTSLIDMYSKCGS+  A+ SFASVY PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        +TALINGYV HGLG+EAF VFE MLK+K++PN ATLLGILSACS  GMVNEGM +FHSME CYGVIPTLEHYACVVDLLGRSG LYEA+EFIR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKKML LDP AISAYV LSNIYA LGKWVEKI+VRR+LRSLKVKK+RGCSWIDVNN+IHVFSV DRSHPNCNAIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY
        LEH+LA+VNSI Q + VP+SV EVSF + I+
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY

XP_022158004.1 pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia]0.0e+0086.36Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        MLRVSSSFGTWKHNR KA L LFPT+ K LHTENS+IISTNICISRHVRNG LDLA+TLFNEMPVRS+VSWN+MISGYSKLG+Y EALNLAS MHCNNVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
         NE TFSTLLS CAHS CT EGKQ HCLVLKSG QIFELVGSALLYLYANI DI+GAKQVFDELH+KN LLWSLMLVGYVKCN MDDAFDLFTKIP RDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSE+NC+RALELFC M MN EVEPNEFTFDC+VRACGR+ DLSQGKV+HGILTKYG HFDHSIC ALI FY QCEAIDNAKAVYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASNSLLEGLIL+GR NDAEEIF KLREK+PVSYNLMLKGYA+S RIEESKRLFERMTHKT IS+NTMISVYSRNGEI+KA +LFESMK EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH +ALKLY TMCRTSVERSRSTFSAL QACTCLGSIQ G+SLH HAIKTAFDSNVYVGTSLIDMYSKCGSI  A+TSF S+Y PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        FTALINGYV HGLGIEAF VFE+MLK KV+PN ATLLGILSACS AGMVNEG+ +F SME CYGVIP LEHYACVVDLLGRSGRL EA+EFIR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKK+LSLDP AISAYV LSNIYA LGKWVEKINVRR+LRSLKVKK+RGCSWIDVNN+ HVFSVEDRSHPNCNAIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL
        LEHLLANV SIAQ + VPKS+ E SFS+SI SL
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL

XP_022958508.1 putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic [Cucurbita moschata]0.0e+0084.54Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        M+RVSS FGTWKHNR KACL+LFP+ CK LHTENS I+STNICISRHVRNGHLDLA+TLF+EMPVRSVVSWNIMISGYSK+G+Y+EAL LASGMHC+NVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNE TFSTLLSICAHSGCT EGKQ+HCLVLKSG QIFELVGSALLY YAN +DI+GAK VFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLF KIPTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSEHNC+RALELFCSM  N EVEPNEFTFDC+VRACGR+  LSQGKV+HGILTKYGFHFDHSIC ALI FY QCEA+D AK VYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASN+LLEGL+L GRINDAEEIF KLREK+PVSYNLMLKGYA+SGRIEESK+LFE+MTHKT+ISSNTMI+VYSRNGEI+KA KLFES K EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH EAL+LY TMC+TSVERSRSTFS L QACTCLG+I  G+SLH HAIKT+FDSNVYVGTSLIDMYSKCGS+  A+ SFASVY PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        +TALINGYV HGLG+EAF VFE MLK K++PN ATLLGILSACS AGMVNEGM +FHSME CYGVIPTLEHYACVVDLLGRSG LYEA+EFIR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKKML LDP AISAYV LSNIYA LGKWVEKI+VRR+LRSLKVKK+RGCSWIDVNN+IHVFSV DRSHPNC+AIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY
        LEH+LA+VNSI Q + VP+SV EVSF + I+
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY

XP_023534728.1 putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0084.68Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        M+RVSS FGTWKHNR KACL+LFP+ CK LHTENS I+STNICISRHVRNGHLDLA+TLF+EMPVRSVVSWNIMISGYSK+G+YSEAL LASGMHC+NVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNE TFSTLLSICAHSGCT EGKQ+HCLVL+SG QIFELVGSALLY YAN +DI+GAK VFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLF KIPTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSEHNC+RALELFCSM+ N EVEPNEFTFDC+VRACGR+  LSQGKV+HGILTKYGFHFDHSIC ALI FY QCEA+D AK VYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASN+LLEGL+L GRINDAEEIF KLREK+PVSYNLMLKGYA+SGRIEESK+LFE+MTHKT+ISSNTMI+VYSRNGEI+KA KLFES K EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH EAL+LY TMC+TSVERSRSTFS L QACTCLG+I  G+SLH HAIKTAFDSNVYVGTSLIDMYSKCGS+  A+ SFASVY PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        +TALINGYV HGLG+EAF VFE MLK+K++PN ATLLGILSACS  GMVNEGM +FHSME CYGVIPTLEHYACVVDLLGRSG LYEA+EFIR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKKML LDP AISAYV LSNIYA LGKWVEKI+VRR+LRSLKVKK+RGCSWIDVNN+IHVFSV DRSHPNCNAIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY
        LEH+LA+VNSI Q + VP+SV EVSF + I+
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY

XP_038874558.1 pentatricopeptide repeat-containing protein At2g13600-like [Benincasa hispida]0.0e+0092.22Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        MLRVSSSFGTWKHNR KACL+L PTLCKGLHTENSNIISTNICISRHV NGHLDLA TLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNETTFSTLLSICA SGCT EGKQFHCL+LKSG QIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAF LF K PTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSEHNC+RALELFCSM MN EVEPNEFTFDC+VRACGRM DLS+GKV+HGILTKYGFHFDHSICSALI FY QCEAID AKAVYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLN SNSLLEGLI +GR+ DAEEIFCKLREK+PVSYNLMLKGYAMSGR+EESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEG+PVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH EALKLYL MCRTSVERSRSTFSALFQACTCLGSIQ GQSLH HAIKTAFDSNVYVGTSLIDMYSK GSIS AQT+FASVYFPNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACS AGMVNEGM VFHSME CYGVIPTLEHYACVVDLLG+SGRLYEA+EFIRSMPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKKMLSLDP AISAYV LSNIYAKLG WVEKINVRRQLRSLKVKKNRGCSWI+VNNK HVFSVEDRSHPNCNAIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL
        LEHLLANVNSIAQ N VPKS+ +V F +SIYSL
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL

TrEMBL top hitse value%identityAlignment
A0A0A0KQ79 Uncharacterized protein0.0e+0088.28Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        MLR SSS GTWKHNR KACLELF TLC+GLHTENSNIISTNI ISRHVR+GHLDLA+TLFNEMPVRSVVSWNIMISGYSK GKYSEALNLAS MHCNNVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNETTFS+LLSICAHSGC+ EGKQFHCLVLKSG QIFE VGSAL+Y YANINDISGAKQVFDELHDKNDLLW L+LVGYVKCNLMDDA DLF KIPTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTT+IS YARSEHNC+R LELFCSM MN EVEPNEFTFD +VRACGRM  LS GKV+HGILTKYGFHFDHS+CSALI FY QCEAIDNAKAVYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCL ASNSLLEGLI +GRINDAEEIFCKLREK+PVSYNLMLKGYA SGRIE SKRLFERMTHKT  S NTMISVYSRNGEIDKAFKLFES+KSEG+PVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISG IQNHQH  ALKLY+TMCRTSVERSRSTFSALFQACTCL  IQ GQ+LH HAI+ AFDSNVYVGTSLIDMY+KCGSI  AQTSFASV FPNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        FTALINGYVHHGLGIEAFSVF+EMLKHKV PNGATLLGILSACSCAGMV EGM VFHSME CYGVIPTLEHYACVVDLLGRSGRLYEA+ FIR MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKK+LSLDP AISAY+ LSNIYAKLGKWVEKINVRRQL SLKVKK RGCSWIDVNNK  VFS  DRSHPNCNAIY+T
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANV
        LEHLLANV
Subjt:  LEHLLANV

A0A1S4DSA9 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0087.99Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        MLR SSS GTWKHNR KACLELF TLC+GLHTENSNIISTN  ISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSK GKYSEALNLASGMHCNNVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNETTFS+LLSICAHSGC+ EGKQFHCLVLKSG QIFE VGSALLYLYANINDISGAKQVFDELHDKNDLLW LMLVGYVKCNLMDDAFDLFTKIPT DV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTT+ISGYARSEHNC+R LELFCSM MN  VEPNEFTFD +VRACGRM DLS GKV+HGILTKYGFHFDHS+CSALI FY QCEAID+AKAVYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCL ASNSLLEGLIL+GRINDAEEIFCKLREK+P SYNLMLKGYAMSGRIE SKRLFERMTHKT  S NTMISVYSRNGEIDKAFKLFES+KSEG+PVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISG IQNHQH  ALKLY+TMCR SVERSRSTFS LFQAC CL SIQ G++LH HAI+ AFDSNVYVGTSLIDMY+KCGSI  AQTSFASV FPNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        FTALINGYVHHGLGIEAFSVF+EMLK KV PNGATLLGILSACSCAGMV EGM VFHSME CYGVIPT EHYACVVDLLGRSGRLYEA+ FIR MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALL+ACWFWMDL+LGE VAKK+LSLDP  ISAY+ LSNIYAKLGKWVEKINVRRQL SLKVKK RGCSWIDVNNK +VFS  DRSHPNCNAIY+T
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANV
        LEHLLANV
Subjt:  LEHLLANV

A0A6J1DY59 pentatricopeptide repeat-containing protein At2g13600-like0.0e+0086.36Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        MLRVSSSFGTWKHNR KA L LFPT+ K LHTENS+IISTNICISRHVRNG LDLA+TLFNEMPVRS+VSWN+MISGYSKLG+Y EALNLAS MHCNNVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
         NE TFSTLLS CAHS CT EGKQ HCLVLKSG QIFELVGSALLYLYANI DI+GAKQVFDELH+KN LLWSLMLVGYVKCN MDDAFDLFTKIP RDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSE+NC+RALELFC M MN EVEPNEFTFDC+VRACGR+ DLSQGKV+HGILTKYG HFDHSIC ALI FY QCEAIDNAKAVYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASNSLLEGLIL+GR NDAEEIF KLREK+PVSYNLMLKGYA+S RIEESKRLFERMTHKT IS+NTMISVYSRNGEI+KA +LFESMK EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH +ALKLY TMCRTSVERSRSTFSAL QACTCLGSIQ G+SLH HAIKTAFDSNVYVGTSLIDMYSKCGSI  A+TSF S+Y PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        FTALINGYV HGLGIEAF VFE+MLK KV+PN ATLLGILSACS AGMVNEG+ +F SME CYGVIP LEHYACVVDLLGRSGRL EA+EFIR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKK+LSLDP AISAYV LSNIYA LGKWVEKINVRR+LRSLKVKK+RGCSWIDVNN+ HVFSVEDRSHPNCNAIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL
        LEHLLANV SIAQ + VPKS+ E SFS+SI SL
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL

A0A6J1H393 putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic0.0e+0084.54Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        M+RVSS FGTWKHNR KACL+LFP+ CK LHTENS I+STNICISRHVRNGHLDLA+TLF+EMPVRSVVSWNIMISGYSK+G+Y+EAL LASGMHC+NVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNE TFSTLLSICAHSGCT EGKQ+HCLVLKSG QIFELVGSALLY YAN +DI+GAK VFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLF KIPTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        V WTTLISGYARSEHNC+RALELFCSM  N EVEPNEFTFDC+VRACGR+  LSQGKV+HGILTKYGFHFDHSIC ALI FY QCEA+D AK VYDSMER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASN+LLEGL+L GRINDAEEIF KLREK+PVSYNLMLKGYA+SGRIEESK+LFE+MTHKT+ISSNTMI+VYSRNGEI+KA KLFES K EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH EAL+LY TMC+TSVERSRSTFS L QACTCLG+I  G+SLH HAIKT+FDSNVYVGTSLIDMYSKCGS+  A+ SFASVY PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        +TALINGYV HGLG+EAF VFE MLK K++PN ATLLGILSACS AGMVNEGM +FHSME CYGVIPTLEHYACVVDLLGRSG LYEA+EFIR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKKML LDP AISAYV LSNIYA LGKWVEKI+VRR+LRSLKVKK+RGCSWIDVNN+IHVFSV DRSHPNC+AIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY
        LEH+LA+VNSI Q + VP+SV EVSF + I+
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY

A0A6J1K5J2 pentatricopeptide repeat-containing protein At2g13600-like0.0e+0084.68Show/hide
Query:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK
        MLRV S FGTWKHNR KACL+LFP+ CK LHTENS I+STNICISRHVRNGHLDLA+TLF+EMP+RSVVSWNIMISGYSK+G+YSEAL LASGMHC+NVK
Subjt:  MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVK

Query:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        LNE TFSTLLSICAHSGCT EGKQ+HCLVLKSG QIFELVGSALLY YAN NDI+GAK VFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLF KIPTRDV
Subjt:  LNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
        VVWTTLISGYARSEHNC+RALELFCSM  N EVEPNEFTFDC+VRACGR+  L QGKV+HGILTKYGFHFDHSIC ALI FY QCEA+D AK VYD MER
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW
        PCLNASN+LLEGL+L GRINDAEEIF KLREK+PVSYNLMLKGYA+SGRIEESK+LFE+MTHKT+ISSNTMI+VYSRNGEI+KA KLFES K EGNPVTW
Subjt:  PCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTW

Query:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA
        NSMISGYIQNHQH EAL+LY TMC+TSVERSRSTFS L QACTCLG+I  G+SLH HAIKTAFDSNVYVGTSLIDMYSKCGS+  A+ SFASVY PNVAA
Subjt:  NSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAA

Query:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD
        +TALINGYV HGLG EAF VFE MLK+K++PN ATLLGILSACS AGMVNEGM +FHSME CYGVIPTLEHYACVVDLLGRSG LYEA+E IR+MPIEAD
Subjt:  FTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEAD

Query:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT
         VIWGALLNACWFWMDLELGESVAKKML LDP AISAYV LSNIYA LGKWVEKI+VRR+LRSLKVKK+RGCSWIDVNN+IHVFSV DRSHPNCNAIYAT
Subjt:  TVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYAT

Query:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY
        LEH+LA+VNSI Q + VP+SV EVSF + I+
Subjt:  LEHLLANVNSIAQLNCVPKSVPEVSFSHSIY

SwissProt top hitse value%identityAlignment
Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.2e-10331.69Show/hide
Query:  STNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFE
        S N  +S + + G +D     F+++P R  VSW  MI GY  +G+Y +A+ +   M    ++  + T + +L+  A + C   GK+ H  ++K G +   
Subjt:  STNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFE

Query:  LVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEF
         V ++LL +YA   D   AK VFD +  ++   W+ M+  +++   MD A   F ++  RD+V W ++ISG+ +  ++  RAL++F  M  +S + P+ F
Subjt:  LVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEF

Query:  TFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNAS--NSLLEGLILSGRINDAEEIFCKLREKSPVS
        T   ++ AC  +  L  GK IH  +   GF     + +ALIS YS+C  ++ A+ + +      L      +LL+G I  G +N A+ IF  L+++    
Subjt:  TFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNAS--NSLLEGLILSGRINDAEEIFCKLREKSPVS

Query:  YNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFS
                                                                   + V W +MI GY Q+  +GEA+ L+ +M       +  T +
Subjt:  YNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFS

Query:  ALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP-NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGAT
        A+    + L S+  G+ +H  A+K+    +V V  +LI MY+K G+I+ A  +F  +    +  ++T++I     HG   EA  +FE ML   + P+  T
Subjt:  ALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP-NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGAT

Query:  LLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAI
         +G+ SAC+ AG+VN+G   F  M++   +IPTL HYAC+VDL GR+G L EA+EFI  MPIE D V WG+LL+AC    +++LG+  A+++L L+P   
Subjt:  LLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAI

Query:  SAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL
         AY  L+N+Y+  GKW E   +R+ ++  +VKK +G SWI+V +K+HVF VED +HP  N IY T++ +
Subjt:  SAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.2e-10533.74Show/hide
Query:  NETTFSTLLSICAHSGCTPEGKQF-HCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        + + F+ LL  C  S  +    ++ H  V+KSGF     + + L+  Y+    +   +QVFD++  +N   W+ ++ G  K   +D+A  LF  +P RD 
Subjt:  NETTFSTLLSICAHSGCTPEGKQF-HCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
          W +++SG+A+ +  C+ AL  F  M     V  NE++F  ++ AC  + D+++G  +H ++ K  F  D  I SAL+  YS+C  +++A+ V+D M  
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIF---------------------C--------------------KLREKSPVSYNLMLKGYAMSGRIEESKRLFER
          + + NSL+     +G   +A ++F                     C                    KLR    +S N  +  YA   RI+E++ +F+ 
Subjt:  PCLNASNSLLEGLILSGRINDAEEIF---------------------C--------------------KLREKSPVSYNLMLKGYAMSGRIEESKRLFER

Query:  MTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAI
        M  + +I+  +MIS Y+      KA +L  +  +E N V+WN++I+GY QN ++ EAL L+  + R SV  +  +F+ + +AC  L  +  G   H H +
Subjt:  MTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAI

Query:  KTAF------DSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGM
        K  F      + +++VG SLIDMY KCG +      F  +   +  ++ A+I G+  +G G EA  +F EML+    P+  T++G+LSAC  AG V EG 
Subjt:  KTAF------DSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGM

Query:  AVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVE
          F SM   +GV P  +HY C+VDLLGR+G L EAK  I  MP++ D+VIWG+LL AC    ++ LG+ VA+K+L ++P+    YV LSN+YA+LGKW +
Subjt:  AVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVE

Query:  KINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANV
         +NVR+ +R   V K  GCSWI +    HVF V+D+SHP    I++ L+ L+A +
Subjt:  KINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANV

Q9SS60 Pentatricopeptide repeat-containing protein At3g035807.5e-9830.65Show/hide
Query:  KACLELFPTLCKGLHTE-------NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTL
        KAC  LF      L  E        S++   N  +  + R G L  AR +F+EMPVR +VSWN +ISGYS  G Y EAL +   +  + +  +  T S++
Subjt:  KACLELFPTLCKGLHTE-------NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTL

Query:  LSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISG
        L    +     +G+  H   LKSG     +V + L+ +Y      + A++VFDE+  ++ + ++ M+ GY+K  +++++  +F +               
Subjt:  LSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISG

Query:  YARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSL
                   L+ F         +P+  T   ++RACG + DLS  K I+  + K GF  + ++ + LI  Y++C  +  A+ V++SME     + NS+
Subjt:  YARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSL

Query:  LEGLILSGRINDAEEIFCKL----REKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS---------SNTMISVYSRNGEIDKAFKLFESMKSEGN
        + G I SG + +A ++F  +     +   ++Y +++   ++S R+ + K  F +  H   I          SN +I +Y++ GE+  + K+F SM   G+
Subjt:  LEGLILSGRINDAEEIFCKL----REKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS---------SNTMISVYSRNGEIDKAFKLFESMKSEGN

Query:  PVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP
         VTWN++IS  ++       L++   M ++ V    +TF      C  L + + G+ +H   ++  ++S + +G +LI+MYSKCG +  +   F  +   
Subjt:  PVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP

Query:  NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMP
        +V  +T +I  Y  +G G +A   F +M K  ++P+    + I+ ACS +G+V+EG+A F  M+  Y + P +EHYACVVDLL RS ++ +A+EFI++MP
Subjt:  NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMP

Query:  IEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNA
        I+ D  IW ++L AC    D+E  E V+++++ L+P+     +  SN YA L KW +   +R+ L+   + KN G SWI+V   +HVFS  D S P   A
Subjt:  IEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNA

Query:  IYATLEHL
        IY +LE L
Subjt:  IYATLEHL

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136501.5e-10131.34Show/hide
Query:  NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSG
        +S+    N  +S +   G+L  A  +F+ M  R  V++N +I+G S+ G   +A+ L   MH + ++ +  T ++L+  C+  G    G+Q H    K G
Subjt:  NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSG

Query:  FQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEV
        F     +  ALL LYA   DI  A   F E   +N +LW++MLV Y   + + ++F +F ++   ++V                                
Subjt:  FQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEV

Query:  EPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREK-
         PN++T+  I++ C R+GDL  G+ IH  + K  F  +  +CS LI  Y++   +D A  +        + +  +++ G       + A   F ++ ++ 
Subjt:  EPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREK-

Query:  ---SPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS----SNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCR
             V     +   A    ++E +++  +       S     N ++++YSR G+I++++  FE  ++ G+ + WN+++SG+ Q+  + EAL++++ M R
Subjt:  ---SPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS----SNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCR

Query:  TSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEML
          ++ +  TF +  +A +   +++ G+ +HA   KT +DS   V  +LI MY+KCGSIS A+  F  V   N  ++ A+IN Y  HG G EA   F++M+
Subjt:  TSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEML

Query:  KHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAK
           V PN  TL+G+LSACS  G+V++G+A F SM + YG+ P  EHY CVVD+L R+G L  AKEFI+ MPI+ D ++W  LL+AC    ++E+GE  A 
Subjt:  KHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAK

Query:  KMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANVNSI
         +L L+P   + YV LSN+YA   KW  +   R++++   VKK  G SWI+V N IH F V D++HP  + I+   + L    + I
Subjt:  KMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANVNSI

Q9SY02 Pentatricopeptide repeat-containing protein At4g027503.8e-10232.98Show/hide
Query:  ISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIF
        +S N  IS ++RNG  +LAR LF+EMP R +VSWN+MI GY +     +A  L   M   +V     +++T+LS  A +GC                   
Subjt:  ISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIF

Query:  ELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNE
                        +  A+ VFD + +KND+ W+ +L  YV+ + M++A  LF       +V W  L+ G+ + +     A + F SM +   V  N 
Subjt:  ELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNE

Query:  FTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSY
                                                +I+ Y+Q   ID A+ ++D      +    +++ G I +  + +A E+F K+ E++ VS+
Subjt:  FTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSY

Query:  NLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSA
        N ML GY    R+E +K LF+ M  + + + NTMI+ Y++ G+I +A  LF+ M    +PV+W +MI+GY Q+    EAL+L++ M R     +RS+FS+
Subjt:  NLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSA

Query:  LFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLL
            C  + +++ G+ LH   +K  +++  +VG +L+ MY KCGSI  A   F  +   ++ ++  +I GY  HG G  A   FE M +  + P+ AT++
Subjt:  LFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLL

Query:  GILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISA
         +LSACS  G+V++G   F++M   YGV+P  +HYAC+VDLLGR+G L +A   +++MP E D  IWG LL A     + EL E+ A K+ +++P     
Subjt:  GILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISA

Query:  YVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL
        YV LSN+YA  G+W +   +R ++R   VKK  G SWI++ NK H FSV D  HP  + I+A LE L
Subjt:  YVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-10633.74Show/hide
Query:  NETTFSTLLSICAHSGCTPEGKQF-HCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV
        + + F+ LL  C  S  +    ++ H  V+KSGF     + + L+  Y+    +   +QVFD++  +N   W+ ++ G  K   +D+A  LF  +P RD 
Subjt:  NETTFSTLLSICAHSGCTPEGKQF-HCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDV

Query:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER
          W +++SG+A+ +  C+ AL  F  M     V  NE++F  ++ AC  + D+++G  +H ++ K  F  D  I SAL+  YS+C  +++A+ V+D M  
Subjt:  VVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMER

Query:  PCLNASNSLLEGLILSGRINDAEEIF---------------------C--------------------KLREKSPVSYNLMLKGYAMSGRIEESKRLFER
          + + NSL+     +G   +A ++F                     C                    KLR    +S N  +  YA   RI+E++ +F+ 
Subjt:  PCLNASNSLLEGLILSGRINDAEEIF---------------------C--------------------KLREKSPVSYNLMLKGYAMSGRIEESKRLFER

Query:  MTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAI
        M  + +I+  +MIS Y+      KA +L  +  +E N V+WN++I+GY QN ++ EAL L+  + R SV  +  +F+ + +AC  L  +  G   H H +
Subjt:  MTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAI

Query:  KTAF------DSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGM
        K  F      + +++VG SLIDMY KCG +      F  +   +  ++ A+I G+  +G G EA  +F EML+    P+  T++G+LSAC  AG V EG 
Subjt:  KTAF------DSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGM

Query:  AVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVE
          F SM   +GV P  +HY C+VDLLGR+G L EAK  I  MP++ D+VIWG+LL AC    ++ LG+ VA+K+L ++P+    YV LSN+YA+LGKW +
Subjt:  AVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVE

Query:  KINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANV
         +NVR+ +R   V K  GCSWI +    HVF V+D+SHP    I++ L+ L+A +
Subjt:  KINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANV

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein8.5e-10531.69Show/hide
Query:  STNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFE
        S N  +S + + G +D     F+++P R  VSW  MI GY  +G+Y +A+ +   M    ++  + T + +L+  A + C   GK+ H  ++K G +   
Subjt:  STNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIFE

Query:  LVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEF
         V ++LL +YA   D   AK VFD +  ++   W+ M+  +++   MD A   F ++  RD+V W ++ISG+ +  ++  RAL++F  M  +S + P+ F
Subjt:  LVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNEF

Query:  TFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNAS--NSLLEGLILSGRINDAEEIFCKLREKSPVS
        T   ++ AC  +  L  GK IH  +   GF     + +ALIS YS+C  ++ A+ + +      L      +LL+G I  G +N A+ IF  L+++    
Subjt:  TFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNAS--NSLLEGLILSGRINDAEEIFCKLREKSPVS

Query:  YNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFS
                                                                   + V W +MI GY Q+  +GEA+ L+ +M       +  T +
Subjt:  YNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFS

Query:  ALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP-NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGAT
        A+    + L S+  G+ +H  A+K+    +V V  +LI MY+K G+I+ A  +F  +    +  ++T++I     HG   EA  +FE ML   + P+  T
Subjt:  ALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP-NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGAT

Query:  LLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAI
         +G+ SAC+ AG+VN+G   F  M++   +IPTL HYAC+VDL GR+G L EA+EFI  MPIE D V WG+LL+AC    +++LG+  A+++L L+P   
Subjt:  LLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAI

Query:  SAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL
         AY  L+N+Y+  GKW E   +R+ ++  +VKK +G SWI+V +K+HVF VED +HP  N IY T++ +
Subjt:  SAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-9930.65Show/hide
Query:  KACLELFPTLCKGLHTE-------NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTL
        KAC  LF      L  E        S++   N  +  + R G L  AR +F+EMPVR +VSWN +ISGYS  G Y EAL +   +  + +  +  T S++
Subjt:  KACLELFPTLCKGLHTE-------NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTL

Query:  LSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISG
        L    +     +G+  H   LKSG     +V + L+ +Y      + A++VFDE+  ++ + ++ M+ GY+K  +++++  +F +               
Subjt:  LSICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISG

Query:  YARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSL
                   L+ F         +P+  T   ++RACG + DLS  K I+  + K GF  + ++ + LI  Y++C  +  A+ V++SME     + NS+
Subjt:  YARSEHNCQRALELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSL

Query:  LEGLILSGRINDAEEIFCKL----REKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS---------SNTMISVYSRNGEIDKAFKLFESMKSEGN
        + G I SG + +A ++F  +     +   ++Y +++   ++S R+ + K  F +  H   I          SN +I +Y++ GE+  + K+F SM   G+
Subjt:  LEGLILSGRINDAEEIFCKL----REKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS---------SNTMISVYSRNGEIDKAFKLFESMKSEGN

Query:  PVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP
         VTWN++IS  ++       L++   M ++ V    +TF      C  L + + G+ +H   ++  ++S + +G +LI+MYSKCG +  +   F  +   
Subjt:  PVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFP

Query:  NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMP
        +V  +T +I  Y  +G G +A   F +M K  ++P+    + I+ ACS +G+V+EG+A F  M+  Y + P +EHYACVVDLL RS ++ +A+EFI++MP
Subjt:  NVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMP

Query:  IEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNA
        I+ D  IW ++L AC    D+E  E V+++++ L+P+     +  SN YA L KW +   +R+ L+   + KN G SWI+V   +HVFS  D S P   A
Subjt:  IEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNA

Query:  IYATLEHL
        IY +LE L
Subjt:  IYATLEHL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-10332.98Show/hide
Query:  ISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIF
        +S N  IS ++RNG  +LAR LF+EMP R +VSWN+MI GY +     +A  L   M   +V     +++T+LS  A +GC                   
Subjt:  ISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSGFQIF

Query:  ELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNE
                        +  A+ VFD + +KND+ W+ +L  YV+ + M++A  LF       +V W  L+ G+ + +     A + F SM +   V  N 
Subjt:  ELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEVEPNE

Query:  FTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSY
                                                +I+ Y+Q   ID A+ ++D      +    +++ G I +  + +A E+F K+ E++ VS+
Subjt:  FTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREKSPVSY

Query:  NLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSA
        N ML GY    R+E +K LF+ M  + + + NTMI+ Y++ G+I +A  LF+ M    +PV+W +MI+GY Q+    EAL+L++ M R     +RS+FS+
Subjt:  NLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSA

Query:  LFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLL
            C  + +++ G+ LH   +K  +++  +VG +L+ MY KCGSI  A   F  +   ++ ++  +I GY  HG G  A   FE M +  + P+ AT++
Subjt:  LFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLL

Query:  GILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISA
         +LSACS  G+V++G   F++M   YGV+P  +HYAC+VDLLGR+G L +A   +++MP E D  IWG LL A     + EL E+ A K+ +++P     
Subjt:  GILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISA

Query:  YVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL
        YV LSN+YA  G+W +   +R ++R   VKK  G SWI++ NK H FSV D  HP  + I+A LE L
Subjt:  YVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHL

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-10231.34Show/hide
Query:  NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSG
        +S+    N  +S +   G+L  A  +F+ M  R  V++N +I+G S+ G   +A+ L   MH + ++ +  T ++L+  C+  G    G+Q H    K G
Subjt:  NSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLLSICAHSGCTPEGKQFHCLVLKSG

Query:  FQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEV
        F     +  ALL LYA   DI  A   F E   +N +LW++MLV Y   + + ++F +F ++   ++V                                
Subjt:  FQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRALELFCSMWMNSEV

Query:  EPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREK-
         PN++T+  I++ C R+GDL  G+ IH  + K  F  +  +CS LI  Y++   +D A  +        + +  +++ G       + A   F ++ ++ 
Subjt:  EPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLREK-

Query:  ---SPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS----SNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCR
             V     +   A    ++E +++  +       S     N ++++YSR G+I++++  FE  ++ G+ + WN+++SG+ Q+  + EAL++++ M R
Subjt:  ---SPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIIS----SNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCR

Query:  TSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEML
          ++ +  TF +  +A +   +++ G+ +HA   KT +DS   V  +LI MY+KCGSIS A+  F  V   N  ++ A+IN Y  HG G EA   F++M+
Subjt:  TSVERSRSTFSALFQACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEML

Query:  KHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAK
           V PN  TL+G+LSACS  G+V++G+A F SM + YG+ P  EHY CVVD+L R+G L  AKEFI+ MPI+ D ++W  LL+AC    ++E+GE  A 
Subjt:  KHKVLPNGATLLGILSACSCAGMVNEGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAK

Query:  KMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANVNSI
         +L L+P   + YV LSN+YA   KW  +   R++++   VKK  G SWI+V N IH F V D++HP  + I+   + L    + I
Subjt:  KMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQLRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANVNSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGGGTGTCATCATCTTTTGGAACATGGAAGCACAATCGTTCGAAAGCTTGCTTGGAACTATTCCCTACTCTATGTAAAGGTTTACATACTGAAAACTCCAATAT
TATTTCTACCAATATTTGTATAAGTCGTCATGTAAGAAATGGTCATCTTGATCTTGCTCGAACTCTGTTCAATGAAATGCCAGTAAGAAGTGTTGTCTCATGGAATATCA
TGATTTCTGGATACTCCAAATTAGGAAAGTATAGTGAAGCTCTCAATCTGGCTTCAGGGATGCATTGCAATAATGTAAAATTAAATGAGACGACCTTTTCTACTTTGTTG
AGTATTTGTGCACACTCAGGGTGTACACCTGAGGGAAAACAATTTCATTGTTTGGTCTTGAAATCTGGGTTTCAGATATTTGAGCTTGTCGGAAGTGCATTGTTGTATTT
GTATGCAAACATCAATGACATTAGTGGAGCTAAGCAAGTCTTTGATGAATTGCATGATAAGAATGATTTATTATGGAGCTTGATGCTTGTGGGGTATGTTAAATGCAATT
TGATGGATGATGCTTTTGATCTATTTACGAAAATTCCAACACGGGATGTGGTGGTGTGGACTACTTTGATATCAGGGTATGCAAGAAGTGAGCATAACTGCCAGAGGGCA
TTGGAATTATTCTGCTCCATGTGGATGAATAGTGAGGTTGAGCCTAACGAGTTCACTTTTGATTGTATTGTGAGAGCTTGTGGGAGAATGGGAGATTTGAGTCAGGGGAA
GGTTATTCATGGGATTTTGACTAAATATGGATTCCACTTCGATCATTCAATTTGTAGTGCACTGATTTCATTTTACTCTCAGTGTGAAGCTATTGACAATGCCAAGGCAG
TTTATGATAGTATGGAAAGACCGTGTCTAAATGCTTCAAATTCTCTTTTGGAGGGGCTAATATTATCAGGAAGAATTAATGATGCTGAAGAAATTTTTTGTAAGTTAAGA
GAAAAAAGTCCAGTTTCATATAATTTGATGCTTAAAGGGTATGCAATGAGTGGTAGAATTGAAGAATCAAAGAGATTATTTGAAAGAATGACTCATAAAACTATCATTTC
ATCAAACACTATGATATCTGTGTATTCCAGGAATGGCGAAATTGATAAAGCTTTCAAATTATTTGAGTCGATGAAGAGTGAAGGAAATCCTGTGACATGGAACTCAATGA
TATCGGGCTATATTCAAAATCATCAGCATGGAGAAGCTTTGAAACTCTATCTAACCATGTGCAGAACATCCGTTGAACGCTCGAGATCAACATTCTCTGCTCTGTTTCAA
GCATGCACATGCTTAGGATCTATTCAATTTGGTCAATCACTCCATGCGCATGCAATCAAGACGGCCTTTGACTCGAATGTTTATGTTGGAACATCACTCATAGATATGTA
CTCAAAATGTGGGAGCATCTCTGGTGCTCAAACTTCGTTTGCCAGTGTTTATTTCCCTAATGTGGCAGCTTTTACCGCTCTAATTAATGGATATGTGCATCATGGACTTG
GGATTGAAGCATTCTCAGTCTTTGAGGAGATGTTAAAGCACAAAGTTCTGCCAAATGGAGCTACTCTTTTGGGAATTCTTTCTGCATGTAGTTGTGCTGGTATGGTAAAT
GAAGGAATGGCAGTTTTCCATTCAATGGAAAACTGTTATGGTGTGATTCCAACTTTAGAACACTATGCTTGTGTGGTGGATCTTCTTGGTCGGTCAGGACGTCTGTATGA
AGCTAAAGAATTTATTAGAAGCATGCCAATTGAAGCTGATACAGTTATTTGGGGAGCTCTGCTAAATGCTTGTTGGTTTTGGATGGACTTGGAATTGGGTGAGAGTGTGG
CTAAGAAGATGCTTAGTTTGGACCCCAACGCAATATCTGCTTATGTTACTCTGTCTAATATATATGCTAAATTAGGGAAGTGGGTAGAGAAGATCAATGTGAGGAGGCAA
TTGAGGAGCTTAAAAGTGAAAAAGAATCGTGGTTGTAGCTGGATCGATGTAAATAATAAAATTCATGTTTTCTCTGTAGAAGATAGGTCCCATCCGAACTGTAATGCAAT
TTATGCAACTTTAGAGCATCTATTAGCAAATGTGAACTCTATAGCTCAACTTAACTGTGTTCCCAAATCTGTTCCGGAGGTTTCCTTTTCGCATTCAATATACTCCCTTT
GA
mRNA sequenceShow/hide mRNA sequence
AACAACAAAAACAAAAAATGGGGCCTAAATTATTAGTTTTGTAGATAAGTCTACAGCCAAATGCTTTTCAATATTAATCAGGAGAAAGCCATTCCAATTCCAACCTCTTA
AAACCTCAAGAAAAGCCCTCGCGACTTCTTCTTCTTCCCAGCTATGGCAGCACAAAACAGATTGTATCCAAAGTTAACTTAACCACCATTCATACTCTTCTATTACACGG
CCGAATCGCCCCCCGAATTCCGAAAACCAGTGGCCGTTTCATCCTCTTCTTAGCTTTTGCAACCATCAACAAACGATTCTTGTTTTCATGCCCGCCATTGACATTGCGAA
GCAGGAGGCTTCAAACTGGGTGGACGAGGTCAGTTTTCTCGTAATTAATACACAATCATCCACTCGTACCTGAAACAATAATCGAGAAGCCTTAGAAGAAGGAAGAAACT
GGAATACGCTAAATGGTACTACTTGTTCTACCCTCATTTTCTGGTTCCATAGTTTTGAGTTTTCACTCACCCGCAGGCTAAACTATAAAAGAACATCCCCAAAAGAAGAA
GAAGAAATATTTCTCTCATAAATGTTTCCACAGTAACCCTCTCCTAGAGTGAAATCCTTTAGAACTTGGACGAAAATTGAAACTTGGAACAACGTGCCCGATTCAGACAA
AAATTAGTACTTGAAACGGGGTGGCGTGCCATTGGTGTGGTGATTCAAATTTTTGTTTTAGGCTATTGCTATGCGATTTCAGGCGCATGACCGCAATCTATGATTATAAC
TCATATGTGTTCTTTCAACCCAATTTGAAATGTTGAGGGTGTCATCATCTTTTGGAACATGGAAGCACAATCGTTCGAAAGCTTGCTTGGAACTATTCCCTACTCTATGT
AAAGGTTTACATACTGAAAACTCCAATATTATTTCTACCAATATTTGTATAAGTCGTCATGTAAGAAATGGTCATCTTGATCTTGCTCGAACTCTGTTCAATGAAATGCC
AGTAAGAAGTGTTGTCTCATGGAATATCATGATTTCTGGATACTCCAAATTAGGAAAGTATAGTGAAGCTCTCAATCTGGCTTCAGGGATGCATTGCAATAATGTAAAAT
TAAATGAGACGACCTTTTCTACTTTGTTGAGTATTTGTGCACACTCAGGGTGTACACCTGAGGGAAAACAATTTCATTGTTTGGTCTTGAAATCTGGGTTTCAGATATTT
GAGCTTGTCGGAAGTGCATTGTTGTATTTGTATGCAAACATCAATGACATTAGTGGAGCTAAGCAAGTCTTTGATGAATTGCATGATAAGAATGATTTATTATGGAGCTT
GATGCTTGTGGGGTATGTTAAATGCAATTTGATGGATGATGCTTTTGATCTATTTACGAAAATTCCAACACGGGATGTGGTGGTGTGGACTACTTTGATATCAGGGTATG
CAAGAAGTGAGCATAACTGCCAGAGGGCATTGGAATTATTCTGCTCCATGTGGATGAATAGTGAGGTTGAGCCTAACGAGTTCACTTTTGATTGTATTGTGAGAGCTTGT
GGGAGAATGGGAGATTTGAGTCAGGGGAAGGTTATTCATGGGATTTTGACTAAATATGGATTCCACTTCGATCATTCAATTTGTAGTGCACTGATTTCATTTTACTCTCA
GTGTGAAGCTATTGACAATGCCAAGGCAGTTTATGATAGTATGGAAAGACCGTGTCTAAATGCTTCAAATTCTCTTTTGGAGGGGCTAATATTATCAGGAAGAATTAATG
ATGCTGAAGAAATTTTTTGTAAGTTAAGAGAAAAAAGTCCAGTTTCATATAATTTGATGCTTAAAGGGTATGCAATGAGTGGTAGAATTGAAGAATCAAAGAGATTATTT
GAAAGAATGACTCATAAAACTATCATTTCATCAAACACTATGATATCTGTGTATTCCAGGAATGGCGAAATTGATAAAGCTTTCAAATTATTTGAGTCGATGAAGAGTGA
AGGAAATCCTGTGACATGGAACTCAATGATATCGGGCTATATTCAAAATCATCAGCATGGAGAAGCTTTGAAACTCTATCTAACCATGTGCAGAACATCCGTTGAACGCT
CGAGATCAACATTCTCTGCTCTGTTTCAAGCATGCACATGCTTAGGATCTATTCAATTTGGTCAATCACTCCATGCGCATGCAATCAAGACGGCCTTTGACTCGAATGTT
TATGTTGGAACATCACTCATAGATATGTACTCAAAATGTGGGAGCATCTCTGGTGCTCAAACTTCGTTTGCCAGTGTTTATTTCCCTAATGTGGCAGCTTTTACCGCTCT
AATTAATGGATATGTGCATCATGGACTTGGGATTGAAGCATTCTCAGTCTTTGAGGAGATGTTAAAGCACAAAGTTCTGCCAAATGGAGCTACTCTTTTGGGAATTCTTT
CTGCATGTAGTTGTGCTGGTATGGTAAATGAAGGAATGGCAGTTTTCCATTCAATGGAAAACTGTTATGGTGTGATTCCAACTTTAGAACACTATGCTTGTGTGGTGGAT
CTTCTTGGTCGGTCAGGACGTCTGTATGAAGCTAAAGAATTTATTAGAAGCATGCCAATTGAAGCTGATACAGTTATTTGGGGAGCTCTGCTAAATGCTTGTTGGTTTTG
GATGGACTTGGAATTGGGTGAGAGTGTGGCTAAGAAGATGCTTAGTTTGGACCCCAACGCAATATCTGCTTATGTTACTCTGTCTAATATATATGCTAAATTAGGGAAGT
GGGTAGAGAAGATCAATGTGAGGAGGCAATTGAGGAGCTTAAAAGTGAAAAAGAATCGTGGTTGTAGCTGGATCGATGTAAATAATAAAATTCATGTTTTCTCTGTAGAA
GATAGGTCCCATCCGAACTGTAATGCAATTTATGCAACTTTAGAGCATCTATTAGCAAATGTGAACTCTATAGCTCAACTTAACTGTGTTCCCAAATCTGTTCCGGAGGT
TTCCTTTTCGCATTCAATATACTCCCTTTGACTGGCTCTCTCTATATATTTTTCTTCATCTATGAGAGCATACCTCCCATGTTCTTTCACCTCCAAAAGTTACTTCCCCT
TTTGGAGCGCTCCGTCAATTCAATTGTACTAACACAGTTCAAAGAGTCTACGTTTTCTGTTATATATTTATTCACTAGTCCCAACAAGAAAACTCTCATGCTGGAAATGA
CAGCTGTAACTTGATTCTCGATGATGTTATTACTAAATAGTTCTGGAAAAAGTGCAGAGTTTCATCTATGACTTGACTGTAAAAACTCTTTTCCTTCAAGAAATTAAAGA
TTCAGCACAGTCAGTCTATAAAACTGTCATATAATCTGGCTAGATCCAATTTTGGGATTCTACCAGTGGAAGTTATTTTTGTGTATATATATATTTCGATTGGAAAATGT
TTTGGATGTTGACCAAAACTTGTTCTTTCAACTCACATTCCAAAATCACTAGAGTCTTCGGTTTCCAAGTTAGATGCTTTTCTCTCGTCTCCTCCAACTGTCCTGTTCAA
CTAATTTCTTCCTCCAGGCAGCTACTGTGAATAAAATGGTGAGTAGGCTGATTGATTTTAACACATCAATTTTAAAAGCTTGTCGGCATTTAACCTCCCAGACATTATGA
GACAGTGATGACAGAGTATATTTGTGTACTTTGGGATTCTTTATCTGTTGGCCAAAATCAAGGGTATTGTGCCTTGTCTTACTCAAAGGTATGGTTTTGTGGCTTTGTAC
TTGGGACAGTGGAATCTCTTAGTCTTATCGAGAAATTCATCGATGATAACCACCATACTTGGGACAAAAATGAGATCCAACTTTTCTCTACTTACCTGAAATTTTTGAGG
GTCTGGTTATATGTGGCTGCTTAGCACTAAGTTGACTCAGACCCAGTTCTTAAATGCTTCATTAAGGCCCACGTTTATACCTTTGGTTCATATTCCTTTCTCTTGTAAGG
ATCCTTTAAATTATGTTTGTTCTCTCTTCCTTATTGAAGTATCTTTTAGATAACCTGCAGAATGAGAGCTCCATCAGTATATTTCAGTTTCTGTTGGTTAATTGCCTTTC
CCCTTTTTCCTCGCTTGAAACGTACAGAAATTGGGTGATTAGGTGCAAGTAAAAACATCAAACAAGAATTAGGTTTGAAACTATTATATCATATGTATATATTGGATATA
TTGTAAAGTTACCCAAAAAAAAGTTCAGGAAACTTTTTTTTTATTGACGCAGCTATATATGTGTCATCTCTTCAGGATCATAGCTACTGCCTACTGGTATTGCTTCCAGA
AAGAGCAGCAATCGTATGGCCATGTTTAATGATTGTCATATACAGTATGTACATAATGGTTTCATGCGCATAAAAGTCATTTTATATGCAAATTTTTTGATGCATCATTG
CACATACAAATATGTGCATGATACGACCATGAATATACACCATTCGTACATGTGATACATACAATATACACATATACAAACACACGTCACCTGTCTGTGTACATCCAGAT
GTGGGTGGAAATTTTGAGTTTTGCAGGTATGTATTTCAATTGGCTTCTTGGTGAAGGTACGGAGGGGAAGAGAGAACATCTTAAAGGTTAGCGAACAGGCAACCTTTAGA
ATAAACTATCTTCAGCTAATAGCAGTGGATGAATTTCATAATAGCTCTGTGGCAAAATATCTTAATGGCTGTGACTGCATGTGCTGTACTGCCTATTTTTCTTTATATTA
GTTAAATACCAAAATCTTCCTTTGTTGCAAATTATGACACTTAATTAGTCTTTGTTGGGCACTTTACACCAATAATCACATGGATGAAGATTTGCAGAGCTTATAAAAAT
TAGTACTGATCTCACTAGTTTATATCTGATTGGTGCTGTGAATTGGAGAATTGATACTTAACCTTTGTGTGTTCTAGTGATTGTCCAGTTAGGTACAGGCTTGAGAATCA
TAGGATGCATTGATGATTTTTTTGCTTGACATTTTCTATTAGATTGCAGAGACAAATAACCAATTTAAATCTAGTTTATAGATTTCTTTTGAGAGAATCAGCACTCTCTT
TCTTCAGGTTGTAATTTGTATATCCATGTATTGGCCACTGTGTCGATGGCATTTTATTTGCTCTATTCAATAGTGGCTCGTTTCCATCTGTGCTCTTAGTTTAATATGCC
TAACCCAATTAACCCTTTCCCTCAGCAGCCACACCCTCACGTACTAATCCTTTTCCTTTATTTTCTTTTCTACTATAAGTAAGCAATATAGGTGGCTTAACATTACCCTC
CTCTTCAAGAACCACCTTGTTCTCAATGTGATAATCTAGAAATCGTTGGTTAAAATCATCAAATATTTCCCGAGTCGCTTCAAAAGCTGGCAAACCCTTCCACCCCACTA
ACACTTCCCAAA
Protein sequenceShow/hide protein sequence
MLRVSSSFGTWKHNRSKACLELFPTLCKGLHTENSNIISTNICISRHVRNGHLDLARTLFNEMPVRSVVSWNIMISGYSKLGKYSEALNLASGMHCNNVKLNETTFSTLL
SICAHSGCTPEGKQFHCLVLKSGFQIFELVGSALLYLYANINDISGAKQVFDELHDKNDLLWSLMLVGYVKCNLMDDAFDLFTKIPTRDVVVWTTLISGYARSEHNCQRA
LELFCSMWMNSEVEPNEFTFDCIVRACGRMGDLSQGKVIHGILTKYGFHFDHSICSALISFYSQCEAIDNAKAVYDSMERPCLNASNSLLEGLILSGRINDAEEIFCKLR
EKSPVSYNLMLKGYAMSGRIEESKRLFERMTHKTIISSNTMISVYSRNGEIDKAFKLFESMKSEGNPVTWNSMISGYIQNHQHGEALKLYLTMCRTSVERSRSTFSALFQ
ACTCLGSIQFGQSLHAHAIKTAFDSNVYVGTSLIDMYSKCGSISGAQTSFASVYFPNVAAFTALINGYVHHGLGIEAFSVFEEMLKHKVLPNGATLLGILSACSCAGMVN
EGMAVFHSMENCYGVIPTLEHYACVVDLLGRSGRLYEAKEFIRSMPIEADTVIWGALLNACWFWMDLELGESVAKKMLSLDPNAISAYVTLSNIYAKLGKWVEKINVRRQ
LRSLKVKKNRGCSWIDVNNKIHVFSVEDRSHPNCNAIYATLEHLLANVNSIAQLNCVPKSVPEVSFSHSIYSL