; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031981 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031981
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein, putative
Genome locationscaffold11:44523555..44528907
RNA-Seq ExpressionSpg031981
SyntenySpg031981
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0016210 - naringenin-chalcone synthase activity (molecular function)
GO:0102128 - chalcone synthase activity (molecular function)
InterPro domainsIPR001099 - Chalcone/stilbene synthase, N-terminal
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012328 - Chalcone/stilbene synthase, C-terminal
IPR016039 - Thiolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144015.1 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like [Momordica charantia]0.0e+0085.05Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MAAIS  K RA  L     L     LPALRCYSS+LTKTE+KSSTKTKKAR MARMIN KPWSNDL+SSLASLSPSLSKTTVLQTLGFLRD SKALQFFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQE GY HTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRG VKLEARFFNSLMRNFSRAGLFQESIK+FTTMKSHGVSPSVVTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEML TYGVTPDT+TFNILIRGFCMNGMV+EGF IFKD+SRFGCEPD+ITYNTLVDGLCRAGKVTIAYNVV+GMGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREID+AL VFEEM+ LGLKANNITYNTLIKGL E  KFEKIKEILE TA DG FSPDTCTFNTL+HCHCDAG LDEA RVFE+MTELK+QPDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVLVRSLCQGGHYEKAE LLDKLLE+EILLSG+GCKPLVAAYNPIFKYL ENGKT KAE+VFRQLM+RGTQDPPS+KT+IMGHC EGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LR+DFLPD EIYESLIDGLL+KDK LLAL TLEKMLRSSHLPKSSTFHSILAKLLE GSASESA+LIQ+MLDKNIRQNL+FSTGCIRLLFGAGMNDKAF+
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IV LLYENGYSV+MEELIHYLCHCKKVI+ASKMLLF LE+HQSV+MDV N VI  LCEINKL+EAFSLYYKLVE+GVHQQLSCQ+QLK SLEAGGKLEEA
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFK-NSNPQNILQSSLQGLSQSERWRGIWT--LKREDEIWNH---EGYVLKDLEHGNFKDGITAPEREPKEICRR-------SLQTKTI
         F+SKRME     K   +  +IL  S + +S     RG+     +REDEIW+    + + L  L            E      C         SLQT T 
Subjt:  AFVSKRMEPPPKFK-NSNPQNILQSSLQGLSQSERWRGIWT--LKREDEIWNH---EGYVLKDLEHGNFKDGITAPEREPKEICRR-------SLQTKTI

Query:  NILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELV
         IL+KLP  SKMSCEG A  +HA+ARRVPTPGKATILA+GKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELV
Subjt:  NILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELV

Query:  TEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGS
        TEGSPTIRQRLEIANPAVVEMATEASKACI+EWGRSVEDITHIVYVSSSEIRLPGGDLYIA+RLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGS
Subjt:  TEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGS

Query:  RILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCR
        RILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAA IIGA+PVPGQESPFMELNYAIQQFLP TH VIDGRLSEEGINFKLGRDLP+RIDDNIEEFCR
Subjt:  RILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCR

Query:  KLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        KLM +  LVDFND+FWAVHPGGPAILNKLESTL+LKS KLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
Subjt:  KLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

XP_022950146.1 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like [Cucurbita moschata]0.0e+0084.67Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MA ISS+KD A  +F+ QS N+HRRLP LRCYSS+LT+ ETKSS +TKKARDMARMIN KPWSNDL+SSLAS SPSLSKTTVLQTLGFLRDPSKAL+FFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQEMG+AHT+QSYFS+LEILGRNRHLN ARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESI +FTTMKSHGVSPS+VTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPD+ITYNTLVDGLCR GKVTIAYNVVK MGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREI+ AL VFEEMV+LGLKANNITYNTLIKGLCEA KFEK+KEILEATA DGTFSPDTCTFNTLMHCHCDAG LDEALRVFERMT+LKI+PDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVL+RSLC+G +YEKAE LLDKLLE+ ILLS +GCKPLVAAYNPIFKYLCENGK KKAE VFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LRKDFLPD+E+YE+LI+G LHKDKPLLALQTLEKMLRSSHLP+SSTFHSIL KLLE G+ASESASLIQ+MLDKNIRQNL FSTGCIRLLF AG+NDKAFQ
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IVR+LY NGYSV+MEELI +LCHCKK I+ASKMLLFSLESHQ+VD+DVC+ VIFHLC+INKL+EAF LYYKLVEMGVHQQLSC NQLK SLEAGGK EE 
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFKNSNPQN-------ILQSSLQGLSQSERWRG--IWTLKREDEIW-----NHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKT
         FVSKRMEP  K+K+ +          +L   L   S +    G  I+  +  + IW     N   + L++L  G     +         I    + T  
Subjt:  AFVSKRMEPPPKFKNSNPQN-------ILQSSLQGLSQSERWRG--IWTLKREDEIW-----NHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKT

Query:  --INILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYP
           +I  KLPN SKMSCEG AKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYP
Subjt:  --INILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYP

Query:  ELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENN
        ELVTEGS TIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENN
Subjt:  ELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENN

Query:  PGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEE
        PGSRILLTTSETTILGFRPPNN RPYDLVGAALFGDGAA  IIGADPV GQESPFMELNYAIQQFLPDTH VIDGRLSE+GINF LGRDLPQRID+NIEE
Subjt:  PGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEE

Query:  FCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        FCRKLMGKGKLV+FN++FWAVHPGGPAILNKLESTLRLKS+KLECSRKALMDYGNVSSNTIFYVIE MRE LKREDGEEWGLALAFGPGITFEGILIRSL
Subjt:  FCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

XP_022974264.1 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like [Cucurbita maxima]0.0e+0085.43Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MA ISS+KD A  +F+ QS N+HRRLP+LRCYSS+LT+ ETKSS KTKKARDMARMIN KPWSNDL+SSLAS SPSLSKTTVLQTLGFLRDPSKAL+FFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQEMGYAHTEQS+FSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESI +FTTMKSHGVSPS+VTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPD+ITYNTLVDGLCR GK+TIAYNVVK MGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREI+ AL VFEEMV+LGLKANNITYNTLIKGLCEA KFEK+KEILEATA DGTFSPDTCTFNTLMHCHCDAG LDEALRVFERMT+LK++PDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVL+RSLC+G +YEKAE LLDKLLE+ ILLS +GCKPLVAAYNPI KYLCENGK KKAE VFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LRKDFLPD+E+YESLI+G LH DKPLLALQTLEKMLRSSHLP+SSTFHSIL KLLE G+ASESASLIQ+MLD+NIRQNL FSTGCIRLLF AG+NDKAFQ
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IVR+LY NGYSV+MEELI +LC+CKKVI+ASKMLLFSLESHQ+VD+DVC+ VIFHLC INKL+EAF LYYKLVEMGVHQQLSCQNQLKVSLE GGK EEA
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFKNSNPQ---NILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTS
         FVSKRMEP  K+K +  +   + L+   + +   +R  G W   R   I    G +   + H N  + +           +        +I  KLPN S
Subjt:  AFVSKRMEPPPKFKNSNPQ---NILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTS

Query:  KMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR
        KMSCEG AKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR
Subjt:  KMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR

Query:  LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT
        LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT
Subjt:  LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT

Query:  ILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVD
        ILGFRPPNN RPYDLVGAALFGDGAA  IIGADPV GQESPFMELNYAIQQFLPDTH VIDGRLSE+GINF LGRDLPQRID+NIEEFCRKLMGKGKLV+
Subjt:  ILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVD

Query:  FNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        FN++FWAVHPGGPAILNKLESTLRLKS+KLECSRKALMDYGNVSSNTIFYVIE MRE LKRE+GEEWGLALAFGPGITFEGILIRSL
Subjt:  FNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

XP_023539432.1 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0085.76Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MA ISS+KD A  +F+ QS N+HRRLP+LRCYSS+LT+ ETKS  KTKKARDMARMIN KPWSNDL+SSLAS SPSLSKTTVLQTLGFLRDPSKAL+FFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESI +FTTMKSHGVSPS+VTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPD+ITYNTLVDGLCR G+VTIAYNVVK MGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREI+ AL VFEEMV+LGLKANNITYNTLIKGLCEA KFEK+KEILEATA DGTFSPDTCTFN LMHCHCDAG LDEALRVFERMT+LKIQPDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVL+RSLC+G +YEKAE LLDKLLE+ ILLS +GCKPLVAAYNPI KYLCENGK KKAE VFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LRKDFLPD+E+YESLI+G LHKDKPLLALQTLEKMLRSSHLP+SSTFHSIL KLLE G+ASESASLIQ+MLDKNIRQNL FSTGCIRLLF AG+NDKAFQ
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IVR+LY NGYSV+MEELI +LCHCKKVI+ASKMLLFSLESHQ+VD+DVC+ VIFHLC+INKL+EAF LYYKLVEMGVHQ+LSCQNQLKVSLE GGK EEA
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFKNSNPQNILQSSLQGLSQSERWRGIWTLKREDEIWN---HEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTS
         FVSKRMEP  K K +  +                  I+  +  + IW+     G VL+ L    F    T    +              +I  KLPN S
Subjt:  AFVSKRMEPPPKFKNSNPQNILQSSLQGLSQSERWRGIWTLKREDEIWN---HEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTS

Query:  KMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR
        KMSCEG AKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR
Subjt:  KMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR

Query:  LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT
        LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT
Subjt:  LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT

Query:  ILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVD
        ILGFRPPNN RPYDLVGAALFGDGAA  IIGADPV GQESPFMELNYAIQQFLPDTH VIDGRLSE+GINF LGRDLPQRID+NIEEFCRKLMGKGKLV+
Subjt:  ILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVD

Query:  FNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        FN++FWAVHPGGPAILNKLESTLRLKS+KLECSRKALMDYGNVSSNTIFYVIE MRE LKREDGEEWGLALAFGPGITFEGILIRSL
Subjt:  FNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

XP_038896237.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like [Benincasa hispida]0.0e+0087.29Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MAAISS KD+A  LF  QSLN +RRLP LRCYSS+LTKTETKSSTKT+KAR MAR+IN KPWS +L+SSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQE GYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRG VKLEARFFNSLMRNFSRAGLFQESIK+FTTMKSHGVSPSVVTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEML TYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCE D+ITYNTLVDGLCRAGKVT+AYNVVKGMGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREID+AL VFEEMV+ GLKANNITYNTLIKGLCEA +FEK+KEIL+ATAGDGTFSPDTCTFNTLMH HC +  LD+ALRVFERMTELKI PDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVLVRSLCQGGHYEKAE+LLDKLLER+ILLS +GCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHC+EGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LRKDFLPDLEIYESLI+GLLHKDKPLLALQ+LEKMLRSSHLPKSSTFHSILAKLLE GSASESASLIQ+MLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IV LLYENGYSV+MEELIHYLC C KVI+ASKMLLFSLESHQSVD+DVCN VIF LCEINKL+EAFSLYYKLVEMGVHQQLSCQNQLKVSLEAG KLEEA
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRME----PPPKFKNSNPQNILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNT
          VSKRME     P   K S P  IL+        +++W G+   +    +   E +++  L   N   G             +    K INI  KLPN 
Subjt:  AFVSKRME----PPPKFKNSNPQNILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNT

Query:  SKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ
        S+MSCEG A LDH RARRVPTP  ATILA+GKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ
Subjt:  SKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ

Query:  RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSET
        RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILL TSET
Subjt:  RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSET

Query:  TILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLV
        TILGFRPPNNARPYDLVGAALFGDGAAA IIGADPVPGQESPFMELNYA+QQFLPDTH VIDGRLSEEGINFKLGRDLPQRIDDNIE+FCRKLMGKG+LV
Subjt:  TILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLV

Query:  DFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        DFND+FW VHPGGPAILNKLE+TLRLKS KLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
Subjt:  DFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

TrEMBL top hitse value%identityAlignment
A0A6J1CQG4 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like0.0e+0085.05Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MAAIS  K RA  L     L     LPALRCYSS+LTKTE+KSSTKTKKAR MARMIN KPWSNDL+SSLASLSPSLSKTTVLQTLGFLRD SKALQFFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQE GY HTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRG VKLEARFFNSLMRNFSRAGLFQESIK+FTTMKSHGVSPSVVTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEML TYGVTPDT+TFNILIRGFCMNGMV+EGF IFKD+SRFGCEPD+ITYNTLVDGLCRAGKVTIAYNVV+GMGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREID+AL VFEEM+ LGLKANNITYNTLIKGL E  KFEKIKEILE TA DG FSPDTCTFNTL+HCHCDAG LDEA RVFE+MTELK+QPDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVLVRSLCQGGHYEKAE LLDKLLE+EILLSG+GCKPLVAAYNPIFKYL ENGKT KAE+VFRQLM+RGTQDPPS+KT+IMGHC EGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LR+DFLPD EIYESLIDGLL+KDK LLAL TLEKMLRSSHLPKSSTFHSILAKLLE GSASESA+LIQ+MLDKNIRQNL+FSTGCIRLLFGAGMNDKAF+
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IV LLYENGYSV+MEELIHYLCHCKKVI+ASKMLLF LE+HQSV+MDV N VI  LCEINKL+EAFSLYYKLVE+GVHQQLSCQ+QLK SLEAGGKLEEA
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFK-NSNPQNILQSSLQGLSQSERWRGIWT--LKREDEIWNH---EGYVLKDLEHGNFKDGITAPEREPKEICRR-------SLQTKTI
         F+SKRME     K   +  +IL  S + +S     RG+     +REDEIW+    + + L  L            E      C         SLQT T 
Subjt:  AFVSKRMEPPPKFK-NSNPQNILQSSLQGLSQSERWRGIWT--LKREDEIWNH---EGYVLKDLEHGNFKDGITAPEREPKEICRR-------SLQTKTI

Query:  NILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELV
         IL+KLP  SKMSCEG A  +HA+ARRVPTPGKATILA+GKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELV
Subjt:  NILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELV

Query:  TEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGS
        TEGSPTIRQRLEIANPAVVEMATEASKACI+EWGRSVEDITHIVYVSSSEIRLPGGDLYIA+RLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGS
Subjt:  TEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGS

Query:  RILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCR
        RILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAA IIGA+PVPGQESPFMELNYAIQQFLP TH VIDGRLSEEGINFKLGRDLP+RIDDNIEEFCR
Subjt:  RILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCR

Query:  KLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        KLM +  LVDFND+FWAVHPGGPAILNKLESTL+LKS KLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
Subjt:  KLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

A0A6J1GE00 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like0.0e+0084.67Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MA ISS+KD A  +F+ QS N+HRRLP LRCYSS+LT+ ETKSS +TKKARDMARMIN KPWSNDL+SSLAS SPSLSKTTVLQTLGFLRDPSKAL+FFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQEMG+AHT+QSYFS+LEILGRNRHLN ARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESI +FTTMKSHGVSPS+VTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPD+ITYNTLVDGLCR GKVTIAYNVVK MGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREI+ AL VFEEMV+LGLKANNITYNTLIKGLCEA KFEK+KEILEATA DGTFSPDTCTFNTLMHCHCDAG LDEALRVFERMT+LKI+PDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVL+RSLC+G +YEKAE LLDKLLE+ ILLS +GCKPLVAAYNPIFKYLCENGK KKAE VFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LRKDFLPD+E+YE+LI+G LHKDKPLLALQTLEKMLRSSHLP+SSTFHSIL KLLE G+ASESASLIQ+MLDKNIRQNL FSTGCIRLLF AG+NDKAFQ
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IVR+LY NGYSV+MEELI +LCHCKK I+ASKMLLFSLESHQ+VD+DVC+ VIFHLC+INKL+EAF LYYKLVEMGVHQQLSC NQLK SLEAGGK EE 
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFKNSNPQN-------ILQSSLQGLSQSERWRG--IWTLKREDEIW-----NHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKT
         FVSKRMEP  K+K+ +          +L   L   S +    G  I+  +  + IW     N   + L++L  G     +         I    + T  
Subjt:  AFVSKRMEPPPKFKNSNPQN-------ILQSSLQGLSQSERWRG--IWTLKREDEIW-----NHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKT

Query:  --INILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYP
           +I  KLPN SKMSCEG AKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYP
Subjt:  --INILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYP

Query:  ELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENN
        ELVTEGS TIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENN
Subjt:  ELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENN

Query:  PGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEE
        PGSRILLTTSETTILGFRPPNN RPYDLVGAALFGDGAA  IIGADPV GQESPFMELNYAIQQFLPDTH VIDGRLSE+GINF LGRDLPQRID+NIEE
Subjt:  PGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEE

Query:  FCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        FCRKLMGKGKLV+FN++FWAVHPGGPAILNKLESTLRLKS+KLECSRKALMDYGNVSSNTIFYVIE MRE LKREDGEEWGLALAFGPGITFEGILIRSL
Subjt:  FCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

A0A6J1IH36 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like0.0e+0085.43Show/hide
Query:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN
        MA ISS+KD A  +F+ QS N+HRRLP+LRCYSS+LT+ ETKSS KTKKARDMARMIN KPWSNDL+SSLAS SPSLSKTTVLQTLGFLRDPSKAL+FFN
Subjt:  MAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFN

Query:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT
        WAQEMGYAHTEQS+FSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESI +FTTMKSHGVSPS+VTFNSLL+ILLKRGRT
Subjt:  WAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRT

Query:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR
        NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPD+ITYNTLVDGLCR GK+TIAYNVVK MGKKSVDLNPNVVTYTTLIR
Subjt:  NMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIR

Query:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT
        GYCAKREI+ AL VFEEMV+LGLKANNITYNTLIKGLCEA KFEK+KEILEATA DGTFSPDTCTFNTLMHCHCDAG LDEALRVFERMT+LK++PDSAT
Subjt:  GYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSAT

Query:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
        YSVL+RSLC+G +YEKAE LLDKLLE+ ILLS +GCKPLVAAYNPI KYLCENGK KKAE VFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM
Subjt:  YSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLM

Query:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ
        LRKDFLPD+E+YESLI+G LH DKPLLALQTLEKMLRSSHLP+SSTFHSIL KLLE G+ASESASLIQ+MLD+NIRQNL FSTGCIRLLF AG+NDKAFQ
Subjt:  LRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQ

Query:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA
        IVR+LY NGYSV+MEELI +LC+CKKVI+ASKMLLFSLESHQ+VD+DVC+ VIFHLC INKL+EAF LYYKLVEMGVHQQLSCQNQLKVSLE GGK EEA
Subjt:  IVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEA

Query:  AFVSKRMEPPPKFKNSNPQ---NILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTS
         FVSKRMEP  K+K +  +   + L+   + +   +R  G W   R   I    G +   + H N  + +           +        +I  KLPN S
Subjt:  AFVSKRMEPPPKFKNSNPQ---NILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTS

Query:  KMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR
        KMSCEG AKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKC DATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR
Subjt:  KMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQR

Query:  LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT
        LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT
Subjt:  LEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETT

Query:  ILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVD
        ILGFRPPNN RPYDLVGAALFGDGAA  IIGADPV GQESPFMELNYAIQQFLPDTH VIDGRLSE+GINF LGRDLPQRID+NIEEFCRKLMGKGKLV+
Subjt:  ILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVD

Query:  FNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL
        FN++FWAVHPGGPAILNKLESTLRLKS+KLECSRKALMDYGNVSSNTIFYVIE MRE LKRE+GEEWGLALAFGPGITFEGILIRSL
Subjt:  FNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGITFEGILIRSL

A0A6P5U031 pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like0.0e+0070.2Show/hide
Query:  LRCYSSQLTKT------------ETKSST-KTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMGYAHTEQSYF
        LRCYSSQ T+             + KSST KTK A+DMAR++N  PWS++L+SSL+++S SLSKTTV QTL  ++ P KALQFF W +  G++H +QSYF
Subjt:  LRCYSSQLTKT------------ETKSST-KTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMGYAHTEQSYF

Query:  SMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYG
         MLEILGR R+LN ARN LFSIEK S GAVKLE RFFNSL+RN+ RAGLFQESIK+FTTMKS GVSPSVV+FNSLLSILLK+GRTNMAKNVYDEMLS YG
Subjt:  SMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYG

Query:  VTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVF
        VTPDT+TFNILIRGFCMN MVDEG+R FKD+S F C+PD+ITYNTLVDGLCRAGKV IA+NVVKGM K+S DL PNVVTYTTLIRGYCAK+EID+AL + 
Subjt:  VTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVF

Query:  EEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVLVRSLCQGGHYE
        EEM + GLK N  TYNTLIKGLCEA K +KIKEILE T   G F+PDTCTFNTLMH HC+AG LDEAL+VF +M+ELK+ PDSATYSVL+RSLCQ G Y 
Subjt:  EEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVLVRSLCQGGHYE

Query:  KAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKDFLPDLEIYESL
        +AEEL D+L ++EILL  +GCKPLVA+YNPIF YL  NGKT+KAE+VFRQLMRRGTQDP SYKTLIMG+C EGT+E+GYELLV MLR+DF+PD EIY SL
Subjt:  KAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKDFLPDLEIYESL

Query:  IDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRLLYENGYSVRME
        IDGLL K KPLLA QTLEKML+SSHLP++STFHS+LA+LL+   A ESAS + +ML+K IRQN++ ST  IRLLF +G+ DKAF+I  +LYENGYS++ME
Subjt:  IDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRLLYENGYSVRME

Query:  ELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKN
        EL+ +LC  +K+++A +ML FSL+ HQSVD+D  N VI  LC+INKL+EAF LYY+LVE   +QQL C + LK +LE  G+  EA F+SKR+   P+   
Subjt:  ELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKN

Query:  SNPQNILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTSKMSCEGLAKLDHARARRV
           Q +L  S +   Q   +  ++ +  E         V   +   NF+   T             L    I I+LK+         G +K  HA +R  
Subjt:  SNPQNILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTSKMSCEGLAKLDHARARRV

Query:  PTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKA
        PTPGKAT+LA+GKAFPSQLVPQ+CLVEGYIRDTKC D  IKEKLERLCKTTTVKTRYTVM KEILDKYPEL TEGS TIRQRLEIANPAVV+MA EAS +
Subjt:  PTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKA

Query:  CIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGA
        CIKEWGR VEDITHIVYVSSSEIRLPGGDLY+A++LGL+NDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSR+LLTTSETTILGFRPPN ARPYDLVGA
Subjt:  CIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGA

Query:  ALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVDFNDMFWAVHPGGPAILNK
        ALFGDGAAA I+G++P PGQE+PFMELNYA+QQFLPDTH VIDGRLSEEGINFKLGRDLPQ+ID+NIEEFC+KLM K  L DFN++FWAVHPGGPAILNK
Subjt:  ALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVDFNDMFWAVHPGGPAILNK

Query:  LESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENL--KREDGEEWGLALAFGPGITFEGILIRSL
        LESTL+L S+KLECSR+ALMDYGNVSSNTIFYV+ENMRE L  K+E+ EEWGLALAFGPGITFEGIL+RSL
Subjt:  LESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENL--KREDGEEWGLALAFGPGITFEGILIRSL

A0A6P9E129 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like0.0e+0067.82Show/hide
Query:  MAAISSYKDRAMFLFK---GQSLNLHRRLPALRCYSSQLTKT----------------ETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTT
        MA ISS + R   L +       + HR  P  RCY SQ + T                ++K+S KTK+A+ MAR+INYKPWS++L+SSL+SLSPSLSKT 
Subjt:  MAAISSYKDRAMFLFK---GQSLNLHRRLPALRCYSSQLTKT----------------ETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTT

Query:  VLQTLGFLRDPSKALQFFNWAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVS
        VLQTL  +R PSKA QFFNW  EMG++HT+QSYF MLEILGR+R+LN ARN LFS+EKRS G VKLE RFFNSL+RN+ RAGLFQESIK+FTTMKS GVS
Subjt:  VLQTLGFLRDPSKALQFFNWAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVS

Query:  PSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGM
        PSVVTFNS+L+ILL+RGRTNMAK++YDEML TYGVTPDT+TFNILIRGFC N MVDEGF+ FK +S F CEPD++TYNTLVDGLCR GK  IA+NVVKGM
Subjt:  PSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGM

Query:  GKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDE
         KKS+DLNPNVVTYT LIRGYC K+EID ALV+ EEM S GLK N ITYNTLIKGLCEA   EKIKEILE     G F+PDTCTFNT++H HC AG LDE
Subjt:  GKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDE

Query:  ALRVFERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLI
        AL VFE+M+EL++ PDSATYSVL+RSLCQ G Y +AE+L  +L E++ILL   GCKPLVAAYNP+F+YLC NGKT KAE+VFRQLM+RG QDPPSYKTLI
Subjt:  ALRVFERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLI

Query:  MGHCNEGTFESGYELLVLMLRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSF
        MGHC EGT+E+GYEL+V MLR+DF+PD EIYESLI G L K KPLLA QTLEKML+SSHLPK+STFHSILA+L+E G A ES+S I +ML++ IRQN++ 
Subjt:  MGHCNEGTFESGYELLVLMLRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSF

Query:  STGCIRLLFGAGMNDKAFQIVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQL
        ST CIRLL   G+ DKAF+IV ++YENGYSV++EEL+ +LC  +K+++A  M LFSL+ HQ++D + CN +I  LC++ KL+EAF LYY+LVE GVHQQL
Subjt:  STGCIRLLFGAGMNDKAFQIVRLLYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQL

Query:  SCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKNSNPQNILQSSLQGLSQSERWRGIWTLK------REDEIWNHEGYVLKDLEHGNFKDGITAPEREPK
        +C   LK +L+AGG+ ++A FVSKRM           ++  +  L G     R   +  +       R  +        +  +    F     +P     
Subjt:  SCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKNSNPQNILQSSLQGLSQSERWRGIWTLK------REDEIWNHEGYVLKDLEHGNFKDGITAPEREPK

Query:  EICRRSLQTKTINILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVM
              +QT+   +  +L   S+ +  G      A+ RR+PTPGKAT+LA+GKAFPSQL+PQECLVEGYIRDTKC D +IKEKLERLCKTTTVKTRYTVM
Subjt:  EICRRSLQTKTINILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVM

Query:  CKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLR
         KEILDKYPELVTEGSPTI+QRLEIANPAV+EMA EAS ACIKEWGR  EDITHIVYVSSSEIRLPGGDLY+A++LGL+NDVGRVMLYFLGCYGGVTGLR
Subjt:  CKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLR

Query:  VAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLP
        VAKDIAENNPGSR+LLTTSETTILGFRPPN  RPYDLVGAALFGDGAAA IIG +P+ GQESPFMELNYA+QQFLP TH VIDG+LSEEGINF+LGRDLP
Subjt:  VAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLP

Query:  QRIDDNIEEFCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGIT
        Q+I+DNIEEFC+KLM K  L++FND+FWAVHPGGPAILN+LESTL L SEKLECSR+ALMDYGNVSSNTIFYVI+ MRE LKR   EEWGLALAFGPGIT
Subjt:  QRIDDNIEEFCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKREDGEEWGLALAFGPGIT

Query:  FEGILIRSL
        FEGILIR L
Subjt:  FEGILIRSL

SwissProt top hitse value%identityAlignment
O23674 Type III polyketide synthase A1.6e-17277.04Show/hide
Query:  SKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ
        S     G+ KL     RRV   GKAT+LA+GKAFPSQ+VPQE LVEG++RDTKC DA IKEKLE LCKTTTVKTRYTV+ +EIL KYPEL TEGSPTI+Q
Subjt:  SKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ

Query:  RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSET
        RLEIAN AVVEMA EAS  CIKEWGR VEDITHIVYVSSSEIRLPGGDLY++ +LGL+NDV RVMLYFLGCYGGVTGLRVAKDIAENNPGSR+LLTTSET
Subjt:  RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSET

Query:  TILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKG--K
        TILGFRPPN ARPYDLVGAALFGDGAAA IIGADP    E+PFMEL+YA+QQFLP T  VI+GRL+EEGINFKLGRDLPQ+I++NIEEFC+KLMGK   +
Subjt:  TILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKG--K

Query:  LVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR--EDGEEWGLALAFGPGITFEGILIRSL
         ++FNDMFWAVHPGGPAILN+LE+ L+L+ EKLE SR+AL+DYGNVSSNTI YV+E MR+ LK+  +  +EWGL LAFGPGITFEG+LIRSL
Subjt:  LVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR--EDGEEWGLALAFGPGITFEGILIRSL

O81305 Type III polyketide synthase C1.1e-16073.02Show/hide
Query:  RARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMAT
        + +RV   GKAT+LA+GKA PS +V QE LVE Y+R+ KC + +IK+KL+ LCK+TTVKTRYTVM +E L KYPEL TEGSPTI+QRLEIAN AVV+MA 
Subjt:  RARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMAT

Query:  EASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPY
        EAS  CIKEWGR+VEDITH+VYVSSSE RLPGGDLY++ +LGL N+V RVMLYFLGCYGG++GLRVAKDIAENNPGSR+LLTTSETT+LGFRPPN ARPY
Subjt:  EASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPY

Query:  DLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGK--GKLVDFNDMFWAVHPG
        +LVGAALFGDGAAA IIGADP    ESPFMEL+ A+QQFLP T GVIDGRLSEEGI FKLGRDLPQ+I+DN+EEFC+KL+ K     ++ ND+FWAVHPG
Subjt:  DLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGK--GKLVDFNDMFWAVHPG

Query:  GPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKRE--DGEEWGLALAFGPGITFEGILIRSL
        GPAIL+ LE+ L+LK EKLECSR+ALMDYGNVSSNTIFY+++ +R+ L+++  +GEEWGL LAFGPGITFEG L+R+L
Subjt:  GPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKRE--DGEEWGLALAFGPGITFEGILIRSL

O81908 Pentatricopeptide repeat-containing protein At1g02060, chloroplastic1.1e-23959.46Show/hide
Query:  KDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPS--LSKTTVLQTLGFLRDPSKALQFFNWAQEM
        K  A+F+ K Q        P LR  ++++T  E   STK+K AR +AR +N  PWS++L+SSL+SL PS  +S+TTVLQTL  ++ P+  L+FF+W    
Subjt:  KDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPS--LSKTTVLQTLGFLRDPSKALQFFNWAQEM

Query:  GYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKN
        G++H EQS+F MLE LGR R+LN ARNFLFSIE+RS G VKL+ R+FNSL+R++  AGLFQES+K+F TMK  G+SPSV+TFNSLLSILLKRGRT MA +
Subjt:  GYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKN

Query:  VYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAK
        ++DEM  TYGVTPD++TFN LI GFC N MVDE FRIFKD+  + C PD++TYNT++DGLCRAGKV IA+NV+ GM KK+ D++PNVV+YTTL+RGYC K
Subjt:  VYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAK

Query:  REIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEIL-EATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVL
        +EID A++VF +M+S GLK N +TYNTLIKGL EAH++++IK+IL        TF+PD CTFN L+  HCDAG LD A++VF+ M  +K+ PDSA+YSVL
Subjt:  REIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEIL-EATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVL

Query:  VRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKD
        +R+LC    +++AE L ++L E+E+LL  + CKPL AAYNP+F+YLC NGKTK+AEKVFRQLM+RG QDPPSYKTLI GHC EG F+  YELLVLMLR++
Subjt:  VRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKD

Query:  FLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRL
        F+PDLE YE LIDGLL   + LLA  TL++MLRSS+LP ++TFHS+LA+L +   A+ES  L+ +ML+K IRQN+  ST  +RLLF +   +KAF IVRL
Subjt:  FLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRL

Query:  LYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVS
        LY+NGY V+MEEL+ YLC  +K++DA  ++LF LE  Q VD+D CN VI  LC+  + +EAFSLY +LVE+G HQQLSC   L+ +LEA GK EE  FVS
Subjt:  LYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVS

Query:  KRM
        KRM
Subjt:  KRM

Q8LDM2 Type III polyketide synthase B2.1e-14567.91Show/hide
Query:  PGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACI
        PGKATILA+GKAFP QLV QE LV+GY + TKC D  +K+KL RLCKTTTVKTRY VM +EIL KYPEL  EG  T+ QRL+I N AV EMA EAS+ACI
Subjt:  PGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACI

Query:  KEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAAL
        K WGRS+ DITH+VYVSSSE RLPGGDLY+A  LGL  D  RV+LYF+GC GGV GLRVAKDIAENNPGSR+LL TSETTI+GF+PP+  RPYDLVG AL
Subjt:  KEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAAL

Query:  FGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLV--DFNDMFWAVHPGGPAILNK
        FGDGA A IIG+DP P  E P  EL+ AIQ FLP+T   IDGRL+E+GINFKL R+LPQ I+DN+E FC+KL+GK  L   ++N MFWAVHPGGPAILN+
Subjt:  FGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLV--DFNDMFWAVHPGGPAILNK

Query:  LESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR-----EDGEEWGLALAFGPGITFEGILIRSL
        +E  L L  EKL  SR+ALMDYGN SSN+I YV+E M E  K+     E+  EWGL LAFGPG+TFEGI+ R+L
Subjt:  LESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR-----EDGEEWGLALAFGPGITFEGILIRSL

Q9ZUU3 Pentatricopeptide repeat-containing protein At2g372302.5e-10133.05Show/hide
Query:  LTKTET----------KSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMG-YAHTEQSYFSMLEILGRN
        LT TET          K     K    + RM++ + W+  LQ+S+  L P    + V   L   +    ALQFF W +  G   H   ++  M+++LG  
Subjt:  LTKTET----------KSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMG-YAHTEQSYFSMLEILGRN

Query:  RHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFN
          LN AR  L  + ++    V  +   F  L+ ++ +AG+ QES+KIF  MK  GV  ++ ++NSL  ++L+RGR  MAK  +++M+S  GV P   T+N
Subjt:  RHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFN

Query:  ILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLK
        +++ GF ++  ++   R F+D+   G  PD  T+NT+++G CR  K+  A  +   M  K   + P+VV+YTT+I+GY A   +D  L +FEEM S G++
Subjt:  ILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLK

Query:  ANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKL
         N  TY+TL+ GLC+A K  + K IL+          D   F  L+     AG +  A  V + M  L +  ++  Y VL+ + C+   Y +A +LLD L
Subjt:  ANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKL

Query:  LEREILLSGNGCKPL-VAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKDFLPDLEIYESLIDGLLHKD
        +E+EI+L       +  +AYNPI +YLC NG+T KAE +FRQLM+RG QD  +   LI GH  EG  +S YE+L +M R+    +   YE LI   + K 
Subjt:  LEREILLSGNGCKPL-VAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKDFLPDLEIYESLIDGLLHKD

Query:  KPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKN--IRQNLSFSTGCIRLLFGAGMNDKAFQIVRLLYENGYSVRMEELIHYL
        +P  A   L+ M+   H+P SS F S++  L E G    ++ ++ +M+DKN  I  N+      +  L   G  ++A   + LL +NG++  ++ L+  L
Subjt:  KPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKN--IRQNLSFSTGCIRLLFGAGMNDKAFQIVRLLYENGYSVRMEELIHYL

Query:  CHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKNSN
            K I A K+L F LE   S++    + V+  L    K   A+S+  K++E G        ++L  SL   G  ++A  +S+ ++     K  N
Subjt:  CHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKNSN

Arabidopsis top hitse value%identityAlignment
AT1G02050.1 Chalcone and stilbene synthase family protein1.1e-17377.04Show/hide
Query:  SKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ
        S     G+ KL     RRV   GKAT+LA+GKAFPSQ+VPQE LVEG++RDTKC DA IKEKLE LCKTTTVKTRYTV+ +EIL KYPEL TEGSPTI+Q
Subjt:  SKMSCEGLAKLDHARARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQ

Query:  RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSET
        RLEIAN AVVEMA EAS  CIKEWGR VEDITHIVYVSSSEIRLPGGDLY++ +LGL+NDV RVMLYFLGCYGGVTGLRVAKDIAENNPGSR+LLTTSET
Subjt:  RLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSET

Query:  TILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKG--K
        TILGFRPPN ARPYDLVGAALFGDGAAA IIGADP    E+PFMEL+YA+QQFLP T  VI+GRL+EEGINFKLGRDLPQ+I++NIEEFC+KLMGK   +
Subjt:  TILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKG--K

Query:  LVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR--EDGEEWGLALAFGPGITFEGILIRSL
         ++FNDMFWAVHPGGPAILN+LE+ L+L+ EKLE SR+AL+DYGNVSSNTI YV+E MR+ LK+  +  +EWGL LAFGPGITFEG+LIRSL
Subjt:  LVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR--EDGEEWGLALAFGPGITFEGILIRSL

AT1G02060.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-24159.46Show/hide
Query:  KDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPS--LSKTTVLQTLGFLRDPSKALQFFNWAQEM
        K  A+F+ K Q        P LR  ++++T  E   STK+K AR +AR +N  PWS++L+SSL+SL PS  +S+TTVLQTL  ++ P+  L+FF+W    
Subjt:  KDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMARMINYKPWSNDLQSSLASLSPS--LSKTTVLQTLGFLRDPSKALQFFNWAQEM

Query:  GYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKN
        G++H EQS+F MLE LGR R+LN ARNFLFSIE+RS G VKL+ R+FNSL+R++  AGLFQES+K+F TMK  G+SPSV+TFNSLLSILLKRGRT MA +
Subjt:  GYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKN

Query:  VYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAK
        ++DEM  TYGVTPD++TFN LI GFC N MVDE FRIFKD+  + C PD++TYNT++DGLCRAGKV IA+NV+ GM KK+ D++PNVV+YTTL+RGYC K
Subjt:  VYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAK

Query:  REIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEIL-EATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVL
        +EID A++VF +M+S GLK N +TYNTLIKGL EAH++++IK+IL        TF+PD CTFN L+  HCDAG LD A++VF+ M  +K+ PDSA+YSVL
Subjt:  REIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEIL-EATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVL

Query:  VRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKD
        +R+LC    +++AE L ++L E+E+LL  + CKPL AAYNP+F+YLC NGKTK+AEKVFRQLM+RG QDPPSYKTLI GHC EG F+  YELLVLMLR++
Subjt:  VRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKD

Query:  FLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRL
        F+PDLE YE LIDGLL   + LLA  TL++MLRSS+LP ++TFHS+LA+L +   A+ES  L+ +ML+K IRQN+  ST  +RLLF +   +KAF IVRL
Subjt:  FLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRL

Query:  LYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVS
        LY+NGY V+MEEL+ YLC  +K++DA  ++LF LE  Q VD+D CN VI  LC+  + +EAFSLY +LVE+G HQQLSC   L+ +LEA GK EE  FVS
Subjt:  LYENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVS

Query:  KRM
        KRM
Subjt:  KRM

AT2G37230.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10233.05Show/hide
Query:  LTKTET----------KSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMG-YAHTEQSYFSMLEILGRN
        LT TET          K     K    + RM++ + W+  LQ+S+  L P    + V   L   +    ALQFF W +  G   H   ++  M+++LG  
Subjt:  LTKTET----------KSSTKTKKARDMARMINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMG-YAHTEQSYFSMLEILGRN

Query:  RHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFN
          LN AR  L  + ++    V  +   F  L+ ++ +AG+ QES+KIF  MK  GV  ++ ++NSL  ++L+RGR  MAK  +++M+S  GV P   T+N
Subjt:  RHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLFQESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFN

Query:  ILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLK
        +++ GF ++  ++   R F+D+   G  PD  T+NT+++G CR  K+  A  +   M  K   + P+VV+YTT+I+GY A   +D  L +FEEM S G++
Subjt:  ILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAYNVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLK

Query:  ANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKL
         N  TY+TL+ GLC+A K  + K IL+          D   F  L+     AG +  A  V + M  L +  ++  Y VL+ + C+   Y +A +LLD L
Subjt:  ANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRVFERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKL

Query:  LEREILLSGNGCKPL-VAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKDFLPDLEIYESLIDGLLHKD
        +E+EI+L       +  +AYNPI +YLC NG+T KAE +FRQLM+RG QD  +   LI GH  EG  +S YE+L +M R+    +   YE LI   + K 
Subjt:  LEREILLSGNGCKPL-VAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYELLVLMLRKDFLPDLEIYESLIDGLLHKD

Query:  KPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKN--IRQNLSFSTGCIRLLFGAGMNDKAFQIVRLLYENGYSVRMEELIHYL
        +P  A   L+ M+   H+P SS F S++  L E G    ++ ++ +M+DKN  I  N+      +  L   G  ++A   + LL +NG++  ++ L+  L
Subjt:  KPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKN--IRQNLSFSTGCIRLLFGAGMNDKAFQIVRLLYENGYSVRMEELIHYL

Query:  CHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKNSN
            K I A K+L F LE   S++    + V+  L    K   A+S+  K++E G        ++L  SL   G  ++A  +S+ ++     K  N
Subjt:  CHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKNSN

AT4G00040.1 Chalcone and stilbene synthase family protein7.5e-16273.02Show/hide
Query:  RARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMAT
        + +RV   GKAT+LA+GKA PS +V QE LVE Y+R+ KC + +IK+KL+ LCK+TTVKTRYTVM +E L KYPEL TEGSPTI+QRLEIAN AVV+MA 
Subjt:  RARRVPTPGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMAT

Query:  EASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPY
        EAS  CIKEWGR+VEDITH+VYVSSSE RLPGGDLY++ +LGL N+V RVMLYFLGCYGG++GLRVAKDIAENNPGSR+LLTTSETT+LGFRPPN ARPY
Subjt:  EASKACIKEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPY

Query:  DLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGK--GKLVDFNDMFWAVHPG
        +LVGAALFGDGAAA IIGADP    ESPFMEL+ A+QQFLP T GVIDGRLSEEGI FKLGRDLPQ+I+DN+EEFC+KL+ K     ++ ND+FWAVHPG
Subjt:  DLVGAALFGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGK--GKLVDFNDMFWAVHPG

Query:  GPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKRE--DGEEWGLALAFGPGITFEGILIRSL
        GPAIL+ LE+ L+LK EKLECSR+ALMDYGNVSSNTIFY+++ +R+ L+++  +GEEWGL LAFGPGITFEG L+R+L
Subjt:  GPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKRE--DGEEWGLALAFGPGITFEGILIRSL

AT4G34850.1 Chalcone and stilbene synthase family protein1.5e-14667.91Show/hide
Query:  PGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACI
        PGKATILA+GKAFP QLV QE LV+GY + TKC D  +K+KL RLCKTTTVKTRY VM +EIL KYPEL  EG  T+ QRL+I N AV EMA EAS+ACI
Subjt:  PGKATILAIGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACI

Query:  KEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAAL
        K WGRS+ DITH+VYVSSSE RLPGGDLY+A  LGL  D  RV+LYF+GC GGV GLRVAKDIAENNPGSR+LL TSETTI+GF+PP+  RPYDLVG AL
Subjt:  KEWGRSVEDITHIVYVSSSEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAAL

Query:  FGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLV--DFNDMFWAVHPGGPAILNK
        FGDGA A IIG+DP P  E P  EL+ AIQ FLP+T   IDGRL+E+GINFKL R+LPQ I+DN+E FC+KL+GK  L   ++N MFWAVHPGGPAILN+
Subjt:  FGDGAAAAIIGADPVPGQESPFMELNYAIQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLV--DFNDMFWAVHPGGPAILNK

Query:  LESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR-----EDGEEWGLALAFGPGITFEGILIRSL
        +E  L L  EKL  SR+ALMDYGN SSN+I YV+E M E  K+     E+  EWGL LAFGPG+TFEGI+ R+L
Subjt:  LESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMRENLKR-----EDGEEWGLALAFGPGITFEGILIRSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTAAAGCACTTGCGGTCTACCATACAATTCATCTGCCCGTAGTAGCATTCTCGGCAGCAATAAACCCCACTGGCGACTGGTTCATTCTCTCGGAAAACCTCCCCTTCGC
TCTCGCTCTGCGGACATCTACTGCATGCGAGATAAAGAGTATCGGCTACATTCCAATGGCGGCAATTTCCAGTTACAAGGACAGGGCGATGTTTCTTTTCAAAGGGCAAT
CGTTAAACTTGCATCGGCGTCTTCCAGCTCTCAGGTGCTATTCTTCTCAACTAACCAAGACCGAAACCAAATCATCGACCAAAACCAAAAAGGCCAGAGACATGGCAAGG
ATGATCAACTATAAACCTTGGTCGAATGACCTCCAATCATCTCTCGCTTCACTCTCACCCTCTCTCTCTAAAACCACAGTTCTTCAAACTCTGGGTTTCCTGAGAGACCC
ATCCAAAGCCCTTCAATTCTTCAATTGGGCACAAGAAATGGGTTACGCTCACACTGAGCAGTCCTACTTCTCGATGCTAGAAATTTTGGGTCGCAACCGGCATCTTAATA
CGGCTAGGAATTTTCTGTTTTCGATCGAAAAACGGTCTCGTGGAGCAGTCAAACTCGAAGCTCGCTTCTTCAATAGCTTAATGAGGAACTTTAGTCGAGCGGGACTCTTT
CAAGAGTCTATAAAGATTTTTACGACGATGAAATCACACGGGGTTTCCCCATCGGTTGTTACATTCAACAGTCTTTTGTCCATTTTGCTTAAAAGGGGTAGGACCAATAT
GGCCAAGAATGTGTATGATGAAATGCTTAGTACTTACGGAGTGACTCCGGATACATTTACATTCAATATTTTGATTAGAGGATTTTGTATGAATGGCATGGTTGATGAAG
GTTTTAGGATTTTCAAGGACTTGTCTCGCTTTGGATGTGAACCGGATATTATCACGTATAACACACTTGTCGATGGATTGTGTAGGGCAGGTAAGGTTACCATAGCATAT
AATGTGGTGAAGGGAATGGGAAAGAAAAGCGTGGATCTGAATCCCAATGTTGTTACATACACTACTTTGATTAGAGGTTACTGTGCAAAGCGAGAGATCGATAGAGCCTT
GGTTGTTTTTGAAGAAATGGTTAGTCTAGGCTTGAAAGCAAACAACATAACCTACAATACTCTAATTAAGGGCCTTTGTGAAGCCCACAAATTTGAGAAAATAAAAGAGA
TATTGGAGGCAACAGCAGGAGATGGAACATTTTCTCCTGACACATGCACATTCAACACTTTGATGCATTGCCATTGTGATGCTGGAAAGTTGGATGAAGCCCTGAGAGTG
TTCGAGAGGATGACAGAATTAAAGATTCAACCAGATTCGGCTACATACAGTGTATTGGTTAGAAGTTTGTGTCAAGGAGGGCATTATGAGAAGGCAGAAGAGTTGTTAGA
TAAACTATTAGAGAGAGAAATCTTGTTAAGTGGCAATGGGTGTAAGCCTCTCGTCGCTGCATATAACCCCATTTTTAAGTATTTATGTGAAAATGGAAAGACTAAGAAAG
CTGAAAAAGTATTTAGACAGCTAATGAGAAGAGGAACACAAGACCCTCCATCTTACAAGACTTTGATCATGGGGCATTGTAACGAAGGTACATTTGAATCTGGGTATGAG
CTACTAGTCTTGATGTTGAGGAAAGATTTTTTACCAGATTTGGAGATATATGAATCCTTAATTGATGGGCTTTTGCACAAGGATAAGCCACTTCTTGCCCTTCAGACACT
GGAAAAGATGCTGAGGAGCTCCCATCTTCCTAAATCATCTACTTTTCACTCTATACTTGCAAAACTTCTAGAACACGGAAGTGCATCCGAATCTGCTAGTCTTATACAGG
TAATGCTAGACAAAAATATTAGACAAAACCTCAGTTTTTCAACTGGTTGTATAAGACTACTTTTTGGAGCTGGCATGAACGACAAAGCATTCCAAATTGTTCGCCTGCTT
TATGAAAATGGCTATTCAGTTAGAATGGAAGAACTAATTCATTATCTTTGCCACTGTAAAAAGGTTATAGATGCATCTAAAATGTTGCTATTTAGTTTGGAGAGTCATCA
ATCTGTCGACATGGATGTTTGCAATCTAGTAATTTTTCACCTTTGTGAAATTAATAAGTTGGCTGAAGCATTTAGTCTGTACTATAAACTGGTGGAGATGGGAGTCCATC
AACAGCTAAGCTGTCAAAACCAATTGAAAGTTTCCCTTGAGGCTGGGGGGAAATTGGAAGAGGCTGCGTTTGTATCGAAAAGGATGGAACCACCGCCAAAATTTAAGAAT
TCCAATCCACAAAATATTCTGCAGTCCTCTTTACAAGGTCTAAGCCAATCAGAGCGTTGGCGTGGGATATGGACTTTGAAGAGAGAGGATGAAATTTGGAATCATGAAGG
CTATGTCTTGAAAGACTTGGAGCATGGCAATTTCAAAGATGGTATAACGGCTCCTGAAAGAGAACCGAAAGAGATTTGTCGCAGGTCTTTACAGACAAAGACTATAAACA
TCTTGTTGAAGCTCCCAAATACTTCAAAGATGAGTTGTGAGGGTTTGGCTAAGCTCGATCATGCACGAGCAAGGCGTGTTCCAACACCCGGGAAGGCAACAATCCTTGCA
ATAGGCAAGGCATTTCCGAGCCAACTCGTTCCTCAAGAATGCTTGGTTGAGGGTTACATTCGTGATACAAAATGCTTCGATGCAACTATCAAGGAAAAATTGGAACGTTT
ATGCAAAACGACCACTGTGAAGACAAGATACACTGTCATGTGCAAAGAGATCTTGGATAAGTATCCTGAGCTTGTAACCGAGGGCTCACCCACCATCCGACAGAGGTTGG
AAATTGCTAACCCTGCAGTTGTTGAGATGGCCACTGAAGCAAGCAAAGCCTGTATCAAAGAATGGGGGAGGTCTGTCGAAGATATCACCCACATTGTCTATGTTTCTTCT
AGCGAAATCCGCTTACCTGGTGGGGATCTTTACATTGCTAATCGGCTCGGTTTGAAGAACGATGTCGGTCGGGTGATGCTATACTTTCTCGGTTGTTATGGTGGTGTCAC
TGGACTCAGGGTTGCCAAAGACATAGCAGAAAACAACCCAGGAAGTCGCATTTTGTTAACGACTTCCGAGACTACAATCCTTGGATTTCGCCCCCCAAACAACGCACGCC
CATACGACCTCGTCGGGGCTGCACTTTTCGGTGATGGAGCTGCAGCTGCGATCATTGGAGCGGACCCTGTGCCGGGGCAAGAATCTCCTTTCATGGAGCTGAACTATGCC
ATCCAGCAATTCTTGCCCGACACCCACGGCGTGATCGATGGGAGGCTTTCTGAGGAGGGAATAAATTTCAAACTTGGTAGAGACCTTCCACAGAGAATAGATGACAACAT
AGAAGAGTTCTGCAGAAAGCTGATGGGAAAGGGGAAGCTGGTGGATTTTAATGACATGTTCTGGGCAGTTCATCCTGGTGGGCCAGCAATTCTGAACAAACTAGAGAGCA
CTCTGAGGTTGAAAAGTGAGAAGCTTGAATGCAGCAGAAAGGCATTGATGGACTATGGAAATGTTAGCAGCAACACAATCTTCTATGTCATTGAGAACATGAGAGAGAAT
CTGAAGAGAGAGGATGGGGAAGAATGGGGACTGGCTTTGGCATTTGGACCAGGCATTACTTTTGAAGGCATTCTCATTCGTAGCCTCTGA
mRNA sequenceShow/hide mRNA sequence
TCTAAAGCACTTGCGGTCTACCATACAATTCATCTGCCCGTAGTAGCATTCTCGGCAGCAATAAACCCCACTGGCGACTGGTTCATTCTCTCGGAAAACCTCCCCTTCGC
TCTCGCTCTGCGGACATCTACTGCATGCGAGATAAAGAGTATCGGCTACATTCCAATGGCGGCAATTTCCAGTTACAAGGACAGGGCGATGTTTCTTTTCAAAGGGCAAT
CGTTAAACTTGCATCGGCGTCTTCCAGCTCTCAGGTGCTATTCTTCTCAACTAACCAAGACCGAAACCAAATCATCGACCAAAACCAAAAAGGCCAGAGACATGGCAAGG
ATGATCAACTATAAACCTTGGTCGAATGACCTCCAATCATCTCTCGCTTCACTCTCACCCTCTCTCTCTAAAACCACAGTTCTTCAAACTCTGGGTTTCCTGAGAGACCC
ATCCAAAGCCCTTCAATTCTTCAATTGGGCACAAGAAATGGGTTACGCTCACACTGAGCAGTCCTACTTCTCGATGCTAGAAATTTTGGGTCGCAACCGGCATCTTAATA
CGGCTAGGAATTTTCTGTTTTCGATCGAAAAACGGTCTCGTGGAGCAGTCAAACTCGAAGCTCGCTTCTTCAATAGCTTAATGAGGAACTTTAGTCGAGCGGGACTCTTT
CAAGAGTCTATAAAGATTTTTACGACGATGAAATCACACGGGGTTTCCCCATCGGTTGTTACATTCAACAGTCTTTTGTCCATTTTGCTTAAAAGGGGTAGGACCAATAT
GGCCAAGAATGTGTATGATGAAATGCTTAGTACTTACGGAGTGACTCCGGATACATTTACATTCAATATTTTGATTAGAGGATTTTGTATGAATGGCATGGTTGATGAAG
GTTTTAGGATTTTCAAGGACTTGTCTCGCTTTGGATGTGAACCGGATATTATCACGTATAACACACTTGTCGATGGATTGTGTAGGGCAGGTAAGGTTACCATAGCATAT
AATGTGGTGAAGGGAATGGGAAAGAAAAGCGTGGATCTGAATCCCAATGTTGTTACATACACTACTTTGATTAGAGGTTACTGTGCAAAGCGAGAGATCGATAGAGCCTT
GGTTGTTTTTGAAGAAATGGTTAGTCTAGGCTTGAAAGCAAACAACATAACCTACAATACTCTAATTAAGGGCCTTTGTGAAGCCCACAAATTTGAGAAAATAAAAGAGA
TATTGGAGGCAACAGCAGGAGATGGAACATTTTCTCCTGACACATGCACATTCAACACTTTGATGCATTGCCATTGTGATGCTGGAAAGTTGGATGAAGCCCTGAGAGTG
TTCGAGAGGATGACAGAATTAAAGATTCAACCAGATTCGGCTACATACAGTGTATTGGTTAGAAGTTTGTGTCAAGGAGGGCATTATGAGAAGGCAGAAGAGTTGTTAGA
TAAACTATTAGAGAGAGAAATCTTGTTAAGTGGCAATGGGTGTAAGCCTCTCGTCGCTGCATATAACCCCATTTTTAAGTATTTATGTGAAAATGGAAAGACTAAGAAAG
CTGAAAAAGTATTTAGACAGCTAATGAGAAGAGGAACACAAGACCCTCCATCTTACAAGACTTTGATCATGGGGCATTGTAACGAAGGTACATTTGAATCTGGGTATGAG
CTACTAGTCTTGATGTTGAGGAAAGATTTTTTACCAGATTTGGAGATATATGAATCCTTAATTGATGGGCTTTTGCACAAGGATAAGCCACTTCTTGCCCTTCAGACACT
GGAAAAGATGCTGAGGAGCTCCCATCTTCCTAAATCATCTACTTTTCACTCTATACTTGCAAAACTTCTAGAACACGGAAGTGCATCCGAATCTGCTAGTCTTATACAGG
TAATGCTAGACAAAAATATTAGACAAAACCTCAGTTTTTCAACTGGTTGTATAAGACTACTTTTTGGAGCTGGCATGAACGACAAAGCATTCCAAATTGTTCGCCTGCTT
TATGAAAATGGCTATTCAGTTAGAATGGAAGAACTAATTCATTATCTTTGCCACTGTAAAAAGGTTATAGATGCATCTAAAATGTTGCTATTTAGTTTGGAGAGTCATCA
ATCTGTCGACATGGATGTTTGCAATCTAGTAATTTTTCACCTTTGTGAAATTAATAAGTTGGCTGAAGCATTTAGTCTGTACTATAAACTGGTGGAGATGGGAGTCCATC
AACAGCTAAGCTGTCAAAACCAATTGAAAGTTTCCCTTGAGGCTGGGGGGAAATTGGAAGAGGCTGCGTTTGTATCGAAAAGGATGGAACCACCGCCAAAATTTAAGAAT
TCCAATCCACAAAATATTCTGCAGTCCTCTTTACAAGGTCTAAGCCAATCAGAGCGTTGGCGTGGGATATGGACTTTGAAGAGAGAGGATGAAATTTGGAATCATGAAGG
CTATGTCTTGAAAGACTTGGAGCATGGCAATTTCAAAGATGGTATAACGGCTCCTGAAAGAGAACCGAAAGAGATTTGTCGCAGGTCTTTACAGACAAAGACTATAAACA
TCTTGTTGAAGCTCCCAAATACTTCAAAGATGAGTTGTGAGGGTTTGGCTAAGCTCGATCATGCACGAGCAAGGCGTGTTCCAACACCCGGGAAGGCAACAATCCTTGCA
ATAGGCAAGGCATTTCCGAGCCAACTCGTTCCTCAAGAATGCTTGGTTGAGGGTTACATTCGTGATACAAAATGCTTCGATGCAACTATCAAGGAAAAATTGGAACGTTT
ATGCAAAACGACCACTGTGAAGACAAGATACACTGTCATGTGCAAAGAGATCTTGGATAAGTATCCTGAGCTTGTAACCGAGGGCTCACCCACCATCCGACAGAGGTTGG
AAATTGCTAACCCTGCAGTTGTTGAGATGGCCACTGAAGCAAGCAAAGCCTGTATCAAAGAATGGGGGAGGTCTGTCGAAGATATCACCCACATTGTCTATGTTTCTTCT
AGCGAAATCCGCTTACCTGGTGGGGATCTTTACATTGCTAATCGGCTCGGTTTGAAGAACGATGTCGGTCGGGTGATGCTATACTTTCTCGGTTGTTATGGTGGTGTCAC
TGGACTCAGGGTTGCCAAAGACATAGCAGAAAACAACCCAGGAAGTCGCATTTTGTTAACGACTTCCGAGACTACAATCCTTGGATTTCGCCCCCCAAACAACGCACGCC
CATACGACCTCGTCGGGGCTGCACTTTTCGGTGATGGAGCTGCAGCTGCGATCATTGGAGCGGACCCTGTGCCGGGGCAAGAATCTCCTTTCATGGAGCTGAACTATGCC
ATCCAGCAATTCTTGCCCGACACCCACGGCGTGATCGATGGGAGGCTTTCTGAGGAGGGAATAAATTTCAAACTTGGTAGAGACCTTCCACAGAGAATAGATGACAACAT
AGAAGAGTTCTGCAGAAAGCTGATGGGAAAGGGGAAGCTGGTGGATTTTAATGACATGTTCTGGGCAGTTCATCCTGGTGGGCCAGCAATTCTGAACAAACTAGAGAGCA
CTCTGAGGTTGAAAAGTGAGAAGCTTGAATGCAGCAGAAAGGCATTGATGGACTATGGAAATGTTAGCAGCAACACAATCTTCTATGTCATTGAGAACATGAGAGAGAAT
CTGAAGAGAGAGGATGGGGAAGAATGGGGACTGGCTTTGGCATTTGGACCAGGCATTACTTTTGAAGGCATTCTCATTCGTAGCCTCTGA
Protein sequenceShow/hide protein sequence
SKALAVYHTIHLPVVAFSAAINPTGDWFILSENLPFALALRTSTACEIKSIGYIPMAAISSYKDRAMFLFKGQSLNLHRRLPALRCYSSQLTKTETKSSTKTKKARDMAR
MINYKPWSNDLQSSLASLSPSLSKTTVLQTLGFLRDPSKALQFFNWAQEMGYAHTEQSYFSMLEILGRNRHLNTARNFLFSIEKRSRGAVKLEARFFNSLMRNFSRAGLF
QESIKIFTTMKSHGVSPSVVTFNSLLSILLKRGRTNMAKNVYDEMLSTYGVTPDTFTFNILIRGFCMNGMVDEGFRIFKDLSRFGCEPDIITYNTLVDGLCRAGKVTIAY
NVVKGMGKKSVDLNPNVVTYTTLIRGYCAKREIDRALVVFEEMVSLGLKANNITYNTLIKGLCEAHKFEKIKEILEATAGDGTFSPDTCTFNTLMHCHCDAGKLDEALRV
FERMTELKIQPDSATYSVLVRSLCQGGHYEKAEELLDKLLEREILLSGNGCKPLVAAYNPIFKYLCENGKTKKAEKVFRQLMRRGTQDPPSYKTLIMGHCNEGTFESGYE
LLVLMLRKDFLPDLEIYESLIDGLLHKDKPLLALQTLEKMLRSSHLPKSSTFHSILAKLLEHGSASESASLIQVMLDKNIRQNLSFSTGCIRLLFGAGMNDKAFQIVRLL
YENGYSVRMEELIHYLCHCKKVIDASKMLLFSLESHQSVDMDVCNLVIFHLCEINKLAEAFSLYYKLVEMGVHQQLSCQNQLKVSLEAGGKLEEAAFVSKRMEPPPKFKN
SNPQNILQSSLQGLSQSERWRGIWTLKREDEIWNHEGYVLKDLEHGNFKDGITAPEREPKEICRRSLQTKTINILLKLPNTSKMSCEGLAKLDHARARRVPTPGKATILA
IGKAFPSQLVPQECLVEGYIRDTKCFDATIKEKLERLCKTTTVKTRYTVMCKEILDKYPELVTEGSPTIRQRLEIANPAVVEMATEASKACIKEWGRSVEDITHIVYVSS
SEIRLPGGDLYIANRLGLKNDVGRVMLYFLGCYGGVTGLRVAKDIAENNPGSRILLTTSETTILGFRPPNNARPYDLVGAALFGDGAAAAIIGADPVPGQESPFMELNYA
IQQFLPDTHGVIDGRLSEEGINFKLGRDLPQRIDDNIEEFCRKLMGKGKLVDFNDMFWAVHPGGPAILNKLESTLRLKSEKLECSRKALMDYGNVSSNTIFYVIENMREN
LKREDGEEWGLALAFGPGITFEGILIRSL