; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017653 (gene) of Snake gourd v1 genome

Gene IDTan0017653
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG08:71668424..71670427
RNA-Seq ExpressionTan0017653
SyntenyTan0017653
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580575.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.63Show/hide
Query:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK
        ALQ IRR+DGMNYGAYGRLIQHCTD  F RLGKQLHARLVLS+VAPDNFLGSKLIA YSKSGSLRDAYN+FD+ISHKNIFSWNALFISYTLH+MH DMLK
Subjt:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK

Query:  LFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCY
        LFSSLVN NS DVKPDKFTVTCVLKALASLF+NS LAKEVHCF+LRR LESD+FVVNAL+TFYSRCDELVLAR++FDR PERDIVSWNAMVAGYSQGG Y
Subjt:  LFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCY

Query:  EECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHG
        E+CKELFK ML   E KPNALTAVSVLQACAQSNDLIFGMEVH+FVN+S IEMDVSL NAVIGLYAKCGSLDYARELF+ MPEKDE+TYGSMISGYMVHG
Subjt:  EECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHG

Query:  FVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKS
        FVN+AMDLFREL+RPALSTWNAVISGLVQNNQQDGVV+IFRAMQ  GCRPNTVTLAS+LPIFSHFSTLKGG EIHAYAVRN YDGN+YVATAIIDSYAKS
Subjt:  FVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKS

Query:  GYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLS
        GYLHGAR+VF+  K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGIRPDPVTFTSV+VACAH+GELDEAWK+FNVLLPE+GIQPLVEHYACMVGVLS
Subjt:  GYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLS

Query:  RAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGG
        RAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVF RLLDIEPENTGNYIIMANLYSQFGRWKEAD+VR+LMK+VGLKKIPGNSWIETRGG
Subjt:  RAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGG

Query:  LQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG
        LQ+FVARDTSNDRTPEIYG LEGL+ LMKEEG I QHEID DCGSG
Subjt:  LQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG

KAG7017327.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.47Show/hide
Query:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK
        ALQ IRR+DGMNYGAYGRLIQHCTD  F RLGKQLHARLVLS+VAPDNFLGSKLIA YSKSGSLRDAYN+FD+ISHKNIFSWNALFISYTLH+MH DMLK
Subjt:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK

Query:  LFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCY
        LFSSLVN NS DVKPDKFTVTCVLKALASLF+NS LAKEVHCF+LRR LESD+FVVNAL+TFYSRCDELVLAR++FDR PERDIVSWNAMVAGYSQGG Y
Subjt:  LFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCY

Query:  EECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHG
        E+CKELFK ML   E KPNALTAVSVLQACAQSNDLIFGMEVH+FVN+S IEMDVSL NAVIGLYAKCGSLDYARELF+ MPEKDE+TYGSMISGYMVHG
Subjt:  EECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHG

Query:  FVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKS
        FVN+AMDLFREL+RPALSTWNAVISGLVQNNQQDGVV+IFRAMQ  GCRPNTVTLAS+LPIFSHFSTLKGG EIHAYAVRN YDGN+YVATAIIDSYAKS
Subjt:  FVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKS

Query:  GYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLS
        GYL GAR+VF+  K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGIRPDPVTFTSV+VACAH+GELDEAWK+FNVLLPE+GIQPLVEHYACMVGVLS
Subjt:  GYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLS

Query:  RAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGG
        RAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVF RLLDIEPENTGNYIIMANLYSQFGRWKEAD+VR+LMK+VGLKKIPGNSWIETRGG
Subjt:  RAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGG

Query:  LQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG
        LQ+FVARDTSNDRTPEIYG LEGL+ LMKEEG I QHEID DCGSG
Subjt:  LQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG

XP_022145703.1 pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia]0.0e+0087.99Show/hide
Query:  QISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFIS
        QIS+PAGA++PWALQAIRR DGMNY AYGRLIQHC D  F+RLGKQLHARLVL +V PDNFLGSKLIAFYSKSGSLRDAYN+F NISHKNIFSWNALFIS
Subjt:  QISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFIS

Query:  YTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWN
        YTLH+MH+DMLKLFSSLVNSN+MDVKPDKFT+TCVLKALAS F++S LAKEVHCF+LRR LESD+FVVNALVT+YSRC+E+VLAR+VF RMPERDIVSWN
Subjt:  YTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWN

Query:  AMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEIT
        AMVAG+SQGG YEECKELFKEMLS VELKPNALTAVSVLQACAQSNDLIFGMEVHRFVN+SQIEMDVSLCNAVIGLYAKCGSLDYARELF+EMPEKDE+T
Subjt:  AMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEIT

Query:  YGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVY
        YGSMISGYMVHG VN+AMDLF+ELK+PALSTWNAVISGLVQNNQQDGV++IFRAMQS GCRPN VTLAS+LP+FSHFSTLKGG EIHAYAVRNGY+GN+Y
Subjt:  YGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVY

Query:  VATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPL
        VATAIIDSYAKSGYLHGA +VF+  KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGI+PDPVTFTSV+VACAH+GELDEAWK+FN++LPEYGIQPL
Subjt:  VATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPL

Query:  VEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKK
        VEHYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDVELGKYVF RLL+IEPENTG YIIMANLYSQ GRWKEADKVR+LMK+VGL+K
Subjt:  VEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKK

Query:  IPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG
        IPG+SWIET GGL +FVARDTSND TPEIY MLEGLLGLMKEEG+ILQ+EID DCGSG
Subjt:  IPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG

XP_022934145.1 pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata]0.0e+0089.29Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS
        MNYGAYGRLIQHCTD  F RLGKQLHARLVLS+VAPDNFLGSKLIA YSKSGSLRDAYN+FD+ISHKNIFSWNALFISYTLH+MH DMLKLFSSLVN NS
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS

Query:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM
         DVKPDKFTVTCVLKALASLF+NS LAKEVHCF+LRR LESD+FVVNAL+TFYSRCDEL LAR++FDR PERDIVSWNAMVAGYSQGG YE+CKELFK M
Subjt:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM

Query:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR
        L   E KPNALTAVSVLQACA SNDLIFGMEVH+FVN+S IEMDVSL NAVIGLYAKCGSLDYARELF+ MPEKDE+TYGSMISGYMVHGFVN+AMDLFR
Subjt:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR

Query:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF
        EL+RPALSTWNAVISGLVQNNQQDGVV+IFRAMQ  GCRPNTVTLAS+LPIFSHFSTLKGG EIHAYAVRN YDGN+YVATAIIDSYAKSGYL GAR+VF
Subjt:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF

Query:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
        +  K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGIRPDPVTFTSV+VACAH+GELDEAWK+FNVLLPE+GIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTS
        FISKMPIEPTAKVWGALLNGASVAGDVELGKYVF RLLDIEPENTGNYIIMANLYSQFGRWKEAD VR+LMK+VGLKKIPGNSWIETR GLQ+FVARDTS
Subjt:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTS

Query:  NDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGS
        NDRTPEIYG LEGL+GLMKEEG I QHEID DCGS
Subjt:  NDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGS

XP_038905794.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida]0.0e+0088.07Show/hide
Query:  PTSLQISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNA
        P + +  VPA   L WALQA+RR D MNYGAYGRLIQHCTD LFVRLGKQLHARLVLS+VAPDNFLGSKLIAFYSKSGSLRDAYN+F NISHKNIF+WNA
Subjt:  PTSLQISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNA

Query:  LFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDI
        LFISYTLH+MH DML+LFSSLVNSNS DVKPDKFT+TCVLKALASLFSNS LAKEVHCFILRR LE D+FVVNAL+TFYSRCDELVLAR+VFDRMPE+DI
Subjt:  LFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDI

Query:  VSWNAMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEK
        VSWNAMVAGYSQGG YEECKELFK MLS VELKPNALT VSVLQACAQSNDLIFGMEVHRFV++SQIEMDVSLCNAVIGLYAKCGSLDYARELF+EMP+K
Subjt:  VSWNAMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEK

Query:  DEITYGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYD
        DE+TYGSMISGYMV+GFVN+AMDLFREL+RP LSTWNAVISGLVQNNQQD V++IFRAMQS GCRPNTVTLAS+LPIFSHFST+KGG EIHAYA+R  YD
Subjt:  DEITYGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYD

Query:  GNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYG
        GN+YVAT II+SYAKSGYLHGAR+VF+  KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGI+PDPVTFTSV+VACAH+GELDEAWK+FNVLLP+YG
Subjt:  GNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYG

Query:  IQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKV
        IQP VEHYACMVGVLSRAGKLSDAVEFISKMP EPTAKVWGALLNGASVAGDVELGKYVF RL +IEPENTGNY+IMANLYSQFGRWKEADKVR+LMK+V
Subjt:  IQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKV

Query:  GLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG
        GLKKIPGNSWIETRGGLQ+F+ARDTSN+RTPEIYGMLEGLLGLMKEEG ILQHEID DCGSG
Subjt:  GLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG

TrEMBL top hitse value%identityAlignment
A0A1S4DUQ6 pentatricopeptide repeat-containing protein At2g373104.8e-30486.2Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS
        MNYGAYGRLIQHCTDHLF R+GKQLHARLVLS+VAPDNFLGSKLI+FYSKSGSLRDAYN+F  I  KNIFSWNALFISYTLH+MHTD+LKLF SLVNSNS
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS

Query:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM
         DVKPD+FTVTCVLKALASLFSNS LAKEVHCFILRR LESD+FVVNAL+TFYSRCDELVLAR++FDRMPERDIVSWNAM+AGYSQGG YE+CKELF+ M
Subjt:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM

Query:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR
         S +E+KPNALTAVSVLQACAQSNDLIFGMEVHRFVN+SQI+MDVSL NAVIGLYAKCGSLDYARELF+EMPEKD ITY SMISGYMVHGFVN+AMDLFR
Subjt:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR

Query:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF
        EL+RP L TWNAVISGLVQNN+QDG ++IFRAMQS GCRPNTVTLASILPIFSHFSTLKGG EIH YA+RN YDGN++VATAIIDSYAK GYL GAR+VF
Subjt:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF

Query:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
        +  KGRSLI WT+IISAYA HGDANVALSLFYEMLT GI+PD VTFTSV+ ACAH+GELDEAWK+FN+LLP+YGIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNF
        FISKMP+EP AKVWGALLNGASVAGDVELGKYVF RL +IEP NTGNY+IMANLYSQ GRWKEAD +R+LMK+V LKKIPGNSWIETRGGLQ+F
Subjt:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNF

A0A5A7TRM4 Pentatricopeptide repeat-containing protein4.8e-30486.2Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS
        MNYGAYGRLIQHCTDHLF R+GKQLHARLVLS+VAPDNFLGSKLI+FYSKSGSLRDAYN+F  I  KNIFSWNALFISYTLH+MHTD+LKLF SLVNSNS
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS

Query:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM
         DVKPD+FTVTCVLKALASLFSNS LAKEVHCFILRR LESD+FVVNAL+TFYSRCDELVLAR++FDRMPERDIVSWNAM+AGYSQGG YE+CKELF+ M
Subjt:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM

Query:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR
         S +E+KPNALTAVSVLQACAQSNDLIFGMEVHRFVN+SQI+MDVSL NAVIGLYAKCGSLDYARELF+EMPEKD ITY SMISGYMVHGFVN+AMDLFR
Subjt:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR

Query:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF
        EL+RP L TWNAVISGLVQNN+QDG ++IFRAMQS GCRPNTVTLASILPIFSHFSTLKGG EIH YA+RN YDGN++VATAIIDSYAK GYL GAR+VF
Subjt:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF

Query:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
        +  KGRSLI WT+IISAYA HGDANVALSLFYEMLT GI+PD VTFTSV+ ACAH+GELDEAWK+FN+LLP+YGIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNF
        FISKMP+EP AKVWGALLNGASVAGDVELGKYVF RL +IEP NTGNY+IMANLYSQ GRWKEAD +R+LMK+V LKKIPGNSWIETRGGLQ+F
Subjt:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNF

A0A6J1CWN9 pentatricopeptide repeat-containing protein At2g373100.0e+0087.99Show/hide
Query:  QISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFIS
        QIS+PAGA++PWALQAIRR DGMNY AYGRLIQHC D  F+RLGKQLHARLVL +V PDNFLGSKLIAFYSKSGSLRDAYN+F NISHKNIFSWNALFIS
Subjt:  QISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFIS

Query:  YTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWN
        YTLH+MH+DMLKLFSSLVNSN+MDVKPDKFT+TCVLKALAS F++S LAKEVHCF+LRR LESD+FVVNALVT+YSRC+E+VLAR+VF RMPERDIVSWN
Subjt:  YTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWN

Query:  AMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEIT
        AMVAG+SQGG YEECKELFKEMLS VELKPNALTAVSVLQACAQSNDLIFGMEVHRFVN+SQIEMDVSLCNAVIGLYAKCGSLDYARELF+EMPEKDE+T
Subjt:  AMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEIT

Query:  YGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVY
        YGSMISGYMVHG VN+AMDLF+ELK+PALSTWNAVISGLVQNNQQDGV++IFRAMQS GCRPN VTLAS+LP+FSHFSTLKGG EIHAYAVRNGY+GN+Y
Subjt:  YGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVY

Query:  VATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPL
        VATAIIDSYAKSGYLHGA +VF+  KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGI+PDPVTFTSV+VACAH+GELDEAWK+FN++LPEYGIQPL
Subjt:  VATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPL

Query:  VEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKK
        VEHYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDVELGKYVF RLL+IEPENTG YIIMANLYSQ GRWKEADKVR+LMK+VGL+K
Subjt:  VEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKK

Query:  IPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG
        IPG+SWIET GGL +FVARDTSND TPEIY MLEGLLGLMKEEG+ILQ+EID DCGSG
Subjt:  IPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG

A0A6J1F110 pentatricopeptide repeat-containing protein At2g373100.0e+0089.29Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS
        MNYGAYGRLIQHCTD  F RLGKQLHARLVLS+VAPDNFLGSKLIA YSKSGSLRDAYN+FD+ISHKNIFSWNALFISYTLH+MH DMLKLFSSLVN NS
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS

Query:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM
         DVKPDKFTVTCVLKALASLF+NS LAKEVHCF+LRR LESD+FVVNAL+TFYSRCDEL LAR++FDR PERDIVSWNAMVAGYSQGG YE+CKELFK M
Subjt:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM

Query:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR
        L   E KPNALTAVSVLQACA SNDLIFGMEVH+FVN+S IEMDVSL NAVIGLYAKCGSLDYARELF+ MPEKDE+TYGSMISGYMVHGFVN+AMDLFR
Subjt:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR

Query:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF
        EL+RPALSTWNAVISGLVQNNQQDGVV+IFRAMQ  GCRPNTVTLAS+LPIFSHFSTLKGG EIHAYAVRN YDGN+YVATAIIDSYAKSGYL GAR+VF
Subjt:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF

Query:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
        +  K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGIRPDPVTFTSV+VACAH+GELDEAWK+FNVLLPE+GIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTS
        FISKMPIEPTAKVWGALLNGASVAGDVELGKYVF RLLDIEPENTGNYIIMANLYSQFGRWKEAD VR+LMK+VGLKKIPGNSWIETR GLQ+FVARDTS
Subjt:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTS

Query:  NDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGS
        NDRTPEIYG LEGL+GLMKEEG I QHEID DCGS
Subjt:  NDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGS

A0A6J1J0S5 pentatricopeptide repeat-containing protein At2g373100.0e+0088.99Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS
        MNYGAYGRLIQHCTD  F RLGKQLHARLVLS+VAPDNFLGSKLIA YSKSGSLRDAYN+FD+ISHKNIFSWNALFISYTLH+MH DMLKLFSSLVN NS
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNS

Query:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM
         DVKPDKFTVTCVLKALASLF+NS LAKEVHCF+LRR LESD+FVVNAL+TFYSRCDELVLAR++F R PERDIVSWNAMVAGYSQGG YE+CKELFK M
Subjt:  MDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEM

Query:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR
        L   E KPNALTAVSVLQACAQSNDLIFGMEVH+FVN+S IEMDVSL NAVIGLYAKCGSLDYARELF+ MPEKDE+TYGSMISGYMVHGFVN+AMDLFR
Subjt:  LSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFR

Query:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF
        EL+RPALSTWNAVISGLVQNNQQDGVV+IFRAMQ  GCRPNTVTLAS+LPIFSHFSTLKGG EIHAYAVRN YDGN+YVATAIIDSYAKSGYL GAR+VF
Subjt:  ELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVF

Query:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
        +  K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGIRPDPVTFTSV+VACAH+GEL+EAWK+FNVLLPE+GIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  NHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTS
        FISKMPIEPTAKVWGALLNGASVAGDVELGKYVF RLLDIEPENTGNYIIMANLYSQFG WKEAD VR+LMK+VGLKKIPGNSWIETRGGLQ+FVARDTS
Subjt:  FISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTS

Query:  NDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG
        NDRTPEIYG LEGL+GLMK EG I QHEID +CGSG
Subjt:  NDRTPEIYGMLEGLLGLMKEEGHILQHEIDGDCGSG

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic3.1e-11435.28Show/hide
Query:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAF--YSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPD
        LI+ C     +R  KQ H  ++ +    D +  SKL A    S   SL  A  +FD I   N F+WN L  +Y        +L +++ L   +     P+
Subjt:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAF--YSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPD

Query:  KFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVEL
        K+T   ++KA A + S S L + +H   ++ A+ SDVFV N+L+  Y  C +L  A  VF  + E+D+VSWN+M+ G+ Q G  ++  ELFK+M S  ++
Subjt:  KFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVEL

Query:  KPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPA
        K + +T V VL ACA+  +L FG +V  ++ ++++ ++++L NA++ +Y KCGS++ A+ LFD M EKD +T+ +M+ GY +      A ++   + +  
Subjt:  KPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPA

Query:  LSTWNAVISGLVQNNQQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKG
        +  WNA+IS   QN + +  + +F  +Q Q   + N +TL S L   +    L+ G  IH+Y  ++G   N +V +A+I  Y+K G L  +R VFN  + 
Subjt:  LSTWNAVISGLVQNNQQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKG

Query:  RSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKM
        R + +W+A+I   A HG  N A+ +FY+M    ++P+ VTFT+V  AC+H G +DEA  +F+ +   YGI P  +HYAC+V VL R+G L  AV+FI  M
Subjt:  RSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKM

Query:  PIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTP
        PI P+  VWGALL    +  ++ L +    RLL++EP N G +++++N+Y++ G+W+   ++R+ M+  GLKK PG S IE  G +  F++ D ++  + 
Subjt:  PIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTP

Query:  EIYGMLEGLLGLMKEEGH
        ++YG L  ++  +K  G+
Subjt:  EIYGMLEGLLGLMKEEGH

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.2e-10432.78Show/hide
Query:  GKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLF
        G Q+H  +V    A D F+ + L+ FY++ G L  A  +FD +S +N+ SW ++   Y       D + LF  +V     +V P+  T+ CV+ A A L 
Subjt:  GKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLF

Query:  SNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACA
         +    ++V+ FI    +E +  +V+ALV  Y +C+ + +A+ +FD     ++   NAM + Y + G   E   +F  M+    ++P+ ++ +S + +C+
Subjt:  SNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACA

Query:  QSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNN
        Q  ++++G   H +V  +  E   ++CNA+I +Y KC   D A  +FD M  K  +T+ S+++GY+ +G V+ A + F  +    + +WN +ISGLVQ +
Subjt:  QSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNN

Query:  QQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAA
          +  +E+F +MQSQ G   + VT+ SI     H   L     I+ Y  +NG   +V + T ++D +++ G    A  +FN    R +  WTA I A A 
Subjt:  QQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAA

Query:  HGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNG
         G+A  A+ LF +M+  G++PD V F   + AC+H G + +  ++F  +L  +G+ P   HY CMV +L RAG L +AV+ I  MP+EP   +W +LL  
Subjt:  HGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNG

Query:  ASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKE
          V G+VE+  Y   ++  + PE TG+Y++++N+Y+  GRW +  KVR  MK+ GL+K PG S I+ RG    F + D S+   P I  ML+ +      
Subjt:  ASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKE

Query:  EGHI
         GH+
Subjt:  EGHI

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.3e-10433.39Show/hide
Query:  HLFVRLGKQLHARLVLSAV-APDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVL
        +++ + G  LHAR +   +     F  + +++ YSK G +      FD +  ++  SW  + + Y     +   +++   +V      ++P +FT+T VL
Subjt:  HLFVRLGKQLHARLVLSAV-APDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVL

Query:  KALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDR-------------------------------MPERDIVSWNAMVAG
         ++A+        K+VH FI++  L  +V V N+L+  Y++C + ++A+ VFDR                               M ERDIV+WN+M++G
Subjt:  KALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDR-------------------------------MPERDIVSWNAMVAG

Query:  YSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYG--S
        ++Q G      ++F +ML    L P+  T  SVL ACA    L  G ++H  +  +  ++   + NA+I +Y++CG ++ AR L ++   KD    G  +
Subjt:  YSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYG--S

Query:  MISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVAT
        ++ GY+  G +N+A ++F  LK   +  W A+I G  Q+      + +FR+M   G RPN+ TLA++L + S  ++L  G +IH  AV++G   +V V+ 
Subjt:  MISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVAT

Query:  AIIDSYAKSGYLHGARRVFNHFK-GRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVE
        A+I  YAK+G +  A R F+  +  R  + WT++I A A HG A  AL LF  ML  G+RPD +T+  V  AC HAG +++  + F+++     I P + 
Subjt:  AIIDSYAKSGYLHGARRVFNHFK-GRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVE

Query:  HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIP
        HYACMV +  RAG L +A EFI KMPIEP    WG+LL+   V  +++LGK    RLL +EPEN+G Y  +ANLYS  G+W+EA K+R+ MK   +KK  
Subjt:  HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIP

Query:  GNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHI
        G SWIE +  +  F   D ++    EIY  ++ +   +K+ G++
Subjt:  GNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHI

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.4e-10631.92Show/hide
Query:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKF
        ++Q C D   ++ GK++   +  +    D+ LGSKL   Y+  G L++A  +FD +  +    WN L          +  + LF  +++S    V+ D +
Subjt:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKF

Query:  TVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKP
        T +CV K+ +SL S     +++H FIL+        V N+LV FY +   +  AR VFD M ERD++SWN+++ GY   G  E+   +F +ML +  ++ 
Subjt:  TVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKP

Query:  NALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALS
        +  T VSV   CA S  +  G  VH     +    +   CN ++ +Y+KCG LD A+ +F EM ++  ++Y SMI+GY   G    A+ LF E++   +S
Subjt:  NALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALS

Query:  ----------------------------------------------------------------------TWNAVISGLVQNNQQDGVVEIFR-AMQSQG
                                                                              +WN +I G  +N   +  + +F   ++ + 
Subjt:  ----------------------------------------------------------------------TWNAVISGLVQNNQQDGVVEIFR-AMQSQG

Query:  CRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTN
          P+  T+A +LP  +  S    G EIH Y +RNGY  + +VA +++D YAK G L  A  +F+    + L+ WT +I+ Y  HG    A++LF +M   
Subjt:  CRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTN

Query:  GIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRL
        GI  D ++F S++ AC+H+G +DE W+ FN++  E  I+P VEHYAC+V +L+R G L  A  FI  MPI P A +WGALL G  +  DV+L + V  ++
Subjt:  GIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRL

Query:  LDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGH
         ++EPENTG Y++MAN+Y++  +W++  ++R+ + + GL+K PG SWIE +G +  FVA D+SN  T  I   L  +   M EEG+
Subjt:  LDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGH

Q9ZUT5 Pentatricopeptide repeat-containing protein At2g373101.2e-21456.12Show/hide
Query:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK
        ALQ +     ++ GAYG LIQH T H       QLHAR+V+ ++ PDNFL SKLI+FY++    R A ++FD I+ +N FS+NAL I+YT   M+ D   
Subjt:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK

Query:  LFSSLVNS---NSMDVKPDKFTVTCVLKALASL--FSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYS
        LF S + S   +S   +PD  +++CVLKAL+    F    LA++VH F++R   +SDVFV N ++T+Y++CD +  AR VFD M ERD+VSWN+M++GYS
Subjt:  LFSSLVNS---NSMDVKPDKFTVTCVLKALASL--FSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYS

Query:  QGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISG
        Q G +E+CK+++K ML+  + KPN +T +SV QAC QS+DLIFG+EVH+ + ++ I+MD+SLCNAVIG YAKCGSLDYAR LFDEM EKD +TYG++ISG
Subjt:  QGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISG

Query:  YMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIID
        YM HG V  AM LF E++   LSTWNA+ISGL+QNN  + V+  FR M   G RPNTVTL+S+LP  ++ S LKGG EIHA+A+RNG D N+YV T+IID
Subjt:  YMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIID

Query:  SYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACM
        +YAK G+L GA+RVF++ K RSLI WTAII+AYA HGD++ A SLF +M   G +PD VT T+V+ A AH+G+ D A  +F+ +L +Y I+P VEHYACM
Subjt:  SYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACM

Query:  VGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWI
        V VLSRAGKLSDA+EFISKMPI+P AKVWGALLNGASV GD+E+ ++   RL ++EPENTGNY IMANLY+Q GRW+EA+ VR  MK++GLKKIPG SWI
Subjt:  VGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWI

Query:  ETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEID
        ET  GL++F+A+D+S +R+ E+Y ++EGL+  M ++ +I + E+D
Subjt:  ETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEID

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein9.1e-10633.39Show/hide
Query:  HLFVRLGKQLHARLVLSAV-APDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVL
        +++ + G  LHAR +   +     F  + +++ YSK G +      FD +  ++  SW  + + Y     +   +++   +V      ++P +FT+T VL
Subjt:  HLFVRLGKQLHARLVLSAV-APDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVL

Query:  KALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDR-------------------------------MPERDIVSWNAMVAG
         ++A+        K+VH FI++  L  +V V N+L+  Y++C + ++A+ VFDR                               M ERDIV+WN+M++G
Subjt:  KALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDR-------------------------------MPERDIVSWNAMVAG

Query:  YSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYG--S
        ++Q G      ++F +ML    L P+  T  SVL ACA    L  G ++H  +  +  ++   + NA+I +Y++CG ++ AR L ++   KD    G  +
Subjt:  YSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYG--S

Query:  MISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVAT
        ++ GY+  G +N+A ++F  LK   +  W A+I G  Q+      + +FR+M   G RPN+ TLA++L + S  ++L  G +IH  AV++G   +V V+ 
Subjt:  MISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVAT

Query:  AIIDSYAKSGYLHGARRVFNHFK-GRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVE
        A+I  YAK+G +  A R F+  +  R  + WT++I A A HG A  AL LF  ML  G+RPD +T+  V  AC HAG +++  + F+++     I P + 
Subjt:  AIIDSYAKSGYLHGARRVFNHFK-GRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVE

Query:  HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIP
        HYACMV +  RAG L +A EFI KMPIEP    WG+LL+   V  +++LGK    RLL +EPEN+G Y  +ANLYS  G+W+EA K+R+ MK   +KK  
Subjt:  HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIP

Query:  GNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHI
        G SWIE +  +  F   D ++    EIY  ++ +   +K+ G++
Subjt:  GNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHI

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-11535.28Show/hide
Query:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAF--YSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPD
        LI+ C     +R  KQ H  ++ +    D +  SKL A    S   SL  A  +FD I   N F+WN L  +Y        +L +++ L   +     P+
Subjt:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAF--YSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPD

Query:  KFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVEL
        K+T   ++KA A + S S L + +H   ++ A+ SDVFV N+L+  Y  C +L  A  VF  + E+D+VSWN+M+ G+ Q G  ++  ELFK+M S  ++
Subjt:  KFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVEL

Query:  KPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPA
        K + +T V VL ACA+  +L FG +V  ++ ++++ ++++L NA++ +Y KCGS++ A+ LFD M EKD +T+ +M+ GY +      A ++   + +  
Subjt:  KPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPA

Query:  LSTWNAVISGLVQNNQQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKG
        +  WNA+IS   QN + +  + +F  +Q Q   + N +TL S L   +    L+ G  IH+Y  ++G   N +V +A+I  Y+K G L  +R VFN  + 
Subjt:  LSTWNAVISGLVQNNQQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKG

Query:  RSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKM
        R + +W+A+I   A HG  N A+ +FY+M    ++P+ VTFT+V  AC+H G +DEA  +F+ +   YGI P  +HYAC+V VL R+G L  AV+FI  M
Subjt:  RSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKM

Query:  PIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTP
        PI P+  VWGALL    +  ++ L +    RLL++EP N G +++++N+Y++ G+W+   ++R+ M+  GLKK PG S IE  G +  F++ D ++  + 
Subjt:  PIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTP

Query:  EIYGMLEGLLGLMKEEGH
        ++YG L  ++  +K  G+
Subjt:  EIYGMLEGLLGLMKEEGH

AT2G37310.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-21656.12Show/hide
Query:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK
        ALQ +     ++ GAYG LIQH T H       QLHAR+V+ ++ PDNFL SKLI+FY++    R A ++FD I+ +N FS+NAL I+YT   M+ D   
Subjt:  ALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLK

Query:  LFSSLVNS---NSMDVKPDKFTVTCVLKALASL--FSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYS
        LF S + S   +S   +PD  +++CVLKAL+    F    LA++VH F++R   +SDVFV N ++T+Y++CD +  AR VFD M ERD+VSWN+M++GYS
Subjt:  LFSSLVNS---NSMDVKPDKFTVTCVLKALASL--FSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYS

Query:  QGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISG
        Q G +E+CK+++K ML+  + KPN +T +SV QAC QS+DLIFG+EVH+ + ++ I+MD+SLCNAVIG YAKCGSLDYAR LFDEM EKD +TYG++ISG
Subjt:  QGGCYEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISG

Query:  YMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIID
        YM HG V  AM LF E++   LSTWNA+ISGL+QNN  + V+  FR M   G RPNTVTL+S+LP  ++ S LKGG EIHA+A+RNG D N+YV T+IID
Subjt:  YMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIID

Query:  SYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACM
        +YAK G+L GA+RVF++ K RSLI WTAII+AYA HGD++ A SLF +M   G +PD VT T+V+ A AH+G+ D A  +F+ +L +Y I+P VEHYACM
Subjt:  SYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACM

Query:  VGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWI
        V VLSRAGKLSDA+EFISKMPI+P AKVWGALLNGASV GD+E+ ++   RL ++EPENTGNY IMANLY+Q GRW+EA+ VR  MK++GLKKIPG SWI
Subjt:  VGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWI

Query:  ETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEID
        ET  GL++F+A+D+S +R+ E+Y ++EGL+  M ++ +I + E+D
Subjt:  ETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEID

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.6e-10532.78Show/hide
Query:  GKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLF
        G Q+H  +V    A D F+ + L+ FY++ G L  A  +FD +S +N+ SW ++   Y       D + LF  +V     +V P+  T+ CV+ A A L 
Subjt:  GKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLF

Query:  SNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACA
         +    ++V+ FI    +E +  +V+ALV  Y +C+ + +A+ +FD     ++   NAM + Y + G   E   +F  M+    ++P+ ++ +S + +C+
Subjt:  SNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKPNALTAVSVLQACA

Query:  QSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNN
        Q  ++++G   H +V  +  E   ++CNA+I +Y KC   D A  +FD M  K  +T+ S+++GY+ +G V+ A + F  +    + +WN +ISGLVQ +
Subjt:  QSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALSTWNAVISGLVQNN

Query:  QQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAA
          +  +E+F +MQSQ G   + VT+ SI     H   L     I+ Y  +NG   +V + T ++D +++ G    A  +FN    R +  WTA I A A 
Subjt:  QQDGVVEIFRAMQSQ-GCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAA

Query:  HGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNG
         G+A  A+ LF +M+  G++PD V F   + AC+H G + +  ++F  +L  +G+ P   HY CMV +L RAG L +AV+ I  MP+EP   +W +LL  
Subjt:  HGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNG

Query:  ASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKE
          V G+VE+  Y   ++  + PE TG+Y++++N+Y+  GRW +  KVR  MK+ GL+K PG S I+ RG    F + D S+   P I  ML+ +      
Subjt:  ASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKE

Query:  EGHI
         GH+
Subjt:  EGHI

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein9.8e-10831.92Show/hide
Query:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKF
        ++Q C D   ++ GK++   +  +    D+ LGSKL   Y+  G L++A  +FD +  +    WN L          +  + LF  +++S    V+ D +
Subjt:  LIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISYTLHHMHTDMLKLFSSLVNSNSMDVKPDKF

Query:  TVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKP
        T +CV K+ +SL S     +++H FIL+        V N+LV FY +   +  AR VFD M ERD++SWN+++ GY   G  E+   +F +ML +  ++ 
Subjt:  TVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGCYEECKELFKEMLSLVELKP

Query:  NALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALS
        +  T VSV   CA S  +  G  VH     +    +   CN ++ +Y+KCG LD A+ +F EM ++  ++Y SMI+GY   G    A+ LF E++   +S
Subjt:  NALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLFRELKRPALS

Query:  ----------------------------------------------------------------------TWNAVISGLVQNNQQDGVVEIFR-AMQSQG
                                                                              +WN +I G  +N   +  + +F   ++ + 
Subjt:  ----------------------------------------------------------------------TWNAVISGLVQNNQQDGVVEIFR-AMQSQG

Query:  CRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTN
          P+  T+A +LP  +  S    G EIH Y +RNGY  + +VA +++D YAK G L  A  +F+    + L+ WT +I+ Y  HG    A++LF +M   
Subjt:  CRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTN

Query:  GIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRL
        GI  D ++F S++ AC+H+G +DE W+ FN++  E  I+P VEHYAC+V +L+R G L  A  FI  MPI P A +WGALL G  +  DV+L + V  ++
Subjt:  GIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFYRL

Query:  LDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGH
         ++EPENTG Y++MAN+Y++  +W++  ++R+ + + GL+K PG SWIE +G +  FVA D+SN  T  I   L  +   M EEG+
Subjt:  LDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAATGCGAAGCCTACGAGCCTTCAAATCTCAGTTCCCGCCGGAGCCCTTCTTCCATGGGCTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTA
TGGCCGCCTTATCCAGCACTGCACCGACCACCTCTTCGTCCGTCTCGGCAAGCAGCTTCACGCTCGTCTTGTTCTATCTGCGGTCGCTCCTGATAACTTCCTCGGATCGA
AGCTCATCGCCTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATCTGTTCGATAACATTTCTCACAAGAACATTTTCTCTTGGAATGCTTTGTTTATCAGCTAC
ACTCTTCACCACATGCACACTGATATGCTGAAGCTGTTTTCATCTTTGGTTAATTCGAATTCGATGGATGTGAAGCCTGATAAGTTTACGGTTACTTGTGTTTTGAAAGC
GTTGGCGTCGTTGTTTTCTAATTCTTTCTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGCGCTTGAGTCTGATGTTTTTGTTGTCAATGCTCTGGTTACTTTTT
ACTCTAGGTGTGATGAGCTAGTTTTAGCGAGAATGGTGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGGTGGCTGGGTACTCTCAGGGTGGGTGC
TATGAGGAATGCAAGGAACTATTTAAAGAGATGTTGAGTTTAGTGGAGTTGAAGCCTAATGCATTAACTGCAGTCAGTGTTTTGCAAGCTTGTGCTCAGTCAAATGATCT
TATTTTTGGAATGGAAGTTCATAGATTCGTCAATGACAGTCAGATTGAAATGGATGTTTCATTATGCAATGCTGTTATTGGATTATATGCGAAGTGTGGCAGCTTGGACT
ATGCTCGGGAGTTGTTCGATGAAATGCCCGAGAAGGATGAGATCACCTATGGCTCGATGATATCAGGCTACATGGTCCATGGTTTTGTTAACCGAGCAATGGATCTTTTT
CGAGAACTGAAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAACAACCAACAGGATGGAGTTGTGGAAATATTTCGAGCAATGCAGTCACA
GGGTTGCAGACCAAATACTGTGACACTTGCGAGCATTCTTCCAATTTTCTCACATTTTTCAACACTAAAAGGTGGGAATGAAATTCATGCTTATGCTGTTAGAAACGGTT
ATGATGGGAATGTTTATGTTGCCACTGCCATCATTGATTCTTATGCTAAGTCTGGTTACCTCCATGGGGCACGACGGGTTTTTAATCATTTTAAAGGTAGGAGTCTAATT
ATTTGGACAGCAATAATTTCAGCATATGCTGCACATGGAGATGCTAATGTGGCTCTTAGTCTCTTCTATGAGATGCTGACAAATGGGATTCGGCCTGACCCGGTAACGTT
TACATCAGTAATGGTTGCCTGTGCCCATGCAGGAGAGTTAGATGAAGCCTGGAAGATGTTTAATGTCTTATTACCAGAATATGGGATTCAACCATTAGTCGAGCATTATG
CTTGCATGGTAGGAGTTCTTAGTCGAGCAGGAAAGCTCTCCGATGCTGTTGAATTTATTTCTAAAATGCCAATTGAACCCACTGCAAAAGTTTGGGGTGCTTTGCTCAAT
GGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTACGTTTTTTATCGTCTACTTGACATTGAGCCTGAAAATACAGGTAACTACATCATCATGGCTAATTTATATTC
ACAATTTGGAAGGTGGAAAGAAGCTGACAAGGTTAGGGAGTTGATGAAGAAAGTTGGATTGAAGAAGATCCCAGGAAATAGCTGGATAGAAACAAGGGGCGGGTTACAGA
ATTTCGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATTTATGGAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGACACATTCTGCAACATGAGATA
GATGGGGACTGTGGCAGTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAATGCGAAGCCTACGAGCCTTCAAATCTCAGTTCCCGCCGGAGCCCTTCTTCCATGGGCTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTA
TGGCCGCCTTATCCAGCACTGCACCGACCACCTCTTCGTCCGTCTCGGCAAGCAGCTTCACGCTCGTCTTGTTCTATCTGCGGTCGCTCCTGATAACTTCCTCGGATCGA
AGCTCATCGCCTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATCTGTTCGATAACATTTCTCACAAGAACATTTTCTCTTGGAATGCTTTGTTTATCAGCTAC
ACTCTTCACCACATGCACACTGATATGCTGAAGCTGTTTTCATCTTTGGTTAATTCGAATTCGATGGATGTGAAGCCTGATAAGTTTACGGTTACTTGTGTTTTGAAAGC
GTTGGCGTCGTTGTTTTCTAATTCTTTCTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGCGCTTGAGTCTGATGTTTTTGTTGTCAATGCTCTGGTTACTTTTT
ACTCTAGGTGTGATGAGCTAGTTTTAGCGAGAATGGTGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGGTGGCTGGGTACTCTCAGGGTGGGTGC
TATGAGGAATGCAAGGAACTATTTAAAGAGATGTTGAGTTTAGTGGAGTTGAAGCCTAATGCATTAACTGCAGTCAGTGTTTTGCAAGCTTGTGCTCAGTCAAATGATCT
TATTTTTGGAATGGAAGTTCATAGATTCGTCAATGACAGTCAGATTGAAATGGATGTTTCATTATGCAATGCTGTTATTGGATTATATGCGAAGTGTGGCAGCTTGGACT
ATGCTCGGGAGTTGTTCGATGAAATGCCCGAGAAGGATGAGATCACCTATGGCTCGATGATATCAGGCTACATGGTCCATGGTTTTGTTAACCGAGCAATGGATCTTTTT
CGAGAACTGAAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAACAACCAACAGGATGGAGTTGTGGAAATATTTCGAGCAATGCAGTCACA
GGGTTGCAGACCAAATACTGTGACACTTGCGAGCATTCTTCCAATTTTCTCACATTTTTCAACACTAAAAGGTGGGAATGAAATTCATGCTTATGCTGTTAGAAACGGTT
ATGATGGGAATGTTTATGTTGCCACTGCCATCATTGATTCTTATGCTAAGTCTGGTTACCTCCATGGGGCACGACGGGTTTTTAATCATTTTAAAGGTAGGAGTCTAATT
ATTTGGACAGCAATAATTTCAGCATATGCTGCACATGGAGATGCTAATGTGGCTCTTAGTCTCTTCTATGAGATGCTGACAAATGGGATTCGGCCTGACCCGGTAACGTT
TACATCAGTAATGGTTGCCTGTGCCCATGCAGGAGAGTTAGATGAAGCCTGGAAGATGTTTAATGTCTTATTACCAGAATATGGGATTCAACCATTAGTCGAGCATTATG
CTTGCATGGTAGGAGTTCTTAGTCGAGCAGGAAAGCTCTCCGATGCTGTTGAATTTATTTCTAAAATGCCAATTGAACCCACTGCAAAAGTTTGGGGTGCTTTGCTCAAT
GGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTACGTTTTTTATCGTCTACTTGACATTGAGCCTGAAAATACAGGTAACTACATCATCATGGCTAATTTATATTC
ACAATTTGGAAGGTGGAAAGAAGCTGACAAGGTTAGGGAGTTGATGAAGAAAGTTGGATTGAAGAAGATCCCAGGAAATAGCTGGATAGAAACAAGGGGCGGGTTACAGA
ATTTCGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATTTATGGAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGACACATTCTGCAACATGAGATA
GATGGGGACTGTGGCAGTGGTTAG
Protein sequenceShow/hide protein sequence
MRNAKPTSLQISVPAGALLPWALQAIRRNDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLVLSAVAPDNFLGSKLIAFYSKSGSLRDAYNLFDNISHKNIFSWNALFISY
TLHHMHTDMLKLFSSLVNSNSMDVKPDKFTVTCVLKALASLFSNSFLAKEVHCFILRRALESDVFVVNALVTFYSRCDELVLARMVFDRMPERDIVSWNAMVAGYSQGGC
YEECKELFKEMLSLVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNDSQIEMDVSLCNAVIGLYAKCGSLDYARELFDEMPEKDEITYGSMISGYMVHGFVNRAMDLF
RELKRPALSTWNAVISGLVQNNQQDGVVEIFRAMQSQGCRPNTVTLASILPIFSHFSTLKGGNEIHAYAVRNGYDGNVYVATAIIDSYAKSGYLHGARRVFNHFKGRSLI
IWTAIISAYAAHGDANVALSLFYEMLTNGIRPDPVTFTSVMVACAHAGELDEAWKMFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLN
GASVAGDVELGKYVFYRLLDIEPENTGNYIIMANLYSQFGRWKEADKVRELMKKVGLKKIPGNSWIETRGGLQNFVARDTSNDRTPEIYGMLEGLLGLMKEEGHILQHEI
DGDCGSG