; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022710 (gene) of Snake gourd v1 genome

Gene IDTan0022710
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG08:73904074..73906026
RNA-Seq ExpressionTan0022710
SyntenyTan0022710
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036256.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0086.73Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SL PHLNRF+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFLAKACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GK EDA NLFDKMPVR+ ASWNAMI+GFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD D SV+NTWI+ YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+ RSSVSWNSLIACYA+FGK +DAVN YKGLLCDGFKPDA TIISLL SC QP+ALIYGSLIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+GLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAREVF SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSRHLFELQP VA SFVEMANIYA VGRWD V+ MRK MRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QRI+EL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

XP_004143574.1 pentatricopeptide repeat-containing protein At4g19191, mitochondrial [Cucumis sativus]0.0e+0086.42Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M +L  HLN F+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFL+KACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GKV+DA NLFDKMPVR+ ASWNAMIIGFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAV+SAKSLRFLKAVHA+GIETGLD D SV+NTWIA YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK  RSSVSWNSLIACYA+FGKY+DAV  YKGLLCDGFKPDASTIISLLSSC QP+ALIYG LIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+ LKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DARE+F SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSR+LFELQP VAVSFVEMANIYASVGRWD V+ MRKTMRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QR VEL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

XP_008440672.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19191, mitochondrial isoform X1 [Cucumis melo]0.0e+0086.73Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SL PHLNRF+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFLAKACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GK EDA NLFDKMPVR+ ASWNAMI+GFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD D SV+NTWI+ YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+ RSSVSWNSLIACYA+FGK +DAVN YKGLLCDGFKPDA TIISLL SC QP+ALIYGSLIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+GLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAREVF SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSRHLFELQP VA SFVEMANIYA VGRWD V+ MRK MRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QRI+EL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

XP_008440673.1 PREDICTED: pentatricopeptide repeat-containing protein At4g19191, mitochondrial isoform X2 [Cucumis melo]0.0e+0086.73Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SL PHLNRF+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFLAKACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GK EDA NLFDKMPVR+ ASWNAMI+GFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD D SV+NTWI+ YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+ RSSVSWNSLIACYA+FGK +DAVN YKGLLCDGFKPDA TIISLL SC QP+ALIYGSLIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+GLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAREVF SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSRHLFELQP VA SFVEMANIYA VGRWD V+ MRK MRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QRI+EL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

XP_038881350.1 pentatricopeptide repeat-containing protein At4g19191, mitochondrial [Benincasa hispida]0.0e+0089.01Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SLPP LNRF+KLSTV WWNS+IRDAVNQG+AYKALVLFRQ+KLNGLQPNNFTFPFLAKA AKLSHLT+SQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GKVEDA NLFDKMPVRD ASWNAMIIGFS IGSLDRVFSLF+GMRLVGIRPDAAT+IGL+RAVLSAKSLRFLKAVHA+ IETGLD D SV+NTWIAGYSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+VRSSVSWNSLIACYANFGKY+DAVN YKGLL DGF PDASTIISLLS C QP+ALIYGSLIH HGFQLGCDSDISLIN LISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDA+VLFNAMEE GEKPD+VTVLSLISGCGK GALELGHWIDNYAS+YGLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAR+VF SLPNRTVVSWT MIAACALNGEFREAL LFY MSE GI+PN+ITF+A+LQACSHGG+LE+G+E FM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGR+GKLIEALEVI DMPMKPDEGIWGALL ACKIHNN+QIGEYVS HLFELQP VAVSFVEMANIYASVGRWD V+ MRKTMRSNKMRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE
        QVNGKSHVFFVEDRSHHDSL IYAVLGNL MQM QKE
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE

TrEMBL top hitse value%identityAlignment
A0A0A0KK94 Uncharacterized protein0.0e+0086.42Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M +L  HLN F+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFL+KACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GKV+DA NLFDKMPVR+ ASWNAMIIGFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAV+SAKSLRFLKAVHA+GIETGLD D SV+NTWIA YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK  RSSVSWNSLIACYA+FGKY+DAV  YKGLLCDGFKPDASTIISLLSSC QP+ALIYG LIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+ LKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DARE+F SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSR+LFELQP VAVSFVEMANIYASVGRWD V+ MRKTMRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QR VEL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

A0A1S3B296 pentatricopeptide repeat-containing protein At4g19191, mitochondrial isoform X10.0e+0086.73Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SL PHLNRF+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFLAKACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GK EDA NLFDKMPVR+ ASWNAMI+GFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD D SV+NTWI+ YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+ RSSVSWNSLIACYA+FGK +DAVN YKGLLCDGFKPDA TIISLL SC QP+ALIYGSLIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+GLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAREVF SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSRHLFELQP VA SFVEMANIYA VGRWD V+ MRK MRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QRI+EL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

A0A1S3B2E4 pentatricopeptide repeat-containing protein At4g19191, mitochondrial isoform X20.0e+0086.73Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SL PHLNRF+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFLAKACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GK EDA NLFDKMPVR+ ASWNAMI+GFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD D SV+NTWI+ YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+ RSSVSWNSLIACYA+FGK +DAVN YKGLLCDGFKPDA TIISLL SC QP+ALIYGSLIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+GLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAREVF SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSRHLFELQP VA SFVEMANIYA VGRWD V+ MRK MRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QRI+EL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

A0A5D3CMI9 Pentatricopeptide repeat-containing protein0.0e+0086.73Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SL PHLNRF+KLST+TWWNS+IR AVNQG+A KAL LF QLKLNGLQPNNFTFPFLAKACAKLSHLT+SQIIHTHVVKSPF+SDI+VQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
        GK EDA NLFDKMPVR+ ASWNAMI+GFSQIGSLDRVF+LF+GMRLVG RPDAAT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD D SV+NTWI+ YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQLAKMVF GIQK+ RSSVSWNSLIACYA+FGK +DAVN YKGLLCDGFKPDA TIISLL SC QP+ALIYGSLIHGHGFQLGCDSDISLINTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI+SATILFDGMS RTC+SWTAMISGYSEVGR DDALVLFNAMEE GEKPD+VTVLSLISGCGK GAL LGHWIDNYASL+GLKKDVVVCNALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MYAKCGSL+DAREVF SLPNRTVVSWT MIAACALNGEFREAL LF  +SE GI+PN+ITF+A+LQAC HGGYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNN++IGEYVSRHLFELQP VA SFVEMANIYA VGRWD V+ MRK MRSN+MRKSPGKS+V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL
        QVNG SHVFFVEDRSHHDSLLIY  LGNL MQM QKE S + QRI+EL
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKE-SLYVQRIVEL

A0A6J1BUQ5 pentatricopeptide repeat-containing protein At4g19191, mitochondrial isoform X10.0e+0083.72Show/hide
Query:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC
        M SLPPH NRF+K STVTWWNSNIRDAVN GH +KALVLFRQ+KLNGLQPNNFTFPF+AKACAKLS+L ++QIIHTHVVKSPFHSDIFVQTAMVDMYVKC
Subjt:  MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKC

Query:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK
         KVEDA NLF+KMPVRD ASWNAMIIGF+Q GS DRVF LFVGM LVGIRPD AT+IGL+RAVLSAKSLRFLKAVHA+GIETGLD DVSVANTWIA YSK
Subjt:  GKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSK

Query:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM
        CGELQ+A+MVF+GIQKH RSSVSWNSLI+CYA F +Y+DAVN YKGL+CDGFKPDASTIISLLSSCMQP+AL+YGSLIHGHG QLGCDSDISL NTLISM
Subjt:  CGELQLAKMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISM

Query:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID
        YSRCGDI SA  LFDGMS RTC+SWTAMISGYSEVGR D+ALVLFNAMEEAGEKPD+VTVLSLISGCGK GALELGHWI+ Y+ L+GLK+DVVV NALID
Subjt:  YSRCGDITSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALID

Query:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM
        MY KCGS+SDAR+VF  LPNRT+VSWT MIAACALNGEFREAL +F+ MSE GI PNH+TF+A+LQACSH GYLE+GRECFM MT+RYGINPGLDHYSCM
Subjt:  MYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCM

Query:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV
        IDLLGRKGKLIEALEVIQDMPM+PD GIWGALLGACKIH N+QIGEYVSR+LFELQP VAVSFVEMANIYASVGRWDRV+ MRK MRSNK+RKSPGKS V
Subjt:  IDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIV

Query:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESL-YVQRIVELGME
        QVNGKSHVF VEDR HHDSLLIYAVL NLT+Q+ ++ES  + Q  VE  ME
Subjt:  QVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESL-YVQRIVELGME

SwissProt top hitse value%identityAlignment
O04659 Pentatricopeptide repeat-containing protein At5g271101.3e-10935.58Show/hide
Query:  STVTWWNSNIRDAVNQGHAYKALVLF-RQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKM
        S V  WNS +         +  L +F R L  +   P++FTFP + KA   L      ++IHT VVKS +  D+ V +++V MY K    E++  +FD+M
Subjt:  STVTWWNSNIRDAVNQGHAYKALVLF-RQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKM

Query:  PVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFL---KAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMV
        P RD ASWN +I  F Q G  ++   LF  M   G  P++   + L+ A+ +   L +L   K +H   ++ G + D  V +  +  Y KC  L++A+ V
Subjt:  PVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFL---KAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMV

Query:  FRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSA
        F+ + +  +S V+WNS+I  Y   G     V I   ++ +G +P  +T+ S+L +C + + L++G  IHG+  +   ++DI +  +LI +Y +CG+   A
Subjt:  FRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSA

Query:  TILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSD
          +F         SW  MIS Y  VG +  A+ +++ M   G KPDVVT  S++  C ++ ALE G  I    S   L+ D ++ +AL+DMY+KCG+  +
Subjt:  TILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSD

Query:  AREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKL
        A  +F+S+P + VVSWT MI+A   +G+ REAL+ F +M + G+KP+ +T +A+L AC H G ++EG + F  M  +YGI P ++HYSCMID+LGR G+L
Subjt:  AREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKL

Query:  IEALEVIQDMPMKPDEG-IWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVF
        +EA E+IQ  P   D   +   L  AC +H    +G+ ++R L E  P  A +++ + N+YAS   WD    +R  M+   +RK PG S ++++ K   F
Subjt:  IEALEVIQDMPMKPDEG-IWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVF

Query:  FVEDRSHHDSLLIYAVLGNLTMQM
        F EDRSH  +  +Y  L  L+  M
Subjt:  FVEDRSHHDSLLIYAVLGNLTMQM

P0C8Q2 Pentatricopeptide repeat-containing protein At4g19191, mitochondrial8.7e-20754.42Show/hide
Query:  LNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDAS
        L R + LS+V  WN  IR+AVN+    ++L+LFR++K  G +PNNFTFPF+AKACA+L+ +   +++H H++KSPF SD+FV TA VDM+VKC  V+ A+
Subjt:  LNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDAS

Query:  NLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLA
         +F++MP RD  +WNAM+ GF Q G  D+ FSLF  MRL  I PD+ T++ L ++    KSL+ L+A+HAVGI  G+D  V+VANTWI+ Y KCG+L  A
Subjt:  NLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLA

Query:  KMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDI
        K+VF  I +  R+ VSWNS+   Y+ FG+  DA  +Y  +L + FKPD ST I+L +SC  P+ L  G LIH H   LG D DI  INT ISMYS+  D 
Subjt:  KMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDI

Query:  TSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKD-VVVCNALIDMYAKCG
         SA +LFD M+ RTC+SWT MISGY+E G  D+AL LF+AM ++GEKPD+VT+LSLISGCGK G+LE G WID  A +YG K+D V++CNALIDMY+KCG
Subjt:  TSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKD-VVVCNALIDMYAKCG

Query:  SLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGR
        S+ +AR++F + P +TVV+WTTMIA  ALNG F EAL LF +M +L  KPNHITF+A+LQAC+H G LE+G E F  M + Y I+PGLDHYSCM+DLLGR
Subjt:  SLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGR

Query:  KGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKS
        KGKL EALE+I++M  KPD GIWGALL ACKIH N++I E  +  LF L+P +A  +VEMANIYA+ G WD  + +R  M+   ++K PG+S++QVNGK+
Subjt:  KGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKS

Query:  HVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESLY
        H F V +  H ++ +IY  L  L++    K  LY
Subjt:  HVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESLY

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.6e-11836.41Show/hide
Query:  KALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSL
        KAL  F +++ + ++P  + F +L K C   + L   + IH  +VKS F  D+F  T + +MY KC +V +A  +FD+MP RD  SWN ++ G+SQ G  
Subjt:  KALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSL

Query:  DRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGIQKHVRSSVSWNSLIACYANF
             +   M    ++P   TI+ +  AV + + +   K +H   + +G D+ V+++   +  Y+KCG L+ A+ +F G+ +  R+ VSWNS+I  Y   
Subjt:  DRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGIQKHVRSSVSWNSLIACYANF

Query:  GKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATILFDGMSFRTCISWTAMISGYSE
            +A+ I++ +L +G KP   +++  L +C     L  G  IH    +LG D ++S++N+LISMY +C ++ +A  +F  +  RT +SW AMI G+++
Subjt:  GKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATILFDGMSFRTCISWTAMISGYSE

Query:  VGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACA
         GR  DAL  F+ M     KPD  T +S+I+   ++       WI        L K+V V  AL+DMYAKCG++  AR +F  +  R V +W  MI    
Subjt:  VGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACA

Query:  LNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLG
         +G  + AL LF +M +  IKPN +TF++++ ACSH G +E G +CF  M + Y I   +DHY  M+DLLGR G+L EA + I  MP+KP   ++GA+LG
Subjt:  LNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLG

Query:  ACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMN
        AC+IH N+   E  +  LFEL P      V +ANIY +   W++V  +R +M    +RK+PG S+V++  + H FF    +H DS  IYA L  L   + 
Subjt:  ACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMN

Query:  QKESLYV
         KE+ YV
Subjt:  QKESLYV

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.8e-11131.95Show/hide
Query:  TVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPV
        T   +NS IR   + G   +A++LF ++  +G+ P+ +TFPF   ACAK     +   IH  +VK  +  D+FVQ ++V  Y +CG+++ A  +FD+M  
Subjt:  TVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPV

Query:  RDTASWNAMIIGFSQIGSLDRVFSLFVGM-RLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGI
        R+  SW +MI G+++         LF  M R   + P++ T++ +  A    + L   + V+A    +G++ +  + +  +  Y KC  + +AK +F   
Subjt:  RDTASWNAMIIGFSQIGSLDRVFSLFVGM-RLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGI

Query:  QKHVRSSVS-WNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATIL
         ++  S++   N++ + Y   G   +A+ ++  ++  G +PD  +++S +SSC Q + +++G   HG+  + G +S  ++ N LI MY +C    +A  +
Subjt:  QKHVRSSVS-WNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATIL

Query:  FDGMSFRTCISWTAMISGYSEVGR-------------------------------FDDALVLFNAME-EAGEKPDVVTVLSLISGCGKIGALELGHWIDN
        FD MS +T ++W ++++GY E G                                F++A+ +F +M+ + G   D VT++S+ S CG +GAL+L  WI  
Subjt:  FDGMSFRTCISWTAMISGYSEVGR-------------------------------FDDALVLFNAME-EAGEKPDVVTVLSLISGCGKIGALELGHWIDN

Query:  YASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECF
        Y    G++ DV +   L+DM+++CG    A  +F+SL NR V +WT  I A A+ G    A+ LF  M E G+KP+ + F+  L ACSHGG +++G+E F
Subjt:  YASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECF

Query:  MAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVST
         +M K +G++P   HY CM+DLLGR G L EA+++I+DMPM+P++ IW +LL AC++  N+++  Y +  +  L P    S+V ++N+YAS GRW+ ++ 
Subjt:  MAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVST

Query:  MRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAV----------------LGNLTMQMNQKESLYV
        +R +M+   +RK PG S +Q+ GK+H F   D SH +   I A+                L N+ M +++KE +++
Subjt:  MRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAV----------------LGNLTMQMNQKESLYV

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.4e-11134.24Show/hide
Query:  KLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDK
        K+    +WN  + +    G    ++ LF+++  +G++ +++TF  ++K+ + L  +   + +H  ++KS F     V  ++V  Y+K  +V+ A  +FD+
Subjt:  KLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDK

Query:  MPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFR
        M  RD  SWN++I G+   G  ++  S+FV M + GI  D ATI+ +      ++ +   +AVH++G++     +    NT +  YSKCG+L  AK VFR
Subjt:  MPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFR

Query:  GIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATI
         +    RS VS+ S+IA YA  G   +AV +++ +  +G  PD  T+ ++L+ C + + L  G  +H    +     DI + N L+ MY++CG +  A +
Subjt:  GIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATI

Query:  LFDGMSFRTCISWTAMISGYSEVGRFDDALVLFN-AMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDA
        +F  M  +  ISW  +I GYS+    ++AL LFN  +EE    PD  TV  ++  C  + A + G  I  Y    G   D  V N+L+DMYAKCG+L  A
Subjt:  LFDGMSFRTCISWTAMISGYSEVGRFDDALVLFN-AMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDA

Query:  REVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLI
          +F  + ++ +VSWT MIA   ++G  +EA+ LF QM + GI+ + I+F+++L ACSH G ++EG   F  M     I P ++HY+C++D+L R G LI
Subjt:  REVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLI

Query:  EALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFV
        +A   I++MP+ PD  IWGALL  C+IH+++++ E V+  +FEL+P     +V MANIYA   +W++V  +RK +    +RK+PG S +++ G+ ++F  
Subjt:  EALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFV

Query:  EDRSHHDSLLIYAVLGNLTMQM
         D S+ ++  I A L  +  +M
Subjt:  EDRSHHDSLLIYAVLGNLTMQM

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-11936.41Show/hide
Query:  KALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSL
        KAL  F +++ + ++P  + F +L K C   + L   + IH  +VKS F  D+F  T + +MY KC +V +A  +FD+MP RD  SWN ++ G+SQ G  
Subjt:  KALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPVRDTASWNAMIIGFSQIGSL

Query:  DRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGIQKHVRSSVSWNSLIACYANF
             +   M    ++P   TI+ +  AV + + +   K +H   + +G D+ V+++   +  Y+KCG L+ A+ +F G+ +  R+ VSWNS+I  Y   
Subjt:  DRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGIQKHVRSSVSWNSLIACYANF

Query:  GKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATILFDGMSFRTCISWTAMISGYSE
            +A+ I++ +L +G KP   +++  L +C     L  G  IH    +LG D ++S++N+LISMY +C ++ +A  +F  +  RT +SW AMI G+++
Subjt:  GKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATILFDGMSFRTCISWTAMISGYSE

Query:  VGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACA
         GR  DAL  F+ M     KPD  T +S+I+   ++       WI        L K+V V  AL+DMYAKCG++  AR +F  +  R V +W  MI    
Subjt:  VGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACA

Query:  LNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLG
         +G  + AL LF +M +  IKPN +TF++++ ACSH G +E G +CF  M + Y I   +DHY  M+DLLGR G+L EA + I  MP+KP   ++GA+LG
Subjt:  LNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLG

Query:  ACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMN
        AC+IH N+   E  +  LFEL P      V +ANIY +   W++V  +R +M    +RK+PG S+V++  + H FF    +H DS  IYA L  L   + 
Subjt:  ACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMN

Query:  QKESLYV
         KE+ YV
Subjt:  QKESLYV

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.3e-11231.95Show/hide
Query:  TVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPV
        T   +NS IR   + G   +A++LF ++  +G+ P+ +TFPF   ACAK     +   IH  +VK  +  D+FVQ ++V  Y +CG+++ A  +FD+M  
Subjt:  TVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPV

Query:  RDTASWNAMIIGFSQIGSLDRVFSLFVGM-RLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGI
        R+  SW +MI G+++         LF  M R   + P++ T++ +  A    + L   + V+A    +G++ +  + +  +  Y KC  + +AK +F   
Subjt:  RDTASWNAMIIGFSQIGSLDRVFSLFVGM-RLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGI

Query:  QKHVRSSVS-WNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATIL
         ++  S++   N++ + Y   G   +A+ ++  ++  G +PD  +++S +SSC Q + +++G   HG+  + G +S  ++ N LI MY +C    +A  +
Subjt:  QKHVRSSVS-WNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATIL

Query:  FDGMSFRTCISWTAMISGYSEVGR-------------------------------FDDALVLFNAME-EAGEKPDVVTVLSLISGCGKIGALELGHWIDN
        FD MS +T ++W ++++GY E G                                F++A+ +F +M+ + G   D VT++S+ S CG +GAL+L  WI  
Subjt:  FDGMSFRTCISWTAMISGYSEVGR-------------------------------FDDALVLFNAME-EAGEKPDVVTVLSLISGCGKIGALELGHWIDN

Query:  YASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECF
        Y    G++ DV +   L+DM+++CG    A  +F+SL NR V +WT  I A A+ G    A+ LF  M E G+KP+ + F+  L ACSHGG +++G+E F
Subjt:  YASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECF

Query:  MAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVST
         +M K +G++P   HY CM+DLLGR G L EA+++I+DMPM+P++ IW +LL AC++  N+++  Y +  +  L P    S+V ++N+YAS GRW+ ++ 
Subjt:  MAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVST

Query:  MRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAV----------------LGNLTMQMNQKESLYV
        +R +M+   +RK PG S +Q+ GK+H F   D SH +   I A+                L N+ M +++KE +++
Subjt:  MRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAV----------------LGNLTMQMNQKESLYV

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.3e-11231.95Show/hide
Query:  TVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPV
        T   +NS IR   + G   +A++LF ++  +G+ P+ +TFPF   ACAK     +   IH  +VK  +  D+FVQ ++V  Y +CG+++ A  +FD+M  
Subjt:  TVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDKMPV

Query:  RDTASWNAMIIGFSQIGSLDRVFSLFVGM-RLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGI
        R+  SW +MI G+++         LF  M R   + P++ T++ +  A    + L   + V+A    +G++ +  + +  +  Y KC  + +AK +F   
Subjt:  RDTASWNAMIIGFSQIGSLDRVFSLFVGM-RLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGI

Query:  QKHVRSSVS-WNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATIL
         ++  S++   N++ + Y   G   +A+ ++  ++  G +PD  +++S +SSC Q + +++G   HG+  + G +S  ++ N LI MY +C    +A  +
Subjt:  QKHVRSSVS-WNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATIL

Query:  FDGMSFRTCISWTAMISGYSEVGR-------------------------------FDDALVLFNAME-EAGEKPDVVTVLSLISGCGKIGALELGHWIDN
        FD MS +T ++W ++++GY E G                                F++A+ +F +M+ + G   D VT++S+ S CG +GAL+L  WI  
Subjt:  FDGMSFRTCISWTAMISGYSEVGR-------------------------------FDDALVLFNAME-EAGEKPDVVTVLSLISGCGKIGALELGHWIDN

Query:  YASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECF
        Y    G++ DV +   L+DM+++CG    A  +F+SL NR V +WT  I A A+ G    A+ LF  M E G+KP+ + F+  L ACSHGG +++G+E F
Subjt:  YASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECF

Query:  MAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVST
         +M K +G++P   HY CM+DLLGR G L EA+++I+DMPM+P++ IW +LL AC++  N+++  Y +  +  L P    S+V ++N+YAS GRW+ ++ 
Subjt:  MAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVST

Query:  MRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAV----------------LGNLTMQMNQKESLYV
        +R +M+   +RK PG S +Q+ GK+H F   D SH +   I A+                L N+ M +++KE +++
Subjt:  MRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAV----------------LGNLTMQMNQKESLYV

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-11234.24Show/hide
Query:  KLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDK
        K+    +WN  + +    G    ++ LF+++  +G++ +++TF  ++K+ + L  +   + +H  ++KS F     V  ++V  Y+K  +V+ A  +FD+
Subjt:  KLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLFDK

Query:  MPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFR
        M  RD  SWN++I G+   G  ++  S+FV M + GI  D ATI+ +      ++ +   +AVH++G++     +    NT +  YSKCG+L  AK VFR
Subjt:  MPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFR

Query:  GIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATI
         +    RS VS+ S+IA YA  G   +AV +++ +  +G  PD  T+ ++L+ C + + L  G  +H    +     DI + N L+ MY++CG +  A +
Subjt:  GIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATI

Query:  LFDGMSFRTCISWTAMISGYSEVGRFDDALVLFN-AMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDA
        +F  M  +  ISW  +I GYS+    ++AL LFN  +EE    PD  TV  ++  C  + A + G  I  Y    G   D  V N+L+DMYAKCG+L  A
Subjt:  LFDGMSFRTCISWTAMISGYSEVGRFDDALVLFN-AMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDA

Query:  REVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLI
          +F  + ++ +VSWT MIA   ++G  +EA+ LF QM + GI+ + I+F+++L ACSH G ++EG   F  M     I P ++HY+C++D+L R G LI
Subjt:  REVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLI

Query:  EALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFV
        +A   I++MP+ PD  IWGALL  C+IH+++++ E V+  +FEL+P     +V MANIYA   +W++V  +RK +    +RK+PG S +++ G+ ++F  
Subjt:  EALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFV

Query:  EDRSHHDSLLIYAVLGNLTMQM
         D S+ ++  I A L  +  +M
Subjt:  EDRSHHDSLLIYAVLGNLTMQM

AT4G19191.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-20854.42Show/hide
Query:  LNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDAS
        L R + LS+V  WN  IR+AVN+    ++L+LFR++K  G +PNNFTFPF+AKACA+L+ +   +++H H++KSPF SD+FV TA VDM+VKC  V+ A+
Subjt:  LNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDAS

Query:  NLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLA
         +F++MP RD  +WNAM+ GF Q G  D+ FSLF  MRL  I PD+ T++ L ++    KSL+ L+A+HAVGI  G+D  V+VANTWI+ Y KCG+L  A
Subjt:  NLFDKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLA

Query:  KMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDI
        K+VF  I +  R+ VSWNS+   Y+ FG+  DA  +Y  +L + FKPD ST I+L +SC  P+ L  G LIH H   LG D DI  INT ISMYS+  D 
Subjt:  KMVFRGIQKHVRSSVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDI

Query:  TSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKD-VVVCNALIDMYAKCG
         SA +LFD M+ RTC+SWT MISGY+E G  D+AL LF+AM ++GEKPD+VT+LSLISGCGK G+LE G WID  A +YG K+D V++CNALIDMY+KCG
Subjt:  TSATILFDGMSFRTCISWTAMISGYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKD-VVVCNALIDMYAKCG

Query:  SLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGR
        S+ +AR++F + P +TVV+WTTMIA  ALNG F EAL LF +M +L  KPNHITF+A+LQAC+H G LE+G E F  M + Y I+PGLDHYSCM+DLLGR
Subjt:  SLSDAREVFSSLPNRTVVSWTTMIAACALNGEFREALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGR

Query:  KGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKS
        KGKL EALE+I++M  KPD GIWGALL ACKIH N++I E  +  LF L+P +A  +VEMANIYA+ G WD  + +R  M+   ++K PG+S++QVNGK+
Subjt:  KGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSRHLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKS

Query:  HVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESLY
        H F V +  H ++ +IY  L  L++    K  LY
Subjt:  HVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCGTTGCCTCCACATCTCAACCGTTTTGCGAAGCTATCAACTGTCACTTGGTGGAACTCCAACATAAGAGACGCTGTAAATCAAGGTCATGCGTACAAAGCCCT
TGTCCTCTTCCGCCAACTGAAGCTAAATGGTCTACAACCTAACAATTTTACATTCCCCTTCTTAGCAAAAGCTTGCGCCAAGCTCTCCCATCTTACTGACTCCCAAATTA
TCCATACCCATGTCGTCAAATCCCCATTCCACTCTGATATCTTCGTTCAGACAGCCATGGTCGATATGTATGTCAAGTGTGGTAAAGTGGAGGATGCATCTAACTTGTTT
GATAAAATGCCTGTTAGGGATACTGCATCATGGAACGCTATGATTATTGGTTTTTCTCAAATTGGCTCCCTTGATAGAGTTTTCAGTCTGTTTGTAGGGATGAGATTAGT
GGGAATTCGGCCAGATGCAGCTACTATCATTGGATTGTCTCGAGCAGTTTTATCTGCCAAAAGTTTAAGATTCTTGAAAGCTGTTCATGCTGTTGGGATTGAAACAGGAT
TGGACACTGATGTCTCAGTTGCTAACACTTGGATTGCTGGCTACTCCAAATGCGGTGAACTACAGCTTGCAAAGATGGTGTTCCGCGGGATTCAAAAGCATGTAAGATCT
TCTGTCTCGTGGAATTCATTAATTGCATGCTATGCAAACTTTGGGAAATATATAGATGCTGTAAATATTTACAAAGGATTGCTCTGTGATGGGTTTAAGCCTGATGCAAG
TACGATTATTAGCCTTCTTTCCTCTTGCATGCAACCTCAGGCACTAATTTATGGTTCCCTTATTCATGGTCATGGTTTCCAATTGGGTTGTGATTCAGATATTTCTTTAA
TTAACACCCTTATATCAATGTATTCTAGGTGTGGAGATATTACCTCTGCTACAATTTTGTTTGATGGTATGTCCTTTAGAACATGTATTTCGTGGACTGCCATGATTAGT
GGGTATAGTGAGGTGGGGCGATTTGACGATGCCTTGGTATTATTTAATGCTATGGAAGAAGCTGGTGAAAAACCTGATGTTGTTACAGTTCTTTCCCTGATATCAGGTTG
TGGTAAAATAGGGGCACTTGAGCTAGGGCATTGGATTGACAACTATGCATCTTTATATGGATTGAAAAAAGATGTGGTAGTTTGTAATGCATTAATAGACATGTATGCAA
AATGTGGTAGTTTGAGTGATGCTCGAGAGGTATTCTCTAGTCTGCCTAATAGAACTGTTGTCTCTTGGACAACCATGATTGCAGCTTGCGCTCTGAATGGGGAATTTAGA
GAAGCTTTACATCTCTTCTATCAAATGTCAGAGTTGGGAATAAAGCCAAACCATATCACGTTTATTGCTATTCTTCAAGCTTGCTCTCATGGAGGCTACCTCGAGGAAGG
GAGGGAATGTTTTATGGCGATGACTAAAAGGTATGGTATAAATCCTGGTTTGGATCATTATTCCTGCATGATTGATCTTCTTGGACGGAAAGGGAAGCTAATTGAAGCCT
TGGAGGTTATTCAAGATATGCCTATGAAACCTGATGAGGGCATATGGGGTGCATTGCTTGGTGCTTGTAAGATTCACAACAACATACAAATTGGTGAGTATGTGTCTCGT
CATCTCTTTGAATTGCAGCCGCACGTTGCAGTTTCGTTTGTCGAGATGGCTAATATATATGCATCGGTTGGAAGATGGGATAGAGTTTCAACTATGAGAAAAACGATGAG
ATCAAACAAAATGAGGAAATCTCCTGGGAAAAGCATCGTTCAAGTGAATGGAAAGTCGCATGTGTTTTTTGTTGAGGACAGAAGCCATCATGACAGTTTGCTTATTTATG
CCGTGTTGGGGAATTTAACAATGCAGATGAATCAAAAAGAATCCTTATATGTCCAGAGAATTGTTGAACTTGGTATGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCGTTGCCTCCACATCTCAACCGTTTTGCGAAGCTATCAACTGTCACTTGGTGGAACTCCAACATAAGAGACGCTGTAAATCAAGGTCATGCGTACAAAGCCCT
TGTCCTCTTCCGCCAACTGAAGCTAAATGGTCTACAACCTAACAATTTTACATTCCCCTTCTTAGCAAAAGCTTGCGCCAAGCTCTCCCATCTTACTGACTCCCAAATTA
TCCATACCCATGTCGTCAAATCCCCATTCCACTCTGATATCTTCGTTCAGACAGCCATGGTCGATATGTATGTCAAGTGTGGTAAAGTGGAGGATGCATCTAACTTGTTT
GATAAAATGCCTGTTAGGGATACTGCATCATGGAACGCTATGATTATTGGTTTTTCTCAAATTGGCTCCCTTGATAGAGTTTTCAGTCTGTTTGTAGGGATGAGATTAGT
GGGAATTCGGCCAGATGCAGCTACTATCATTGGATTGTCTCGAGCAGTTTTATCTGCCAAAAGTTTAAGATTCTTGAAAGCTGTTCATGCTGTTGGGATTGAAACAGGAT
TGGACACTGATGTCTCAGTTGCTAACACTTGGATTGCTGGCTACTCCAAATGCGGTGAACTACAGCTTGCAAAGATGGTGTTCCGCGGGATTCAAAAGCATGTAAGATCT
TCTGTCTCGTGGAATTCATTAATTGCATGCTATGCAAACTTTGGGAAATATATAGATGCTGTAAATATTTACAAAGGATTGCTCTGTGATGGGTTTAAGCCTGATGCAAG
TACGATTATTAGCCTTCTTTCCTCTTGCATGCAACCTCAGGCACTAATTTATGGTTCCCTTATTCATGGTCATGGTTTCCAATTGGGTTGTGATTCAGATATTTCTTTAA
TTAACACCCTTATATCAATGTATTCTAGGTGTGGAGATATTACCTCTGCTACAATTTTGTTTGATGGTATGTCCTTTAGAACATGTATTTCGTGGACTGCCATGATTAGT
GGGTATAGTGAGGTGGGGCGATTTGACGATGCCTTGGTATTATTTAATGCTATGGAAGAAGCTGGTGAAAAACCTGATGTTGTTACAGTTCTTTCCCTGATATCAGGTTG
TGGTAAAATAGGGGCACTTGAGCTAGGGCATTGGATTGACAACTATGCATCTTTATATGGATTGAAAAAAGATGTGGTAGTTTGTAATGCATTAATAGACATGTATGCAA
AATGTGGTAGTTTGAGTGATGCTCGAGAGGTATTCTCTAGTCTGCCTAATAGAACTGTTGTCTCTTGGACAACCATGATTGCAGCTTGCGCTCTGAATGGGGAATTTAGA
GAAGCTTTACATCTCTTCTATCAAATGTCAGAGTTGGGAATAAAGCCAAACCATATCACGTTTATTGCTATTCTTCAAGCTTGCTCTCATGGAGGCTACCTCGAGGAAGG
GAGGGAATGTTTTATGGCGATGACTAAAAGGTATGGTATAAATCCTGGTTTGGATCATTATTCCTGCATGATTGATCTTCTTGGACGGAAAGGGAAGCTAATTGAAGCCT
TGGAGGTTATTCAAGATATGCCTATGAAACCTGATGAGGGCATATGGGGTGCATTGCTTGGTGCTTGTAAGATTCACAACAACATACAAATTGGTGAGTATGTGTCTCGT
CATCTCTTTGAATTGCAGCCGCACGTTGCAGTTTCGTTTGTCGAGATGGCTAATATATATGCATCGGTTGGAAGATGGGATAGAGTTTCAACTATGAGAAAAACGATGAG
ATCAAACAAAATGAGGAAATCTCCTGGGAAAAGCATCGTTCAAGTGAATGGAAAGTCGCATGTGTTTTTTGTTGAGGACAGAAGCCATCATGACAGTTTGCTTATTTATG
CCGTGTTGGGGAATTTAACAATGCAGATGAATCAAAAAGAATCCTTATATGTCCAGAGAATTGTTGAACTTGGTATGGAATGA
Protein sequenceShow/hide protein sequence
MPSLPPHLNRFAKLSTVTWWNSNIRDAVNQGHAYKALVLFRQLKLNGLQPNNFTFPFLAKACAKLSHLTDSQIIHTHVVKSPFHSDIFVQTAMVDMYVKCGKVEDASNLF
DKMPVRDTASWNAMIIGFSQIGSLDRVFSLFVGMRLVGIRPDAATIIGLSRAVLSAKSLRFLKAVHAVGIETGLDTDVSVANTWIAGYSKCGELQLAKMVFRGIQKHVRS
SVSWNSLIACYANFGKYIDAVNIYKGLLCDGFKPDASTIISLLSSCMQPQALIYGSLIHGHGFQLGCDSDISLINTLISMYSRCGDITSATILFDGMSFRTCISWTAMIS
GYSEVGRFDDALVLFNAMEEAGEKPDVVTVLSLISGCGKIGALELGHWIDNYASLYGLKKDVVVCNALIDMYAKCGSLSDAREVFSSLPNRTVVSWTTMIAACALNGEFR
EALHLFYQMSELGIKPNHITFIAILQACSHGGYLEEGRECFMAMTKRYGINPGLDHYSCMIDLLGRKGKLIEALEVIQDMPMKPDEGIWGALLGACKIHNNIQIGEYVSR
HLFELQPHVAVSFVEMANIYASVGRWDRVSTMRKTMRSNKMRKSPGKSIVQVNGKSHVFFVEDRSHHDSLLIYAVLGNLTMQMNQKESLYVQRIVELGME