; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028187 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028187
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr8:15065728..15067851
RNA-Seq ExpressionLag0028187
SyntenyLag0028187
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588362.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089Show/hide
Query:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV
        MLKL  PI+ LAPVKFTPFL KSN  ASPLLDP KLLKVAADAKNLKFGR IHAHLI+TN  P +CRVNQ+NSLINLYVKCD +FIARQMFD M KRNVV
Subjt:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV

Query:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYMQNG+P EVF LFKKM+VKD+IFPNEYVI+T+ISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPGYD
Subjt:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        IFCYNLVLNGLLEH+H+ EAIEVLKLMI EG +WNNAT+VTIFR+CASLKDLK GK VHA+MLK++ID D+YIGSSIIDMYGKCGNVLSGRAFFDQLQ+R
Subjt:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY
        NVVSWTAIMAAYFQNGFFEEALNLF KMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMK 
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI
        CDSITWNAIITGHSHH +GKEAL++F DMLT RECPNYVTFIGVLSACAHL LVDEG YYF+HLMKQFGI PGLEHYTCIVGLLSRSGRLDEAENFMRS 
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI

Query:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS
         INWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS
Subjt:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS

Query:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED
        +IYE VRDLL+KIRPLGY+PDIAGVLHDIEDEQKL+NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF+D
Subjt:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED

Query:  GCCSCGDYW
        G CSCGDYW
Subjt:  GCCSCGDYW

XP_022144415.1 pentatricopeptide repeat-containing protein At5g39680 [Momordica charantia]0.0e+0089.09Show/hide
Query:  LKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWS
        LKLP +GL      PFLFKSNYFASP  +P KLLK+AADAKNLKFGRIIHAHLI+TNHTP +CRVNQ+NSLIN Y KCD + +ARQMFD MPKRNVVSWS
Subjt:  LKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWS

Query:  ALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFC
        ALMAGYMQNG+  EVF L KKMVV+DDI PNEYVI+TI+SSCC SQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAA+QILDTVPGYDIFC
Subjt:  ALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFC

Query:  YNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVV
        YNLVLNGLLEH+H+ EAIEVL LMI E IEWNNATYVTIFRLCASLKDL+LGKQVHAQML+N+ID D+YIGSSIIDMYGKCG VLSGR FFD+LQS+NVV
Subjt:  YNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVV

Query:  SWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDS
        SWT IMAAYFQNGFFEEALNLF KMEID IPPNEYTLAV LNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM  CDS
Subjt:  SWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDS

Query:  ITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILIN
        ITWNAIITGHSHHGLGKEALSMFQDML T ECPNYVTFIGVLSACAHL LV EGFYYF+HLMKQFGI PGLEHYTCI+GLLSRSG+LDEAENFMRS  IN
Subjt:  ITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILIN

Query:  WDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIY
        WDVVAWRTLL ACYVHRNYDKGKQIAEYLLQMDPEDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS+IY
Subjt:  WDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIY

Query:  EKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCC
        EKVRDLLS+I+PLGYVPDIAGVLHDI+DEQKLDNLSYHSEKLAVAYGLMKTP GAPIRVIKNLRMCDDCHTAVKLISKVANR IIVRDANRFHHFEDGCC
Subjt:  EKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCC

Query:  SCGDYW
        SCGDYW
Subjt:  SCGDYW

XP_022933883.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata]0.0e+0089Show/hide
Query:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV
        MLKL  PI+ LAPVKFTPFL KSN  ASPLLDP KLLKVAADAKNLKFGR IHAHLI+TN  P +CRVNQ+NSLINLYVKCD +FIARQMFD M KRNVV
Subjt:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV

Query:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYMQNG+P EVF LFKKM+VKD+IFPNEYVI+T+ISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPGYD
Subjt:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        +FCYNLVLNGLLEH+H+ EAIEVLKLMI EG +WNNAT+VTIFR+CASLKDLK GK VHA+MLK++ID D+YIGSSIIDMYGKCGNVLSGRAFFDQLQ+R
Subjt:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY
        NVVSWTAIMAAYFQNGFFEEALNLF KMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNMK 
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI
        CDSITWNAIITGHSHH +GKEAL++F DMLT RECPNYVTFIGVLSACAHL LVDEG YYF+HLMKQFGI PGLEHYTCIVGLLSRSGRLDEAENFMRS 
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI

Query:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS
         INWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS
Subjt:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS

Query:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED
        +IYE VRDLL+KIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF+D
Subjt:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED

Query:  GCCSCGDYW
        G CSCGDYW
Subjt:  GCCSCGDYW

XP_023002421.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima]0.0e+0089.14Show/hide
Query:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV
        MLKL  PI+ LAPVKFTPFL KSN  ASPLLDP KLLKVAADAKNLKFGR+IHAHL++TN  PR+CRVNQ+NSLINLYVKCD +FIARQMFD M KRNVV
Subjt:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV

Query:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYMQNG+P +VF LFKKM+VKD+IFPNEYVI+T+ISSC DSQMYVEGKQCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPGYD
Subjt:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        +FCYNLVLNGLLEH+H+GEAIEVLKLMIDEG +WNNAT+VTIFR+CASLKDLKLGK VHA+MLK++ID D+YIGSSIIDMYGKCGNVLSGRAFFDQLQ+R
Subjt:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY
        NVVSWTAIMAAYFQNGFFEEALNLF KMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNMK 
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI
        CDSITWNAIITGHSHH +GKEAL++F DMLT RECPNYVTFIGVLSACAHL LVDEG YYF+HLMKQ GI PGLEHYTCIVGLLSRSGRLDEAENFMRS 
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI

Query:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS
         INWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS
Subjt:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS

Query:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED
        +IYE +RDLL+KIRPLGYVPDIAGVLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISKVANR IIVRDANRFHHF+D
Subjt:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED

Query:  GCCSCGDYW
        G CSCGDYW
Subjt:  GCCSCGDYW

XP_023529544.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita pepo subsp. pepo]0.0e+0088.86Show/hide
Query:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV
        MLKL  PI+ LAPVKFTPFL KSN  ASPLLDP KLLKVAADAKNLKFGR IHAHLI+TN  P +CRVNQ+NSLINLYVKCD +FIARQMFD M KRNVV
Subjt:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV

Query:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYMQNG+P  VF LFKKM+VKD+IFPNEYVI+T+ISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPGYD
Subjt:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        +FCYNLVLNGLLEH+H+ EAIEVLKLMIDEG +WNNAT+VTIFR+CASLKDL+ GK VHA+MLK++ID D+YIGSSIIDMYGKCGNVLSGRAFFDQLQ+R
Subjt:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY
        NVVSWTAIMAAYFQNGFFEEALNLF KMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM  
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI
        CDSITWNAIITGHSHH +GKEAL++F DMLT RECPNYVTFIGVLSACAHL LVDEG YYF+HLMKQFGI PGLEHYTCIVGLLSRSGRLDEAENFMRS 
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI

Query:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS
         INWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS
Subjt:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS

Query:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED
        +IYE VRDLL+KIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF+D
Subjt:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED

Query:  GCCSCGDYW
        G CSCGDYW
Subjt:  GCCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KR26 DYW_deaminase domain-containing protein0.0e+0084.72Show/hide
Query:  MLKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSW
        +LKLPI  + PVKFTPFL +SN+ ASP  DP KLLKVAADAKNLKFGR IHAHL +TNH  R+ +VNQ+NSLINLYVKCD V IAR++FDSMP+RNVVSW
Subjt:  MLKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSW

Query:  SALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIF
        SALMAGYMQNGNP EVF LFKKMVVKD+IFPNEYVI+T ISS CDSQMYVEGKQCHGYALKSGLE HQYVKNALIQ+YSKCSDV AA+QIL TVPG DIF
Subjt:  SALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIF

Query:  CYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNV
        CYNLV+NGLL+HTHM EA++VLKL+I EGIEWNNATYVTIFRLCASLKD+ LGKQVHAQMLK++ID D+YIGSSIIDMYGKCGNVLSGR FFD+LQSRNV
Subjt:  CYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNV

Query:  VSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCD
        VSWT+I+AAYFQN FFEEALNLF KMEID IPPNEYT+AVL NSAAGLSAL  GDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM  C+
Subjt:  VSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCD

Query:  SITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILI
         ITWNAIITGHSHHGLGKEALSMFQDM+ T E PNYVTFIGV+ ACAHL LVDEGFYYF+HLMKQF I PGLEHYTCIVGLLSRSGRLDEAENFMRS  I
Subjt:  SITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILI

Query:  NWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRI
        NWDVV+WRTLLNACYVH++YDKG++IAEYLLQ++P DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+ HVFTSED KHPE++ I
Subjt:  NWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRI

Query:  YEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGC
        YE V+DLLSKIRPLGYVPDI  VLHDIEDEQK+DNLSYHSEKLAVAYGLMKTPSGAPI VIKNLRMCDDCHTA+KLISKVANR I+VRDANRFHHF++GC
Subjt:  YEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGC

Query:  CSCGDYW
        CSCGDYW
Subjt:  CSCGDYW

A0A1S4E243 pentatricopeptide repeat-containing protein At5g39680 isoform X20.0e+0083.73Show/hide
Query:  MLKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSW
        +LKLPI+ + PVKFTPFL +S++FASP  DP KLLKVAADAKNL FGR I AHL +TNH  R+ +VNQ+NSLINLYVKC  V IAR++FDSMP+RNVVSW
Subjt:  MLKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSW

Query:  SALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIF
        S LMAGYMQNGNPSEVF LFKKMV+KD+I PN+YVI+T+ISS C+SQMYVEGKQCHGYALKSGLE HQYVKNALIQ+YSKCSDV AA+QIL TVPG DIF
Subjt:  SALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIF

Query:  CYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNV
        CYNLV+NGLL+HTHM EA++VLKL+I +GIEWN+ATYVTIFRLCASLKD+ LGKQVHAQMLK++ID D+YIGSSIIDMYGKCGNVLSGR FFD+LQSRNV
Subjt:  CYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNV

Query:  VSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCD
        VSWT+IMAAYFQN FFEEAL+LF KMEID IPPNEYT+AVL NSAAGLSAL  GDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQRVFSNM  CD
Subjt:  VSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCD

Query:  SITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILI
         ITWNAIITGHSHHGLGKEALSMFQDM+TT E PNYVTFIGV+SACAHL LVDEGFYYF+HLMKQFGI PGLEHYTCIVGLLSRSGRLDEAENFMRS  I
Subjt:  SITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILI

Query:  NWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRI
        NWDVV+WRTLLNACYVH++YDKGKQIAEYLLQ++P DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+ HVFTSED KHP+++ I
Subjt:  NWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRI

Query:  YEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGC
        YE V++LLSKIRPLGYVPDI  VLHDIEDEQK++NLSYHSEKLAVAYGLMKT SG PIRVIKNLRMCDDCHTA+KLIS+VANR IIVRD NRFHHF++GC
Subjt:  YEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGC

Query:  CSCGDYW
        CSCGDYW
Subjt:  CSCGDYW

A0A6J1CRJ5 pentatricopeptide repeat-containing protein At5g396800.0e+0089.09Show/hide
Query:  LKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWS
        LKLP +GL      PFLFKSNYFASP  +P KLLK+AADAKNLKFGRIIHAHLI+TNHTP +CRVNQ+NSLIN Y KCD + +ARQMFD MPKRNVVSWS
Subjt:  LKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWS

Query:  ALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFC
        ALMAGYMQNG+  EVF L KKMVV+DDI PNEYVI+TI+SSCC SQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAA+QILDTVPGYDIFC
Subjt:  ALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFC

Query:  YNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVV
        YNLVLNGLLEH+H+ EAIEVL LMI E IEWNNATYVTIFRLCASLKDL+LGKQVHAQML+N+ID D+YIGSSIIDMYGKCG VLSGR FFD+LQS+NVV
Subjt:  YNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVV

Query:  SWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDS
        SWT IMAAYFQNGFFEEALNLF KMEID IPPNEYTLAV LNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM  CDS
Subjt:  SWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDS

Query:  ITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILIN
        ITWNAIITGHSHHGLGKEALSMFQDML T ECPNYVTFIGVLSACAHL LV EGFYYF+HLMKQFGI PGLEHYTCI+GLLSRSG+LDEAENFMRS  IN
Subjt:  ITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILIN

Query:  WDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIY
        WDVVAWRTLL ACYVHRNYDKGKQIAEYLLQMDPEDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS+IY
Subjt:  WDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIY

Query:  EKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCC
        EKVRDLLS+I+PLGYVPDIAGVLHDI+DEQKLDNLSYHSEKLAVAYGLMKTP GAPIRVIKNLRMCDDCHTAVKLISKVANR IIVRDANRFHHFEDGCC
Subjt:  EKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCC

Query:  SCGDYW
        SCGDYW
Subjt:  SCGDYW

A0A6J1F637 pentatricopeptide repeat-containing protein At5g396800.0e+0089Show/hide
Query:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV
        MLKL  PI+ LAPVKFTPFL KSN  ASPLLDP KLLKVAADAKNLKFGR IHAHLI+TN  P +CRVNQ+NSLINLYVKCD +FIARQMFD M KRNVV
Subjt:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV

Query:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYMQNG+P EVF LFKKM+VKD+IFPNEYVI+T+ISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPGYD
Subjt:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        +FCYNLVLNGLLEH+H+ EAIEVLKLMI EG +WNNAT+VTIFR+CASLKDLK GK VHA+MLK++ID D+YIGSSIIDMYGKCGNVLSGRAFFDQLQ+R
Subjt:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY
        NVVSWTAIMAAYFQNGFFEEALNLF KMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNMK 
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI
        CDSITWNAIITGHSHH +GKEAL++F DMLT RECPNYVTFIGVLSACAHL LVDEG YYF+HLMKQFGI PGLEHYTCIVGLLSRSGRLDEAENFMRS 
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI

Query:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS
         INWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS
Subjt:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS

Query:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED
        +IYE VRDLL+KIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF+D
Subjt:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED

Query:  GCCSCGDYW
        G CSCGDYW
Subjt:  GCCSCGDYW

A0A6J1KL92 pentatricopeptide repeat-containing protein At5g396800.0e+0089.14Show/hide
Query:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV
        MLKL  PI+ LAPVKFTPFL KSN  ASPLLDP KLLKVAADAKNLKFGR+IHAHL++TN  PR+CRVNQ+NSLINLYVKCD +FIARQMFD M KRNVV
Subjt:  MLKL--PINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVV

Query:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYMQNG+P +VF LFKKM+VKD+IFPNEYVI+T+ISSC DSQMYVEGKQCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPGYD
Subjt:  SWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        +FCYNLVLNGLLEH+H+GEAIEVLKLMIDEG +WNNAT+VTIFR+CASLKDLKLGK VHA+MLK++ID D+YIGSSIIDMYGKCGNVLSGRAFFDQLQ+R
Subjt:  IFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY
        NVVSWTAIMAAYFQNGFFEEALNLF KMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNMK 
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKY

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI
        CDSITWNAIITGHSHH +GKEAL++F DMLT RECPNYVTFIGVLSACAHL LVDEG YYF+HLMKQ GI PGLEHYTCIVGLLSRSGRLDEAENFMRS 
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSI

Query:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS
         INWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNI HVFTSED KHPESS
Subjt:  LINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESS

Query:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED
        +IYE +RDLL+KIRPLGYVPDIAGVLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISKVANR IIVRDANRFHHF+D
Subjt:  RIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFED

Query:  GCCSCGDYW
        G CSCGDYW
Subjt:  GCCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK93 Pentatricopeptide repeat-containing protein At5g396802.7e-22252.66Show/hide
Query:  KFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGN
        K    + KS     P+    +LLKV A++  L+ G  IHAHLI+TN + R     Q+NSLINLYVKC     AR++FD MP+RNVVSW A+M GY  +G 
Subjt:  KFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGN

Query:  PSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEH
          EV  LFK M    +  PNE+V + +  SC +S    EGKQ HG  LK GL  H++V+N L+ MYS CS    A+++LD +P  D+  ++  L+G LE 
Subjt:  PSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEH

Query:  THMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQ
            E ++VL+   +E   WNN TY++  RL ++L+DL L  QVH++M++   + ++    ++I+MYGKCG VL  +  FD   ++N+   T IM AYFQ
Subjt:  THMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQ

Query:  NGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHS
        +  FEEALNLF KM+   +PPNEYT A+LLNS A LS L  GD LH    KSG + +V+VGNAL+ MY+KSG I  A++ FS M + D +TWN +I+G S
Subjt:  NGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHS

Query:  HHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLN
        HHGLG+EAL  F  M+ T E PN +TFIGVL AC+H+G V++G +YF+ LMK+F + P ++HYTCIVGLLS++G   +AE+FMR+  I WDVVAWRTLLN
Subjt:  HHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLN

Query:  ACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIR
        ACYV RNY  GK++AEY ++  P D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE + IY KV++++SKI+
Subjt:  ACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIR

Query:  PLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW
        PLGY PD+AG  HD+++EQ+ DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  PLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.7e-13936.74Show/hide
Query:  NSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQY
        N++++ Y K   +    + FD +P+R+ VSW+ ++ GY   G   +   +   M VK+ I P ++ ++ +++S   ++    GK+ H + +K GL  +  
Subjt:  NSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQY

Query:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLNGLLEHTHMGEAIEVL-KLMIDEGIEWNNATY
        V N+L+ MY+KC D   A  + D                                +   DI  +N +++G  +  +   A+++  K++ D  +  +  T 
Subjt:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLNGLLEHTHMGEAIEVL-KLMIDEGIEWNNATY

Query:  VTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT
         ++   CA+L+ L +GKQ+H+ ++    D    + +++I MY +CG V + R   +Q                                 L+ R+VV+WT
Subjt:  VTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT

Query:  AIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYC--DSI
        A++  Y Q+G + EA+NLF  M      PN YTLA +L+ A+ L++LSHG Q+H  A KSG   +V V NALI MY+K+G+I +A R F  ++ C  D++
Subjt:  AIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYC--DSI

Query:  TWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINW
        +W ++I   + HG  +EAL +F+ ML     P+++T++GV SAC H GLV++G  YF  +     I P L HY C+V L  R+G L EA+ F+  + I  
Subjt:  TWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINW

Query:  DVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYE
        DVV W +LL+AC VH+N D GK  AE LL ++PE+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++ VHVF  ED  HPE + IY 
Subjt:  DVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYE

Query:  KVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCS
         ++ +  +I+ +GYVPD A VLHD+E+E K   L +HSEKLA+A+GL+ TP    +R++KNLR+C+DCHTA+K ISK+  R IIVRD  RFHHF+DG CS
Subjt:  KVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCS

Query:  CGDYW
        C DYW
Subjt:  CGDYW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331706.3e-14237.48Show/hide
Query:  LLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNE
        +L  A    +L  G+ +H   +       +  +   NSLIN+Y K  +   AR +FD+M +R+++SW++++AG  QNG   E   LF ++ ++  + P++
Subjt:  LLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNE

Query:  YVISTIISSCCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEW
        Y +++++ +       +   KQ H +A+K       +V  ALI  YS+   ++ A +IL     +D+  +N ++ G  +     + +++  LM  +G   
Subjt:  YVISTIISSCCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEW

Query:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIP
        ++ T  T+F+ C  L  +  GKQVHA  +K+  D D+++ S I+DMY KCG++ + +  FD +   + V+WT +++   +NG  E A ++F +M +  + 
Subjt:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIP

Query:  PNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRE
        P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  ++  +   WNA++ G + HG GKE L +F+ M +   
Subjt:  PNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRE

Query:  CPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ
         P+ VTFIGVLSAC+H GLV E + +   +   +GI P +EHY+C+   L R+G + +AEN + S+ +      +RTLL AC V  + + GK++A  LL+
Subjt:  CPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  MDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK
        ++P D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N +H+F  +D  + ++  IY KV+D++  I+  GYVP+    L D+E+E+K
Subjt:  MDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK

Query:  LDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW
           L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG CSCGDYW
Subjt:  LDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136501.3e-14237.83Show/hide
Query:  SLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYV
        +L+NLY KC  +  A   F      NVV W+ ++  Y    +    F +F++M + ++I PN+Y   +I+ +C        G+Q H   +K+  +L+ YV
Subjt:  SLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYV

Query:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIY
         + LI MY+K   +  A  IL    G D+  +  ++ G  ++    +A+   + M+D GI  +          CA L+ LK G+Q+HAQ   +    D+ 
Subjt:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIY

Query:  IGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNV
          ++++ +Y +CG +      F+Q ++ + ++W A+++ + Q+G  EEAL +F +M  + I  N +T    + +A+  + +  G Q+HA   K+G     
Subjt:  IGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNV

Query:  IVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAP
         V NALI MY+K G I  A++ F  +   + ++WNAII  +S HG G EAL  F  M+ +   PN+VT +GVLSAC+H+GLVD+G  YF  +  ++G++P
Subjt:  IVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAP

Query:  GLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK
          EHY C+V +L+R+G L  A+ F++ + I  D + WRTLL+AC VH+N + G+  A +LL+++PED  TY+LLSN++A  ++WD     R+ M+E+ VK
Subjt:  GLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK

Query:  KEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDC
        KEPG SW+E++N +H F   D  HP +  I+E  +DL  +   +GYV D   +L++++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DC
Subjt:  KEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDC

Query:  HTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW
        H  +K +SKV+NR IIVRDA RFHHFE G CSC DYW
Subjt:  HTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276108.4e-13938.2Show/hide
Query:  GRIIHAH-LILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCD
        GR +  H +++ N   +   V+  NSLINLY+KC  V  AR +FD    ++VV+W+++++GY  NG   E  G+F  M + + +  +E   +++I  C +
Subjt:  GRIIHAH-LILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCD

Query:  SQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLC
         +     +Q H   +K G    Q ++ AL+  YSKC+ +  AL++   +    ++  +  +++G L++    EA+++   M  +G+  N  TY  I    
Subjt:  SQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLC

Query:  ASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNS
          +       +VHAQ++K N +R   +G++++D Y K G V      F  +  +++V+W+A++A Y Q G  E A+ +F ++    I PNE+T + +LN 
Subjt:  ASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNS

Query:  AAGLSA-LSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVL
         A  +A +  G Q H  A KS L  ++ V +AL+ MY+K G+I +A+ VF   +  D ++WN++I+G++ HG   +AL +F++M   +   + VTFIGV 
Subjt:  AAGLSA-LSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVL

Query:  SACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYIL
        +AC H GLV+EG  YF  +++   IAP  EH +C+V L SR+G+L++A   + ++        WRT+L AC VH+  + G+  AE ++ M PED   Y+L
Subjt:  SACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYIL

Query:  LSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKL
        LSNM+A    W    K+RKLM ERNVKKEPG SW+E++N  + F + D  HP   +IY K+ DL ++++ LGY PD + VL DI+DE K   L+ HSE+L
Subjt:  LSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKL

Query:  AVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-EDGCCSCGDYW
        A+A+GL+ TP G+P+ +IKNLR+C DCH  +KLI+K+  R I+VRD+NRFHHF  DG CSCGD+W
Subjt:  AVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-EDGCCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.2e-14036.74Show/hide
Query:  NSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQY
        N++++ Y K   +    + FD +P+R+ VSW+ ++ GY   G   +   +   M VK+ I P ++ ++ +++S   ++    GK+ H + +K GL  +  
Subjt:  NSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQY

Query:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLNGLLEHTHMGEAIEVL-KLMIDEGIEWNNATY
        V N+L+ MY+KC D   A  + D                                +   DI  +N +++G  +  +   A+++  K++ D  +  +  T 
Subjt:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLNGLLEHTHMGEAIEVL-KLMIDEGIEWNNATY

Query:  VTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT
         ++   CA+L+ L +GKQ+H+ ++    D    + +++I MY +CG V + R   +Q                                 L+ R+VV+WT
Subjt:  VTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT

Query:  AIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYC--DSI
        A++  Y Q+G + EA+NLF  M      PN YTLA +L+ A+ L++LSHG Q+H  A KSG   +V V NALI MY+K+G+I +A R F  ++ C  D++
Subjt:  AIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYC--DSI

Query:  TWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINW
        +W ++I   + HG  +EAL +F+ ML     P+++T++GV SAC H GLV++G  YF  +     I P L HY C+V L  R+G L EA+ F+  + I  
Subjt:  TWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINW

Query:  DVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYE
        DVV W +LL+AC VH+N D GK  AE LL ++PE+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++ VHVF  ED  HPE + IY 
Subjt:  DVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYE

Query:  KVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCS
         ++ +  +I+ +GYVPD A VLHD+E+E K   L +HSEKLA+A+GL+ TP    +R++KNLR+C+DCHTA+K ISK+  R IIVRD  RFHHF+DG CS
Subjt:  KVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCS

Query:  CGDYW
        C DYW
Subjt:  CGDYW

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-14038.2Show/hide
Query:  GRIIHAH-LILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCD
        GR +  H +++ N   +   V+  NSLINLY+KC  V  AR +FD    ++VV+W+++++GY  NG   E  G+F  M + + +  +E   +++I  C +
Subjt:  GRIIHAH-LILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCD

Query:  SQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLC
         +     +Q H   +K G    Q ++ AL+  YSKC+ +  AL++   +    ++  +  +++G L++    EA+++   M  +G+  N  TY  I    
Subjt:  SQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLC

Query:  ASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNS
          +       +VHAQ++K N +R   +G++++D Y K G V      F  +  +++V+W+A++A Y Q G  E A+ +F ++    I PNE+T + +LN 
Subjt:  ASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNS

Query:  AAGLSA-LSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVL
         A  +A +  G Q H  A KS L  ++ V +AL+ MY+K G+I +A+ VF   +  D ++WN++I+G++ HG   +AL +F++M   +   + VTFIGV 
Subjt:  AAGLSA-LSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVL

Query:  SACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYIL
        +AC H GLV+EG  YF  +++   IAP  EH +C+V L SR+G+L++A   + ++        WRT+L AC VH+  + G+  AE ++ M PED   Y+L
Subjt:  SACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYIL

Query:  LSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKL
        LSNM+A    W    K+RKLM ERNVKKEPG SW+E++N  + F + D  HP   +IY K+ DL ++++ LGY PD + VL DI+DE K   L+ HSE+L
Subjt:  LSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKL

Query:  AVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-EDGCCSCGDYW
        A+A+GL+ TP G+P+ +IKNLR+C DCH  +KLI+K+  R I+VRD+NRFHHF  DG CSCGD+W
Subjt:  AVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-EDGCCSCGDYW

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-14437.83Show/hide
Query:  SLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYV
        +L+NLY KC  +  A   F      NVV W+ ++  Y    +    F +F++M + ++I PN+Y   +I+ +C        G+Q H   +K+  +L+ YV
Subjt:  SLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYV

Query:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIY
         + LI MY+K   +  A  IL    G D+  +  ++ G  ++    +A+   + M+D GI  +          CA L+ LK G+Q+HAQ   +    D+ 
Subjt:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIY

Query:  IGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNV
          ++++ +Y +CG +      F+Q ++ + ++W A+++ + Q+G  EEAL +F +M  + I  N +T    + +A+  + +  G Q+HA   K+G     
Subjt:  IGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNV

Query:  IVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAP
         V NALI MY+K G I  A++ F  +   + ++WNAII  +S HG G EAL  F  M+ +   PN+VT +GVLSAC+H+GLVD+G  YF  +  ++G++P
Subjt:  IVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAP

Query:  GLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK
          EHY C+V +L+R+G L  A+ F++ + I  D + WRTLL+AC VH+N + G+  A +LL+++PED  TY+LLSN++A  ++WD     R+ M+E+ VK
Subjt:  GLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK

Query:  KEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDC
        KEPG SW+E++N +H F   D  HP +  I+E  +DL  +   +GYV D   +L++++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DC
Subjt:  KEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDC

Query:  HTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW
        H  +K +SKV+NR IIVRDA RFHHFE G CSC DYW
Subjt:  HTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-14337.48Show/hide
Query:  LLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNE
        +L  A    +L  G+ +H   +       +  +   NSLIN+Y K  +   AR +FD+M +R+++SW++++AG  QNG   E   LF ++ ++  + P++
Subjt:  LLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGNPSEVFGLFKKMVVKDDIFPNE

Query:  YVISTIISSCCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEW
        Y +++++ +       +   KQ H +A+K       +V  ALI  YS+   ++ A +IL     +D+  +N ++ G  +     + +++  LM  +G   
Subjt:  YVISTIISSCCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIEVLKLMIDEGIEW

Query:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIP
        ++ T  T+F+ C  L  +  GKQVHA  +K+  D D+++ S I+DMY KCG++ + +  FD +   + V+WT +++   +NG  E A ++F +M +  + 
Subjt:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDHIP

Query:  PNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRE
        P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  ++  +   WNA++ G + HG GKE L +F+ M +   
Subjt:  PNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRE

Query:  CPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ
         P+ VTFIGVLSAC+H GLV E + +   +   +GI P +EHY+C+   L R+G + +AEN + S+ +      +RTLL AC V  + + GK++A  LL+
Subjt:  CPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  MDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK
        ++P D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N +H+F  +D  + ++  IY KV+D++  I+  GYVP+    L D+E+E+K
Subjt:  MDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK

Query:  LDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW
           L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG CSCGDYW
Subjt:  LDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW

AT5G39680.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-22352.66Show/hide
Query:  KFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGN
        K    + KS     P+    +LLKV A++  L+ G  IHAHLI+TN + R     Q+NSLINLYVKC     AR++FD MP+RNVVSW A+M GY  +G 
Subjt:  KFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQNGN

Query:  PSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEH
          EV  LFK M    +  PNE+V + +  SC +S    EGKQ HG  LK GL  H++V+N L+ MYS CS    A+++LD +P  D+  ++  L+G LE 
Subjt:  PSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEH

Query:  THMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQ
            E ++VL+   +E   WNN TY++  RL ++L+DL L  QVH++M++   + ++    ++I+MYGKCG VL  +  FD   ++N+   T IM AYFQ
Subjt:  THMGEAIEVLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQ

Query:  NGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHS
        +  FEEALNLF KM+   +PPNEYT A+LLNS A LS L  GD LH    KSG + +V+VGNAL+ MY+KSG I  A++ FS M + D +TWN +I+G S
Subjt:  NGFFEEALNLFPKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHS

Query:  HHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLN
        HHGLG+EAL  F  M+ T E PN +TFIGVL AC+H+G V++G +YF+ LMK+F + P ++HYTCIVGLLS++G   +AE+FMR+  I WDVVAWRTLLN
Subjt:  HHGLGKEALSMFQDMLTTRECPNYVTFIGVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLN

Query:  ACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIR
        ACYV RNY  GK++AEY ++  P D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE + IY KV++++SKI+
Subjt:  ACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIR

Query:  PLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW
        PLGY PD+AG  HD+++EQ+ DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  PLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAACTACCAATTAATGGCCTTGCTCCTGTGAAGTTCACGCCATTTCTATTCAAGTCCAATTACTTTGCTTCTCCTCTCCTGGACCCAACAAAGCTCTTGAAAGT
AGCTGCAGACGCCAAGAACTTAAAATTTGGTAGAATAATCCATGCCCATTTGATCCTTACCAATCACACCCCTAGAGAATGCAGAGTTAACCAAGTTAATTCCCTTATCA
ATTTGTACGTGAAATGTGATAGAGTATTTATTGCTCGGCAGATGTTTGATAGTATGCCTAAAAGAAATGTGGTATCTTGGAGTGCTTTAATGGCTGGGTACATGCAAAAT
GGGAATCCGTCGGAAGTTTTTGGGTTGTTCAAAAAGATGGTTGTGAAGGATGATATTTTCCCCAATGAATATGTGATTTCCACTATTATATCTTCTTGTTGTGATAGTCA
AATGTATGTAGAGGGAAAGCAATGTCATGGGTATGCGTTAAAGTCTGGATTGGAGCTTCATCAATATGTTAAGAATGCACTTATTCAGATGTACTCTAAATGTTCAGATG
TAAGAGCAGCATTGCAGATATTAGATACTGTGCCAGGTTATGACATATTTTGTTATAATTTGGTTTTAAATGGGCTTTTAGAGCACACACATATGGGAGAAGCTATAGAA
GTTCTGAAGTTAATGATTGATGAAGGGATAGAATGGAATAATGCCACTTACGTTACAATTTTTCGCCTTTGTGCCAGCCTTAAAGATTTAAAATTAGGTAAGCAAGTTCA
TGCTCAAATGTTGAAAAACAACATTGACCGTGACATCTATATTGGAAGTTCTATCATTGACATGTATGGGAAATGCGGTAATGTGTTGAGTGGAAGAGCCTTTTTTGATC
AGTTGCAAAGCCGAAACGTTGTTTCTTGGACGGCAATCATGGCAGCTTATTTTCAGAATGGATTCTTTGAAGAAGCATTGAATCTATTTCCAAAGATGGAAATTGATCAT
ATTCCTCCAAATGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCAGGTTTGTCTGCACTAAGCCATGGAGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAA
AGGCAATGTTATAGTAGGGAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCGGCACAGCGTGTGTTCTCAAATATGAAGTATTGTGATTCCATTACCT
GGAATGCAATAATAACTGGCCACTCCCACCATGGTCTGGGCAAGGAAGCTTTAAGCATGTTCCAGGACATGTTGACTACTAGAGAGTGTCCAAATTATGTTACCTTTATC
GGTGTTCTCTCTGCTTGTGCCCATTTAGGTCTGGTAGATGAAGGATTCTACTATTTTAGTCATCTGATGAAACAGTTTGGTATTGCTCCTGGATTGGAGCACTATACCTG
TATTGTTGGACTCTTAAGTAGATCTGGACGACTAGATGAAGCTGAGAATTTTATGAGGTCAATTCTAATTAATTGGGATGTTGTTGCCTGGCGCACCCTTCTCAATGCTT
GTTATGTTCATAGAAATTATGATAAAGGAAAGCAAATAGCAGAGTACTTGCTGCAGATGGATCCTGAGGATGTAGGGACTTATATTCTATTATCAAACATGCATGCACGA
GTTAGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGTCCATGTTTT
TACATCTGAAGATACTAAACACCCTGAGTCCAGTCGGATTTATGAAAAGGTAAGGGACTTGTTATCTAAGATTCGACCATTGGGGTATGTTCCTGATATTGCTGGTGTAT
TGCATGATATCGAGGATGAGCAAAAGCTAGACAATCTTAGCTATCATAGTGAGAAGCTTGCTGTAGCATATGGCCTGATGAAGACACCATCAGGTGCACCAATCCGGGTG
ATTAAGAACCTTAGGATGTGCGATGATTGTCACACTGCTGTGAAACTTATTTCCAAAGTTGCAAATAGGGCTATAATTGTTAGAGATGCCAACCGTTTCCATCATTTTGA
AGACGGTTGTTGCTCATGTGGAGATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAACTACCAATTAATGGCCTTGCTCCTGTGAAGTTCACGCCATTTCTATTCAAGTCCAATTACTTTGCTTCTCCTCTCCTGGACCCAACAAAGCTCTTGAAAGT
AGCTGCAGACGCCAAGAACTTAAAATTTGGTAGAATAATCCATGCCCATTTGATCCTTACCAATCACACCCCTAGAGAATGCAGAGTTAACCAAGTTAATTCCCTTATCA
ATTTGTACGTGAAATGTGATAGAGTATTTATTGCTCGGCAGATGTTTGATAGTATGCCTAAAAGAAATGTGGTATCTTGGAGTGCTTTAATGGCTGGGTACATGCAAAAT
GGGAATCCGTCGGAAGTTTTTGGGTTGTTCAAAAAGATGGTTGTGAAGGATGATATTTTCCCCAATGAATATGTGATTTCCACTATTATATCTTCTTGTTGTGATAGTCA
AATGTATGTAGAGGGAAAGCAATGTCATGGGTATGCGTTAAAGTCTGGATTGGAGCTTCATCAATATGTTAAGAATGCACTTATTCAGATGTACTCTAAATGTTCAGATG
TAAGAGCAGCATTGCAGATATTAGATACTGTGCCAGGTTATGACATATTTTGTTATAATTTGGTTTTAAATGGGCTTTTAGAGCACACACATATGGGAGAAGCTATAGAA
GTTCTGAAGTTAATGATTGATGAAGGGATAGAATGGAATAATGCCACTTACGTTACAATTTTTCGCCTTTGTGCCAGCCTTAAAGATTTAAAATTAGGTAAGCAAGTTCA
TGCTCAAATGTTGAAAAACAACATTGACCGTGACATCTATATTGGAAGTTCTATCATTGACATGTATGGGAAATGCGGTAATGTGTTGAGTGGAAGAGCCTTTTTTGATC
AGTTGCAAAGCCGAAACGTTGTTTCTTGGACGGCAATCATGGCAGCTTATTTTCAGAATGGATTCTTTGAAGAAGCATTGAATCTATTTCCAAAGATGGAAATTGATCAT
ATTCCTCCAAATGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCAGGTTTGTCTGCACTAAGCCATGGAGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAA
AGGCAATGTTATAGTAGGGAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCGGCACAGCGTGTGTTCTCAAATATGAAGTATTGTGATTCCATTACCT
GGAATGCAATAATAACTGGCCACTCCCACCATGGTCTGGGCAAGGAAGCTTTAAGCATGTTCCAGGACATGTTGACTACTAGAGAGTGTCCAAATTATGTTACCTTTATC
GGTGTTCTCTCTGCTTGTGCCCATTTAGGTCTGGTAGATGAAGGATTCTACTATTTTAGTCATCTGATGAAACAGTTTGGTATTGCTCCTGGATTGGAGCACTATACCTG
TATTGTTGGACTCTTAAGTAGATCTGGACGACTAGATGAAGCTGAGAATTTTATGAGGTCAATTCTAATTAATTGGGATGTTGTTGCCTGGCGCACCCTTCTCAATGCTT
GTTATGTTCATAGAAATTATGATAAAGGAAAGCAAATAGCAGAGTACTTGCTGCAGATGGATCCTGAGGATGTAGGGACTTATATTCTATTATCAAACATGCATGCACGA
GTTAGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGTCCATGTTTT
TACATCTGAAGATACTAAACACCCTGAGTCCAGTCGGATTTATGAAAAGGTAAGGGACTTGTTATCTAAGATTCGACCATTGGGGTATGTTCCTGATATTGCTGGTGTAT
TGCATGATATCGAGGATGAGCAAAAGCTAGACAATCTTAGCTATCATAGTGAGAAGCTTGCTGTAGCATATGGCCTGATGAAGACACCATCAGGTGCACCAATCCGGGTG
ATTAAGAACCTTAGGATGTGCGATGATTGTCACACTGCTGTGAAACTTATTTCCAAAGTTGCAAATAGGGCTATAATTGTTAGAGATGCCAACCGTTTCCATCATTTTGA
AGACGGTTGTTGCTCATGTGGAGATTATTGGTGA
Protein sequenceShow/hide protein sequence
MLKLPINGLAPVKFTPFLFKSNYFASPLLDPTKLLKVAADAKNLKFGRIIHAHLILTNHTPRECRVNQVNSLINLYVKCDRVFIARQMFDSMPKRNVVSWSALMAGYMQN
GNPSEVFGLFKKMVVKDDIFPNEYVISTIISSCCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLNGLLEHTHMGEAIE
VLKLMIDEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKNNIDRDIYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFPKMEIDH
IPPNEYTLAVLLNSAAGLSALSHGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMKYCDSITWNAIITGHSHHGLGKEALSMFQDMLTTRECPNYVTFI
GVLSACAHLGLVDEGFYYFSHLMKQFGIAPGLEHYTCIVGLLSRSGRLDEAENFMRSILINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDPEDVGTYILLSNMHAR
VRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIVHVFTSEDTKHPESSRIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPSGAPIRV
IKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFEDGCCSCGDYW