; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016106 (gene) of Snake gourd v1 genome

Gene IDTan0016106
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG07:41462146..41483549
RNA-Seq ExpressionTan0016106
SyntenyTan0016106
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588362.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.87Show/hide
Query:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN
        M MLKL  PIS LAPVK TPFL K N L SPLLDP+KLLKVAADAKNLKFGR IHAHLIITN +P DCRVNQINSLINLYVKCDELFIARQMFD M KRN
Subjt:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN

Query:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG
        VVSW ALMAGYM+N  PLEVF L K M VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG +LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG

Query:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
        YDIFCYNLVLNGLLEH+H+REAIEVLKLMIGE  +WNNAT+VTIFR+CA LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
Subjt:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ

Query:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM
        +RNVVSWTAIMAAYFQNGFFEEALNLFS MEIDHI PNEYTLAVLLNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM

Query:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR
        KCCDSITWNAIITGHSHH +GKEAL++F DMLT RE PNYVTFIGVLSACAHLSLVDEG YYFNHLMK+FGIVPGLEHYTCIVGLLSRSGRLD+AENFMR
Subjt:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR

Query:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGY+PDIAGVLHDIEDEQKL+NLSYHSEKLAVAYGLMK+P GAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

XP_022144415.1 pentatricopeptide repeat-containing protein At5g39680 [Momordica charantia]0.0e+0089Show/hide
Query:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV
        MP LKLP SGL      PFLFK NY  SP  +PIKLLK+AADAKNLKFGRIIHAHLIITNH P DCRVNQINSLIN Y KCDEL +ARQMFD MPKRNVV
Subjt:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV

Query:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD
        SWSALMAGYM+N   LEVF LLK M V+D+I PNEYVIAT++SSCC SQMYVEGKQCHG ALKSGLELHQYVKNALIQMYSKCSDVRAA+QIL TVPGYD
Subjt:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD

Query:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        IFCYNLVLNGLLEH+H+REAIEVL LMIGE+IEWNNATYVTIFRLCA LKDL+LGKQVHAQML++DIDYDVYIGSSIIDMYGKCG VLSGR FFD+LQS+
Subjt:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC
        NVVSWT IMAAYFQNGFFEEALNLFS MEID I PNEYTLAV LNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQHVFSNM C
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN
        CDSITWNAIITGHSHHGLGKEALSMFQDML   E PNYVTFIGVLSACAHLSLV EGFYYFNHLMK+FGIVPGLEHYTCI+GLLSRSG+LD+AENFMRSN
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN

Query:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN
        PINWDVVAWRTLL ACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+
Subjt:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN

Query:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD
         IYEKVRDLLS+I+PLGYVPDIAGVLHDI+DEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANR IIVRDANRFHHF+D
Subjt:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD

Query:  GCCSCGDYW
        GCCSCGDYW
Subjt:  GCCSCGDYW

XP_022933883.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata]0.0e+0090.01Show/hide
Query:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN
        M MLKL  PIS LAPVK TPFL K N L SPLLDP+KLLKVAADAKNLKFGR IHAHLIITN +P DCRVNQINSLINLYVKCDELFIARQMFD M KRN
Subjt:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN

Query:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG
        VVSW ALMAGYM+N  PLEVF L K M VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG +LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG

Query:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
        YD+FCYNLVLNGLLEH+H+REAIEVLKLMIGE  +WNNAT+VTIFR+CA LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
Subjt:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ

Query:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM
        +RNVVSWTAIMAAYFQNGFFEEALNLFS MEIDHI PNEYTLAVLLNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM

Query:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR
        KCCDSITWNAIITGHSHH +GKEAL++F DMLT RE PNYVTFIGVLSACAHLSLVDEG YYFNHLMK+FGIVPGLEHYTCIVGLLSRSGRLD+AENFMR
Subjt:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR

Query:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMK P GAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

XP_023002421.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima]0.0e+0089.59Show/hide
Query:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN
        M MLKL  PIS LAPVK TPFL K N L SPLLDP+KLLKVAADAKNLKFGR+IHAHL+ITN IPRDCRVNQINSLINLYVKCDELFIARQMFD M KRN
Subjt:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN

Query:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG
        VVSW ALMAGYM+N  PL+VF L K M VKDNIFPNEYVIATVISSC DSQMYVEGKQCHG +LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG

Query:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
        YD+FCYNLVLNGLLEH+H+ EAIEVLKLMI E  +WNNAT+VTIFR+CA LKDLKLGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
Subjt:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ

Query:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM
        +RNVVSWTAIMAAYFQNGFFEEALNLFS MEIDHI PNEYTLAVLLNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM

Query:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR
        KCCDSITWNAIITGHSHH +GKEAL++F DMLT RE PNYVTFIGVLSACAHLSLVDEG YYFNHLMK+ GIVPGLEHYTCIVGLLSRSGRLD+AENFMR
Subjt:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR

Query:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF
        S+ IYE +RDLL+KIRPLGYVPDIAGVLHDIEDEQK+DNLSYHSEKLAVAYGLMK+P GAPIRVIKNLRMCDDCHTA+KLISKVANR IIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

XP_023529544.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita pepo subsp. pepo]0.0e+0089.45Show/hide
Query:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN
        M MLKL  PIS LAPVK TPFL K N L SPLLDP+KLLKVAADAKNLKFGR IHAHLIITN +P DCRVNQINSLINLYVKCDELFIARQMFD M KRN
Subjt:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN

Query:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG
        VVSW ALMAGYM+N  PL VF L K M VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG +LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG

Query:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
        YD+FCYNLVLNGLLEH+H+REAIEVLKLMI E  +WNNAT+VTIFR+CA LKDL+ GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
Subjt:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ

Query:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM
        +RNVVSWTAIMAAYFQNGFFEEALNLFS MEIDHI PNEYTLAVLLNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM

Query:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR
         CCDSITWNAIITGHSHH +GKEAL++F DMLT RE PNYVTFIGVLSACAHLSLVDEG YYFNHLMK+FGIVPGLEHYTCIVGLLSRSGRLD+AENFMR
Subjt:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR

Query:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMK+P GAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KR26 DYW_deaminase domain-containing protein0.0e+0084.49Show/hide
Query:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV
        M +LKLPI+ + PVK TPFL + N+L SP  DPIKLLKVAADAKNLKFGR IHAHL ITNH  RD +VNQ+NSLINLYVKCDE+ IAR++FDSMP+RNVV
Subjt:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV

Query:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD
        SWSALMAGYM+N  PLEVF L K M VKDNIFPNEYVIAT ISS CDSQMYVEGKQCHG ALKSGLE HQYVKNALIQ+YSKCSDV AA+QIL TVPG D
Subjt:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD

Query:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        IFCYNLV+NGLL+HTHM EA++VLKL+I E IEWNNATYVTIFRLCA LKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSGR FFD+LQSR
Subjt:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC
        NVVSWT+I+AAYFQN FFEEALNLFS MEID I PNEYT+AVL NSAAGLSAL  G+QLHARAEKSGLKGNVMVGNALIIMY KSGDILAAQ VFSNM C
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN
        C+ ITWNAIITGHSHHGLGKEALSMFQDM+   E PNYVTFIGV+ ACAHL LVDEGFYYFNHLMK+F IVPGLEHYTCIVGLLSRSGRLD+AENFMRS+
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN

Query:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN
         INWDVV+WRTLLNACYVH++YDKG++IAEYLLQ++  DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE+N
Subjt:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN

Query:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD
        LIYE V+DLLSKIRPLGYVPDI  VLHDIEDEQK+DNLSYHSEKLAVAYGLMKTP GAPI VIKNLRMCDDCHTA+KLISKVANR I+VRDANRFHHFQ+
Subjt:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD

Query:  GCCSCGDYW
        GCCSCGDYW
Subjt:  GCCSCGDYW

A0A1S4E243 pentatricopeptide repeat-containing protein At5g39680 isoform X20.0e+0083.22Show/hide
Query:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV
        M +LKLPIS + PVK TPFL + ++  SP  DPIKLLKVAADAKNL FGR I AHL ITNH  RD +VNQ+NSLINLYVKC E+ IAR++FDSMP+RNVV
Subjt:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV

Query:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD
        SWS LMAGYM+N  P EVF L K M +KDNI PN+YVIATVISS C+SQMYVEGKQCHG ALKSGLE HQYVKNALIQ+YSKCSDV AA+QIL TVPG D
Subjt:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD

Query:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        IFCYNLV+NGLL+HTHMREA++VLKL+I + IEWN+ATYVTIFRLCA LKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSGR FFD+LQSR
Subjt:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC
        NVVSWT+IMAAYFQN FFEEAL+LFS MEID I PNEYT+AVL NSAAGLSAL  G+QLHARAEKSGLKGNVMVGNALIIMY KSGDILAAQ VFSNM C
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN
        CD ITWNAIITGHSHHGLGKEALSMFQDM+T  E PNYVTFIGV+SACAHL LVDEGFYYFNHLMK+FGIVPGLEHYTCIVGLLSRSGRLD+AENFMRS+
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN

Query:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN
         INWDVV+WRTLLNACYVH++YDKGKQIAEYLLQ++  DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHP++N
Subjt:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN

Query:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD
        LIYE V++LLSKIRPLGYVPDI  VLHDIEDEQK++NLSYHSEKLAVAYGLMKT  G PIRVIKNLRMCDDCHTA+KLIS+VANR IIVRD NRFHHFQ+
Subjt:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD

Query:  GCCSCGDYW
        GCCSCGDYW
Subjt:  GCCSCGDYW

A0A6J1CRJ5 pentatricopeptide repeat-containing protein At5g396800.0e+0089Show/hide
Query:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV
        MP LKLP SGL      PFLFK NY  SP  +PIKLLK+AADAKNLKFGRIIHAHLIITNH P DCRVNQINSLIN Y KCDEL +ARQMFD MPKRNVV
Subjt:  MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVV

Query:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD
        SWSALMAGYM+N   LEVF LLK M V+D+I PNEYVIAT++SSCC SQMYVEGKQCHG ALKSGLELHQYVKNALIQMYSKCSDVRAA+QIL TVPGYD
Subjt:  SWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYD

Query:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR
        IFCYNLVLNGLLEH+H+REAIEVL LMIGE+IEWNNATYVTIFRLCA LKDL+LGKQVHAQML++DIDYDVYIGSSIIDMYGKCG VLSGR FFD+LQS+
Subjt:  IFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSR

Query:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC
        NVVSWT IMAAYFQNGFFEEALNLFS MEID I PNEYTLAV LNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQHVFSNM C
Subjt:  NVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC

Query:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN
        CDSITWNAIITGHSHHGLGKEALSMFQDML   E PNYVTFIGVLSACAHLSLV EGFYYFNHLMK+FGIVPGLEHYTCI+GLLSRSG+LD+AENFMRSN
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSN

Query:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN
        PINWDVVAWRTLL ACYVHRNYDKGKQIAEYLLQMD EDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+
Subjt:  PINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN

Query:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD
         IYEKVRDLLS+I+PLGYVPDIAGVLHDI+DEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANR IIVRDANRFHHF+D
Subjt:  LIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQD

Query:  GCCSCGDYW
        GCCSCGDYW
Subjt:  GCCSCGDYW

A0A6J1F637 pentatricopeptide repeat-containing protein At5g396800.0e+0090.01Show/hide
Query:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN
        M MLKL  PIS LAPVK TPFL K N L SPLLDP+KLLKVAADAKNLKFGR IHAHLIITN +P DCRVNQINSLINLYVKCDELFIARQMFD M KRN
Subjt:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN

Query:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG
        VVSW ALMAGYM+N  PLEVF L K M VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG +LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG

Query:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
        YD+FCYNLVLNGLLEH+H+REAIEVLKLMIGE  +WNNAT+VTIFR+CA LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
Subjt:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ

Query:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM
        +RNVVSWTAIMAAYFQNGFFEEALNLFS MEIDHI PNEYTLAVLLNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM

Query:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR
        KCCDSITWNAIITGHSHH +GKEAL++F DMLT RE PNYVTFIGVLSACAHLSLVDEG YYFNHLMK+FGIVPGLEHYTCIVGLLSRSGRLD+AENFMR
Subjt:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR

Query:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMK P GAPIRVIKNLRMCDDCHTA+KLISK+ANR IIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

A0A6J1KL92 pentatricopeptide repeat-containing protein At5g396800.0e+0089.59Show/hide
Query:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN
        M MLKL  PIS LAPVK TPFL K N L SPLLDP+KLLKVAADAKNLKFGR+IHAHL+ITN IPRDCRVNQINSLINLYVKCDELFIARQMFD M KRN
Subjt:  MPMLKL--PISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRN

Query:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG
        VVSW ALMAGYM+N  PL+VF L K M VKDNIFPNEYVIATVISSC DSQMYVEGKQCHG +LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPG

Query:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
        YD+FCYNLVLNGLLEH+H+ EAIEVLKLMI E  +WNNAT+VTIFR+CA LKDLKLGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSGRAFFDQLQ
Subjt:  YDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ

Query:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM
        +RNVVSWTAIMAAYFQNGFFEEALNLFS MEIDHI PNEYTLAVLLNSAAGLSALSHG+QLHARAEKSGLKGNV+VGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNM

Query:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR
        KCCDSITWNAIITGHSHH +GKEAL++F DMLT RE PNYVTFIGVLSACAHLSLVDEG YYFNHLMK+ GIVPGLEHYTCIVGLLSRSGRLD+AENFMR
Subjt:  KCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMR

Query:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF
        S+ IYE +RDLL+KIRPLGYVPDIAGVLHDIEDEQK+DNLSYHSEKLAVAYGLMK+P GAPIRVIKNLRMCDDCHTA+KLISKVANR IIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK93 Pentatricopeptide repeat-containing protein At5g396809.2e-21853.93Show/hide
Query:  KLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPN
        +LLKV A++  L+ G  IHAHLI+TN   R     QINSLINLYVKC E   AR++FD MP+RNVVSW A+M GY  +    EV  L K+M       PN
Subjt:  KLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPN

Query:  EYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEW
        E+V   V  SC +S    EGKQ HG  LK GL  H++V+N L+ MYS CS    A+++L  +P  D+  ++  L+G LE    +E ++VL+    ED  W
Subjt:  EYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEW

Query:  NNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHIS
        NN TY++  RL + L+DL L  QVH++M++   + +V    ++I+MYGKCG VL  +  FD   ++N+   T IM AYFQ+  FEEALNLFS M+   + 
Subjt:  NNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHIS

Query:  PNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVRE
        PNEYT A+LLNS A LS L  G+ LH    KSG + +VMVGNAL+ MY+KSG I  A+  FS M   D +TWN +I+G SHHGLG+EAL  F  M+   E
Subjt:  PNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVRE

Query:  HPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ
         PN +TFIGVL AC+H+  V++G +YFN LMK+F + P ++HYTCIVGLLS++G   DAE+FMR+ PI WDVVAWRTLLNACYV RNY  GK++AEY ++
Subjt:  HPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  MDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK
            D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE  LIY KV++++SKI+PLGY PD+AG  HD+++EQ+
Subjt:  MDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK

Query:  LDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW
         DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  LDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220707.2e-13836.08Show/hide
Query:  NSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQY
        N++++ Y K  ++    + FD +P+R+ VSW+ ++ GY    +  +   ++ +M VK+ I P ++ +  V++S   ++    GK+ H   +K GL  +  
Subjt:  NSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQY

Query:  VKNALIQMYSKCSD-------------------------------VRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGED-IEWNNATY
        V N+L+ MY+KC D                               +  A+     +   DI  +N +++G  +  +   A+++   M+ +  +  +  T 
Subjt:  VKNALIQMYSKCSD-------------------------------VRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGED-IEWNNATY

Query:  VTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT
         ++   CA L+ L +GKQ+H+ ++ +  D    + +++I MY +CG V + R   +Q                                 L+ R+VV+WT
Subjt:  VTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT

Query:  AIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC-CDSIT
        A++  Y Q+G + EA+NLF +M      PN YTLA +L+ A+ L++LSHG Q+H  A KSG   +V V NALI MY+K+G+I +A   F  ++C  D+++
Subjt:  AIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC-CDSIT

Query:  WNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWD
        W ++I   + HG  +EAL +F+ ML     P+++T++GV SAC H  LV++G  YF+ +     I+P L HY C+V L  R+G L +A+ F+   PI  D
Subjt:  WNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWD

Query:  VVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK
        VV W +LL+AC VH+N D GK  AE LL ++ E+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++  HVF  ED  HPE N IY  
Subjt:  VVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK

Query:  VRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSC
        ++ +  +I+ +GYVPD A VLHD+E+E K   L +HSEKLA+A+GL+ TP    +R++KNLR+C+DCHTA+K ISK+  R IIVRD  RFHHF+DG CSC
Subjt:  VRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSC

Query:  GDYW
         DYW
Subjt:  GDYW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331701.9e-13837.37Show/hide
Query:  IKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFP
        I +L  A    +L  G+ +H   +    +  D  +   NSLIN+Y K  +   AR +FD+M +R+++SW++++AG  +N   +E   L   + ++  + P
Subjt:  IKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFP

Query:  NEYVIATVISSCCDSQMYVE-GKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDI
        ++Y + +V+ +       +   KQ H  A+K       +V  ALI  YS+   ++ A +IL     +D+  +N ++ G  +     + +++  LM  +  
Subjt:  NEYVIATVISSCCDSQMYVE-GKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDI

Query:  EWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDH
          ++ T  T+F+ C  L  +  GKQVHA  +KS  D D+++ S I+DMY KCG++ + +  FD +   + V+WT +++   +NG  E A ++FS M +  
Subjt:  EWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDH

Query:  ISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTV
        + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  ++  +   WNA++ G + HG GKE L +F+ M ++
Subjt:  ISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTV

Query:  REHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYL
           P+ VTFIGVLSAC+H  LV E + +   +   +GI P +EHY+C+   L R+G +  AEN + S  +      +RTLL AC V  + + GK++A  L
Subjt:  REHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYL

Query:  LQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDE
        L+++  D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++ LIY KV+D++  I+  GYVP+    L D+E+E
Subjt:  LQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDE

Query:  QKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW
        +K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG CSCGDYW
Subjt:  QKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136507.2e-13836.22Show/hide
Query:  LFKPNYL--TSPLLDPIKLLKVAADAKNLKF-GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPL
        LFK  +L    P  + +  L VA  A    F G+ +HA+   T  +          +L+NLY KC ++  A   F      NVV W+ ++  Y   +   
Subjt:  LFKPNYL--TSPLLDPIKLLKVAADAKNLKF-GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPL

Query:  EVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTH
          F + + M +++ I PN+Y   +++ +C        G+Q H   +K+  +L+ YV + LI MY+K   +  A  IL+   G D+  +  ++ G  ++  
Subjt:  EVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTH

Query:  MREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNG
          +A+   + M+   I  +          CAGL+ LK G+Q+HAQ   S    D+   ++++ +Y +CG +      F+Q ++ + ++W A+++ + Q+G
Subjt:  MREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNG

Query:  FFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHH
          EEAL +F  M  + I  N +T    + +A+  + +  G Q+HA   K+G      V NALI MY+K G I  A+  F  +   + ++WNAII  +S H
Subjt:  FFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHH

Query:  GLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNAC
        G G EAL  F  M+     PN+VT +GVLSAC+H+ LVD+G  YF  +   +G+ P  EHY C+V +L+R+G L  A+ F++  PI  D + WRTLL+AC
Subjt:  GLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNAC

Query:  YVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPL
         VH+N + G+  A +LL+++ ED  TY+LLSN++A  ++WD     R+ M+E+ VKKEPG SW+E++N  H F   D  HP ++ I+E  +DL  +   +
Subjt:  YVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPL

Query:  GYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW
        GYV D   +L++++ EQK   +  HSEKLA+++GL+  P   PI V+KNLR+C+DCH  +K +SKV+NR IIVRDA RFHHF+ G CSC DYW
Subjt:  GYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276103.7e-13437.35Show/hide
Query:  GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDS
        GR +  H ++  +   D  +   NSLINLY+KC  +  AR +FD    ++VV+W+++++GY  N   LE  G+  +M + + +  +E   A+VI  C + 
Subjt:  GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDS

Query:  QMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGY-DIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCA
        +     +Q H   +K G    Q ++ AL+  YSKC+ +  AL++   +    ++  +  +++G L++    EA+++   M  + +  N  TY  I     
Subjt:  QMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGY-DIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCA

Query:  GLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSA
         +       +VHAQ++K++ +    +G++++D Y K G V      F  +  +++V+W+A++A Y Q G  E A+ +F  +    I PNE+T + +LN  
Subjt:  GLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSA

Query:  AGLSA-LSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLS
        A  +A +  G Q H  A KS L  ++ V +AL+ MY+K G+I +A+ VF   +  D ++WN++I+G++ HG   +AL +F++M   +   + VTFIGV +
Subjt:  AGLSA-LSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLS

Query:  ACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILL
        AC H  LV+EG  YF+ +++   I P  EH +C+V L SR+G+L+ A   + + P       WRT+L AC VH+  + G+  AE ++ M  ED   Y+LL
Subjt:  ACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILL

Query:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLA
        SNM+A    W    K+RKLM ERNVKKEPG SW+E++N  + F + D  HP  + IY K+ DL ++++ LGY PD + VL DI+DE K   L+ HSE+LA
Subjt:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLA

Query:  VAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-QDGCCSCGDYW
        +A+GL+ TP G+P+ +IKNLR+C DCH  +KLI+K+  R I+VRD+NRFHHF  DG CSCGD+W
Subjt:  VAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-QDGCCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein5.1e-13936.08Show/hide
Query:  NSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQY
        N++++ Y K  ++    + FD +P+R+ VSW+ ++ GY    +  +   ++ +M VK+ I P ++ +  V++S   ++    GK+ H   +K GL  +  
Subjt:  NSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQY

Query:  VKNALIQMYSKCSD-------------------------------VRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGED-IEWNNATY
        V N+L+ MY+KC D                               +  A+     +   DI  +N +++G  +  +   A+++   M+ +  +  +  T 
Subjt:  VKNALIQMYSKCSD-------------------------------VRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGED-IEWNNATY

Query:  VTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT
         ++   CA L+ L +GKQ+H+ ++ +  D    + +++I MY +CG V + R   +Q                                 L+ R+VV+WT
Subjt:  VTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQ---------------------------------LQSRNVVSWT

Query:  AIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC-CDSIT
        A++  Y Q+G + EA+NLF +M      PN YTLA +L+ A+ L++LSHG Q+H  A KSG   +V V NALI MY+K+G+I +A   F  ++C  D+++
Subjt:  AIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKC-CDSIT

Query:  WNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWD
        W ++I   + HG  +EAL +F+ ML     P+++T++GV SAC H  LV++G  YF+ +     I+P L HY C+V L  R+G L +A+ F+   PI  D
Subjt:  WNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWD

Query:  VVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK
        VV W +LL+AC VH+N D GK  AE LL ++ E+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++  HVF  ED  HPE N IY  
Subjt:  VVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK

Query:  VRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSC
        ++ +  +I+ +GYVPD A VLHD+E+E K   L +HSEKLA+A+GL+ TP    +R++KNLR+C+DCHTA+K ISK+  R IIVRD  RFHHF+DG CSC
Subjt:  VRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSC

Query:  GDYW
         DYW
Subjt:  GDYW

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-13537.35Show/hide
Query:  GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDS
        GR +  H ++  +   D  +   NSLINLY+KC  +  AR +FD    ++VV+W+++++GY  N   LE  G+  +M + + +  +E   A+VI  C + 
Subjt:  GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDS

Query:  QMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGY-DIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCA
        +     +Q H   +K G    Q ++ AL+  YSKC+ +  AL++   +    ++  +  +++G L++    EA+++   M  + +  N  TY  I     
Subjt:  QMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGY-DIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEWNNATYVTIFRLCA

Query:  GLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSA
         +       +VHAQ++K++ +    +G++++D Y K G V      F  +  +++V+W+A++A Y Q G  E A+ +F  +    I PNE+T + +LN  
Subjt:  GLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHISPNEYTLAVLLNSA

Query:  AGLSA-LSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLS
        A  +A +  G Q H  A KS L  ++ V +AL+ MY+K G+I +A+ VF   +  D ++WN++I+G++ HG   +AL +F++M   +   + VTFIGV +
Subjt:  AGLSA-LSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVTFIGVLS

Query:  ACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILL
        AC H  LV+EG  YF+ +++   I P  EH +C+V L SR+G+L+ A   + + P       WRT+L AC VH+  + G+  AE ++ M  ED   Y+LL
Subjt:  ACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILL

Query:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLA
        SNM+A    W    K+RKLM ERNVKKEPG SW+E++N  + F + D  HP  + IY K+ DL ++++ LGY PD + VL DI+DE K   L+ HSE+LA
Subjt:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLA

Query:  VAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-QDGCCSCGDYW
        +A+GL+ TP G+P+ +IKNLR+C DCH  +KLI+K+  R I+VRD+NRFHHF  DG CSCGD+W
Subjt:  VAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHF-QDGCCSCGDYW

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-13936.22Show/hide
Query:  LFKPNYL--TSPLLDPIKLLKVAADAKNLKF-GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPL
        LFK  +L    P  + +  L VA  A    F G+ +HA+   T  +          +L+NLY KC ++  A   F      NVV W+ ++  Y   +   
Subjt:  LFKPNYL--TSPLLDPIKLLKVAADAKNLKF-GRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPL

Query:  EVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTH
          F + + M +++ I PN+Y   +++ +C        G+Q H   +K+  +L+ YV + LI MY+K   +  A  IL+   G D+  +  ++ G  ++  
Subjt:  EVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTH

Query:  MREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNG
          +A+   + M+   I  +          CAGL+ LK G+Q+HAQ   S    D+   ++++ +Y +CG +      F+Q ++ + ++W A+++ + Q+G
Subjt:  MREAIEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNG

Query:  FFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHH
          EEAL +F  M  + I  N +T    + +A+  + +  G Q+HA   K+G      V NALI MY+K G I  A+  F  +   + ++WNAII  +S H
Subjt:  FFEEALNLFSNMEIDHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHH

Query:  GLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNAC
        G G EAL  F  M+     PN+VT +GVLSAC+H+ LVD+G  YF  +   +G+ P  EHY C+V +L+R+G L  A+ F++  PI  D + WRTLL+AC
Subjt:  GLGKEALSMFQDMLTVREHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNAC

Query:  YVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPL
         VH+N + G+  A +LL+++ ED  TY+LLSN++A  ++WD     R+ M+E+ VKKEPG SW+E++N  H F   D  HP ++ I+E  +DL  +   +
Subjt:  YVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPL

Query:  GYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW
        GYV D   +L++++ EQK   +  HSEKLA+++GL+  P   PI V+KNLR+C+DCH  +K +SKV+NR IIVRDA RFHHF+ G CSC DYW
Subjt:  GYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-13937.37Show/hide
Query:  IKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFP
        I +L  A    +L  G+ +H   +    +  D  +   NSLIN+Y K  +   AR +FD+M +R+++SW++++AG  +N   +E   L   + ++  + P
Subjt:  IKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFP

Query:  NEYVIATVISSCCDSQMYVE-GKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDI
        ++Y + +V+ +       +   KQ H  A+K       +V  ALI  YS+   ++ A +IL     +D+  +N ++ G  +     + +++  LM  +  
Subjt:  NEYVIATVISSCCDSQMYVE-GKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDI

Query:  EWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDH
          ++ T  T+F+ C  L  +  GKQVHA  +KS  D D+++ S I+DMY KCG++ + +  FD +   + V+WT +++   +NG  E A ++FS M +  
Subjt:  EWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDH

Query:  ISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTV
        + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  ++  +   WNA++ G + HG GKE L +F+ M ++
Subjt:  ISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTV

Query:  REHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYL
           P+ VTFIGVLSAC+H  LV E + +   +   +GI P +EHY+C+   L R+G +  AEN + S  +      +RTLL AC V  + + GK++A  L
Subjt:  REHPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYL

Query:  LQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDE
        L+++  D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++ LIY KV+D++  I+  GYVP+    L D+E+E
Subjt:  LQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDE

Query:  QKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW
        +K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG CSCGDYW
Subjt:  QKLDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW

AT5G39680.1 Pentatricopeptide repeat (PPR) superfamily protein6.5e-21953.93Show/hide
Query:  KLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPN
        +LLKV A++  L+ G  IHAHLI+TN   R     QINSLINLYVKC E   AR++FD MP+RNVVSW A+M GY  +    EV  L K+M       PN
Subjt:  KLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYMKNERPLEVFGLLKNMPVKDNIFPN

Query:  EYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEW
        E+V   V  SC +S    EGKQ HG  LK GL  H++V+N L+ MYS CS    A+++L  +P  D+  ++  L+G LE    +E ++VL+    ED  W
Subjt:  EYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREAIEVLKLMIGEDIEW

Query:  NNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHIS
        NN TY++  RL + L+DL L  QVH++M++   + +V    ++I+MYGKCG VL  +  FD   ++N+   T IM AYFQ+  FEEALNLFS M+   + 
Subjt:  NNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEIDHIS

Query:  PNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVRE
        PNEYT A+LLNS A LS L  G+ LH    KSG + +VMVGNAL+ MY+KSG I  A+  FS M   D +TWN +I+G SHHGLG+EAL  F  M+   E
Subjt:  PNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVRE

Query:  HPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ
         PN +TFIGVL AC+H+  V++G +YFN LMK+F + P ++HYTCIVGLLS++G   DAE+FMR+ PI WDVVAWRTLLNACYV RNY  GK++AEY ++
Subjt:  HPNYVTFIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  MDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK
            D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE  LIY KV++++SKI+PLGY PD+AG  HD+++EQ+
Subjt:  MDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQK

Query:  LDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW
         DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  LDNLSYHSEKLAVAYGLMKTPLGAPIRVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTATGTTAAAACTACCAATTAGTGGCCTTGCTCCTGTGAAATCCACGCCATTTCTATTCAAGCCCAATTACTTGACTTCTCCTCTCCTTGACCCAATAAAGCTCTT
GAAAGTAGCTGCTGACGCCAAGAACTTAAAATTTGGTAGAATAATCCACGCCCATTTGATCATTACCAATCACATCCCCAGAGACTGCAGAGTAAACCAAATTAATTCCC
TTATTAATTTGTACGTGAAATGTGATGAACTATTCATTGCTCGGCAGATGTTTGATAGTATGCCTAAAAGGAATGTGGTATCTTGGAGCGCTTTAATGGCTGGGTACATG
AAAAATGAGAGGCCCTTAGAAGTTTTTGGGTTGTTAAAAAATATGCCTGTGAAGGATAATATTTTCCCCAATGAATATGTGATTGCCACTGTTATATCTTCTTGTTGTGA
TAGTCAAATGTATGTAGAGGGAAAGCAGTGTCATGGGCTTGCGTTAAAGTCTGGATTGGAGCTTCATCAATATGTCAAGAATGCACTTATTCAGATGTACTCTAAATGTT
CAGATGTAAGAGCAGCATTGCAGATACTAGTTACTGTGCCAGGTTATGACATATTTTGTTATAATTTGGTTCTAAATGGGCTTCTAGAGCACACACATATGAGAGAAGCT
ATAGAAGTTCTGAAGTTAATGATTGGTGAAGATATAGAATGGAATAATGCCACTTATGTTACAATTTTTCGCCTTTGTGCTGGTCTTAAAGATTTAAAATTAGGCAAGCA
AGTTCATGCTCAAATGTTGAAAAGCGATATTGACTATGATGTCTATATTGGAAGTTCTATCATTGATATGTATGGGAAATGCGGTAATGTGTTGAGTGGAAGAGCCTTTT
TTGATCAGTTGCAAAGCCGAAATGTTGTTTCTTGGACTGCAATCATGGCAGCTTATTTTCAGAATGGATTCTTTGAAGAAGCATTGAATCTGTTTTCAAACATGGAAATT
GATCATATTTCTCCCAATGAATATACGCTGGCAGTGTTGTTAAACTCTGCTGCAGGTTTGTCTGCACTAAGCCATGGAAATCAGTTACATGCACGTGCTGAGAAATCAGG
TCTCAAAGGCAACGTTATGGTAGGGAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCGGCACAACATGTGTTCTCAAATATGAAGTGTTGTGATTCGA
TTACCTGGAATGCAATAATAACCGGCCACTCCCACCATGGTCTTGGCAAGGAAGCTTTAAGCATGTTCCAGGACATGTTGACTGTTAGAGAGCATCCAAATTATGTAACC
TTTATTGGTGTTCTTTCTGCCTGTGCCCATTTAAGCCTGGTGGATGAAGGATTCTACTATTTTAATCATTTGATGAAACGGTTTGGTATTGTTCCTGGGTTGGAGCACTA
TACCTGTATTGTTGGACTCCTAAGTAGATCTGGACGACTTGATGATGCTGAGAATTTTATGAGGTCAAATCCAATCAATTGGGATGTTGTTGCCTGGCGCACCCTTCTCA
ATGCTTGTTATGTTCATAGGAATTATGATAAAGGGAAGCAAATAGCAGAGTACTTGCTACAGATGGACCATGAGGATGTAGGAACTTATATTCTATTATCAAACATGCAT
GCAAGAGTTCGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGCCCA
TGTTTTTACATCTGAAGATACTAAACACCCCGAGTCCAATCTGATTTATGAAAAGGTAAGGGACTTATTATCTAAGATTCGACCATTAGGGTATGTTCCTGATATTGCTG
GTGTATTGCACGATATTGAGGATGAGCAAAAGCTAGATAATCTTAGCTATCATAGTGAAAAGCTTGCCGTAGCATATGGCCTGATGAAGACACCATTAGGTGCACCAATT
CGAGTGATCAAGAACCTTAGGATGTGCGATGATTGTCACACTGCTGTCAAACTTATTTCAAAGGTTGCAAATAGGGCTATAATTGTTAGAGATGCCAATCGTTTCCATCA
TTTTCAAGATGGTTGTTGCTCGTGCGGAGATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
CCTCAGGGTACCACTGTCGGAACTTTGTGTACAACTCTTTTTGTTGTCGTGCGAGTTTTGTTTTGCGATTGTAAATTTTACCATTATATTTTGCTTGCTAAATTGGCGAT
TTGGAATTACTGCTTTCCTCTGGTGTAGTTAATAGGATTGAGCTTGTTTTGAGGATGAGCGGATGGATTCTGTGAAAAATTGTTACATAAGCTTAAGAAGAGGAAATAGT
TCTGGTTCTTCCATTTTCTTCTCCTAGTATTCTTTGTTGGGTTTGCGTTCTAATTTGGAAATTCAAAGCATTGTAATGCCTATGTTAAAACTACCAATTAGTGGCCTTGC
TCCTGTGAAATCCACGCCATTTCTATTCAAGCCCAATTACTTGACTTCTCCTCTCCTTGACCCAATAAAGCTCTTGAAAGTAGCTGCTGACGCCAAGAACTTAAAATTTG
GTAGAATAATCCACGCCCATTTGATCATTACCAATCACATCCCCAGAGACTGCAGAGTAAACCAAATTAATTCCCTTATTAATTTGTACGTGAAATGTGATGAACTATTC
ATTGCTCGGCAGATGTTTGATAGTATGCCTAAAAGGAATGTGGTATCTTGGAGCGCTTTAATGGCTGGGTACATGAAAAATGAGAGGCCCTTAGAAGTTTTTGGGTTGTT
AAAAAATATGCCTGTGAAGGATAATATTTTCCCCAATGAATATGTGATTGCCACTGTTATATCTTCTTGTTGTGATAGTCAAATGTATGTAGAGGGAAAGCAGTGTCATG
GGCTTGCGTTAAAGTCTGGATTGGAGCTTCATCAATATGTCAAGAATGCACTTATTCAGATGTACTCTAAATGTTCAGATGTAAGAGCAGCATTGCAGATACTAGTTACT
GTGCCAGGTTATGACATATTTTGTTATAATTTGGTTCTAAATGGGCTTCTAGAGCACACACATATGAGAGAAGCTATAGAAGTTCTGAAGTTAATGATTGGTGAAGATAT
AGAATGGAATAATGCCACTTATGTTACAATTTTTCGCCTTTGTGCTGGTCTTAAAGATTTAAAATTAGGCAAGCAAGTTCATGCTCAAATGTTGAAAAGCGATATTGACT
ATGATGTCTATATTGGAAGTTCTATCATTGATATGTATGGGAAATGCGGTAATGTGTTGAGTGGAAGAGCCTTTTTTGATCAGTTGCAAAGCCGAAATGTTGTTTCTTGG
ACTGCAATCATGGCAGCTTATTTTCAGAATGGATTCTTTGAAGAAGCATTGAATCTGTTTTCAAACATGGAAATTGATCATATTTCTCCCAATGAATATACGCTGGCAGT
GTTGTTAAACTCTGCTGCAGGTTTGTCTGCACTAAGCCATGGAAATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAAAGGCAACGTTATGGTAGGGAATGCCTTGA
TCATTATGTATTCCAAGAGTGGGGACATTTTAGCGGCACAACATGTGTTCTCAAATATGAAGTGTTGTGATTCGATTACCTGGAATGCAATAATAACCGGCCACTCCCAC
CATGGTCTTGGCAAGGAAGCTTTAAGCATGTTCCAGGACATGTTGACTGTTAGAGAGCATCCAAATTATGTAACCTTTATTGGTGTTCTTTCTGCCTGTGCCCATTTAAG
CCTGGTGGATGAAGGATTCTACTATTTTAATCATTTGATGAAACGGTTTGGTATTGTTCCTGGGTTGGAGCACTATACCTGTATTGTTGGACTCCTAAGTAGATCTGGAC
GACTTGATGATGCTGAGAATTTTATGAGGTCAAATCCAATCAATTGGGATGTTGTTGCCTGGCGCACCCTTCTCAATGCTTGTTATGTTCATAGGAATTATGATAAAGGG
AAGCAAATAGCAGAGTACTTGCTACAGATGGACCATGAGGATGTAGGAACTTATATTCTATTATCAAACATGCATGCAAGAGTTCGGAGGTGGGATGGTGTTGTTAAGAT
TCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGCCCATGTTTTTACATCTGAAGATACTAAACACCCCGAGT
CCAATCTGATTTATGAAAAGGTAAGGGACTTATTATCTAAGATTCGACCATTAGGGTATGTTCCTGATATTGCTGGTGTATTGCACGATATTGAGGATGAGCAAAAGCTA
GATAATCTTAGCTATCATAGTGAAAAGCTTGCCGTAGCATATGGCCTGATGAAGACACCATTAGGTGCACCAATTCGAGTGATCAAGAACCTTAGGATGTGCGATGATTG
TCACACTGCTGTCAAACTTATTTCAAAGGTTGCAAATAGGGCTATAATTGTTAGAGATGCCAATCGTTTCCATCATTTTCAAGATGGTTGTTGCTCGTGCGGAGATTATT
GGTGACAATTTGGACGAGTTTCTCAATGGCTTGGATGTTTTGATGATAAAGAATTTCGATCTATGATTCTGTGGTGTAGCGGAGATATCAATATACACTTCTATTTCAGT
CTGGTATTAGGCACGCATGAAACAAGTTGATCCAACTAGGTTCACATGTTGGGCGTGGGAAGGGAAAAGAAGCTTGGAGCAAGTTAAGGGGAGCTTTTCTTGTAGTTCCT
AATTTCGTGAACTTCCTACAGCCACAACAGGCTTTGAGGAAGTAAACTTTTGTGGAGAGGGTGATTTTGGATGAATCTACAAAGGAGGTCCGGAAACAGGCTCCTACAAG
TCGGGTTCTTGTCTAGAGAATAACAGGGAAGATCAAGTGGAGATGTCCAAATGATCGTGTTGTGAAGATCAACGTCGTTTTACAAAAGTTTCAACGTCTCTAATTGGACA
ATCAAACTCAATGTCCAAATGACCCTCTCTAGGCTGATCTTCTCTAATGACTCGTATGAGGGGATGGTTGTGCTCCCACTCTGTTTCCTATAGGGTTTTGGAGCGAGATC
TGTTCAAGTCCCCTAAGATTGCCGGATTGTTGGAGTGGTCGAATTGATTAGGTGGTTGGATTGTCAAGGTGGACGCTGGATTTTTGGTTGTCGGTGGTCATCAGAGTTGT
TGTTGAAAAGGTCGTTGAGGCAGCCGTTAGTGGTGGCGAGTCGCCGGAGTTGTCGTTGTTTGTGGTCGCCGGAGTTGTCACTGGAGTTGTTGT
Protein sequenceShow/hide protein sequence
MPMLKLPISGLAPVKSTPFLFKPNYLTSPLLDPIKLLKVAADAKNLKFGRIIHAHLIITNHIPRDCRVNQINSLINLYVKCDELFIARQMFDSMPKRNVVSWSALMAGYM
KNERPLEVFGLLKNMPVKDNIFPNEYVIATVISSCCDSQMYVEGKQCHGLALKSGLELHQYVKNALIQMYSKCSDVRAALQILVTVPGYDIFCYNLVLNGLLEHTHMREA
IEVLKLMIGEDIEWNNATYVTIFRLCAGLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQSRNVVSWTAIMAAYFQNGFFEEALNLFSNMEI
DHISPNEYTLAVLLNSAAGLSALSHGNQLHARAEKSGLKGNVMVGNALIIMYSKSGDILAAQHVFSNMKCCDSITWNAIITGHSHHGLGKEALSMFQDMLTVREHPNYVT
FIGVLSACAHLSLVDEGFYYFNHLMKRFGIVPGLEHYTCIVGLLSRSGRLDDAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHEDVGTYILLSNMH
ARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKTPLGAPI
RVIKNLRMCDDCHTAVKLISKVANRAIIVRDANRFHHFQDGCCSCGDYW