; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G217560 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G217560
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr11:31017814..31019940
RNA-Seq ExpressionCaUC11G217560
SyntenyCaUC11G217560
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588362.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.34Show/hide
Query:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN
        M MLKL  PIS +APVKFTPFL + N  ASP  DP+KLLKVAADAKNLKFGR IHAHLII N    DC+VNQ+NSLINLYVKCDE+ IARQMFD M +RN
Subjt:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN

Query:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG
        VVSW ALMAGY QNGSPLEVFELFKKM+VKDNIFPNEYVIATVISSC DSQMYVEGRQCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG

Query:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ
        YDIFCYNLV++GLL+H+H+ EA+E+LKLMI EG +WNNAT+VTIFR+CASLKDLK G+ VHA+MLKSDID+DVYIGSSIIDMYGKCGNVLSG A FDQLQ
Subjt:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ

Query:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM
        +RNVVSWT++MAAYFQNG+FEEAL+LFSKMEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM
Subjt:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM

Query:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK
         CCDSITWNAIITGHSHH +GKEAL++F  ML  RECPNYVTFIGVLSACAHLSLVDEGLYY NHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+
Subjt:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK

Query:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SN INWDVV+WRTLLNACYVHRNYDKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGY+PDIA VLHDIEDEQK++NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISKLANRTIIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

XP_011655117.1 pentatricopeptide repeat-containing protein At5g39680 isoform X1 [Cucumis sativus]0.0e+0087.99Show/hide
Query:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV
        M +LKLPI+DI PVKFTPFL R NF ASPHQDPIKLLKVAADAKNLKFGR IHAHL I NH  RD KVNQLNSLINLYVKCDEV IAR++FDSMPRRNVV
Subjt:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV

Query:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI
        SWSALMAGY QNG+PLEVFELFKKMVVKDNIFPNEYVIAT ISSCDSQMYVEG+QCHGYALKSGLE HQYVKNALIQ+YSKCSDV AA+QILYTVPG DI
Subjt:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI

Query:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN
        FCYNLV++GLLQHTHM EAV++LKL+ISEGIEWNNATYVTIFRLCASLKD+ LG+QVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSG   FD+LQSRN
Subjt:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN

Query:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC
        VVSWTS++AAYFQN +FEEAL+LFSKMEID IPPNEYT+AVL NSAAGLSAL LGDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC
Subjt:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC

Query:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ
        + ITWNAIITGHSHHGLGKEALSMFQ M+AT E PNYVTFIGV+ ACAHL LVDEG YY NHLMKQF IVPGLEHYTCIVGLLSRSGRLDEAENFM+S+Q
Subjt:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ

Query:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL
        INWDVVSWRTLLNACYVH++YDKG++IAEYLLQLEP DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE+NL
Subjt:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL

Query:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG
        IYE V+DLLSKIRPLGYVPDI NVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPI VIKNLRMCDDCHTA+KLISK+ANR I+VRDANRFHHFQ+G
Subjt:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG

Query:  CCSCGDYW
        CCSCGDYW
Subjt:  CCSCGDYW

XP_022933883.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata]0.0e+0087.34Show/hide
Query:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN
        M MLKL  PIS +APVKFTPFL + N  ASP  DP+KLLKVAADAKNLKFGR IHAHLII N    DC+VNQ+NSLINLYVKCDE+ IARQMFD M +RN
Subjt:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN

Query:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG
        VVSW ALMAGY QNGSPLEVFELFKKM+VKDNIFPNEYVIATVISSC DSQMYVEGRQCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG

Query:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ
        YD+FCYNLV++GLL+H+H+ EA+E+LKLMI EG +WNNAT+VTIFR+CASLKDLK G+ VHA+MLKSDID+DVYIGSSIIDMYGKCGNVLSG A FDQLQ
Subjt:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ

Query:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM
        +RNVVSWT++MAAYFQNG+FEEAL+LFSKMEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM

Query:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK
         CCDSITWNAIITGHSHH +GKEAL++F  ML  RECPNYVTFIGVLSACAHLSLVDEGLYY NHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+
Subjt:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK

Query:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SN INWDVV+WRTLLNACYVHRNYDKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGYVPDIA VLHDIEDEQK+DNLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMCDDCHTA+KLISKLANRTIIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

XP_023002421.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima]0.0e+0086.92Show/hide
Query:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN
        M MLKL  PIS +APVKFTPFL + N  ASP  DP+KLLKVAADAKNLKFGR+IHAHL+I N   RDC+VNQ+NSLINLYVKCDE+ IARQMFD M +RN
Subjt:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN

Query:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG
        VVSW ALMAGY QNGSPL+VFELFKKM+VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG

Query:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ
        YD+FCYNLV++GLL+H+H+GEA+E+LKLMI EG +WNNAT+VTIFR+CASLKDLKLG+ VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSG A FDQLQ
Subjt:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ

Query:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM
        +RNVVSWT++MAAYFQNG+FEEAL+LFSKMEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM

Query:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK
         CCDSITWNAIITGHSHH +GKEAL++F  ML  RECPNYVTFIGVLSACAHLSLVDEGLYY NHLMKQ GIVPGLEHYTCIVGLLSRSGRLDEAENFM+
Subjt:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK

Query:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SN INWDVV+WRTLLNACYVHRNYDKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF
        S+ IYE +RDLL+KIRPLGYVPDIA VLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISK+ANRTIIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

XP_038892172.1 pentatricopeptide repeat-containing protein At5g39680 [Benincasa hispida]0.0e+0091.24Show/hide
Query:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV
        MP+LKLPI DI PVKFTPFLFR NFFASPHQ+PIKLLKVAADAKNLKFGRI+HAHLII NH+  DCKVNQLNSLINLYVKCDEV IARQMFDSMPRRNV+
Subjt:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV

Query:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI
        SWS LMAGY QNGSPLEVFEL KKMVVKDNI PNEYVIATVISSCDSQMY+EG+QCHGY  KSGLELHQYVKN LIQMYSKCSDVRAA+QILYTVPGYDI
Subjt:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI

Query:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN
        FCYNLVMS LLQHTH GEAVE+LKLMISEGIEWNNAT+VTIFRLCASLKD+KLG+QVHAQMLKSDID+DVYIGSS+IDMYGKCGNV SG A FDQLQSRN
Subjt:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN

Query:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC
        VVSWTS+MAAYFQNG+FEEALDLFSKME+DHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLK NVIVGNALIIMYSKSGDILAA+ VFSNM CC
Subjt:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC

Query:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ
        D+ITWNA+ITGHSHHGLG  ALSMFQ MLATRE PNYVTFIGVLSACAHLS VDEG YY NHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAE FM SNQ
Subjt:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ

Query:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL
        INWDVVSWRTLLNACYVHRNYDKGKQIAE LLQLEP+DVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPESNL
Subjt:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL

Query:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG
        IY+KV+DLLSKIRPLGYVPDIA+VLHDIEDEQKVDNLSYHSEKLA+AYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISK+ANR IIVRDANRFHHFQDG
Subjt:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG

Query:  CCSCGDYW
         CSCGDYW
Subjt:  CCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KR26 DYW_deaminase domain-containing protein0.0e+0087.99Show/hide
Query:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV
        M +LKLPI+DI PVKFTPFL R NF ASPHQDPIKLLKVAADAKNLKFGR IHAHL I NH  RD KVNQLNSLINLYVKCDEV IAR++FDSMPRRNVV
Subjt:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV

Query:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI
        SWSALMAGY QNG+PLEVFELFKKMVVKDNIFPNEYVIAT ISSCDSQMYVEG+QCHGYALKSGLE HQYVKNALIQ+YSKCSDV AA+QILYTVPG DI
Subjt:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI

Query:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN
        FCYNLV++GLLQHTHM EAV++LKL+ISEGIEWNNATYVTIFRLCASLKD+ LG+QVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSG   FD+LQSRN
Subjt:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN

Query:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC
        VVSWTS++AAYFQN +FEEAL+LFSKMEID IPPNEYT+AVL NSAAGLSAL LGDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC
Subjt:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC

Query:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ
        + ITWNAIITGHSHHGLGKEALSMFQ M+AT E PNYVTFIGV+ ACAHL LVDEG YY NHLMKQF IVPGLEHYTCIVGLLSRSGRLDEAENFM+S+Q
Subjt:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ

Query:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL
        INWDVVSWRTLLNACYVH++YDKG++IAEYLLQLEP DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE+NL
Subjt:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL

Query:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG
        IYE V+DLLSKIRPLGYVPDI NVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPI VIKNLRMCDDCHTA+KLISK+ANR I+VRDANRFHHFQ+G
Subjt:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG

Query:  CCSCGDYW
        CCSCGDYW
Subjt:  CCSCGDYW

A0A1S4E243 pentatricopeptide repeat-containing protein At5g39680 isoform X20.0e+0087.01Show/hide
Query:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV
        M +LKLPISDI PVKFTPFL R +FFASPHQDPIKLLKVAADAKNL FGR I AHL I NH  RD KVNQLNSLINLYVKC EV IAR++FDSMPRRNVV
Subjt:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV

Query:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI
        SWS LMAGY QNG+P EVFELFKKMV+KDNI PN+YVIATVISSC+SQMYVEG+QCHGYALKSGLE HQYVKNALIQ+YSKCSDV AA+QILYTVPG DI
Subjt:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDI

Query:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN
        FCYNLV++GLLQHTHM EAV++LKL+IS+GIEWN+ATYVTIFRLCASLKD+ LG+QVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSG   FD+LQSRN
Subjt:  FCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRN

Query:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC
        VVSWTS+MAAYFQN +FEEALDLFSKMEID IPPNEYT+AVL NSAAGLSAL LGDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQRVFSNM CC
Subjt:  VVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCC

Query:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ
        D ITWNAIITGHSHHGLGKEALSMFQ M+ T E PNYVTFIGV+SACAHL LVDEG YY NHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+S+Q
Subjt:  DSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQ

Query:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL
        INWDVVSWRTLLNACYVH++YDKGKQIAEYLLQLEP DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHP++NL
Subjt:  INWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNL

Query:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG
        IYE V++LLSKIRPLGYVPDI NVLHDIEDEQKV+NLSYHSEKLAVAYGLMKT SG PIRVIKNLRMCDDCHTA+KLIS++ANR IIVRD NRFHHFQ+G
Subjt:  IYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDG

Query:  CCSCGDYW
        CCSCGDYW
Subjt:  CCSCGDYW

A0A6J1CRJ5 pentatricopeptide repeat-containing protein At5g396800.0e+0086.04Show/hide
Query:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV
        MP LKLP S +      PFLF+ N+FASP Q+PIKLLK+AADAKNLKFGRIIHAHLII NH   DC+VNQ+NSLIN Y KCDE+L+ARQMFD MP+RNVV
Subjt:  MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVV

Query:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISS-CDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYD
        SWSALMAGY QNGS LEVF L KKMVV+D+I PNEYVIAT++SS C SQMYVEG+QCHGYALKSGLELHQYVKNALIQMYSKCSDVRAA+QIL TVPGYD
Subjt:  SWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISS-CDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYD

Query:  IFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSR
        IFCYNLV++GLL+H+H+ EA+E+L LMI E IEWNNATYVTIFRLCASLKDL+LG+QVHAQML++DID+DVYIGSSIIDMYGKCG VLSG   FD+LQS+
Subjt:  IFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSR

Query:  NVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPC
        NVVSWT++MAAYFQNG+FEEAL+LFSKMEID IPPNEYTLAV LNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM C
Subjt:  NVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPC

Query:  CDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSN
        CDSITWNAIITGHSHHGLGKEALSMFQ MLAT ECPNYVTFIGVLSACAHLSLV EG YY NHLMKQFGIVPGLEHYTCI+GLLSRSG+LDEAENFM+SN
Subjt:  CDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSN

Query:  QINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN
         INWDVV+WRTLL ACYVHRNYDKGKQIAEYLLQ++PEDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+
Subjt:  QINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESN

Query:  LIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQD
         IYEKVRDLLS+I+PLGYVPDIA VLHDI+DEQK+DNLSYHSEKLAVAYGLMKTP GAPIRVIKNLRMCDDCHTAVKLISK+ANR IIVRDANRFHHF+D
Subjt:  LIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQD

Query:  GCCSCGDYW
        GCCSCGDYW
Subjt:  GCCSCGDYW

A0A6J1F637 pentatricopeptide repeat-containing protein At5g396800.0e+0087.34Show/hide
Query:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN
        M MLKL  PIS +APVKFTPFL + N  ASP  DP+KLLKVAADAKNLKFGR IHAHLII N    DC+VNQ+NSLINLYVKCDE+ IARQMFD M +RN
Subjt:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN

Query:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG
        VVSW ALMAGY QNGSPLEVFELFKKM+VKDNIFPNEYVIATVISSC DSQMYVEGRQCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG

Query:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ
        YD+FCYNLV++GLL+H+H+ EA+E+LKLMI EG +WNNAT+VTIFR+CASLKDLK G+ VHA+MLKSDID+DVYIGSSIIDMYGKCGNVLSG A FDQLQ
Subjt:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ

Query:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM
        +RNVVSWT++MAAYFQNG+FEEAL+LFSKMEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM

Query:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK
         CCDSITWNAIITGHSHH +GKEAL++F  ML  RECPNYVTFIGVLSACAHLSLVDEGLYY NHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+
Subjt:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK

Query:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SN INWDVV+WRTLLNACYVHRNYDKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF
        S+ IYE VRDLL+KIRPLGYVPDIA VLHDIEDEQK+DNLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMCDDCHTA+KLISKLANRTIIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

A0A6J1KL92 pentatricopeptide repeat-containing protein At5g396800.0e+0086.92Show/hide
Query:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN
        M MLKL  PIS +APVKFTPFL + N  ASP  DP+KLLKVAADAKNLKFGR+IHAHL+I N   RDC+VNQ+NSLINLYVKCDE+ IARQMFD M +RN
Subjt:  MPMLKL--PISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRN

Query:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG
        VVSW ALMAGY QNGSPL+VFELFKKM+VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQMYSKCSDVRAAL+IL TVPG
Subjt:  VVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPG

Query:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ
        YD+FCYNLV++GLL+H+H+GEA+E+LKLMI EG +WNNAT+VTIFR+CASLKDLKLG+ VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSG A FDQLQ
Subjt:  YDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQ

Query:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM
        +RNVVSWT++MAAYFQNG+FEEAL+LFSKMEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM
Subjt:  SRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM

Query:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK
         CCDSITWNAIITGHSHH +GKEAL++F  ML  RECPNYVTFIGVLSACAHLSLVDEGLYY NHLMKQ GIVPGLEHYTCIVGLLSRSGRLDEAENFM+
Subjt:  PCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMK

Query:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE
        SN INWDVV+WRTLLNACYVHRNYDKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPE
Subjt:  SNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPE

Query:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF
        S+ IYE +RDLL+KIRPLGYVPDIA VLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISK+ANRTIIVRDANRFHHF
Subjt:  SNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHF

Query:  QDGCCSCGDYW
        QDG CSCGDYW
Subjt:  QDGCCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099503.6e-13738.22Show/hide
Query:  ADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIAT
        A+   LK GR +H H+I       D  V   N L+N+Y KC  +  AR++F  M  ++ VSW++++ G  QNG  +E  E +K M  + +I P  + + +
Subjt:  ADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIAT

Query:  VISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQ-HTHMGEAVEILKLMISEGIEWNNATY
         +SSC S  + + G+Q HG +LK G++L+  V NAL+ +Y++   +    +I  ++P +D   +N ++  L +    + EAV         G + N  T+
Subjt:  VISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQ-HTHMGEAVEILKLMISEGIEWNNATY

Query:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQL-QSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEY
         ++    +SL   +LG+Q+H   LK++I  +    +++I  YGKCG +      F ++ + R+ V+W SM++ Y  N    +ALDL   M       + +
Subjt:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQL-QSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEY

Query:  TLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGM-LATRECPN
          A +L++ A ++ L  G ++HA + ++ L+ +V+VG+AL+ MYSK G +  A R F+ MP  +S +WN++I+G++ HG G+EAL +F+ M L  +  P+
Subjt:  TLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGM-LATRECPN

Query:  YVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNAC--YVHRNYDKGKQIAEYLLQL
        +VTF+GVLSAC+H  L++EG  +   +   +G+ P +EH++C+  +L R+G LD+ E+F++   +  +V+ WRT+L AC     R  + GK+ AE L QL
Subjt:  YVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNAC--YVHRNYDKGKQIAEYLLQL

Query:  EPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKV
        EPE+   Y+LL NM+A   RW+ +VK RK M++ +VKKE G SW+ +++  H+F + D  HP++++IY+K+++L  K+R  GYVP     L+D+E E K 
Subjt:  EPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKV

Query:  DNLSYHSEKLAVAYGL-MKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
        + LSYHSEKLAVA+ L  +  S  PIR++KNLR+C DCH+A K ISK+  R II+RD+NRFHHFQDG CSC D+W
Subjt:  DNLSYHSEKLAVAYGL-MKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

Q9FK93 Pentatricopeptide repeat-containing protein At5g396802.9e-21652.89Show/hide
Query:  KLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPN
        +LLKV A++  L+ G  IHAHLI+ N + R     Q+NSLINLYVKC E + AR++FD MP RNVVSW A+M GY+ +G   EV +LFK M       PN
Subjt:  KLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPN

Query:  EYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEW
        E+V   V  SC +S    EG+Q HG  LK GL  H++V+N L+ MYS CS    A+++L  +P  D+  ++  +SG L+     E +++L+   +E   W
Subjt:  EYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEW

Query:  NNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIP
        NN TY++  RL ++L+DL L  QVH++M++   + +V    ++I+MYGKCG VL     FD   ++N+   T++M AYFQ+  FEEAL+LFSKM+   +P
Subjt:  NNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIP

Query:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRE
        PNEYT A+LLNS A LS L  GD LH    KSG + +V+VGNAL+ MY+KSG I  A++ FS M   D +TWN +I+G SHHGLG+EAL  F  M+ T E
Subjt:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRE

Query:  CPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ
         PN +TFIGVL AC+H+  V++GL+Y N LMK+F + P ++HYTCIVGLLS++G   +AE+FM++  I WDVV+WRTLLNACYV RNY  GK++AEY ++
Subjt:  CPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQK
          P D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE  LIY KV++++SKI+PLGY PD+A   HD+++EQ+
Subjt:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQK

Query:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
         DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220705.1e-13636.79Show/hide
Query:  NSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQY
        N++++ Y K  ++    + FD +P+R+ VSW+ ++ GY+  G   +   +   M VK+ I P ++ +  V++S  +   +E G++ H + +K GL  +  
Subjt:  NSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQY

Query:  VKNALIQMYSKCSD-------------------------------VRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEG-IEWNNATY
        V N+L+ MY+KC D                               +  A+     +   DI  +N ++SG  Q  +   A++I   M+ +  +  +  T 
Subjt:  VKNALIQMYSKCSD-------------------------------VRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEG-IEWNNATY

Query:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNV----------------LSGIAC-----------------FDQLQSRNVVSWT
         ++   CA+L+ L +G+Q+H+ ++ +  D    + +++I MY +CG V                + G                    F  L+ R+VV+WT
Subjt:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNV----------------LSGIAC-----------------FDQLQSRNVVSWT

Query:  SMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPC-CDSIT
        +M+  Y Q+G + EA++LF  M      PN YTLA +L+ A+ L++LS G Q+H  A KSG   +V V NALI MY+K+G+I +A R F  + C  D+++
Subjt:  SMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPC-CDSIT

Query:  WNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWD
        W ++I   + HG  +EAL +F+ ML     P+++T++GV SAC H  LV++G  Y + +     I+P L HY C+V L  R+G L EA+ F++   I  D
Subjt:  WNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWD

Query:  VVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK
        VV+W +LL+AC VH+N D GK  AE LL LEPE+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++  HVF  ED  HPE N IY  
Subjt:  VVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK

Query:  VRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSC
        ++ +  +I+ +GYVPD A+VLHD+E+E K   L +HSEKLA+A+GL+ TP    +R++KNLR+C+DCHTA+K ISKL  R IIVRD  RFHHF+DG CSC
Subjt:  VRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSC

Query:  GDYW
         DYW
Subjt:  GDYW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331702.5e-14338.55Show/hide
Query:  IKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFP
        I +L  A    +L  G+ +H   +       D  +   NSLIN+Y K  +   AR +FD+M  R+++SW++++AG  QNG  +E   LF ++ ++  + P
Subjt:  IKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFP

Query:  NEYVIATVISSCDS--QMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGI
        ++Y + +V+ +  S  +     +Q H +A+K       +V  ALI  YS+   ++ A +IL+    +D+  +N +M+G  Q     + +++  LM  +G 
Subjt:  NEYVIATVISSCDS--QMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGI

Query:  EWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDH
          ++ T  T+F+ C  L  +  G+QVHA  +KS  D D+++ S I+DMY KCG++ +    FD +   + V+WT+M++   +NG  E A  +FS+M +  
Subjt:  EWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDH

Query:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLAT
        + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  +   +   WNA++ G + HG GKE L +F+ M + 
Subjt:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLAT

Query:  RECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL
           P+ VTFIGVLSAC+H  LV E   ++  +   +GI P +EHY+C+   L R+G + +AEN ++S  +      +RTLL AC V  + + GK++A  L
Subjt:  RECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL

Query:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDE
        L+LEP D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++ LIY KV+D++  I+  GYVP+    L D+E+E
Subjt:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDE

Query:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
        +K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+K+ NR I++RDANRFH F+DG CSCGDYW
Subjt:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136507.7e-14036.66Show/hide
Query:  PHQDPIKLLKVAADAKNLKF-GRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVV
        P  + +  L VA  A    F G+ +HA+      A  + K+    +L+NLY KC ++  A   F      NVV W+ ++  Y         F +F++M +
Subjt:  PHQDPIKLLKVAADAKNLKF-GRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVV

Query:  KDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLM
        ++ I PN+Y   +++ +C     +E G Q H   +K+  +L+ YV + LI MY+K   +  A  IL    G D+  +  +++G  Q+    +A+   + M
Subjt:  KDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLM

Query:  ISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSK
        +  GI  +          CA L+ LK G+Q+HAQ   S    D+   ++++ +Y +CG +      F+Q ++ + ++W ++++ + Q+G  EEAL +F +
Subjt:  ISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSK

Query:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQ
        M  + I  N +T    + +A+  + +  G Q+HA   K+G      V NALI MY+K G I  A++ F  +   + ++WNAII  +S HG G EAL  F 
Subjt:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQ

Query:  GMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ
         M+ +   PN+VT +GVLSAC+H+ LVD+G+ Y   +  ++G+ P  EHY C+V +L+R+G L  A+ F++   I  D + WRTLL+AC VH+N + G+ 
Subjt:  GMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ

Query:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLH
         A +LL+LEPED  TY+LLSN++A  ++WD     R+ M+E+ VKKEPG SW+E++N  H F   D  HP ++ I+E  +DL  +   +GYV D  ++L+
Subjt:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLH

Query:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
        +++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DCH  +K +SK++NR IIVRDA RFHHF+ G CSC DYW
Subjt:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein3.6e-13736.79Show/hide
Query:  NSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQY
        N++++ Y K  ++    + FD +P+R+ VSW+ ++ GY+  G   +   +   M VK+ I P ++ +  V++S  +   +E G++ H + +K GL  +  
Subjt:  NSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQY

Query:  VKNALIQMYSKCSD-------------------------------VRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEG-IEWNNATY
        V N+L+ MY+KC D                               +  A+     +   DI  +N ++SG  Q  +   A++I   M+ +  +  +  T 
Subjt:  VKNALIQMYSKCSD-------------------------------VRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEG-IEWNNATY

Query:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNV----------------LSGIAC-----------------FDQLQSRNVVSWT
         ++   CA+L+ L +G+Q+H+ ++ +  D    + +++I MY +CG V                + G                    F  L+ R+VV+WT
Subjt:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNV----------------LSGIAC-----------------FDQLQSRNVVSWT

Query:  SMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPC-CDSIT
        +M+  Y Q+G + EA++LF  M      PN YTLA +L+ A+ L++LS G Q+H  A KSG   +V V NALI MY+K+G+I +A R F  + C  D+++
Subjt:  SMMAAYFQNGYFEEALDLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPC-CDSIT

Query:  WNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWD
        W ++I   + HG  +EAL +F+ ML     P+++T++GV SAC H  LV++G  Y + +     I+P L HY C+V L  R+G L EA+ F++   I  D
Subjt:  WNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWD

Query:  VVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK
        VV+W +LL+AC VH+N D GK  AE LL LEPE+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++  HVF  ED  HPE N IY  
Subjt:  VVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEK

Query:  VRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSC
        ++ +  +I+ +GYVPD A+VLHD+E+E K   L +HSEKLA+A+GL+ TP    +R++KNLR+C+DCHTA+K ISKL  R IIVRD  RFHHF+DG CSC
Subjt:  VRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSC

Query:  GDYW
         DYW
Subjt:  GDYW

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein5.4e-14136.66Show/hide
Query:  PHQDPIKLLKVAADAKNLKF-GRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVV
        P  + +  L VA  A    F G+ +HA+      A  + K+    +L+NLY KC ++  A   F      NVV W+ ++  Y         F +F++M +
Subjt:  PHQDPIKLLKVAADAKNLKF-GRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVV

Query:  KDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLM
        ++ I PN+Y   +++ +C     +E G Q H   +K+  +L+ YV + LI MY+K   +  A  IL    G D+  +  +++G  Q+    +A+   + M
Subjt:  KDNIFPNEYVIATVISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLM

Query:  ISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSK
        +  GI  +          CA L+ LK G+Q+HAQ   S    D+   ++++ +Y +CG +      F+Q ++ + ++W ++++ + Q+G  EEAL +F +
Subjt:  ISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSK

Query:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQ
        M  + I  N +T    + +A+  + +  G Q+HA   K+G      V NALI MY+K G I  A++ F  +   + ++WNAII  +S HG G EAL  F 
Subjt:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQ

Query:  GMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ
         M+ +   PN+VT +GVLSAC+H+ LVD+G+ Y   +  ++G+ P  EHY C+V +L+R+G L  A+ F++   I  D + WRTLL+AC VH+N + G+ 
Subjt:  GMLATRECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ

Query:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLH
         A +LL+LEPED  TY+LLSN++A  ++WD     R+ M+E+ VKKEPG SW+E++N  H F   D  HP ++ I+E  +DL  +   +GYV D  ++L+
Subjt:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLH

Query:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
        +++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DCH  +K +SK++NR IIVRDA RFHHF+ G CSC DYW
Subjt:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-14438.55Show/hide
Query:  IKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFP
        I +L  A    +L  G+ +H   +       D  +   NSLIN+Y K  +   AR +FD+M  R+++SW++++AG  QNG  +E   LF ++ ++  + P
Subjt:  IKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFP

Query:  NEYVIATVISSCDS--QMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGI
        ++Y + +V+ +  S  +     +Q H +A+K       +V  ALI  YS+   ++ A +IL+    +D+  +N +M+G  Q     + +++  LM  +G 
Subjt:  NEYVIATVISSCDS--QMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGI

Query:  EWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDH
          ++ T  T+F+ C  L  +  G+QVHA  +KS  D D+++ S I+DMY KCG++ +    FD +   + V+WT+M++   +NG  E A  +FS+M +  
Subjt:  EWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDH

Query:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLAT
        + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  +   +   WNA++ G + HG GKE L +F+ M + 
Subjt:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLAT

Query:  RECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL
           P+ VTFIGVLSAC+H  LV E   ++  +   +GI P +EHY+C+   L R+G + +AEN ++S  +      +RTLL AC V  + + GK++A  L
Subjt:  RECPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL

Query:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDE
        L+LEP D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++ LIY KV+D++  I+  GYVP+    L D+E+E
Subjt:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDE

Query:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
        +K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+K+ NR I++RDANRFH F+DG CSCGDYW
Subjt:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-13838.22Show/hide
Query:  ADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIAT
        A+   LK GR +H H+I       D  V   N L+N+Y KC  +  AR++F  M  ++ VSW++++ G  QNG  +E  E +K M  + +I P  + + +
Subjt:  ADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPNEYVIAT

Query:  VISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQ-HTHMGEAVEILKLMISEGIEWNNATY
         +SSC S  + + G+Q HG +LK G++L+  V NAL+ +Y++   +    +I  ++P +D   +N ++  L +    + EAV         G + N  T+
Subjt:  VISSCDSQMYVE-GRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQ-HTHMGEAVEILKLMISEGIEWNNATY

Query:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQL-QSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEY
         ++    +SL   +LG+Q+H   LK++I  +    +++I  YGKCG +      F ++ + R+ V+W SM++ Y  N    +ALDL   M       + +
Subjt:  VTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQL-QSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIPPNEY

Query:  TLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGM-LATRECPN
          A +L++ A ++ L  G ++HA + ++ L+ +V+VG+AL+ MYSK G +  A R F+ MP  +S +WN++I+G++ HG G+EAL +F+ M L  +  P+
Subjt:  TLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGM-LATRECPN

Query:  YVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNAC--YVHRNYDKGKQIAEYLLQL
        +VTF+GVLSAC+H  L++EG  +   +   +G+ P +EH++C+  +L R+G LD+ E+F++   +  +V+ WRT+L AC     R  + GK+ AE L QL
Subjt:  YVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNAC--YVHRNYDKGKQIAEYLLQL

Query:  EPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKV
        EPE+   Y+LL NM+A   RW+ +VK RK M++ +VKKE G SW+ +++  H+F + D  HP++++IY+K+++L  K+R  GYVP     L+D+E E K 
Subjt:  EPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKV

Query:  DNLSYHSEKLAVAYGL-MKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
        + LSYHSEKLAVA+ L  +  S  PIR++KNLR+C DCH+A K ISK+  R II+RD+NRFHHFQDG CSC D+W
Subjt:  DNLSYHSEKLAVAYGL-MKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW

AT5G39680.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-21752.89Show/hide
Query:  KLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPN
        +LLKV A++  L+ G  IHAHLI+ N + R     Q+NSLINLYVKC E + AR++FD MP RNVVSW A+M GY+ +G   EV +LFK M       PN
Subjt:  KLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYRQNGSPLEVFELFKKMVVKDNIFPN

Query:  EYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEW
        E+V   V  SC +S    EG+Q HG  LK GL  H++V+N L+ MYS CS    A+++L  +P  D+  ++  +SG L+     E +++L+   +E   W
Subjt:  EYVIATVISSC-DSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAVEILKLMISEGIEW

Query:  NNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIP
        NN TY++  RL ++L+DL L  QVH++M++   + +V    ++I+MYGKCG VL     FD   ++N+   T++M AYFQ+  FEEAL+LFSKM+   +P
Subjt:  NNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEIDHIP

Query:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRE
        PNEYT A+LLNS A LS L  GD LH    KSG + +V+VGNAL+ MY+KSG I  A++ FS M   D +TWN +I+G SHHGLG+EAL  F  M+ T E
Subjt:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRE

Query:  CPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ
         PN +TFIGVL AC+H+  V++GL+Y N LMK+F + P ++HYTCIVGLLS++G   +AE+FM++  I WDVV+WRTLLNACYV RNY  GK++AEY ++
Subjt:  CPNYVTFIGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQK
          P D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE  LIY KV++++SKI+PLGY PD+A   HD+++EQ+
Subjt:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQK

Query:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW
         DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTATGTTAAAGCTACCAATTAGTGACATTGCTCCTGTGAAGTTCACACCGTTTCTATTCAGGCCCAATTTCTTTGCTTCTCCTCACCAGGACCCAATAAAGCTGTT
GAAAGTAGCTGCAGATGCCAAGAACTTAAAATTTGGTAGAATAATCCATGCCCATCTGATCATTAACAATCACGCCGATAGGGACTGCAAAGTAAACCAACTGAATTCCC
TTATTAATTTGTACGTGAAATGTGATGAAGTATTGATTGCTCGGCAAATGTTTGATAGTATGCCTAGAAGAAATGTGGTATCTTGGAGCGCTTTAATGGCTGGGTACAGG
CAAAATGGGAGTCCCTTGGAAGTTTTTGAGTTGTTCAAAAAGATGGTTGTGAAGGATAATATTTTCCCCAATGAATATGTGATTGCTACTGTTATATCTTCTTGTGATAG
TCAAATGTATGTAGAGGGGAGGCAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGCTTCATCAATATGTTAAGAATGCACTTATTCAGATGTACTCTAAATGTTCAG
ATGTAAGAGCAGCATTGCAGATATTATATACTGTCCCAGGTTATGACATATTTTGTTATAATTTGGTAATGAGTGGGCTTCTACAGCACACACATATGGGAGAAGCTGTA
GAAATTCTGAAGTTAATGATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTTCGCCTTTGTGCTAGTCTTAAAGATTTAAAATTAGGTGAGCAAGT
TCATGCTCAAATGTTAAAAAGTGATATTGACCATGATGTCTATATCGGAAGTTCTATCATTGACATGTACGGGAAGTGTGGTAATGTGTTGAGTGGAATAGCCTGTTTTG
ATCAGTTGCAAAGCCGAAACGTTGTTTCTTGGACATCAATGATGGCAGCTTATTTTCAGAATGGATACTTTGAAGAAGCATTGGATCTGTTTTCAAAGATGGAAATTGAT
CATATTCCTCCCAATGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCAGGTTTGTCTGCACTAAGCCTTGGGGATCAGTTACATGCACGTGCTGAGAAATCGGGTCT
CAAAGGCAATGTTATAGTAGGTAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCAGCACAACGTGTGTTTTCAAATATGCCCTGTTGTGATTCCATTA
CCTGGAATGCAATAATAACTGGCCATTCACACCATGGTCTTGGCAAGGAAGCTTTAAGCATGTTCCAGGGCATGTTAGCTACTAGAGAGTGTCCTAACTATGTAACCTTT
ATTGGTGTTCTCTCTGCTTGTGCCCATTTAAGCCTGGTGGATGAAGGATTATACTATTTGAATCATTTGATGAAACAATTTGGTATTGTTCCCGGATTGGAGCACTATAC
CTGTATTGTTGGACTTCTTAGTAGATCTGGACGATTGGATGAAGCTGAAAATTTTATGAAGTCGAATCAAATAAATTGGGATGTTGTTTCCTGGCGCACCCTTCTCAATG
CTTGTTATGTTCATAGAAATTATGATAAAGGGAAGCAAATAGCAGAGTACTTGCTACAGTTGGAGCCTGAGGATGTAGGAACTTACATTCTATTATCAAACATGCATGCG
AGAGTTCGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGCCCATGT
TTTTACATCTGAGGATACTAAACACCCTGAGTCCAATCTGATTTATGAAAAGGTAAGGGATTTATTATCTAAGATCCGACCATTGGGGTATGTTCCTGATATTGCTAATG
TATTGCATGATATTGAGGACGAGCAAAAGGTAGATAATCTTAGCTATCACAGTGAGAAGCTCGCCGTAGCATATGGCCTGATGAAGACACCATCAGGCGCACCAATCAGG
GTGATTAAGAACCTTAGGATGTGCGATGATTGTCATACTGCTGTCAAACTTATTTCAAAGCTTGCTAATAGGACTATAATTGTTAGAGATGCCAATCGCTTCCATCATTT
TCAAGATGGTTGTTGCTCTTGTGGAGATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTATGTTAAAGCTACCAATTAGTGACATTGCTCCTGTGAAGTTCACACCGTTTCTATTCAGGCCCAATTTCTTTGCTTCTCCTCACCAGGACCCAATAAAGCTGTT
GAAAGTAGCTGCAGATGCCAAGAACTTAAAATTTGGTAGAATAATCCATGCCCATCTGATCATTAACAATCACGCCGATAGGGACTGCAAAGTAAACCAACTGAATTCCC
TTATTAATTTGTACGTGAAATGTGATGAAGTATTGATTGCTCGGCAAATGTTTGATAGTATGCCTAGAAGAAATGTGGTATCTTGGAGCGCTTTAATGGCTGGGTACAGG
CAAAATGGGAGTCCCTTGGAAGTTTTTGAGTTGTTCAAAAAGATGGTTGTGAAGGATAATATTTTCCCCAATGAATATGTGATTGCTACTGTTATATCTTCTTGTGATAG
TCAAATGTATGTAGAGGGGAGGCAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGCTTCATCAATATGTTAAGAATGCACTTATTCAGATGTACTCTAAATGTTCAG
ATGTAAGAGCAGCATTGCAGATATTATATACTGTCCCAGGTTATGACATATTTTGTTATAATTTGGTAATGAGTGGGCTTCTACAGCACACACATATGGGAGAAGCTGTA
GAAATTCTGAAGTTAATGATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTTCGCCTTTGTGCTAGTCTTAAAGATTTAAAATTAGGTGAGCAAGT
TCATGCTCAAATGTTAAAAAGTGATATTGACCATGATGTCTATATCGGAAGTTCTATCATTGACATGTACGGGAAGTGTGGTAATGTGTTGAGTGGAATAGCCTGTTTTG
ATCAGTTGCAAAGCCGAAACGTTGTTTCTTGGACATCAATGATGGCAGCTTATTTTCAGAATGGATACTTTGAAGAAGCATTGGATCTGTTTTCAAAGATGGAAATTGAT
CATATTCCTCCCAATGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCAGGTTTGTCTGCACTAAGCCTTGGGGATCAGTTACATGCACGTGCTGAGAAATCGGGTCT
CAAAGGCAATGTTATAGTAGGTAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCAGCACAACGTGTGTTTTCAAATATGCCCTGTTGTGATTCCATTA
CCTGGAATGCAATAATAACTGGCCATTCACACCATGGTCTTGGCAAGGAAGCTTTAAGCATGTTCCAGGGCATGTTAGCTACTAGAGAGTGTCCTAACTATGTAACCTTT
ATTGGTGTTCTCTCTGCTTGTGCCCATTTAAGCCTGGTGGATGAAGGATTATACTATTTGAATCATTTGATGAAACAATTTGGTATTGTTCCCGGATTGGAGCACTATAC
CTGTATTGTTGGACTTCTTAGTAGATCTGGACGATTGGATGAAGCTGAAAATTTTATGAAGTCGAATCAAATAAATTGGGATGTTGTTTCCTGGCGCACCCTTCTCAATG
CTTGTTATGTTCATAGAAATTATGATAAAGGGAAGCAAATAGCAGAGTACTTGCTACAGTTGGAGCCTGAGGATGTAGGAACTTACATTCTATTATCAAACATGCATGCG
AGAGTTCGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGCCCATGT
TTTTACATCTGAGGATACTAAACACCCTGAGTCCAATCTGATTTATGAAAAGGTAAGGGATTTATTATCTAAGATCCGACCATTGGGGTATGTTCCTGATATTGCTAATG
TATTGCATGATATTGAGGACGAGCAAAAGGTAGATAATCTTAGCTATCACAGTGAGAAGCTCGCCGTAGCATATGGCCTGATGAAGACACCATCAGGCGCACCAATCAGG
GTGATTAAGAACCTTAGGATGTGCGATGATTGTCATACTGCTGTCAAACTTATTTCAAAGCTTGCTAATAGGACTATAATTGTTAGAGATGCCAATCGCTTCCATCATTT
TCAAGATGGTTGTTGCTCTTGTGGAGATTATTGGTGA
Protein sequenceShow/hide protein sequence
MPMLKLPISDIAPVKFTPFLFRPNFFASPHQDPIKLLKVAADAKNLKFGRIIHAHLIINNHADRDCKVNQLNSLINLYVKCDEVLIARQMFDSMPRRNVVSWSALMAGYR
QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGRQCHGYALKSGLELHQYVKNALIQMYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHMGEAV
EILKLMISEGIEWNNATYVTIFRLCASLKDLKLGEQVHAQMLKSDIDHDVYIGSSIIDMYGKCGNVLSGIACFDQLQSRNVVSWTSMMAAYFQNGYFEEALDLFSKMEID
HIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKEALSMFQGMLATRECPNYVTF
IGVLSACAHLSLVDEGLYYLNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHA
RVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYEKVRDLLSKIRPLGYVPDIANVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIR
VIKNLRMCDDCHTAVKLISKLANRTIIVRDANRFHHFQDGCCSCGDYW