; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011222 (gene) of Snake gourd v1 genome

Gene IDTan0011222
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:3932078..3935391
RNA-Seq ExpressionTan0011222
SyntenyTan0011222
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139075.1 pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Momordica charantia]0.0e+0087.48Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMSRD MDSFDLFVTNHLINMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEFTVASLLTSFGEHDGERG+QV
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDAWTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ DIVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAG+LTEKHASTYHSLLIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ AL +FSKM+VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNELEEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFS+MNDSNLC +GT +RIMKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

XP_022139076.1 pentatricopeptide repeat-containing protein At1g71420 isoform X2 [Momordica charantia]0.0e+0087.48Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMSRD MDSFDLFVTNHLINMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEFTVASLLTSFGEHDGERG+QV
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDAWTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ DIVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAG+LTEKHASTYHSLLIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ AL +FSKM+VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNELEEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFS+MNDSNLC +GT +RIMKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

XP_022957425.1 pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita moschata]0.0e+0087.09Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH  F AKRNLV YPSK+ FGS LR+WRS AEGDIV FRTED  +DYL  +  IST G L QALSLFY SRQPHS QTYA+LFHACARLRCL+EG  LH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMS DPM SFDLFVTNHLINMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ GHVDECFL+FSRMLVDH+PNEFTVASLLTSFG+HDGERG+Q+
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALK SLDA VYVANALITMYSKS+ KGGAFND+KDDAWTMFKSIENP LITWNSMIAGFCF+K GN A+HLF+QMNRQGIGFDRAT+LSTLSS+SLC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        NWDE  LGL FC ELHCQALKTAF SEVEIITAL+KTYAEL GDIADSYRLF+EAGYNRDIVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAGFLTEKHASTYHSLLIK MSE+D VLNNALIHAY RCGSITSS+KVF+QMKH DLVSWNTMMK YAVHGQAE ALQ+FSKM VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT+LFNSIANYG+ CQLDHYACMVDILGR+GRIQEAEDFISKMP+EPD+V+WSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGR HP+REVICNELEEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFSVMND+NL  + TPIRIMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

XP_023511808.1 pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0087.22Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH  F AKRNLV YPSK+AFGS LR+WRS AEGDIV FRTED  +DYL  ++ IST G L QALSLFY SRQPHS QTYA+LFHACARLRCL+EGV LH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMS DPM SFDLFVTNHLINMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ GHVDECFL+FSRMLVDH+PNEFTVASLLTSFG+HDGERG+QV
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALK SLDA VYVANALITMYSK+++KGGAFND KDDAWTMFKSIENPSLITWNSMIAGFCF+K GN+A+HLF+QMN +GIGFDRAT+LSTLSS+SLC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        NWDE  LGL FC ELHCQALKTAF SEVEIITALVKTYAEL GDI DSYRLF+EAGYNRDIVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAGFLTEKHASTYHSLLIK MSE+D VLNNALIHAY RCGSITSS+KVF+QMKH DLVSWNTMMK YAVHGQAE ALQ+FSKM VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT LFNSIANYG+ CQLDHYACMVDILGR+GRIQEAEDFISKMP+EPD+V+WSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGR HP+REVICNELEEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFSVMND+NL  + TPIRIMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

XP_038892212.1 pentatricopeptide repeat-containing protein At1g71420 isoform X1 [Benincasa hispida]0.0e+0087.24Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TI+CPF AKRNLVSYPSKHAFG   R WRSAAEGDIV  RTEDIDNDYLL++  IST G L QALSLFYSSRQPHS QTYA+LFHACARLRCLQEG+GLH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSR-DPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQ
        RYMMSR DPM++FDLFVTNHLINMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ G VDECFL+FSRMLVDH+PNEFTVASLLTSFGEHDGERG+Q
Subjt:  RYMMSR-DPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQ

Query:  VHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSL
        +HGF LK SLD  VYVANALI MYSKS+ K GA+ND+KDDAWTMFKSIE P+LITWNSMIAGFCF+KLG+QAI+LF+QMN QGIGFDRAT+LSTLSS SL
Subjt:  VHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSL

Query:  CNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFS
        CNWDEFG GL FCH++HCQALKTAFISEVEIITALVKT AEL GDIADSYRLFVE GYNRDIVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FS
Subjt:  CNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFS

Query:  IVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSL
        IVLKACAGFLTEKHASTYHSLLIK MSE+D VLNNALIHAY RCGSI+SS+KVFDQMKH DLVSWNTMMKAYAVHGQAE ALQ+F+ MNVPPD+TTFVSL
Subjt:  IVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSL

Query:  LSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ
        LSACSHAGLVEEG  LFNSI +YGI CQLDHYACMVDILGR+G+IQEA DFISKMP+EPDFVVWSSFLGSCRKHGAT+LAKL+S KLKELDP NSLAYVQ
Subjt:  LSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ

Query:  MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL
        MSNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGG RHPQREVI NELEEL+GRLKEIGYVPETSLALHDVE EQKEEQLYHHSEKL
Subjt:  MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL

Query:  ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        ALVFSVMND NL R  TPIRIMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Subjt:  ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

TrEMBL top hitse value%identityAlignment
A0A5D3D022 Pentatricopeptide repeat-containing protein0.0e+0085.01Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TI+C F   RNLVS PSKHAFG   R WRSAAEGDIV FRTEDIDNDYLL+T  IS+ G L +ALSLFYSSRQPHS QTYA+LFH CARLRCLQEGVGLH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYM+S++PM SFDLFVTNHLINMYCKCGHL YA QLFN+MPRRN VSWT LISGLSQ GHVDECF +FSRMLVD +PNEFTVASLLTSFGEHDGERG+Q+
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALK SLDASVYVANALITMYSKS+ + G FND KDDAWTMFKSIENPSLITWNSMIAGFCF+KLG QAI+LF+QMNR GIGFDRAT+LSTLSS   C
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        N DEFG  L FCH++HCQALKTAF SE+EIITALVKTYAEL G+IADSY+LFVEAGYNRDIVLWTSIM AF++HDPGKTLSLF QFRQEGL PDGH FS+
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAGFLTEKHAS YHSLLIK MSE+D VLNNALIHAY RCGSI+SS+KVF+QMKH DLVSWNTMMKAYA+HGQAE ALQ+F+KMNVPPD+TTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT LFNSI NYGI CQLDHYACMVDILGR+GR+QEA DFISKMP+EPDFVVWSSFLGSCRK+GA  LAKL+S KLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYCF+GSFY+ADLIRTEM GSRVRKEPGLSWVEIENQVHEFASGGR HPQREVICNELEEL+GRLKEIGYVPET LA +DVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFSVMND NL  +  PIRIMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

A0A6J1CBA2 pentatricopeptide repeat-containing protein At1g71420 isoform X10.0e+0087.48Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMSRD MDSFDLFVTNHLINMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEFTVASLLTSFGEHDGERG+QV
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDAWTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ DIVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAG+LTEKHASTYHSLLIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ AL +FSKM+VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNELEEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFS+MNDSNLC +GT +RIMKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

A0A6J1CBK6 pentatricopeptide repeat-containing protein At1g71420 isoform X20.0e+0087.48Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH PF AKRNLV YPSK AF  HLRYWRSAAE D VP RTEDIDNDYL DT  IST G L  ALSLFYS RQPHSRQTYA+LFHACARLRCL EG+GLH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMSRD MDSFDLFVTNHLINMYCKCGHL YA+QLF++MPRRNLVSWT LISGLSQ GHVDECFLLF RMLVD +PNEFTVASLLTSFGEHDGERG+QV
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALKTSLDA VYVANALITMYSKSF KGG FND+ DDAWTMFKSIENPSLITWNSMIAGFCF+KLGNQAI+LF++MNR+GIGFDRAT+LSTLSS++LC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        N DEFGLGLSFCHELHC A KTAFISE+E+ TALVKTYA+L GDIADSYRLFVEAGY+ DIVLWTSIMTA VEHDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAG+LTEKHASTYHSLLIK MSE+DIVLNNALIHAY RCGSIT S+KVF +MK+RDLVSWNTMMKAYA+HGQA+ AL +FSKM+VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT LFNSI  YGI CQLDHYACMVDILGR GR+QEAE FISKMP+EPDFVVWSSFLGSCRKHGATQLAKL+SNKLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYCFSGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQRE ICNELEEL+GRLK++GYVPETS+ALHDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFS+MNDSNLC +GT +RIMKNIRICVDCHNFMKLAS LLKKEIVIRDSNRFHHFT GLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

A0A6J1H0I1 pentatricopeptide repeat-containing protein At1g71420 isoform X10.0e+0087.09Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH  F AKRNLV YPSK+ FGS LR+WRS AEGDIV FRTED  +DYL  +  IST G L QALSLFY SRQPHS QTYA+LFHACARLRCL+EG  LH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMS DPM SFDLFVTNHLINMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ GHVDECFL+FSRMLVDH+PNEFTVASLLTSFG+HDGERG+Q+
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALK SLDA VYVANALITMYSKS+ KGGAFND+KDDAWTMFKSIENP LITWNSMIAGFCF+K GN A+HLF+QMNRQGIGFDRAT+LSTLSS+SLC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        NWDE  LGL FC ELHCQALKTAF SEVEIITAL+KTYAEL GDIADSYRLF+EAGYNRDIVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAGFLTEKHASTYHSLLIK MSE+D VLNNALIHAY RCGSITSS+KVF+QMKH DLVSWNTMMK YAVHGQAE ALQ+FSKM VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT+LFNSIANYG+ CQLDHYACMVDILGR+GRIQEAEDFISKMP+EPD+V+WSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQVHEFASGGR HP+REVICNELEEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFSVMND+NL  + TPIRIMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF AGLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

A0A6J1JQJ9 pentatricopeptide repeat-containing protein At1g71420 isoform X10.0e+0086.18Show/hide
Query:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH
        TIH  F AKRNLV YPSK+AFGS LR+WRS  EGDIV FRTED   DYL  ++ IST G L QALSLFY SRQPHS QTYA+LFHACARLRCL+EG  LH
Subjt:  TIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLH

Query:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV
        RYMMS DPM SFDLFVTNHLINMYCKCGHL YAYQLFN+MPRRNLVSWT LISGLSQ  HVDECFL+FSRMLVDH+PNEFTVASLLTSFG+HDGERG+QV
Subjt:  RYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQV

Query:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC
        HGFALK SLDA VYVANALITMYSKS++KGGAFND KDDAWTMFKSIENPSLITWNSMIAGFCF+K GN+A+HLF+QMN +GIGFDRAT+LSTLSS+SLC
Subjt:  HGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLC

Query:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI
        NWDE  LGL FC ELHCQALKTAF SEVEIITALVKTYAEL GDI DSYRLF+EAGYNRDIVLWTSIMTAFV+HDPGKTLSLF QFRQEGL PDGH FSI
Subjt:  NWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSI

Query:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL
        VLKACAGFLTEKHASTYHSLLIK  SE+D V+NNALIHAY RCGSITSS+KVFDQMKH DLVSWNTMMK YAVHGQAE ALQ+FSKM VPPDSTTFVSLL
Subjt:  VLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM
        SACSHAGLVEEGT+LFNSI NYG+ CQLDHYACMVDILGR+GRI+EAE F+SKMP+EPD+VVWSSFLGSC+KHGATQLAKL+S+KLKELDPSNSLAYVQM
Subjt:  SACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQM

Query:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA
        SNLYC SGSFY+ADLIR EMKGSRVRKEPGLSWVEIENQ+HEFASGGR HP+REVICNELEEL+GRLKEIGYVPETSLA+HDVEQEQKEEQLYHHSEKLA
Subjt:  SNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLA

Query:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        LVFSVMND+NL  +G PIRIMKNIRICVDCHNFMKLAS LL+KEIVIRDSNRFHHF  GLCSCNDYW
Subjt:  LVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

SwissProt top hitse value%identityAlignment
Q0WSH6 Pentatricopeptide repeat-containing protein At4g148502.5e-12036.98Show/hide
Query:  FVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFT-------VASLLTSFGEHDGERGKQVHGFALK
        F+ N+LINMY K  H   A  +    P RN+VSWT+LISGL+QNGH     + F  M  +   PN+FT       VASL           GKQ+H  A+K
Subjt:  FVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFT-------VASLLTSFGEHDGERGKQVHGFALK

Query:  TSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFG
              V+V  +   MY K+          +DDA  +F  I   +L TWN+ I+         +AI  FI+  R     +  T  + L++ S  +W    
Subjt:  TSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFG

Query:  LGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVE-HDPGKTLSLFHQFRQEGLIPDGHNFSIVLKAC
        LG+    +LH   L++ F ++V +   L+  Y + +  I  S  +F E G  ++ V W S++ A+V+ H+  K   L+ + R++ +       S VL AC
Subjt:  LGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVE-HDPGKTLSLFHQFRQEGLIPDGHNFSIVLKAC

Query:  AGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM-----NVPPDSTTFVSLL
        AG    +   + H+  +K   E  I + +AL+  Y +CG I  SE+ FD+M  ++LV+ N+++  YA  GQ + AL +F +M        P+  TFVSLL
Subjt:  AGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM-----NVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ
        SACS AG VE G ++F+S+ + YGI    +HY+C+VD+LGRAG ++ A +FI KMP++P   VW +   +CR HG  QL  L++  L +LDP +S  +V 
Subjt:  SACSHAGLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ

Query:  MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL
        +SN +  +G + +A+ +R E+KG  ++K  G SW+ ++NQVH F +  R H   + I   L +L   ++  GY P+  L+L+D+E+E+K  ++ HHSEKL
Subjt:  MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL

Query:  ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        AL F +++      +  PIRI KN+RIC DCH+F K  S  +K+EI++RD+NRFH F  G+CSC DYW
Subjt:  ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

Q9C9H9 Pentatricopeptide repeat-containing protein At1g714204.2e-20852.45Show/hide
Query:  GQLWQALSLFYSSR-QPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQ
        G + +A+SLFYS+  +  S+Q YA LF ACA  R L +G+ LH +M+S     S ++ + N LINMY KCG+++YA Q+F+ MP RN+VSWTALI+G  Q
Subjt:  GQLWQALSLFYSSR-QPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQ

Query:  NGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNS
         G+  E F LFS ML    PNEFT++S+LTS      E GKQVHG ALK  L  S+YVANA+I+MY +      A+     +AWT+F++I+  +L+TWNS
Subjt:  NGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNS

Query:  MIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGY
        MIA F    LG +AI +F++M+  G+GFDRAT+L+  SS+   +          C +LH   +K+  +++ E+ TAL+K Y+E+  D  D Y+LF+E  +
Subjt:  MIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGY

Query:  NRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMK
         RDIV W  I+TAF  +DP + + LF Q RQE L PD + FS VLKACAG +T +HA + H+ +IK     D VLNN+LIHAYA+CGS+    +VFD M 
Subjt:  NRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMK

Query:  HRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIAC-QLDHYACMVDILGRAGRIQEAEDFISKMPV
         RD+VSWN+M+KAY++HGQ +  L VF KM++ PDS TF++LLSACSHAG VEEG R+F S+        QL+HYAC++D+L RA R  EAE+ I +MP+
Subjt:  HRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIAC-QLDHYACMVDILGRAGRIQEAEDFISKMPV

Query:  EPDFVVWSSFLGSCRKHGATQLAKLSSNKLKEL-DPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREV
        +PD VVW + LGSCRKHG T+L KL+++KLKEL +P+NS++Y+QMSN+Y   GSF +A+L   EM+  RVRKEP LSW EI N+VHEFASGGR  P +E 
Subjt:  EPDFVVWSSFLGSCRKHGATQLAKLSSNKLKEL-DPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREV

Query:  ICNELEELVGRLKEIGYVPETSLALHDVE-QEQKEEQLYHHSEKLALVFSVM--NDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNR
        +  EL+ L+  LKE+GYVPE   A  D+E +EQ+E+ L HHSEKLAL F+VM    S+ C +   I+IMKN RIC+DCHNFMKLAS LL KEI++RDSNR
Subjt:  ICNELEELVGRLKEIGYVPETSLALHDVE-QEQKEEQLYHHSEKLALVFSVM--NDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNR

Query:  FHHFTAGLCSCNDYW
        FHHF    CSCNDYW
Subjt:  FHHFTAGLCSCNDYW

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099502.1e-12736.77Show/hide
Query:  LQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRM-LVDHQPNEFTVASLLTSFGE
        L++G  +H ++++   +D F + + N L+NMY KCG +  A ++F  M  ++ VSW ++I+GL QNG   E    +  M   D  P  FT+ S L+S   
Subjt:  LQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRM-LVDHQPNEFTVASLLTSFGE

Query:  HD-GERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFC-FQKLGNQAIHLFIQMNRQGIGFDRAT
            + G+Q+HG +LK  +D +V V+NAL+T+Y+++ Y         ++   +F S+     ++WNS+I      ++   +A+  F+   R G   +R T
Subjt:  HD-GERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFC-FQKLGNQAIHLFIQMNRQGIGFDRAT

Query:  ILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHD-PGKTLSLFHQFRQ
          S LS+VS  ++ E G       ++H  ALK     E     AL+  Y +  G++    ++F      RD V W S+++ ++ ++   K L L     Q
Subjt:  ILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHD-PGKTLSLFHQFRQ

Query:  EGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMN
         G   D   ++ VL A A   T +     H+  ++   E+D+V+ +AL+  Y++CG +  + + F+ M  R+  SWN+M+  YA HGQ E+AL++F  M 
Subjt:  EGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMN

Query:  V----PPDSTTFVSLLSACSHAGLVEEGTRLFNSIA-NYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGS-CRKHG-ATQLAKL
        +    PPD  TFV +LSACSHAGL+EEG + F S++ +YG+A +++H++CM D+LGRAG + + EDFI KMP++P+ ++W + LG+ CR +G   +L K 
Subjt:  V----PPDSTTFVSLLSACSHAGLVEEGTRLFNSIA-NYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGS-CRKHG-ATQLAKL

Query:  SSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALH
        ++  L +L+P N++ YV + N+Y   G + D    R +MK + V+KE G SWV +++ VH F +G + HP  +VI  +L+EL  ++++ GYVP+T  AL+
Subjt:  SSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALH

Query:  DVEQEQKEEQLYHHSEKLALVF--SVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        D+EQE KEE L +HSEKLA+ F  +    S L     PIRIMKN+R+C DCH+  K  S +  ++I++RDSNRFHHF  G CSC+D+W
Subjt:  DVEQEQKEEQLYHHSEKLALVF--SVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220703.6e-11934.75Show/hide
Query:  GVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFTVASLLTSF-GEHD
        G  LH   +  D M     F  N +++ Y K G +    + F+ +P+R+ VSWT +I G    G   +   +   M+ +  +P +FT+ ++L S      
Subjt:  GVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFTVASLLTSF-GEHD

Query:  GERGKQVHGFALKTSLDASVYVANALITMYSKS--------FYKGGAFND---------------NKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQ
         E GK+VH F +K  L  +V V+N+L+ MY+K          +      D                 D A   F+ +    ++TWNSMI+GF  +    +
Subjt:  GERGKQVHGFALKTSLDASVYVANALITMYSKS--------FYKGGAFND---------------NKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQ

Query:  AIHLFIQMNRQG-IGFDRATILSTLSSVS----LC------------NWDEFGLGLSFCHELH--CQALKTA--FISE-------VEIITALVKTYAELE
        A+ +F +M R   +  DR T+ S LS+ +    LC             +D  G+ L+    ++  C  ++TA   I +       +E  TAL+  Y +L 
Subjt:  AIHLFIQMNRQG-IGFDRATILSTLSSVS----LC------------NWDEFGLGLSFCHELH--CQALKTA--FISE-------VEIITALVKTYAELE

Query:  GDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDP-GKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYA
        GD+  +  +FV    +RD+V WT+++  + +H   G+ ++LF      G  P+ +  + +L   +   +  H    H   +K      + ++NALI  YA
Subjt:  GDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDP-GKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYA

Query:  RCGSITSSEKVFDQMK-HRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYG-IACQLDHYACMVD
        + G+ITS+ + FD ++  RD VSW +M+ A A HG AE+AL++F  M    + PD  T+V + SAC+HAGLV +G + F+ + +   I   L HYACMVD
Subjt:  RCGSITSSEKVFDQMK-HRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYG-IACQLDHYACMVD

Query:  ILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEI
        + GRAG +QEA++FI KMP+EPD V W S L +CR H    L K+++ +L  L+P NS AY  ++NLY   G + +A  IR  MK  RV+KE G SW+E+
Subjt:  ILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEI

Query:  ENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKL
        +++VH F      HP++  I   ++++   +K++GYVP+T+  LHD+E+E KE+ L HHSEKLA+ F +++  +     T +RIMKN+R+C DCH  +K 
Subjt:  ENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKL

Query:  ASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
         S L+ +EI++RD+ RFHHF  G CSC DYW
Subjt:  ASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331703.1e-11834.29Show/hide
Query:  LFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRML-VDHQPNEFTVASLLTSFGE--HDGERGKQVHGFALKTSLD
        L V+N LINMYCK     +A  +F++M  R+L+SW ++I+G++QNG   E   LF ++L    +P+++T+ S+L +           KQVH  A+K +  
Subjt:  LFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRML-VDHQPNEFTVASLLTSFGE--HDGERGKQVHGFALKTSLD

Query:  ASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLS
        +  +V+ ALI  YS+        N    +A  +F+   N  L+ WN+M+AG+     G++ + LF  M++QG   D  T+ +   +        F   ++
Subjt:  ASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLS

Query:  FCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEH-DPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFL
           ++H  A+K+ +  ++ + + ++  Y +  GD++ +   F       D V WT++++  +E+ +  +   +F Q R  G++PD    + + KA +   
Subjt:  FCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEH-DPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFL

Query:  TEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHA
          +     H+  +K    ND  +  +L+  YA+CGSI  +  +F +++  ++ +WN M+   A HG+ ++ LQ+F +M    + PD  TF+ +LSACSH+
Subjt:  TEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHA

Query:  GLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYC
        GLV E  +   S+  +YGI  +++HY+C+ D LGRAG +++AE+ I  M +E    ++ + L +CR  G T+  K  + KL EL+P +S AYV +SN+Y 
Subjt:  GLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYC

Query:  FSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSV
         +  + +  L RT MKG +V+K+PG SW+E++N++H F    R + Q E+I  ++++++  +K+ GYVPET   L DVE+E+KE  LY+HSEKLA+ F +
Subjt:  FSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSV

Query:  MNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        ++        TPIR++KN+R+C DCHN MK  + +  +EIV+RD+NRFH F  G+CSC DYW
Subjt:  MNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

Arabidopsis top hitse value%identityAlignment
AT1G71420.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-20952.45Show/hide
Query:  GQLWQALSLFYSSR-QPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQ
        G + +A+SLFYS+  +  S+Q YA LF ACA  R L +G+ LH +M+S     S ++ + N LINMY KCG+++YA Q+F+ MP RN+VSWTALI+G  Q
Subjt:  GQLWQALSLFYSSR-QPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQ

Query:  NGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNS
         G+  E F LFS ML    PNEFT++S+LTS      E GKQVHG ALK  L  S+YVANA+I+MY +      A+     +AWT+F++I+  +L+TWNS
Subjt:  NGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNS

Query:  MIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGY
        MIA F    LG +AI +F++M+  G+GFDRAT+L+  SS+   +          C +LH   +K+  +++ E+ TAL+K Y+E+  D  D Y+LF+E  +
Subjt:  MIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGY

Query:  NRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMK
         RDIV W  I+TAF  +DP + + LF Q RQE L PD + FS VLKACAG +T +HA + H+ +IK     D VLNN+LIHAYA+CGS+    +VFD M 
Subjt:  NRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMK

Query:  HRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIAC-QLDHYACMVDILGRAGRIQEAEDFISKMPV
         RD+VSWN+M+KAY++HGQ +  L VF KM++ PDS TF++LLSACSHAG VEEG R+F S+        QL+HYAC++D+L RA R  EAE+ I +MP+
Subjt:  HRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIAC-QLDHYACMVDILGRAGRIQEAEDFISKMPV

Query:  EPDFVVWSSFLGSCRKHGATQLAKLSSNKLKEL-DPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREV
        +PD VVW + LGSCRKHG T+L KL+++KLKEL +P+NS++Y+QMSN+Y   GSF +A+L   EM+  RVRKEP LSW EI N+VHEFASGGR  P +E 
Subjt:  EPDFVVWSSFLGSCRKHGATQLAKLSSNKLKEL-DPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREV

Query:  ICNELEELVGRLKEIGYVPETSLALHDVE-QEQKEEQLYHHSEKLALVFSVM--NDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNR
        +  EL+ L+  LKE+GYVPE   A  D+E +EQ+E+ L HHSEKLAL F+VM    S+ C +   I+IMKN RIC+DCHNFMKLAS LL KEI++RDSNR
Subjt:  ICNELEELVGRLKEIGYVPETSLALHDVE-QEQKEEQLYHHSEKLALVFSVM--NDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNR

Query:  FHHFTAGLCSCNDYW
        FHHF    CSCNDYW
Subjt:  FHHFTAGLCSCNDYW

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein2.6e-12034.75Show/hide
Query:  GVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFTVASLLTSF-GEHD
        G  LH   +  D M     F  N +++ Y K G +    + F+ +P+R+ VSWT +I G    G   +   +   M+ +  +P +FT+ ++L S      
Subjt:  GVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFTVASLLTSF-GEHD

Query:  GERGKQVHGFALKTSLDASVYVANALITMYSKS--------FYKGGAFND---------------NKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQ
         E GK+VH F +K  L  +V V+N+L+ MY+K          +      D                 D A   F+ +    ++TWNSMI+GF  +    +
Subjt:  GERGKQVHGFALKTSLDASVYVANALITMYSKS--------FYKGGAFND---------------NKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQ

Query:  AIHLFIQMNRQG-IGFDRATILSTLSSVS----LC------------NWDEFGLGLSFCHELH--CQALKTA--FISE-------VEIITALVKTYAELE
        A+ +F +M R   +  DR T+ S LS+ +    LC             +D  G+ L+    ++  C  ++TA   I +       +E  TAL+  Y +L 
Subjt:  AIHLFIQMNRQG-IGFDRATILSTLSSVS----LC------------NWDEFGLGLSFCHELH--CQALKTA--FISE-------VEIITALVKTYAELE

Query:  GDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDP-GKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYA
        GD+  +  +FV    +RD+V WT+++  + +H   G+ ++LF      G  P+ +  + +L   +   +  H    H   +K      + ++NALI  YA
Subjt:  GDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDP-GKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYA

Query:  RCGSITSSEKVFDQMK-HRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYG-IACQLDHYACMVD
        + G+ITS+ + FD ++  RD VSW +M+ A A HG AE+AL++F  M    + PD  T+V + SAC+HAGLV +G + F+ + +   I   L HYACMVD
Subjt:  RCGSITSSEKVFDQMK-HRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYG-IACQLDHYACMVD

Query:  ILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEI
        + GRAG +QEA++FI KMP+EPD V W S L +CR H    L K+++ +L  L+P NS AY  ++NLY   G + +A  IR  MK  RV+KE G SW+E+
Subjt:  ILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEI

Query:  ENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKL
        +++VH F      HP++  I   ++++   +K++GYVP+T+  LHD+E+E KE+ L HHSEKLA+ F +++  +     T +RIMKN+R+C DCH  +K 
Subjt:  ENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKL

Query:  ASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
         S L+ +EI++RD+ RFHHF  G CSC DYW
Subjt:  ASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

AT4G14850.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-12136.98Show/hide
Query:  FVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFT-------VASLLTSFGEHDGERGKQVHGFALK
        F+ N+LINMY K  H   A  +    P RN+VSWT+LISGL+QNGH     + F  M  +   PN+FT       VASL           GKQ+H  A+K
Subjt:  FVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDH-QPNEFT-------VASLLTSFGEHDGERGKQVHGFALK

Query:  TSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFG
              V+V  +   MY K+          +DDA  +F  I   +L TWN+ I+         +AI  FI+  R     +  T  + L++ S  +W    
Subjt:  TSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFG

Query:  LGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVE-HDPGKTLSLFHQFRQEGLIPDGHNFSIVLKAC
        LG+    +LH   L++ F ++V +   L+  Y + +  I  S  +F E G  ++ V W S++ A+V+ H+  K   L+ + R++ +       S VL AC
Subjt:  LGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVE-HDPGKTLSLFHQFRQEGLIPDGHNFSIVLKAC

Query:  AGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM-----NVPPDSTTFVSLL
        AG    +   + H+  +K   E  I + +AL+  Y +CG I  SE+ FD+M  ++LV+ N+++  YA  GQ + AL +F +M        P+  TFVSLL
Subjt:  AGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM-----NVPPDSTTFVSLL

Query:  SACSHAGLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ
        SACS AG VE G ++F+S+ + YGI    +HY+C+VD+LGRAG ++ A +FI KMP++P   VW +   +CR HG  QL  L++  L +LDP +S  +V 
Subjt:  SACSHAGLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQ

Query:  MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL
        +SN +  +G + +A+ +R E+KG  ++K  G SW+ ++NQVH F +  R H   + I   L +L   ++  GY P+  L+L+D+E+E+K  ++ HHSEKL
Subjt:  MSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKL

Query:  ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        AL F +++      +  PIRI KN+RIC DCH+F K  S  +K+EI++RD+NRFH F  G+CSC DYW
Subjt:  ALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-11934.29Show/hide
Query:  LFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRML-VDHQPNEFTVASLLTSFGE--HDGERGKQVHGFALKTSLD
        L V+N LINMYCK     +A  +F++M  R+L+SW ++I+G++QNG   E   LF ++L    +P+++T+ S+L +           KQVH  A+K +  
Subjt:  LFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRML-VDHQPNEFTVASLLTSFGE--HDGERGKQVHGFALKTSLD

Query:  ASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLS
        +  +V+ ALI  YS+        N    +A  +F+   N  L+ WN+M+AG+     G++ + LF  M++QG   D  T+ +   +        F   ++
Subjt:  ASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLS

Query:  FCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEH-DPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFL
           ++H  A+K+ +  ++ + + ++  Y +  GD++ +   F       D V WT++++  +E+ +  +   +F Q R  G++PD    + + KA +   
Subjt:  FCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEH-DPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFL

Query:  TEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHA
          +     H+  +K    ND  +  +L+  YA+CGSI  +  +F +++  ++ +WN M+   A HG+ ++ LQ+F +M    + PD  TF+ +LSACSH+
Subjt:  TEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKM---NVPPDSTTFVSLLSACSHA

Query:  GLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYC
        GLV E  +   S+  +YGI  +++HY+C+ D LGRAG +++AE+ I  M +E    ++ + L +CR  G T+  K  + KL EL+P +S AYV +SN+Y 
Subjt:  GLVEEGTRLFNSI-ANYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYC

Query:  FSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSV
         +  + +  L RT MKG +V+K+PG SW+E++N++H F    R + Q E+I  ++++++  +K+ GYVPET   L DVE+E+KE  LY+HSEKLA+ F +
Subjt:  FSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSV

Query:  MNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        ++        TPIR++KN+R+C DCHN MK  + +  +EIV+RD+NRFH F  G+CSC DYW
Subjt:  MNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-12836.77Show/hide
Query:  LQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRM-LVDHQPNEFTVASLLTSFGE
        L++G  +H ++++   +D F + + N L+NMY KCG +  A ++F  M  ++ VSW ++I+GL QNG   E    +  M   D  P  FT+ S L+S   
Subjt:  LQEGVGLHRYMMSRDPMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRM-LVDHQPNEFTVASLLTSFGE

Query:  HD-GERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFC-FQKLGNQAIHLFIQMNRQGIGFDRAT
            + G+Q+HG +LK  +D +V V+NAL+T+Y+++ Y         ++   +F S+     ++WNS+I      ++   +A+  F+   R G   +R T
Subjt:  HD-GERGKQVHGFALKTSLDASVYVANALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFC-FQKLGNQAIHLFIQMNRQGIGFDRAT

Query:  ILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHD-PGKTLSLFHQFRQ
          S LS+VS  ++ E G       ++H  ALK     E     AL+  Y +  G++    ++F      RD V W S+++ ++ ++   K L L     Q
Subjt:  ILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISEVEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHD-PGKTLSLFHQFRQ

Query:  EGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMN
         G   D   ++ VL A A   T +     H+  ++   E+D+V+ +AL+  Y++CG +  + + F+ M  R+  SWN+M+  YA HGQ E+AL++F  M 
Subjt:  EGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALIHAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMN

Query:  V----PPDSTTFVSLLSACSHAGLVEEGTRLFNSIA-NYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGS-CRKHG-ATQLAKL
        +    PPD  TFV +LSACSHAGL+EEG + F S++ +YG+A +++H++CM D+LGRAG + + EDFI KMP++P+ ++W + LG+ CR +G   +L K 
Subjt:  V----PPDSTTFVSLLSACSHAGLVEEGTRLFNSIA-NYGIACQLDHYACMVDILGRAGRIQEAEDFISKMPVEPDFVVWSSFLGS-CRKHG-ATQLAKL

Query:  SSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALH
        ++  L +L+P N++ YV + N+Y   G + D    R +MK + V+KE G SWV +++ VH F +G + HP  +VI  +L+EL  ++++ GYVP+T  AL+
Subjt:  SSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVICNELEELVGRLKEIGYVPETSLALH

Query:  DVEQEQKEEQLYHHSEKLALVF--SVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW
        D+EQE KEE L +HSEKLA+ F  +    S L     PIRIMKN+R+C DCH+  K  S +  ++I++RDSNRFHHF  G CSC+D+W
Subjt:  DVEQEQKEEQLYHHSEKLALVF--SVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGTCACGATTCATTGTCCATTTCGTGCCAAACGGAATTTGGTTTCGTACCCAAGTAAGCATGCTTTTGGTTCCCACCTTAGATACTGGCGTTCGGCTGCAGAAGG
CGATATTGTGCCTTTTAGGACAGAAGATATTGATAATGACTATCTATTGGATACACACAGGATCTCCACGCACGGCCAGCTTTGGCAGGCTCTTTCACTGTTCTATTCCT
CTAGACAGCCTCATTCCCGCCAGACCTATGCCCATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTACAAGAAGGCGTTGGACTGCACCGTTACATGATGTCACGGGAT
CCTATGGACTCATTTGATCTCTTTGTTACCAATCATCTTATTAACATGTACTGTAAATGTGGCCACTTAATCTATGCCTACCAATTATTTAATGACATGCCAAGGAGAAA
CCTTGTTTCGTGGACTGCACTTATCTCGGGACTTTCTCAGAATGGCCATGTCGATGAGTGCTTCCTTCTATTTTCGAGAATGTTGGTAGATCACCAGCCAAATGAGTTTA
CAGTTGCAAGTTTGCTTACCTCATTTGGTGAGCATGACGGTGAACGTGGCAAACAGGTACATGGGTTTGCCTTGAAAACGTCTTTAGATGCCTCTGTTTATGTTGCAAAT
GCTCTTATTACCATGTATAGCAAGAGTTTCTATAAAGGTGGTGCTTTTAATGATAATAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAAC
ATGGAATTCAATGATTGCAGGGTTTTGTTTCCAGAAACTCGGAAACCAAGCTATCCATTTATTTATACAAATGAATCGTCAAGGAATTGGGTTTGATCGTGCAACAATTT
TAAGTACTTTGTCTTCCGTAAGTCTCTGCAATTGGGATGAATTTGGCCTGGGTCTGAGCTTTTGTCATGAGTTACACTGTCAAGCATTAAAAACTGCTTTCATCTCAGAA
GTTGAAATAATTACTGCGTTAGTGAAAACTTATGCAGAACTAGAAGGGGACATCGCGGATAGTTATAGGCTTTTTGTTGAAGCAGGATATAATCGGGATATAGTTTTATG
GACTAGCATTATGACAGCTTTTGTAGAACATGACCCTGGGAAAACCCTTTCCCTTTTTCATCAGTTCCGACAAGAAGGCTTAATTCCAGATGGACACAATTTTTCAATTG
TATTAAAGGCTTGTGCTGGATTTTTGACCGAGAAGCATGCCTCAACATATCATTCACTGCTAATTAAATATATGTCTGAGAATGACATTGTCCTTAACAATGCCTTGATT
CATGCTTATGCGAGGTGTGGTTCAATTACTTCCTCTGAGAAAGTATTTGATCAAATGAAACATCGAGATTTGGTTTCTTGGAACACAATGATGAAGGCCTATGCTGTCCA
TGGGCAAGCTGAGAAAGCTTTGCAGGTTTTTTCAAAGATGAATGTTCCACCTGATTCTACTACATTTGTCTCTCTCCTTTCAGCCTGTAGCCATGCTGGGCTCGTGGAAG
AAGGGACCAGACTTTTCAATTCAATAGCAAATTATGGTATTGCTTGTCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGAGCTGGTCGGATTCAAGAGGCT
GAAGATTTTATAAGTAAAATGCCTGTGGAACCTGATTTTGTTGTTTGGAGTTCATTCCTCGGATCATGTAGAAAGCATGGTGCAACGCAATTGGCCAAATTATCATCTAA
TAAATTGAAAGAGTTAGATCCTAGCAATTCTCTGGCTTATGTGCAAATGTCAAATCTATATTGCTTCAGTGGTAGCTTTTATGATGCAGACTTAATTAGGACGGAAATGA
AAGGGTCTAGAGTGAGAAAGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTGCATGAGTTTGCATCTGGAGGTCGCCGTCATCCACAGAGAGAGGTAATATGC
AATGAGCTTGAAGAACTCGTTGGGAGGTTAAAGGAGATTGGTTATGTGCCTGAGACAAGCTTAGCATTGCATGACGTGGAGCAAGAGCAAAAGGAGGAGCAACTATATCA
TCATAGCGAGAAGCTGGCTTTGGTTTTCTCTGTAATGAATGATAGTAACTTGTGTCGCATTGGTACTCCTATAAGGATTATGAAAAACATCCGAATTTGTGTGGATTGTC
ATAATTTCATGAAGTTAGCTTCAACGCTACTTAAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCACAGCAGGTTTATGCTCTTGCAATGATTATTGG
TAA
mRNA sequenceShow/hide mRNA sequence
AGCCAGTCCATCACTGTCTCCCTCAGCGCTCGCAAATCAATGCCATCTACTCTGCAGAATATCTGTTGAAGGTAGATAGGATCATTGATTTCATGAAAGTCACGATTCAT
TGTCCATTTCGTGCCAAACGGAATTTGGTTTCGTACCCAAGTAAGCATGCTTTTGGTTCCCACCTTAGATACTGGCGTTCGGCTGCAGAAGGCGATATTGTGCCTTTTAG
GACAGAAGATATTGATAATGACTATCTATTGGATACACACAGGATCTCCACGCACGGCCAGCTTTGGCAGGCTCTTTCACTGTTCTATTCCTCTAGACAGCCTCATTCCC
GCCAGACCTATGCCCATCTCTTCCATGCTTGTGCACGCCTCCGCTGCCTACAAGAAGGCGTTGGACTGCACCGTTACATGATGTCACGGGATCCTATGGACTCATTTGAT
CTCTTTGTTACCAATCATCTTATTAACATGTACTGTAAATGTGGCCACTTAATCTATGCCTACCAATTATTTAATGACATGCCAAGGAGAAACCTTGTTTCGTGGACTGC
ACTTATCTCGGGACTTTCTCAGAATGGCCATGTCGATGAGTGCTTCCTTCTATTTTCGAGAATGTTGGTAGATCACCAGCCAAATGAGTTTACAGTTGCAAGTTTGCTTA
CCTCATTTGGTGAGCATGACGGTGAACGTGGCAAACAGGTACATGGGTTTGCCTTGAAAACGTCTTTAGATGCCTCTGTTTATGTTGCAAATGCTCTTATTACCATGTAT
AGCAAGAGTTTCTATAAAGGTGGTGCTTTTAATGATAATAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAACATGGAATTCAATGATTGC
AGGGTTTTGTTTCCAGAAACTCGGAAACCAAGCTATCCATTTATTTATACAAATGAATCGTCAAGGAATTGGGTTTGATCGTGCAACAATTTTAAGTACTTTGTCTTCCG
TAAGTCTCTGCAATTGGGATGAATTTGGCCTGGGTCTGAGCTTTTGTCATGAGTTACACTGTCAAGCATTAAAAACTGCTTTCATCTCAGAAGTTGAAATAATTACTGCG
TTAGTGAAAACTTATGCAGAACTAGAAGGGGACATCGCGGATAGTTATAGGCTTTTTGTTGAAGCAGGATATAATCGGGATATAGTTTTATGGACTAGCATTATGACAGC
TTTTGTAGAACATGACCCTGGGAAAACCCTTTCCCTTTTTCATCAGTTCCGACAAGAAGGCTTAATTCCAGATGGACACAATTTTTCAATTGTATTAAAGGCTTGTGCTG
GATTTTTGACCGAGAAGCATGCCTCAACATATCATTCACTGCTAATTAAATATATGTCTGAGAATGACATTGTCCTTAACAATGCCTTGATTCATGCTTATGCGAGGTGT
GGTTCAATTACTTCCTCTGAGAAAGTATTTGATCAAATGAAACATCGAGATTTGGTTTCTTGGAACACAATGATGAAGGCCTATGCTGTCCATGGGCAAGCTGAGAAAGC
TTTGCAGGTTTTTTCAAAGATGAATGTTCCACCTGATTCTACTACATTTGTCTCTCTCCTTTCAGCCTGTAGCCATGCTGGGCTCGTGGAAGAAGGGACCAGACTTTTCA
ATTCAATAGCAAATTATGGTATTGCTTGTCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGAGCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAA
ATGCCTGTGGAACCTGATTTTGTTGTTTGGAGTTCATTCCTCGGATCATGTAGAAAGCATGGTGCAACGCAATTGGCCAAATTATCATCTAATAAATTGAAAGAGTTAGA
TCCTAGCAATTCTCTGGCTTATGTGCAAATGTCAAATCTATATTGCTTCAGTGGTAGCTTTTATGATGCAGACTTAATTAGGACGGAAATGAAAGGGTCTAGAGTGAGAA
AGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTGCATGAGTTTGCATCTGGAGGTCGCCGTCATCCACAGAGAGAGGTAATATGCAATGAGCTTGAAGAACTC
GTTGGGAGGTTAAAGGAGATTGGTTATGTGCCTGAGACAAGCTTAGCATTGCATGACGTGGAGCAAGAGCAAAAGGAGGAGCAACTATATCATCATAGCGAGAAGCTGGC
TTTGGTTTTCTCTGTAATGAATGATAGTAACTTGTGTCGCATTGGTACTCCTATAAGGATTATGAAAAACATCCGAATTTGTGTGGATTGTCATAATTTCATGAAGTTAG
CTTCAACGCTACTTAAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCACAGCAGGTTTATGCTCTTGCAATGATTATTGGTAATTAATTGGCTTCAAA
CTTTCAAATACCTAAGGTTCACCTGCATTTACCAGTAGCTTGTAACCTCCAATATGAATAGAGACAAGTGATGATCTAGAATATTGACACTATATATCAATAAACTCTCA
GTTCCTTTCACAACTGAGGACATTTTAGGAGGTGAAGATGTGGTTAGTCCAAACACATCCCAGACAAGATGGATACAGGAAAGATCAATTCCAGACTCCACGAAGTTGCT
TTTACAAATTTGCAAACATGAAGCTTCTCCATTTGAGCGAGTGTAAATTACTACCAATCATTATACAACCTCCAATTCATTTGGATATCCTAATCCTTGTCTTTATATTA
ACTAACTAGGTTCATCCGAACTTCCTGTTTTGTCTTTCAGCAGATTTCCTCTAGTTTGGTCATAAA
Protein sequenceShow/hide protein sequence
MKVTIHCPFRAKRNLVSYPSKHAFGSHLRYWRSAAEGDIVPFRTEDIDNDYLLDTHRISTHGQLWQALSLFYSSRQPHSRQTYAHLFHACARLRCLQEGVGLHRYMMSRD
PMDSFDLFVTNHLINMYCKCGHLIYAYQLFNDMPRRNLVSWTALISGLSQNGHVDECFLLFSRMLVDHQPNEFTVASLLTSFGEHDGERGKQVHGFALKTSLDASVYVAN
ALITMYSKSFYKGGAFNDNKDDAWTMFKSIENPSLITWNSMIAGFCFQKLGNQAIHLFIQMNRQGIGFDRATILSTLSSVSLCNWDEFGLGLSFCHELHCQALKTAFISE
VEIITALVKTYAELEGDIADSYRLFVEAGYNRDIVLWTSIMTAFVEHDPGKTLSLFHQFRQEGLIPDGHNFSIVLKACAGFLTEKHASTYHSLLIKYMSENDIVLNNALI
HAYARCGSITSSEKVFDQMKHRDLVSWNTMMKAYAVHGQAEKALQVFSKMNVPPDSTTFVSLLSACSHAGLVEEGTRLFNSIANYGIACQLDHYACMVDILGRAGRIQEA
EDFISKMPVEPDFVVWSSFLGSCRKHGATQLAKLSSNKLKELDPSNSLAYVQMSNLYCFSGSFYDADLIRTEMKGSRVRKEPGLSWVEIENQVHEFASGGRRHPQREVIC
NELEELVGRLKEIGYVPETSLALHDVEQEQKEEQLYHHSEKLALVFSVMNDSNLCRIGTPIRIMKNIRICVDCHNFMKLASTLLKKEIVIRDSNRFHHFTAGLCSCNDYW