; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020735 (gene) of Snake gourd v1 genome

Gene IDTan0020735
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG09:74085664..74103041
RNA-Seq ExpressionTan0020735
SyntenyTan0020735
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135134.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Momordica charantia]7.1e-25485.91Show/hide
Query:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV
        YQTLY VLEACR S NSKTAIETHARIIKFGYG+YPTLITSLVSTYQR GC N V++LL LLCSKHLDLVAMN+ I+NFMKIGE KFAKKVFYKMPYRDV
Subjt:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV

Query:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD
        +TWNSIIGGCVKNA Y+EAFRFF +MLISNI  DGFTFASIL A AQLGALSN Q VHA+MT +K+ELNSIL+SALI  YSKCGSIQIAKEIFSSVPHSD
Subjt:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD

Query:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI
        ISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDA+TFLG LTACNHGGL EQG  YFDWM+SRYSI+PQLEHYGVMVDLYSRAGFLEEAYS I AMPI
Subjt:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI

Query:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV
        EPDVVTWRTLLSGC+IYRNQELAEVAIAN+   KSGDYVLLSNIYCS +RW ++ETVREMMK+K V K CGKSWIELAG+I  F+SGDRSHPEI+AV++V
Subjt:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV

Query:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
        LGSL+KRTRSEGYMP TE VFMDISEEEKEENLSFHSEKLALAYAI+KTSPG KI ISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
Subjt:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC

Query:  GDCW
        GDCW
Subjt:  GDCW

XP_022135141.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Momordica charantia]7.1e-25485.91Show/hide
Query:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV
        YQTLY VLEACR S NSKTAIETHARIIKFGYG+YPTLITSLVSTYQR GC N V++LL LLCSKHLDLVAMN+ I+NFMKIGE KFAKKVFYKMPYRDV
Subjt:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV

Query:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD
        +TWNSIIGGCVKNA Y+EAFRFF +MLISNI  DGFTFASIL A AQLGALSN Q VHA+MT +K+ELNSIL+SALI  YSKCGSIQIAKEIFSSVPHSD
Subjt:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD

Query:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI
        ISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDA+TFLG LTACNHGGL EQG  YFDWM+SRYSI+PQLEHYGVMVDLYSRAGFLEEAYS I AMPI
Subjt:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI

Query:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV
        EPDVVTWRTLLSGC+IYRNQELAEVAIAN+   KSGDYVLLSNIYCS +RW ++ETVREMMK+K V K CGKSWIELAG+I  F+SGDRSHPEI+AV++V
Subjt:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV

Query:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
        LGSL+KRTRSEGYMP TE VFMDISEEEKEENLSFHSEKLALAYAI+KTSPG KI ISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
Subjt:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC

Query:  GDCW
        GDCW
Subjt:  GDCW

XP_022988253.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita maxima]2.7e-25385.35Show/hide
Query:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD
        YQTLYRVLEACR+S  NSKTA ETHAR+IKFGYGNYPTL+TSLVS YQR  C NRVHQLL+LLCSKHLDLVAMNL I NFMKIGE K AK+VF KMPYRD
Subjt:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS
        VVTWNSIIGGCVKNARY+EAF+FFRQML+SNI  DGFTFAS+LNACAQLGA SNTQWVHALMT++KI+LNSILS ALIDAYSKCGSIQIAKEIFSSVP S
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDA+TFLGILTACNHGGL EQG  +FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII+AMP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK
        IE DVVTWR LLSGCRIYRNQELAEVAIANM HR SGDYVLLSNIYCSLNRW H+E VRE MK+  V K+CGKSWIEL G+IQ+FRSGDRSHPE DAV+K
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK

Query:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+RSEGYMP T+LV MDISEEEKEENLS+HSEKLALAYAI+KTSPGAKI ISKNLR+CDDCHRWIK+VS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

XP_022988254.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita maxima]2.7e-25385.35Show/hide
Query:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD
        YQTLYRVLEACR+S  NSKTA ETHAR+IKFGYGNYPTL+TSLVS YQR  C NRVHQLL+LLCSKHLDLVAMNL I NFMKIGE K AK+VF KMPYRD
Subjt:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS
        VVTWNSIIGGCVKNARY+EAF+FFRQML+SNI  DGFTFAS+LNACAQLGA SNTQWVHALMT++KI+LNSILS ALIDAYSKCGSIQIAKEIFSSVP S
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDA+TFLGILTACNHGGL EQG  +FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII+AMP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK
        IE DVVTWR LLSGCRIYRNQELAEVAIANM HR SGDYVLLSNIYCSLNRW H+E VRE MK+  V K+CGKSWIEL G+IQ+FRSGDRSHPE DAV+K
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK

Query:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+RSEGYMP T+LV MDISEEEKEENLS+HSEKLALAYAI+KTSPGAKI ISKNLR+CDDCHRWIK+VS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

XP_038879432.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Benincasa hispida]1.5e-25686.14Show/hide
Query:  YQTLYRVLEACR-ISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD
        YQTL+RVLEACR + L+SKT IETHARIIKFGYG+YP LITSLVSTYQR GC NRVHQLLDLLCSKHLDLV MNLLI+NF K+GE KFA +VFYKMPYRD
Subjt:  YQTLYRVLEACR-ISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS
        VVTWNSIIGGCVKNA YDEAFRFFRQML SNI  DGFTFAS+LNACAQLG  SNTQWVHALMT++KIELNSILS ALIDAYSKCGSIQIAKE+FS VPHS
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP
        D+SVWNAMIKGLAIHGLA DAL +F MMERENVLPDA+TFLGILTACNHGGL +QG  YF+WM+SRYSIQPQLEHYGV+VDLYSRAGFLEEAYS+I+AMP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK
        IEPDVVTWRTLLSGCRIYRNQELAEVAI NM HRKSGDYVLLSNIYCSLN+W H+ TVR+MMK   V K CGKSWIEL G IQNF+SGDRSHPE DAV++
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK

Query:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        VL SLMKRTRSEGYMP TELVFMDISEEEKEENLSFHSEK+ALAYAI+KTSPGAKI ISKNLRICDDCH WIKLVSRVLCR IVVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

TrEMBL top hitse value%identityAlignment
A0A6J1BZT0 pentatricopeptide repeat-containing protein At5g50990 isoform X23.4e-25485.91Show/hide
Query:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV
        YQTLY VLEACR S NSKTAIETHARIIKFGYG+YPTLITSLVSTYQR GC N V++LL LLCSKHLDLVAMN+ I+NFMKIGE KFAKKVFYKMPYRDV
Subjt:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV

Query:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD
        +TWNSIIGGCVKNA Y+EAFRFF +MLISNI  DGFTFASIL A AQLGALSN Q VHA+MT +K+ELNSIL+SALI  YSKCGSIQIAKEIFSSVPHSD
Subjt:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD

Query:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI
        ISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDA+TFLG LTACNHGGL EQG  YFDWM+SRYSI+PQLEHYGVMVDLYSRAGFLEEAYS I AMPI
Subjt:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI

Query:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV
        EPDVVTWRTLLSGC+IYRNQELAEVAIAN+   KSGDYVLLSNIYCS +RW ++ETVREMMK+K V K CGKSWIELAG+I  F+SGDRSHPEI+AV++V
Subjt:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV

Query:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
        LGSL+KRTRSEGYMP TE VFMDISEEEKEENLSFHSEKLALAYAI+KTSPG KI ISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
Subjt:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC

Query:  GDCW
        GDCW
Subjt:  GDCW

A0A6J1C1T7 pentatricopeptide repeat-containing protein At5g50990 isoform X13.4e-25485.91Show/hide
Query:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV
        YQTLY VLEACR S NSKTAIETHARIIKFGYG+YPTLITSLVSTYQR GC N V++LL LLCSKHLDLVAMN+ I+NFMKIGE KFAKKVFYKMPYRDV
Subjt:  YQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDV

Query:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD
        +TWNSIIGGCVKNA Y+EAFRFF +MLISNI  DGFTFASIL A AQLGALSN Q VHA+MT +K+ELNSIL+SALI  YSKCGSIQIAKEIFSSVPHSD
Subjt:  VTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSD

Query:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI
        ISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDA+TFLG LTACNHGGL EQG  YFDWM+SRYSI+PQLEHYGVMVDLYSRAGFLEEAYS I AMPI
Subjt:  ISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPI

Query:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV
        EPDVVTWRTLLSGC+IYRNQELAEVAIAN+   KSGDYVLLSNIYCS +RW ++ETVREMMK+K V K CGKSWIELAG+I  F+SGDRSHPEI+AV++V
Subjt:  EPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKV

Query:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
        LGSL+KRTRSEGYMP TE VFMDISEEEKEENLSFHSEKLALAYAI+KTSPG KI ISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC
Subjt:  LGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSC

Query:  GDCW
        GDCW
Subjt:  GDCW

A0A6J1E1Y0 pentatricopeptide repeat-containing protein At5g50990 isoform X11.1e-25285.35Show/hide
Query:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD
        YQTLYRVLEACR+S  NSKTA ETHAR+IKFGYGNYPTL+TSLVS YQR  C NRVHQLL+LLCSKHLDLVAMNL I NFMKIGE K AK+VF KMPYRD
Subjt:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS
        VVTWNSIIGGCVKNARY+EAF+FFRQML SNI  DGFTFAS+LNACAQLGA SNTQWVHALMT++KIELNSILS ALIDAYSKCGSIQIAKEIFSSVP S
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDA+TFLGILTACNHGGL EQG  +FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII+AMP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK
        IE DVVTWR LLSGCRIYRNQELAEVAIANM HR SGDYVLLSNIYCSLNRW H+E VRE MK+  V K+CGKSWIEL G+IQ+F+SGDRSHPE DAV+K
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK

Query:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+RSEGYMP T+LV MDISEEEKEENLS+HSEKLALAYAI+KT PGAKI ISKNLR+CDDCHRWIKLVS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1JL20 pentatricopeptide repeat-containing protein At5g50990 isoform X21.3e-25385.35Show/hide
Query:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD
        YQTLYRVLEACR+S  NSKTA ETHAR+IKFGYGNYPTL+TSLVS YQR  C NRVHQLL+LLCSKHLDLVAMNL I NFMKIGE K AK+VF KMPYRD
Subjt:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS
        VVTWNSIIGGCVKNARY+EAF+FFRQML+SNI  DGFTFAS+LNACAQLGA SNTQWVHALMT++KI+LNSILS ALIDAYSKCGSIQIAKEIFSSVP S
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDA+TFLGILTACNHGGL EQG  +FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII+AMP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK
        IE DVVTWR LLSGCRIYRNQELAEVAIANM HR SGDYVLLSNIYCSLNRW H+E VRE MK+  V K+CGKSWIEL G+IQ+FRSGDRSHPE DAV+K
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK

Query:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+RSEGYMP T+LV MDISEEEKEENLS+HSEKLALAYAI+KTSPGAKI ISKNLR+CDDCHRWIK+VS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1JLQ2 pentatricopeptide repeat-containing protein At5g50990 isoform X11.3e-25385.35Show/hide
Query:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD
        YQTLYRVLEACR+S  NSKTA ETHAR+IKFGYGNYPTL+TSLVS YQR  C NRVHQLL+LLCSKHLDLVAMNL I NFMKIGE K AK+VF KMPYRD
Subjt:  YQTLYRVLEACRIS-LNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS
        VVTWNSIIGGCVKNARY+EAF+FFRQML+SNI  DGFTFAS+LNACAQLGA SNTQWVHALMT++KI+LNSILS ALIDAYSKCGSIQIAKEIFSSVP S
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHS

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDA+TFLGILTACNHGGL EQG  +FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII+AMP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK
        IE DVVTWR LLSGCRIYRNQELAEVAIANM HR SGDYVLLSNIYCSLNRW H+E VRE MK+  V K+CGKSWIEL G+IQ+FRSGDRSHPE DAV+K
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFK

Query:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+RSEGYMP T+LV MDISEEEKEENLS+HSEKLALAYAI+KTSPGAKI ISKNLR+CDDCHRWIK+VS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210652.6e-10541.56Show/hide
Query:  YPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMD
        YP LI + V+T   V     +H ++       L +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   + +M    I  D
Subjt:  YPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMD

Query:  GFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDISVWNAMIKGLAIHGLAMDALLVFSMME-RENVL
        GFT  S+L+ACA++GAL+  + VH  M +  +  N   S+ L+D Y++CG ++ AK +F  +   +   W ++I GLA++G   +A+ +F  ME  E +L
Subjt:  GFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDISVWNAMIKGLAIHGLAMDALLVFSMME-RENVL

Query:  PDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANM
        P  ITF+GIL AC+H G+ ++G  YF  M+  Y I+P++EH+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ + +LAE A   I  +
Subjt:  PDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANM

Query:  PHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEGYMPATELVFMDISEEEKE
            SGDYVLLSN+Y S  RW   + +R+ M    V K  G S +E+   +  F  GD+SHP+ DA++  L  +  R RSEGY+P    V++D+ EEEKE
Subjt:  PHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEGYMPATELVFMDISEEEKE

Query:  ENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW
          + +HSEK+A+A+ ++ T   + I + KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  ENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW

Q683I9 Pentatricopeptide repeat-containing protein At3g628907.2e-10840.85Show/hide
Query:  THARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRF
        THA+I+ FG    P + TSL++ Y   G      ++ D   SK  DL A N ++  + K G    A+K+F +MP R+V++W+ +I G V   +Y EA   
Subjt:  THARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRF

Query:  FRQMLISN-----IHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSV-PHSDISVWNAMIKGLAIHGL
        FR+M +       +  + FT +++L+AC +LGAL   +WVHA + +  +E++ +L +ALID Y+KCGS++ AK +F+++    D+  ++AMI  LA++GL
Subjt:  FRQMLISN-----IHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSV-PHSDISVWNAMIKGLAIHGL

Query:  AMDALLVFS-MMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCR
          +   +FS M   +N+ P+++TF+GIL AC H GL  +G +YF  M   + I P ++HYG MVDLY R+G ++EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALLVFS-MMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCR

Query:  IYRNQELAEVA---IANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEG
        +  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M+ K ++K  G S++E+ G +  F  GD S  E + ++ +L  +M+R R  G
Subjt:  IYRNQELAEVA---IANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEG

Query:  YMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW
        Y+  T+ V +D++E++KE  LS+HSEKLA+A+ +MKT PG  +RI KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW

Q9FI49 Pentatricopeptide repeat-containing protein At5g509902.8e-16054.38Show/hide
Query:  LYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTW
        L +VLE+C+   NSK  ++ HA+I K GYG YP+L+ S V+ Y+R        +LL    S    +  +NL+I++ MKIGES  AKKV      ++V+TW
Subjt:  LYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTW

Query:  NSIIGGCVKNARYDEAFRFFRQML-ISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS
        N +IGG V+N +Y+EA +  + ML  ++I  + F+FAS L ACA+LG L + +WVH+LM    IELN+ILSSAL+D Y+KCG I  ++E+F SV  +D+S
Subjt:  NSIIGGCVKNARYDEAFRFFRQML-ISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP
        +WNAMI G A HGLA +A+ VFS ME E+V PD+ITFLG+LT C+H GL E+G  YF  M  R+SIQP+LEHYG MVDL  RAG ++EAY +I +MPIEP
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP

Query:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLG
        DVV WR+LLS  R Y+N EL E+AI N+   KSGDYVLLSNIY S  +W  ++ VRE+M  + + K  GKSW+E  G I  F++GD SH E  A++KVL 
Subjt:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLG

Query:  SLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGD
         L+++T+S+G++  T+LV MD+SEEEKEENL++HSEKLALAY I+K+SPG +IRI KN+R+C DCH WIK VS++L RVI++RDRIRFH+FE G+CSC D
Subjt:  SLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGD

Query:  CW
         W
Subjt:  CW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665204.0e-10638.26Show/hide
Query:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT
        T   +L+AC      +   + HA+I K GY N    + SL+++Y   G F   H L D +     D V+ N +I+ ++K G+   A  +F KM  ++ ++
Subjt:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT

Query:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS
        W ++I G V+     EA + F +M  S++  D  + A+ L+ACAQLGAL   +W+H+ + + +I ++S+L   LID Y+KCG ++ A E+F ++    + 
Subjt:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP
         W A+I G A HG   +A+  F  M++  + P+ ITF  +LTAC++ GL E+G   F  M+  Y+++P +EHYG +VDL  RAG L+EA   I  MP++P
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP

Query:  DVVTWRTLLSGCRIYRN----QELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVF
        + V W  LL  CRI++N    +E+ E+ IA  P+   G YV  +NI+    +W  +   R +MK + V K  G S I L G    F +GDRSHPEI+ + 
Subjt:  DVVTWRTLLSGCRIYRN----QELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVF

Query:  KVLGSLMKRTRSEGYMPATELVFMD-ISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGM
             + ++    GY+P  E + +D + ++E+E  +  HSEKLA+ Y ++KT PG  IRI KNLR+C DCH+  KL+S++  R IV+RDR RFH F  G 
Subjt:  KVLGSLMKRTRSEGYMPATELVFMD-ISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGM

Query:  CSCGDCW
        CSCGD W
Subjt:  CSCGDCW

Q9FND7 Putative pentatricopeptide repeat-containing protein At5g404052.1e-10738.78Show/hide
Query:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT
        T+  +++AC      +T ++ H   I+ G+ N P + T L+S Y  +GC +  H++ + +     D V    ++    + G+  FA+K+F  MP RD + 
Subjt:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT

Query:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS
        WN++I G  +     EA   F  M +  + ++G    S+L+AC QLGAL   +W H+ + R KI++   L++ L+D Y+KCG ++ A E+F  +   ++ 
Subjt:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP
         W++ + GLA++G     L +FS+M+++ V P+A+TF+ +L  C+  G  ++G  +FD M++ + I+PQLEHYG +VDLY+RAG LE+A SII  MP++P
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP

Query:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKS---GDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHP---EIDA
            W +LL   R+Y+N EL  +A   M   ++   G YVLLSNIY   N W +   VR+ MK+K V K  G S +E+ G +  F  GD+SHP   +IDA
Subjt:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKS---GDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHP---EIDA

Query:  VFKVLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGG
        V+K    + +R R  GY   T  V  DI EEEKE+ L  HSEK A+A+ IM       IRI KNLR+C DCH+   ++S++  R I+VRDR RFH F+ G
Subjt:  VFKVLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGG

Query:  MCSCGDCW
         CSC   W
Subjt:  MCSCGDCW

Arabidopsis top hitse value%identityAlignment
AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-10940.85Show/hide
Query:  THARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRF
        THA+I+ FG    P + TSL++ Y   G      ++ D   SK  DL A N ++  + K G    A+K+F +MP R+V++W+ +I G V   +Y EA   
Subjt:  THARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRF

Query:  FRQMLISN-----IHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSV-PHSDISVWNAMIKGLAIHGL
        FR+M +       +  + FT +++L+AC +LGAL   +WVHA + +  +E++ +L +ALID Y+KCGS++ AK +F+++    D+  ++AMI  LA++GL
Subjt:  FRQMLISN-----IHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSV-PHSDISVWNAMIKGLAIHGL

Query:  AMDALLVFS-MMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCR
          +   +FS M   +N+ P+++TF+GIL AC H GL  +G +YF  M   + I P ++HYG MVDLY R+G ++EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALLVFS-MMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCR

Query:  IYRNQELAEVA---IANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEG
        +  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M+ K ++K  G S++E+ G +  F  GD S  E + ++ +L  +M+R R  G
Subjt:  IYRNQELAEVA---IANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEG

Query:  YMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW
        Y+  T+ V +D++E++KE  LS+HSEKLA+A+ +MKT PG  +RI KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10641.56Show/hide
Query:  YPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMD
        YP LI + V+T   V     +H ++       L +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   + +M    I  D
Subjt:  YPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSIIGGCVKNARYDEAFRFFRQMLISNIHMD

Query:  GFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDISVWNAMIKGLAIHGLAMDALLVFSMME-RENVL
        GFT  S+L+ACA++GAL+  + VH  M +  +  N   S+ L+D Y++CG ++ AK +F  +   +   W ++I GLA++G   +A+ +F  ME  E +L
Subjt:  GFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDISVWNAMIKGLAIHGLAMDALLVFSMME-RENVL

Query:  PDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANM
        P  ITF+GIL AC+H G+ ++G  YF  M+  Y I+P++EH+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ + +LAE A   I  +
Subjt:  PDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANM

Query:  PHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEGYMPATELVFMDISEEEKE
            SGDYVLLSN+Y S  RW   + +R+ M    V K  G S +E+   +  F  GD+SHP+ DA++  L  +  R RSEGY+P    V++D+ EEEKE
Subjt:  PHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEGYMPATELVFMDISEEEKE

Query:  ENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW
          + +HSEK+A+A+ ++ T   + I + KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  ENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW

AT5G40405.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-10838.78Show/hide
Query:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT
        T+  +++AC      +T ++ H   I+ G+ N P + T L+S Y  +GC +  H++ + +     D V    ++    + G+  FA+K+F  MP RD + 
Subjt:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT

Query:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS
        WN++I G  +     EA   F  M +  + ++G    S+L+AC QLGAL   +W H+ + R KI++   L++ L+D Y+KCG ++ A E+F  +   ++ 
Subjt:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP
         W++ + GLA++G     L +FS+M+++ V P+A+TF+ +L  C+  G  ++G  +FD M++ + I+PQLEHYG +VDLY+RAG LE+A SII  MP++P
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP

Query:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKS---GDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHP---EIDA
            W +LL   R+Y+N EL  +A   M   ++   G YVLLSNIY   N W +   VR+ MK+K V K  G S +E+ G +  F  GD+SHP   +IDA
Subjt:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKS---GDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHP---EIDA

Query:  VFKVLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGG
        V+K    + +R R  GY   T  V  DI EEEKE+ L  HSEK A+A+ IM       IRI KNLR+C DCH+   ++S++  R I+VRDR RFH F+ G
Subjt:  VFKVLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGG

Query:  MCSCGDCW
         CSC   W
Subjt:  MCSCGDCW

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-16154.38Show/hide
Query:  LYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTW
        L +VLE+C+   NSK  ++ HA+I K GYG YP+L+ S V+ Y+R        +LL    S    +  +NL+I++ MKIGES  AKKV      ++V+TW
Subjt:  LYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTW

Query:  NSIIGGCVKNARYDEAFRFFRQML-ISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS
        N +IGG V+N +Y+EA +  + ML  ++I  + F+FAS L ACA+LG L + +WVH+LM    IELN+ILSSAL+D Y+KCG I  ++E+F SV  +D+S
Subjt:  NSIIGGCVKNARYDEAFRFFRQML-ISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP
        +WNAMI G A HGLA +A+ VFS ME E+V PD+ITFLG+LT C+H GL E+G  YF  M  R+SIQP+LEHYG MVDL  RAG ++EAY +I +MPIEP
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP

Query:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLG
        DVV WR+LLS  R Y+N EL E+AI N+   KSGDYVLLSNIY S  +W  ++ VRE+M  + + K  GKSW+E  G I  F++GD SH E  A++KVL 
Subjt:  DVVTWRTLLSGCRIYRNQELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLG

Query:  SLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGD
         L+++T+S+G++  T+LV MD+SEEEKEENL++HSEKLALAY I+K+SPG +IRI KN+R+C DCH WIK VS++L RVI++RDRIRFH+FE G+CSC D
Subjt:  SLMKRTRSEGYMPATELVFMDISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGD

Query:  CW
         W
Subjt:  CW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-10738.26Show/hide
Query:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT
        T   +L+AC      +   + HA+I K GY N    + SL+++Y   G F   H L D +     D V+ N +I+ ++K G+   A  +F KM  ++ ++
Subjt:  TLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVT

Query:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS
        W ++I G V+     EA + F +M  S++  D  + A+ L+ACAQLGAL   +W+H+ + + +I ++S+L   LID Y+KCG ++ A E+F ++    + 
Subjt:  WNSIIGGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP
         W A+I G A HG   +A+  F  M++  + P+ ITF  +LTAC++ GL E+G   F  M+  Y+++P +EHYG +VDL  RAG L+EA   I  MP++P
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEP

Query:  DVVTWRTLLSGCRIYRN----QELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVF
        + V W  LL  CRI++N    +E+ E+ IA  P+   G YV  +NI+    +W  +   R +MK + V K  G S I L G    F +GDRSHPEI+ + 
Subjt:  DVVTWRTLLSGCRIYRN----QELAEVAIANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVF

Query:  KVLGSLMKRTRSEGYMPATELVFMD-ISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGM
             + ++    GY+P  E + +D + ++E+E  +  HSEKLA+ Y ++KT PG  IRI KNLR+C DCH+  KL+S++  R IV+RDR RFH F  G 
Subjt:  KVLGSLMKRTRSEGYMPATELVFMD-ISEEEKEENLSFHSEKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGM

Query:  CSCGDCW
        CSCGD W
Subjt:  CSCGDCW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTATTATCAAACCCTTTATCGTGTTCTTGAAGCATGCAGAATCTCACTGAATTCCAAAACTGCTATTGAAACACATGCAAGGATTATTAAGTTTGGATATGGAAA
CTACCCAACTCTCATCACCTCTCTAGTATCTACTTATCAACGTGTTGGTTGCTTTAATCGCGTCCATCAACTTCTTGATCTACTCTGCTCTAAGCATCTTGATTTAGTTG
CAATGAACTTACTTATTCAAAATTTTATGAAAATCGGGGAAAGCAAATTTGCTAAAAAGGTATTTTATAAAATGCCTTACCGTGATGTGGTAACATGGAACTCAATCATT
GGAGGTTGTGTGAAGAATGCACGCTATGACGAGGCATTTAGATTCTTTAGACAGATGCTGATTTCAAATATTCATATGGACGGATTTACATTTGCTTCCATATTGAATGC
ATGTGCACAACTCGGAGCTCTAAGTAACACTCAGTGGGTCCATGCTCTAATGACTCGGGAAAAAATTGAACTTAATTCAATATTAAGTTCTGCACTCATAGACGCATACT
CCAAGTGCGGTAGCATCCAAATTGCAAAGGAAATCTTTAGTAGCGTCCCTCATAGTGATATCTCAGTTTGGAATGCGATGATCAAAGGGCTTGCAATTCACGGACTTGCA
ATGGATGCATTATTGGTATTCTCGATGATGGAGCGTGAGAATGTTCTACCTGATGCTATCACCTTCTTGGGTATTTTAACAGCCTGCAACCATGGTGGTTTAACTGAACA
GGGTCACACATATTTTGATTGGATGAAAAGCCGTTATTCAATTCAGCCACAGCTTGAGCATTATGGAGTCATGGTTGATCTCTATAGCCGGGCTGGGTTTCTTGAAGAGG
CCTATTCCATAATCATGGCAATGCCAATAGAGCCCGATGTTGTCACATGGAGGACTCTTCTCAGTGGTTGTAGAATTTACAGAAATCAAGAACTAGCAGAAGTAGCTATT
GCAAACATGCCTCATCGTAAGAGTGGAGATTACGTGTTATTATCAAATATCTATTGTTCCCTAAACAGATGGGGGCATTCAGAAACAGTTAGAGAGATGATGAAAAACAA
GAGAGTTCATAAGACTTGTGGAAAAAGTTGGATTGAGTTGGCAGGTAACATTCAAAACTTCAGGTCAGGGGATCGATCACATCCAGAAATCGATGCAGTATTCAAAGTGC
TGGGCAGTTTGATGAAGAGAACTCGGTCAGAGGGATATATGCCTGCGACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAAAAAGAAGAGAACTTATCATTTCATAGC
GAAAAGTTGGCATTGGCTTATGCGATCATGAAAACTAGTCCTGGGGCAAAAATCAGGATATCAAAGAACCTACGGATCTGTGATGATTGTCATAGATGGATAAAACTAGT
TTCAAGAGTGCTTTGCAGAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAGGTGGCATGTGTTCCTGCGGGGATTGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
CTAAAGATGATCCTTTGTTTTCTGTGTGGGATGCTGAAAACTCCATGGTGATGACGTGGCTAGTCAATTCAACGGTGGAAGACATTAGTTCTAACTATATGTGCTATGCT
ACTGCACAGGAACTATGGGATAGTGTGACGAAGATGTATTCAGATATGGGTAATCAGTCACAAATATTTGAGCTGAATCTCAAATTAGGCGACATACGACAAGGAGGTAA
CTCAATTACGCAATATTTTCATTCCCTAAAGAGGATCCGACAAGATCTCGATTTGTTTGATTCATATGAATGGAAGTTTACAGATGATCAGAAACACTACAGGAAACTCG
TCGAAGATGGTCGTATTTATAAATTTCTGGCAAGCCTCAATGTTGAATTCGATGAGGTTAGAGGCAGAATACTTGGTAAGACCACCTTGCCAACTATCAGCGAAGTGTTT
TCTGAAGTGCGTAGGGAAGAAAGTCGTCGGAATTTTATGATTAGAAAGAAGCCTATCGATTCAGTTGAAAGTTCAACGTTGGTGGTTGAAGCTACAACCTACAAGGCCTC
TGATCAATCAAACAAGAACCATGAAAACCCTCGTGTCTGGTGTGACCACTGTAATAAACCCCGACATACTCGGGATACTTGTTGGAAACTGCACGAAAAACCTGCAAATT
GGAAGAGTTCTAAGCAAAGTGAGAAAAGTAGCAATCAACAATCTTCCAGTGCTAATGTTGTTGACTCCAACCCATTTAGCAAAGAGCAAGTTGATCAACTTCTGAAGCTG
CTAAAGGCTATTCATCTTCTGGTAATTCTAGTGTTTCTTTGGCACAAACAGGTAATTCTCCTAAAACCCTCTCTTGTTTTAACTCCTCTCCGTGGATTATAGATTCTGGA
GCCTCTGATCACATGACTGGTTCCTCAAAACTATTTGACTCATACTCTCCCATGTATTGCAATGAAAAAATTCGAATTGCAGATGGTAGTTTTAATTCTATTGCAAGAAA
GGAATTATTCGTTTGACCTCACATATTACTCTACATTCTGTCCTTCGTGTCCCAAAATTAGCTTGTAATTTGTTATCTGTTAGTAAAATCTCAAAGGATGCTAATTCTCG
TGTTACCTTTTTTGACTCTTATTACACCTTTCAGGATCGGGATTCGGGGGAGATGATTGGGTGTGCTAAGATGCTCGATGGCCTCTACTATTTTGATGAGTCTCCAACTA
GTCATAAAAAAGTTCAGGGCTTATGTGGTATTATCAAACCCTTTATCGTGTTCTTGAAGCATGCAGAATCTCACTGAATTCCAAAACTGCTATTGAAACACATGCAAGGA
TTATTAAGTTTGGATATGGAAACTACCCAACTCTCATCACCTCTCTAGTATCTACTTATCAACGTGTTGGTTGCTTTAATCGCGTCCATCAACTTCTTGATCTACTCTGC
TCTAAGCATCTTGATTTAGTTGCAATGAACTTACTTATTCAAAATTTTATGAAAATCGGGGAAAGCAAATTTGCTAAAAAGGTATTTTATAAAATGCCTTACCGTGATGT
GGTAACATGGAACTCAATCATTGGAGGTTGTGTGAAGAATGCACGCTATGACGAGGCATTTAGATTCTTTAGACAGATGCTGATTTCAAATATTCATATGGACGGATTTA
CATTTGCTTCCATATTGAATGCATGTGCACAACTCGGAGCTCTAAGTAACACTCAGTGGGTCCATGCTCTAATGACTCGGGAAAAAATTGAACTTAATTCAATATTAAGT
TCTGCACTCATAGACGCATACTCCAAGTGCGGTAGCATCCAAATTGCAAAGGAAATCTTTAGTAGCGTCCCTCATAGTGATATCTCAGTTTGGAATGCGATGATCAAAGG
GCTTGCAATTCACGGACTTGCAATGGATGCATTATTGGTATTCTCGATGATGGAGCGTGAGAATGTTCTACCTGATGCTATCACCTTCTTGGGTATTTTAACAGCCTGCA
ACCATGGTGGTTTAACTGAACAGGGTCACACATATTTTGATTGGATGAAAAGCCGTTATTCAATTCAGCCACAGCTTGAGCATTATGGAGTCATGGTTGATCTCTATAGC
CGGGCTGGGTTTCTTGAAGAGGCCTATTCCATAATCATGGCAATGCCAATAGAGCCCGATGTTGTCACATGGAGGACTCTTCTCAGTGGTTGTAGAATTTACAGAAATCA
AGAACTAGCAGAAGTAGCTATTGCAAACATGCCTCATCGTAAGAGTGGAGATTACGTGTTATTATCAAATATCTATTGTTCCCTAAACAGATGGGGGCATTCAGAAACAG
TTAGAGAGATGATGAAAAACAAGAGAGTTCATAAGACTTGTGGAAAAAGTTGGATTGAGTTGGCAGGTAACATTCAAAACTTCAGGTCAGGGGATCGATCACATCCAGAA
ATCGATGCAGTATTCAAAGTGCTGGGCAGTTTGATGAAGAGAACTCGGTCAGAGGGATATATGCCTGCGACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAAAAAGA
AGAGAACTTATCATTTCATAGCGAAAAGTTGGCATTGGCTTATGCGATCATGAAAACTAGTCCTGGGGCAAAAATCAGGATATCAAAGAACCTACGGATCTGTGATGATT
GTCATAGATGGATAAAACTAGTTTCAAGAGTGCTTTGCAGAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAGGTGGCATGTGTTCCTGCGGGGATTGT
TGGTAGTTTAGTCCCAAGATACCAAAGTTGCTTTATCTGAGTTTTTTTTAGTTCATCAACATAATCGGGTGGAATGATTCAAACCTCTGATCTCATGGTCGATGGTATGT
GCCTAAATCAATTGAGGTATGCTCAAGTTAGCATCTGAGCTTTTTAATAATGATAAATTTAACTACATTTGCAAGGGTCCCATGGATAATGTCAACCTGTCAACAAATTG
ATAATCTTTCTTTATGAAAATTGTAATGGCCGATGAAACTGAGGATCATATTTGCAAATTTATTACACAGAAAAGTAAAGTGTAAATGTGGCAA
Protein sequenceShow/hide protein sequence
MWYYQTLYRVLEACRISLNSKTAIETHARIIKFGYGNYPTLITSLVSTYQRVGCFNRVHQLLDLLCSKHLDLVAMNLLIQNFMKIGESKFAKKVFYKMPYRDVVTWNSII
GGCVKNARYDEAFRFFRQMLISNIHMDGFTFASILNACAQLGALSNTQWVHALMTREKIELNSILSSALIDAYSKCGSIQIAKEIFSSVPHSDISVWNAMIKGLAIHGLA
MDALLVFSMMERENVLPDAITFLGILTACNHGGLTEQGHTYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMAMPIEPDVVTWRTLLSGCRIYRNQELAEVAI
ANMPHRKSGDYVLLSNIYCSLNRWGHSETVREMMKNKRVHKTCGKSWIELAGNIQNFRSGDRSHPEIDAVFKVLGSLMKRTRSEGYMPATELVFMDISEEEKEENLSFHS
EKLALAYAIMKTSPGAKIRISKNLRICDDCHRWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDCW