; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007656 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007656
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold2:60334..67774
RNA-Seq ExpressionSpg007656
SyntenySpg007656
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922012.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita moschata]3.9e-25386.14Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

XP_022922013.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita moschata]3.9e-25386.14Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

XP_022988253.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita maxima]2.3e-25385.74Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

XP_022988254.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita maxima]2.3e-25385.74Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

XP_038879432.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Benincasa hispida]2.2e-25686.34Show/hide
Query:  FQTLYRVLEACRL-SLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTL+RVLEACRL  L SKT IETHARIIKFGYG+YP LITSLVSTYQ AG LNRVHQLLDLLCSKHLDLV MNLLIENF K+ E K A +VFY MPYRD
Subjt:  FQTLYRVLEACRL-SLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AF+FFRQML SNIQPDGFTFAS+LNACA+LG  SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKE+F+ VPH+
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        D+SVWNAMIKGLAIHGLA DAL +F MMERENVLPDAVTFLGILTACNHGG+I++GRRYF+WM+SRYSIQPQLEHYGV+VDLYSRAGFLEEAYS+I++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IEPDVVTWRTLLSGCRIYRNQELAEVAI NMSHRKSGDYVLLSNIYCSLN+WEHA TVR+MMK  GVRK CGKSWIEL GTIQNFKSGDRSHPE DAVY+
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        VL SLMK+TR+EGYMPVTELVFMDISEEEKEENLSFHSEK+ALAYAILKTSPG KI+ISKNLRICDDCH WIKLVS++LCR IVVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

TrEMBL top hitse value%identityAlignment
A0A6J1BZT0 pentatricopeptide repeat-containing protein At5g50990 isoform X24.2e-25383.82Show/hide
Query:  YYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV
        +++   +  +QTLY VLEACR S +SKTAIETHARIIKFGYG+YPTLITSLVSTYQ AG LN V++LL LLCSKHLDLVAMN+ IENFMKI E K A+KV
Subjt:  YYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV

Query:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE
        FY MPYRDV+TWNSIIGGCVKNA Y +AF+FF +MLISNIQPDGFTFASIL A A+LGALSN Q VHA+MT+KK+ELNSIL SALI  YSKCGSIQIAKE
Subjt:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE

Query:  IFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA
        IF+SVPH+DISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDAVTFLG LTACNHGG++E+GR+YFDWM+SRYSI+PQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH
        YS I +MPIEPDVVTWRTLLSGC+IYRNQELAEVAIAN+S  KSGDYVLLSNIYCS +RWE+AETVREMMK+KGVRK CGKSWIELAG+I  FKSGDRSH
Subjt:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH

Query:  PEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH
        PEI+AVY+VLGSL+K+TR+EGYMPVTE VFMDISEEEKEENLSFHSEKLALAYAILKTSPG KI+ISKNLRICDDCHRWIKLVS++LCRVIVVRDRIRFH
Subjt:  PEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH

Query:  QFEGGMCSCGDCW
        QFEGGMCSCGDCW
Subjt:  QFEGGMCSCGDCW

A0A6J1E1Y0 pentatricopeptide repeat-containing protein At5g50990 isoform X11.9e-25386.14Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1E5D9 pentatricopeptide repeat-containing protein At5g50990 isoform X21.9e-25386.14Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1JL20 pentatricopeptide repeat-containing protein At5g50990 isoform X21.1e-25385.74Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1JLQ2 pentatricopeptide repeat-containing protein At5g50990 isoform X11.1e-25385.74Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN
        VVTWNSIIGGCVKNA Y +AFKFFRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKEIF+SVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS
        V+  LMK++R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

SwissProt top hitse value%identityAlignment
Q683I9 Pentatricopeptide repeat-containing protein At3g628903.3e-10941.67Show/hide
Query:  THARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKF
        THA+I+ FG    P + TSL++ Y   G L    ++ D   SK  DL A N ++  + K      A K+F  MP R+V++W+ +I G V    Y +A   
Subjt:  THARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKF

Query:  FRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSV-PHNDISVWNAMIKGLAIHGL
        FR+M +       ++P+ FT +++L+AC +LGAL   +WVHA + +  +E++ +L +ALID Y+KCGS++ AK +F ++    D+  ++AMI  LA++GL
Subjt:  FRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSV-PHNDISVWNAMIKGLAIHGL

Query:  AMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCR
          +   +FS M   +N+ P++VTF+GIL AC H G+I EG+ YF  M   + I P ++HYG MVDLY R+G ++EA S I SMP+EPDV+ W +LLSG R
Subjt:  AMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCR

Query:  IYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEG
        +  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M+ KG+ K+ G S++E+ G +  F  GD S  E + +Y +L  +M++ R  G
Subjt:  IYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW
        Y+  T+ V +D++E++KE  LS+HSEKLA+A+ ++KT PG  + I KNLRIC DCH  +K++SKL  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW

Q9FI49 Pentatricopeptide repeat-containing protein At5g509902.1e-16455.98Show/hide
Query:  LYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTW
        L +VLE+C+   +SK  ++ HA+I K GYG YP+L+ S V+ Y+         +LL    S    +  +NL+IE+ MKI ES LA+KV      ++V+TW
Subjt:  LYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTW

Query:  NSIIGGCVKNAWYGKAFKFFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDIS
        N +IGG V+N  Y +A K  + ML  ++I+P+ F+FAS L ACA+LG L + +WVH+LM    IELN+IL SAL+D Y+KCG I  ++E+F SV  ND+S
Subjt:  NSIIGGCVKNAWYGKAFKFFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP
        +WNAMI G A HGLA +A+ VFS ME E+V PD++TFLG+LT C+H G++EEG+ YF  M  R+SIQP+LEHYG MVDL  RAG ++EAY +I SMPIEP
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP

Query:  DVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLG
        DVV WR+LLS  R Y+N EL E+AI N+S  KSGDYVLLSNIY S  +WE A+ VRE+M  +G+RK  GKSW+E  G I  FK+GD SH E  A+YKVL 
Subjt:  DVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLG

Query:  SLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGD
         L++KT+++G++  T+LV MD+SEEEKEENL++HSEKLALAY ILK+SPG +I I KN+R+C DCH WIK VSKLL RVI++RDRIRFH+FE G+CSC D
Subjt:  SLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGD

Query:  CW
         W
Subjt:  CW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.2e-10839.85Show/hide
Query:  TLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK
        T   VL+AC  +   +   + H   +K+G+G    ++++LV  Y   G++     L        D++       +  ++V  N++I+ +M++ + K A  
Subjt:  TLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK

Query:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK
        +F  M  R VV+WN++I G   N ++  A + FR+M   +I+P+  T  S+L A ++LG+L   +W+H       I ++ +L SALID YSKCG I+ A 
Subjt:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK

Query:  EIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE
         +F  +P  ++  W+AMI G AIHG A DA+  F  M +  V P  V ++ +LTAC+HGG++EEGRRYF  M S   ++P++EHYG MVDL  R+G L+E
Subjt:  EIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE

Query:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG
        A   I++MPI+PD V W+ LL  CR+  N E+ + VA  + +M    SG YV LSN+Y S   W     +R  MK K +RK  G S I++ G +  F   
Subjt:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG

Query:  DRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR
        D SHP+   +  +L  +  K R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH  IKL+SK+  R I VRDR
Subjt:  DRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR

Query:  IRFHQFEGGMCSCGDCW
         RFH F+ G CSC D W
Subjt:  IRFHQFEGGMCSCGDCW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.2e-10636.7Show/hide
Query:  YDVGNDPEGGYYARRRLL------HFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL
        +   ++PE      +R+L      +  T   +L+AC      +   + HA+I K GY N    + SL+++Y   G     H L D +     D V+ N +
Subjt:  YDVGNDPEGGYYARRRLL------HFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL

Query:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA
        I+ ++K  +  +A  +F  M  ++ ++W ++I G V+     +A + F +M  S+++PD  + A+ L+ACA+LGAL   +W+H+ + + +I ++S+L   
Subjt:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA

Query:  LIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY
        LID Y+KCG ++ A E+F ++    +  W A+I G A HG   +A+  F  M++  + P+ +TF  +LTAC++ G++EEG+  F  M+  Y+++P +EHY
Subjt:  LIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY

Query:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICG
        G +VDL  RAG L+EA   I  MP++P+ V W  LL  CRI++N    +E+ E+ IA +     G YV  +NI+    +W+ A   R +MK +GV K+ G
Subjt:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICG

Query:  KSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRW
         S I L GT   F +GDRSHPEI+ +      + +K    GY+P  E + +D + ++E+E  +  HSEKLA+ Y ++KT PG  I I KNLR+C DCH+ 
Subjt:  KSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRW

Query:  IKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW
         KL+SK+  R IV+RDR RFH F  G CSCGD W
Subjt:  IKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW

Q9FND7 Putative pentatricopeptide repeat-containing protein At5g404053.4e-10637.55Show/hide
Query:  IPEKTPRSIFVYFRGLFYDVGNDPEGGYYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCS
        +PEK+    F ++R +    GND +   Y         T+  +++AC      +T ++ H   I+ G+ N P + T L+S Y   G L+  H++ + +  
Subjt:  IPEKTPRSIFVYFRGLFYDVGNDPEGGYYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCS

Query:  KHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQK
           D V    ++    +  +   A K+F  MP RD + WN++I G  +     +A   F  M +  ++ +G    S+L+AC +LGAL   +W H+ + + 
Subjt:  KHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQK

Query:  KIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKS
        KI++   L + L+D Y+KCG ++ A E+F  +   ++  W++ + GLA++G     L +FS+M+++ V P+AVTF+ +L  C+  G ++EG+R+FD M++
Subjt:  KIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKS

Query:  RYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMM
         + I+PQLEHYG +VDLY+RAG LE+A SII  MP++P    W +LL   R+Y+N EL  +A   M   ++   G YVLLSNIY   N W++   VR+ M
Subjt:  RYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMM

Query:  KNKGVRKICGKSWIELAGTIQNFKSGDRSHP---EIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINIS
        K+KGVRK  G S +E+ G +  F  GD+SHP   +IDAV+K    + ++ R  GY   T  V  DI EEEKE+ L  HSEK A+A+ I+     V I I 
Subjt:  KNKGVRKICGKSWIELAGTIQNFKSGDRSHP---EIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINIS

Query:  KNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW
        KNLR+C DCH+   ++SK+  R I+VRDR RFH F+ G CSC   W
Subjt:  KNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW

Arabidopsis top hitse value%identityAlignment
AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-11041.67Show/hide
Query:  THARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKF
        THA+I+ FG    P + TSL++ Y   G L    ++ D   SK  DL A N ++  + K      A K+F  MP R+V++W+ +I G V    Y +A   
Subjt:  THARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKF

Query:  FRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSV-PHNDISVWNAMIKGLAIHGL
        FR+M +       ++P+ FT +++L+AC +LGAL   +WVHA + +  +E++ +L +ALID Y+KCGS++ AK +F ++    D+  ++AMI  LA++GL
Subjt:  FRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSV-PHNDISVWNAMIKGLAIHGL

Query:  AMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCR
          +   +FS M   +N+ P++VTF+GIL AC H G+I EG+ YF  M   + I P ++HYG MVDLY R+G ++EA S I SMP+EPDV+ W +LLSG R
Subjt:  AMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCR

Query:  IYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEG
        +  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M+ KG+ K+ G S++E+ G +  F  GD S  E + +Y +L  +M++ R  G
Subjt:  IYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW
        Y+  T+ V +D++E++KE  LS+HSEKLA+A+ ++KT PG  + I KNLRIC DCH  +K++SKL  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW

AT5G40405.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-10737.55Show/hide
Query:  IPEKTPRSIFVYFRGLFYDVGNDPEGGYYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCS
        +PEK+    F ++R +    GND +   Y         T+  +++AC      +T ++ H   I+ G+ N P + T L+S Y   G L+  H++ + +  
Subjt:  IPEKTPRSIFVYFRGLFYDVGNDPEGGYYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCS

Query:  KHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQK
           D V    ++    +  +   A K+F  MP RD + WN++I G  +     +A   F  M +  ++ +G    S+L+AC +LGAL   +W H+ + + 
Subjt:  KHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQK

Query:  KIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKS
        KI++   L + L+D Y+KCG ++ A E+F  +   ++  W++ + GLA++G     L +FS+M+++ V P+AVTF+ +L  C+  G ++EG+R+FD M++
Subjt:  KIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKS

Query:  RYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMM
         + I+PQLEHYG +VDLY+RAG LE+A SII  MP++P    W +LL   R+Y+N EL  +A   M   ++   G YVLLSNIY   N W++   VR+ M
Subjt:  RYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMM

Query:  KNKGVRKICGKSWIELAGTIQNFKSGDRSHP---EIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINIS
        K+KGVRK  G S +E+ G +  F  GD+SHP   +IDAV+K    + ++ R  GY   T  V  DI EEEKE+ L  HSEK A+A+ I+     V I I 
Subjt:  KNKGVRKICGKSWIELAGTIQNFKSGDRSHP---EIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINIS

Query:  KNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW
        KNLR+C DCH+   ++SK+  R I+VRDR RFH F+ G CSC   W
Subjt:  KNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein8.8e-11039.85Show/hide
Query:  TLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK
        T   VL+AC  +   +   + H   +K+G+G    ++++LV  Y   G++     L        D++       +  ++V  N++I+ +M++ + K A  
Subjt:  TLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK

Query:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK
        +F  M  R VV+WN++I G   N ++  A + FR+M   +I+P+  T  S+L A ++LG+L   +W+H       I ++ +L SALID YSKCG I+ A 
Subjt:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK

Query:  EIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE
         +F  +P  ++  W+AMI G AIHG A DA+  F  M +  V P  V ++ +LTAC+HGG++EEGRRYF  M S   ++P++EHYG MVDL  R+G L+E
Subjt:  EIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE

Query:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG
        A   I++MPI+PD V W+ LL  CR+  N E+ + VA  + +M    SG YV LSN+Y S   W     +R  MK K +RK  G S I++ G +  F   
Subjt:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG

Query:  DRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR
        D SHP+   +  +L  +  K R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH  IKL+SK+  R I VRDR
Subjt:  DRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR

Query:  IRFHQFEGGMCSCGDCW
         RFH F+ G CSC D W
Subjt:  IRFHQFEGGMCSCGDCW

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-16555.98Show/hide
Query:  LYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTW
        L +VLE+C+   +SK  ++ HA+I K GYG YP+L+ S V+ Y+         +LL    S    +  +NL+IE+ MKI ES LA+KV      ++V+TW
Subjt:  LYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVTW

Query:  NSIIGGCVKNAWYGKAFKFFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDIS
        N +IGG V+N  Y +A K  + ML  ++I+P+ F+FAS L ACA+LG L + +WVH+LM    IELN+IL SAL+D Y+KCG I  ++E+F SV  ND+S
Subjt:  NSIIGGCVKNAWYGKAFKFFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP
        +WNAMI G A HGLA +A+ VFS ME E+V PD++TFLG+LT C+H G++EEG+ YF  M  R+SIQP+LEHYG MVDL  RAG ++EAY +I SMPIEP
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP

Query:  DVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLG
        DVV WR+LLS  R Y+N EL E+AI N+S  KSGDYVLLSNIY S  +WE A+ VRE+M  +G+RK  GKSW+E  G I  FK+GD SH E  A+YKVL 
Subjt:  DVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLG

Query:  SLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGD
         L++KT+++G++  T+LV MD+SEEEKEENL++HSEKLALAY ILK+SPG +I I KN+R+C DCH WIK VSKLL RVI++RDRIRFH+FE G+CSC D
Subjt:  SLMKKTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGD

Query:  CW
         W
Subjt:  CW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-10836.7Show/hide
Query:  YDVGNDPEGGYYARRRLL------HFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL
        +   ++PE      +R+L      +  T   +L+AC      +   + HA+I K GY N    + SL+++Y   G     H L D +     D V+ N +
Subjt:  YDVGNDPEGGYYARRRLL------HFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL

Query:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA
        I+ ++K  +  +A  +F  M  ++ ++W ++I G V+     +A + F +M  S+++PD  + A+ L+ACA+LGAL   +W+H+ + + +I ++S+L   
Subjt:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA

Query:  LIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY
        LID Y+KCG ++ A E+F ++    +  W A+I G A HG   +A+  F  M++  + P+ +TF  +LTAC++ G++EEG+  F  M+  Y+++P +EHY
Subjt:  LIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY

Query:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICG
        G +VDL  RAG L+EA   I  MP++P+ V W  LL  CRI++N    +E+ E+ IA +     G YV  +NI+    +W+ A   R +MK +GV K+ G
Subjt:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICG

Query:  KSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRW
         S I L GT   F +GDRSHPEI+ +      + +K    GY+P  E + +D + ++E+E  +  HSEKLA+ Y ++KT PG  I I KNLR+C DCH+ 
Subjt:  KSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRW

Query:  IKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW
         KL+SK+  R IV+RDR RFH F  G CSCGD W
Subjt:  IKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGTTTGAGCTGGGTTACTTCCGTTCCTCTGCTTGCTGTCTTCCTGACCACAACTGATGCAGTATTGCGGGAAAGGAGTCAACACACAGAACGAATTTCGGGTAC
GATTTCAGATCCCTGTTTAAAAGGATTGATTAGCACAGTTATGCACAAGTTCTTGATCTTGATTGAATTCTCGTTGCGGAATAAGAGTGGTAAGGCATACGCCATATTCA
CTGTCCTGATTATTGGTCTCCTTAACGAGGAGACTGTTGAATGTGCTGTTTGGACTCGAAATCGCTCTAATCTTGTGACCCATTCTCTGTTTGGCGTTAAACTAGAGCAT
CCACTGCCTAAGGACTCTAGACAAGTGGTTTCGAATCACTTAGGTTGTTGGACAGCACGTGTTTGGGCCTATTTGGATACTAAACGAGCTCTCGACTGTTTCAGGAAGAG
AAGGCTGTTGAAAGAGGAATCCTTCCTTTGCTACAACGTGCTACCTTGGTTCAGACTTTTTGGACAACGAAATCATGTTTGCTTGAAGGAGGGATTAATTACAATTCATC
CTTATGCTCCTCCTCAGAAAATGCACGCCCGCCTCATTCCTGAAAAAACTCCTAGGTCCATCTTTGTTTACTTTCGTGGACTATTCTATGATGTTGGAAATGATCCAGAA
GGTGGTTATTACGCAAGAAGGCGTTTATTACATTTTCAAACCCTTTATCGTGTTCTTGAAGCCTGCAGACTCTCCTTGGATTCCAAAACTGCTATTGAAACGCATGCGAG
AATTATTAAATTTGGATATGGAAACTACCCAACTCTCATCACCTCTCTTGTATCTACTTATCAACATGCTGGTTACCTTAATCGTGTCCATCAACTTCTTGATCTACTCT
GCTCTAAGCATCTTGATTTAGTTGCAATGAACTTACTTATTGAAAATTTTATGAAAATCAGGGAAAGCAAACTTGCTGAAAAGGTATTTTATATAATGCCTTACCGTGAT
GTGGTAACATGGAACTCAATCATTGGAGGTTGTGTGAAGAATGCATGGTATGGCAAGGCATTTAAATTCTTTAGACAGATGCTGATCTCAAATATTCAGCCGGACGGATT
TACATTTGCTTCTATATTGAATGCATGTGCCAAGCTCGGAGCTCTAAGTAATACTCAGTGGGTTCATGCTCTAATGACTCAGAAAAAAATTGAGCTTAATTCCATATTGA
TTTCTGCACTCATAGACGCGTACTCTAAGTGTGGTAGCATCCAAATTGCAAAGGAAATCTTTACTAGTGTCCCTCATAATGATATATCAGTTTGGAATGCGATGATCAAA
GGGCTTGCAATTCACGGGCTTGCGATGGATGCATTATTGGTATTCTCGATGATGGAGCGTGAGAATGTTCTCCCCGATGCTGTCACCTTTTTGGGTATTTTAACAGCATG
CAACCATGGTGGTGTAATTGAAGAGGGTCGCAGGTATTTTGATTGGATGAAAAGCCGTTATTCAATTCAGCCACAGCTTGAGCATTACGGAGTCATGGTTGATCTCTATA
GCCGGGCTGGGTTTCTCGAAGAGGCCTATTCCATAATCATGTCAATGCCAATAGAGCCAGATGTTGTCACATGGAGGACGCTTCTCAGTGGTTGTAGAATTTACAGAAAT
CAAGAACTCGCAGAAGTTGCTATAGCGAACATGTCTCATCGTAAGAGTGGAGATTACGTGTTATTATCAAATATCTATTGTTCCCTCAATAGATGGGAGCATGCAGAAAC
AGTAAGAGAGATGATGAAAAACAAGGGAGTTCGTAAGATTTGTGGAAAAAGCTGGATTGAGTTGGCAGGTACCATTCAAAACTTCAAGTCAGGTGATCGATCACATCCAG
AAATCGATGCAGTATACAAAGTGCTGGGCAGTTTGATGAAGAAAACTCGGGCAGAGGGTTATATGCCTGTCACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAAAAG
GAAGAAAACCTATCATTTCATAGCGAAAAGTTGGCATTGGCTTATGCCATCCTGAAAACTAGTCCTGGGGTAAAAATCAATATATCAAAGAACCTACGGATCTGTGATGA
TTGTCATAGATGGATAAAACTAGTTTCAAAACTGCTCTGCAGAGTTATAGTAGTGAGGGATCGGATCCGGTTCCATCAATTTGAAGGTGGCATGTGTTCCTGTGGGGATT
GTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAGTTTGAGCTGGGTTACTTCCGTTCCTCTGCTTGCTGTCTTCCTGACCACAACTGATGCAGTATTGCGGGAAAGGAGTCAACACACAGAACGAATTTCGGGTAC
GATTTCAGATCCCTGTTTAAAAGGATTGATTAGCACAGTTATGCACAAGTTCTTGATCTTGATTGAATTCTCGTTGCGGAATAAGAGTGGTAAGGCATACGCCATATTCA
CTGTCCTGATTATTGGTCTCCTTAACGAGGAGACTGTTGAATGTGCTGTTTGGACTCGAAATCGCTCTAATCTTGTGACCCATTCTCTGTTTGGCGTTAAACTAGAGCAT
CCACTGCCTAAGGACTCTAGACAAGTGGTTTCGAATCACTTAGGTTGTTGGACAGCACGTGTTTGGGCCTATTTGGATACTAAACGAGCTCTCGACTGTTTCAGGAAGAG
AAGGCTGTTGAAAGAGGAATCCTTCCTTTGCTACAACGTGCTACCTTGGTTCAGACTTTTTGGACAACGAAATCATGTTTGCTTGAAGGAGGGATTAATTACAATTCATC
CTTATGCTCCTCCTCAGAAAATGCACGCCCGCCTCATTCCTGAAAAAACTCCTAGGTCCATCTTTGTTTACTTTCGTGGACTATTCTATGATGTTGGAAATGATCCAGAA
GGTGGTTATTACGCAAGAAGGCGTTTATTACATTTTCAAACCCTTTATCGTGTTCTTGAAGCCTGCAGACTCTCCTTGGATTCCAAAACTGCTATTGAAACGCATGCGAG
AATTATTAAATTTGGATATGGAAACTACCCAACTCTCATCACCTCTCTTGTATCTACTTATCAACATGCTGGTTACCTTAATCGTGTCCATCAACTTCTTGATCTACTCT
GCTCTAAGCATCTTGATTTAGTTGCAATGAACTTACTTATTGAAAATTTTATGAAAATCAGGGAAAGCAAACTTGCTGAAAAGGTATTTTATATAATGCCTTACCGTGAT
GTGGTAACATGGAACTCAATCATTGGAGGTTGTGTGAAGAATGCATGGTATGGCAAGGCATTTAAATTCTTTAGACAGATGCTGATCTCAAATATTCAGCCGGACGGATT
TACATTTGCTTCTATATTGAATGCATGTGCCAAGCTCGGAGCTCTAAGTAATACTCAGTGGGTTCATGCTCTAATGACTCAGAAAAAAATTGAGCTTAATTCCATATTGA
TTTCTGCACTCATAGACGCGTACTCTAAGTGTGGTAGCATCCAAATTGCAAAGGAAATCTTTACTAGTGTCCCTCATAATGATATATCAGTTTGGAATGCGATGATCAAA
GGGCTTGCAATTCACGGGCTTGCGATGGATGCATTATTGGTATTCTCGATGATGGAGCGTGAGAATGTTCTCCCCGATGCTGTCACCTTTTTGGGTATTTTAACAGCATG
CAACCATGGTGGTGTAATTGAAGAGGGTCGCAGGTATTTTGATTGGATGAAAAGCCGTTATTCAATTCAGCCACAGCTTGAGCATTACGGAGTCATGGTTGATCTCTATA
GCCGGGCTGGGTTTCTCGAAGAGGCCTATTCCATAATCATGTCAATGCCAATAGAGCCAGATGTTGTCACATGGAGGACGCTTCTCAGTGGTTGTAGAATTTACAGAAAT
CAAGAACTCGCAGAAGTTGCTATAGCGAACATGTCTCATCGTAAGAGTGGAGATTACGTGTTATTATCAAATATCTATTGTTCCCTCAATAGATGGGAGCATGCAGAAAC
AGTAAGAGAGATGATGAAAAACAAGGGAGTTCGTAAGATTTGTGGAAAAAGCTGGATTGAGTTGGCAGGTACCATTCAAAACTTCAAGTCAGGTGATCGATCACATCCAG
AAATCGATGCAGTATACAAAGTGCTGGGCAGTTTGATGAAGAAAACTCGGGCAGAGGGTTATATGCCTGTCACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAAAAG
GAAGAAAACCTATCATTTCATAGCGAAAAGTTGGCATTGGCTTATGCCATCCTGAAAACTAGTCCTGGGGTAAAAATCAATATATCAAAGAACCTACGGATCTGTGATGA
TTGTCATAGATGGATAAAACTAGTTTCAAAACTGCTCTGCAGAGTTATAGTAGTGAGGGATCGGATCCGGTTCCATCAATTTGAAGGTGGCATGTGTTCCTGTGGGGATT
GTTGGTAG
Protein sequenceShow/hide protein sequence
MSSLSWVTSVPLLAVFLTTTDAVLRERSQHTERISGTISDPCLKGLISTVMHKFLILIEFSLRNKSGKAYAIFTVLIIGLLNEETVECAVWTRNRSNLVTHSLFGVKLEH
PLPKDSRQVVSNHLGCWTARVWAYLDTKRALDCFRKRRLLKEESFLCYNVLPWFRLFGQRNHVCLKEGLITIHPYAPPQKMHARLIPEKTPRSIFVYFRGLFYDVGNDPE
GGYYARRRLLHFQTLYRVLEACRLSLDSKTAIETHARIIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
VVTWNSIIGGCVKNAWYGKAFKFFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFTSVPHNDISVWNAMIK
GLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRN
QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKKTRAEGYMPVTELVFMDISEEEK
EENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEGGMCSCGDCW