; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033539 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033539
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:166702..168297
RNA-Seq ExpressionLag0033539
SyntenyLag0033539
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135141.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Momordica charantia]4.0e-25582.99Show/hide
Query:  LPLLLNLSALFTGLLTFFLINSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL
        LPL+ +    F   + FF  NS+SD+QTLY VLEACR S +SKTAIETHAR+IKFGYG+YPTLITSLVSTYQ AG LN V++LL LLCSKHLDLVAMN+ 
Subjt:  LPLLLNLSALFTGLLTFFLINSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL

Query:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA
        IENFMKI E K A+KVFY MPYRDV+TWNSIIGGCVKNA Y +AF+ F +MLISNIQPDGFTFASIL A A+LGALSN Q VHA+MT+KK+ELNSIL SA
Subjt:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA

Query:  LIDAYSKCGSIQIAKEIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY
        LI  YSKCGSIQIAKEIFSSVPH+DISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDAVTFLG LTACNHGG++E+GR+YFDWM+SRYSI+PQLEHY
Subjt:  LIDAYSKCGSIQIAKEIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY

Query:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWI
        GVMVDLYSRAGFLEEAYS I +MPIEPDVVTWRTLLSGC+IYRNQELAEVAIAN+S  KSGDYVLLSNIYCS +RWE+AETVREMMK+KGVRK CGKSWI
Subjt:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWI

Query:  ELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVS
        ELAG+I  FKSGDRSHPEI+AVY+VLGSL+KRTR+EGYMPVTE VFMDISEEEKEENLSFHSEKLALAYAILKTSPG KI+ISKNLRICDDCHRWIKLVS
Subjt:  ELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVS

Query:  KLLCRVIVVRDRIRFHQFEAGMCSCGDCW
        ++LCRVIVVRDRIRFHQFE GMCSCGDCW
Subjt:  KLLCRVIVVRDRIRFHQFEAGMCSCGDCW

XP_022922012.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita moschata]6.3e-25384.99Show/hide
Query:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV
        F + + + +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++V
Subjt:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV

Query:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE
        F  MPYRDVVTWNSIIGGCVKNA Y +AFK FRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKE
Subjt:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE

Query:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA
        IFSSVP ++ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH
        YSII++MPIE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSH
Subjt:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH

Query:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH
        PE DAVYKV+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFH
Subjt:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH

Query:  QFEAGMCSCGDCW
        QFE GMCSCGD W
Subjt:  QFEAGMCSCGDCW

XP_022988253.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita maxima]3.7e-25384.6Show/hide
Query:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV
        F + + + +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++V
Subjt:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV

Query:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE
        F  MPYRDVVTWNSIIGGCVKNA Y +AFK FRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKE
Subjt:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE

Query:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA
        IFSSVP ++ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH
        YSII++MPIE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSH
Subjt:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH

Query:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH
        PE DAVYKV+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFH
Subjt:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH

Query:  QFEAGMCSCGDCW
        QFE GMCSCGD W
Subjt:  QFEAGMCSCGDCW

XP_023516963.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita pepo subsp. pepo]8.3e-25384.99Show/hide
Query:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV
        F + + + +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++V
Subjt:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV

Query:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE
        F  MPYRDVVTWNSIIGGCVKNA Y +AFK FRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKE
Subjt:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE

Query:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA
        IFSSVP ++ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH
        YSII++MPIE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSN YCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSH
Subjt:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH

Query:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH
        PE DAVYKV+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFH
Subjt:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH

Query:  QFEAGMCSCGDCW
        QFE GMCSCGD W
Subjt:  QFEAGMCSCGDCW

XP_038879432.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Benincasa hispida]2.7e-25686.17Show/hide
Query:  DFQTLYRVLEACRL-SLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYR
        D+QTL+RVLEACRL  L SKT IETHAR+IKFGYG+YP LITSLVSTYQ AG LNRVHQLLDLLCSKHLDLV MNLLIENF K+ E K A +VFY MPYR
Subjt:  DFQTLYRVLEACRL-SLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYR

Query:  DVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPH
        DVVTWNSIIGGCVKNA Y +AF+ FRQML SNIQPDGFTFAS+LNACA+LG  SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKE+FS VPH
Subjt:  DVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPH

Query:  NDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSM
        +D+SVWNAMIKGLAIHGLA DAL +F MMERENVLPDAVTFLGILTACNHGG+I++GRRYF+WM+SRYSIQPQLEHYGV+VDLYSRAGFLEEAYS+I++M
Subjt:  NDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVY
        PIEPDVVTWRTLLSGCRIYRNQELAEVAI NMSHRKSGDYVLLSNIYCSLN+WEHA TVR+MMK  GVRK CGKSWIEL GTIQNFKSGDRSHPE DAVY
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVY

Query:  KVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMC
        +VL SLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEK+ALAYAILKTSPG KI+ISKNLRICDDCH WIKLVS++LCR IVVRDRIRFHQFE GMC
Subjt:  KVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMC

Query:  SCGDCW
        SCGD W
Subjt:  SCGDCW

TrEMBL top hitse value%identityAlignment
A0A6J1BZT0 pentatricopeptide repeat-containing protein At5g50990 isoform X21.9e-25582.99Show/hide
Query:  LPLLLNLSALFTGLLTFFLINSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL
        LPL+ +    F   + FF  NS+SD+QTLY VLEACR S +SKTAIETHAR+IKFGYG+YPTLITSLVSTYQ AG LN V++LL LLCSKHLDLVAMN+ 
Subjt:  LPLLLNLSALFTGLLTFFLINSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLL

Query:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA
        IENFMKI E K A+KVFY MPYRDV+TWNSIIGGCVKNA Y +AF+ F +MLISNIQPDGFTFASIL A A+LGALSN Q VHA+MT+KK+ELNSIL SA
Subjt:  IENFMKIRESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISA

Query:  LIDAYSKCGSIQIAKEIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY
        LI  YSKCGSIQIAKEIFSSVPH+DISVWNAMIKGLAIHGLAMDALLVFSMMER++V PDAVTFLG LTACNHGG++E+GR+YFDWM+SRYSI+PQLEHY
Subjt:  LIDAYSKCGSIQIAKEIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHY

Query:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWI
        GVMVDLYSRAGFLEEAYS I +MPIEPDVVTWRTLLSGC+IYRNQELAEVAIAN+S  KSGDYVLLSNIYCS +RWE+AETVREMMK+KGVRK CGKSWI
Subjt:  GVMVDLYSRAGFLEEAYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWI

Query:  ELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVS
        ELAG+I  FKSGDRSHPEI+AVY+VLGSL+KRTR+EGYMPVTE VFMDISEEEKEENLSFHSEKLALAYAILKTSPG KI+ISKNLRICDDCHRWIKLVS
Subjt:  ELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVS

Query:  KLLCRVIVVRDRIRFHQFEAGMCSCGDCW
        ++LCRVIVVRDRIRFHQFE GMCSCGDCW
Subjt:  KLLCRVIVVRDRIRFHQFEAGMCSCGDCW

A0A6J1E1Y0 pentatricopeptide repeat-containing protein At5g50990 isoform X13.1e-25384.99Show/hide
Query:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV
        F + + + +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++V
Subjt:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV

Query:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE
        F  MPYRDVVTWNSIIGGCVKNA Y +AFK FRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKE
Subjt:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE

Query:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA
        IFSSVP ++ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH
        YSII++MPIE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSH
Subjt:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH

Query:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH
        PE DAVYKV+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFH
Subjt:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH

Query:  QFEAGMCSCGDCW
        QFE GMCSCGD W
Subjt:  QFEAGMCSCGDCW

A0A6J1E5D9 pentatricopeptide repeat-containing protein At5g50990 isoform X21.2e-25286.14Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN
        VVTWNSIIGGCVKNA Y +AFK FRQML SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKIELNSIL  ALIDAYSKCGSIQIAKEIFSSVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+FKSGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMCS
        V+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKT PG KI+ISKNLR+CDDCHRWIKLVS LLCRV+VVRDRIRFHQFE GMCS
Subjt:  VLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1JL20 pentatricopeptide repeat-containing protein At5g50990 isoform X26.8e-25385.74Show/hide
Query:  FQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++VF  MPYRD
Subjt:  FQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN
        VVTWNSIIGGCVKNA Y +AFK FRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKEIFSSVP +
Subjt:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        +ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEAYSII++MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSHPE DAVYK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYK

Query:  VLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMCS
        V+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFHQFE GMCS
Subjt:  VLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMCS

Query:  CGDCW
        CGD W
Subjt:  CGDCW

A0A6J1JLQ2 pentatricopeptide repeat-containing protein At5g50990 isoform X11.8e-25384.6Show/hide
Query:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV
        F + + + +QTLYRVLEACRLS  +SKTA ETHAR+IKFGYGNYPTL+TSLVS YQ A  LNRVHQLL+LLCSKHLDLVAMNL I+NFMKI E KLA++V
Subjt:  FLINSVSDFQTLYRVLEACRLS-LDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKV

Query:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE
        F  MPYRDVVTWNSIIGGCVKNA Y +AFK FRQML+SNIQPDGFTFAS+LNACA+LGA SNTQWVHALMTQKKI+LNSIL  ALIDAYSKCGSIQIAKE
Subjt:  FYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKE

Query:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA
        IFSSVP ++ISVWNAMIKGLAIHGL+MDAL VF MMERENVLPDAVTFLGILTACNHGG+IE+GRR+FDWMK+RYSIQPQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH
        YSII++MPIE DVVTWR LLSGCRIYRNQELAEVAIANMSHR SGDYVLLSNIYCSLNRWEHAE VRE MK+ GVRK CGKSWIEL G+IQ+F+SGDRSH
Subjt:  YSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSH

Query:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH
        PE DAVYKV+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEKLALAYAILKTSPG KI+ISKNLR+CDDCHRWIK+VS LLCRV+VVRDRIRFH
Subjt:  PEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFH

Query:  QFEAGMCSCGDCW
        QFE GMCSCGD W
Subjt:  QFEAGMCSCGDCW

SwissProt top hitse value%identityAlignment
Q683I9 Pentatricopeptide repeat-containing protein At3g628909.5e-11140.97Show/hide
Query:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        DF T   +L +    L       THA+++ FG    P + TSL++ Y   G L    ++ D   SK  DL A N ++  + K      A K+F  MP R+
Subjt:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS
        V++W+ +I G V    Y +A  LFR+M +       ++P+ FT +++L+AC +LGAL   +WVHA + +  +E++ +L +ALID Y+KCGS++ AK +F+
Subjt:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS

Query:  SV-PHNDISVWNAMIKGLAIHGLAMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAY
        ++    D+  ++AMI  LA++GL  +   +FS M   +N+ P++VTF+GIL AC H G+I EG+ YF  M   + I P ++HYG MVDLY R+G ++EA 
Subjt:  SV-PHNDISVWNAMIKGLAIHGLAMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAY

Query:  SIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDR
        S I SMP+EPDV+ W +LLSG R+  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M+ KG+ K+ G S++E+ G +  F  GD 
Subjt:  SIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDR

Query:  SHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIR
        S  E + +Y +L  +M+R R  GY+  T+ V +D++E++KE  LS+HSEKLA+A+ ++KT PG  + I KNLRIC DCH  +K++SKL  R IVVRD  R
Subjt:  SHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIR

Query:  FHQFEAGMCSCGDCW
        FH F  G CSC D W
Subjt:  FHQFEAGMCSCGDCW

Q9FI49 Pentatricopeptide repeat-containing protein At5g509908.8e-16554.9Show/hide
Query:  NSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIM
        ++++D   L +VLE+C+   +SK  ++ HA++ K GYG YP+L+ S V+ Y+         +LL    S    +  +NL+IE+ MKI ES LA+KV    
Subjt:  NSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIM

Query:  PYRDVVTWNSIIGGCVKNAWYGKAFKLFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS
          ++V+TWN +IGG V+N  Y +A K  + ML  ++I+P+ F+FAS L ACA+LG L + +WVH+LM    IELN+IL SAL+D Y+KCG I  ++E+F 
Subjt:  PYRDVVTWNSIIGGCVKNAWYGKAFKLFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS

Query:  SVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSI
        SV  ND+S+WNAMI G A HGLA +A+ VFS ME E+V PD++TFLG+LT C+H G++EEG+ YF  M  R+SIQP+LEHYG MVDL  RAG ++EAY +
Subjt:  SVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSI

Query:  IMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEI
        I SMPIEPDVV WR+LLS  R Y+N EL E+AI N+S  KSGDYVLLSNIY S  +WE A+ VRE+M  +G+RK  GKSW+E  G I  FK+GD SH E 
Subjt:  IMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEI

Query:  DAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFE
         A+YKVL  L+++T+++G++  T+LV MD+SEEEKEENL++HSEKLALAY ILK+SPG +I I KN+R+C DCH WIK VSKLL RVI++RDRIRFH+FE
Subjt:  DAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFE

Query:  AGMCSCGDCW
         G+CSC D W
Subjt:  AGMCSCGDCW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.5e-10839.65Show/hide
Query:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK
        T   VL+AC  +   +   + H   +K+G+G    ++++LV  Y   G++     L        D++       +  ++V  N++I+ +M++ + K A  
Subjt:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK

Query:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK
        +F  M  R VV+WN++I G   N ++  A ++FR+M   +I+P+  T  S+L A ++LG+L   +W+H       I ++ +L SALID YSKCG I+ A 
Subjt:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK

Query:  EIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE
         +F  +P  ++  W+AMI G AIHG A DA+  F  M +  V P  V ++ +LTAC+HGG++EEGRRYF  M S   ++P++EHYG MVDL  R+G L+E
Subjt:  EIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE

Query:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG
        A   I++MPI+PD V W+ LL  CR+  N E+ + VA  + +M    SG YV LSN+Y S   W     +R  MK K +RK  G S I++ G +  F   
Subjt:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG

Query:  DRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR
        D SHP+   +  +L  +  + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH  IKL+SK+  R I VRDR
Subjt:  DRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR

Query:  IRFHQFEAGMCSCGDCW
         RFH F+ G CSC D W
Subjt:  IRFHQFEAGMCSCGDCW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665205.4e-10637.67Show/hide
Query:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVT
        T   +L+AC      +   + HA++ K GY N    + SL+++Y   G     H L D +     D V+ N +I+ ++K  +  +A  +F  M  ++ ++
Subjt:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVT

Query:  WNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHNDIS
        W ++I G V+     +A +LF +M  S+++PD  + A+ L+ACA+LGAL   +W+H+ + + +I ++S+L   LID Y+KCG ++ A E+F ++    + 
Subjt:  WNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHNDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP
         W A+I G A HG   +A+  F  M++  + P+ +TF  +LTAC++ G++EEG+  F  M+  Y+++P +EHYG +VDL  RAG L+EA   I  MP++P
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP

Query:  DVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVY
        + V W  LL  CRI++N    +E+ E+ IA +     G YV  +NI+    +W+ A   R +MK +GV K+ G S I L GT   F +GDRSHPEI+ + 
Subjt:  DVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVY

Query:  KVLGSLMKRTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGM
             + ++    GY+P  E + +D + ++E+E  +  HSEKLA+ Y ++KT PG  I I KNLR+C DCH+  KL+SK+  R IV+RDR RFH F  G 
Subjt:  KVLGSLMKRTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGM

Query:  CSCGDCW
        CSCGD W
Subjt:  CSCGDCW

Q9FND7 Putative pentatricopeptide repeat-containing protein At5g404054.2e-10638.75Show/hide
Query:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        D  T+  +++AC      +T ++ H   I+ G+ N P + T L+S Y   G L+  H++ + +     D V    ++    +  +   A K+F  MP RD
Subjt:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN
         + WN++I G  +     +A  +F  M +  ++ +G    S+L+AC +LGAL   +W H+ + + KI++   L + L+D Y+KCG ++ A E+F  +   
Subjt:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        ++  W++ + GLA++G     L +FS+M+++ V P+AVTF+ +L  C+  G ++EG+R+FD M++ + I+PQLEHYG +VDLY+RAG LE+A SII  MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHP---E
        ++P    W +LL   R+Y+N EL  +A   M   ++   G YVLLSNIY   N W++   VR+ MK+KGVRK  G S +E+ G +  F  GD+SHP   +
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHP---E

Query:  IDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQF
        IDAV+K    + +R R  GY   T  V  DI EEEKE+ L  HSEK A+A+ I+     V I I KNLR+C DCH+   ++SK+  R I+VRDR RFH F
Subjt:  IDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQF

Query:  EAGMCSCGDCW
        + G CSC   W
Subjt:  EAGMCSCGDCW

Arabidopsis top hitse value%identityAlignment
AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein6.8e-11240.97Show/hide
Query:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        DF T   +L +    L       THA+++ FG    P + TSL++ Y   G L    ++ D   SK  DL A N ++  + K      A K+F  MP R+
Subjt:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS
        V++W+ +I G V    Y +A  LFR+M +       ++P+ FT +++L+AC +LGAL   +WVHA + +  +E++ +L +ALID Y+KCGS++ AK +F+
Subjt:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISN-----IQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS

Query:  SV-PHNDISVWNAMIKGLAIHGLAMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAY
        ++    D+  ++AMI  LA++GL  +   +FS M   +N+ P++VTF+GIL AC H G+I EG+ YF  M   + I P ++HYG MVDLY R+G ++EA 
Subjt:  SV-PHNDISVWNAMIKGLAIHGLAMDALLVFS-MMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAY

Query:  SIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDR
        S I SMP+EPDV+ W +LLSG R+  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M+ KG+ K+ G S++E+ G +  F  GD 
Subjt:  SIIMSMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDR

Query:  SHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIR
        S  E + +Y +L  +M+R R  GY+  T+ V +D++E++KE  LS+HSEKLA+A+ ++KT PG  + I KNLRIC DCH  +K++SKL  R IVVRD  R
Subjt:  SHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIR

Query:  FHQFEAGMCSCGDCW
        FH F  G CSC D W
Subjt:  FHQFEAGMCSCGDCW

AT5G40405.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-10738.75Show/hide
Query:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD
        D  T+  +++AC      +T ++ H   I+ G+ N P + T L+S Y   G L+  H++ + +     D V    ++    +  +   A K+F  MP RD
Subjt:  DFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRD

Query:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN
         + WN++I G  +     +A  +F  M +  ++ +G    S+L+AC +LGAL   +W H+ + + KI++   L + L+D Y+KCG ++ A E+F  +   
Subjt:  VVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHN

Query:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP
        ++  W++ + GLA++G     L +FS+M+++ V P+AVTF+ +L  C+  G ++EG+R+FD M++ + I+PQLEHYG +VDLY+RAG LE+A SII  MP
Subjt:  DISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHP---E
        ++P    W +LL   R+Y+N EL  +A   M   ++   G YVLLSNIY   N W++   VR+ MK+KGVRK  G S +E+ G +  F  GD+SHP   +
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKS---GDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHP---E

Query:  IDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQF
        IDAV+K    + +R R  GY   T  V  DI EEEKE+ L  HSEK A+A+ I+     V I I KNLR+C DCH+   ++SK+  R I+VRDR RFH F
Subjt:  IDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQF

Query:  EAGMCSCGDCW
        + G CSC   W
Subjt:  EAGMCSCGDCW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-10939.65Show/hide
Query:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK
        T   VL+AC  +   +   + H   +K+G+G    ++++LV  Y   G++     L        D++       +  ++V  N++I+ +M++ + K A  
Subjt:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLL-------DLLC-----SKHLDLVAMNLLIENFMKIRESKLAEK

Query:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK
        +F  M  R VV+WN++I G   N ++  A ++FR+M   +I+P+  T  S+L A ++LG+L   +W+H       I ++ +L SALID YSKCG I+ A 
Subjt:  VFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAK

Query:  EIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE
         +F  +P  ++  W+AMI G AIHG A DA+  F  M +  V P  V ++ +LTAC+HGG++EEGRRYF  M S   ++P++EHYG MVDL  R+G L+E
Subjt:  EIFSSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEE

Query:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG
        A   I++MPI+PD V W+ LL  CR+  N E+ + VA  + +M    SG YV LSN+Y S   W     +R  MK K +RK  G S I++ G +  F   
Subjt:  AYSIIMSMPIEPDVVTWRTLLSGCRIYRNQELAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSG

Query:  DRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR
        D SHP+   +  +L  +  + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH  IKL+SK+  R I VRDR
Subjt:  DRSHPEIDAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDR

Query:  IRFHQFEAGMCSCGDCW
         RFH F+ G CSC D W
Subjt:  IRFHQFEAGMCSCGDCW

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-16654.9Show/hide
Query:  NSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIM
        ++++D   L +VLE+C+   +SK  ++ HA++ K GYG YP+L+ S V+ Y+         +LL    S    +  +NL+IE+ MKI ES LA+KV    
Subjt:  NSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIM

Query:  PYRDVVTWNSIIGGCVKNAWYGKAFKLFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS
          ++V+TWN +IGG V+N  Y +A K  + ML  ++I+P+ F+FAS L ACA+LG L + +WVH+LM    IELN+IL SAL+D Y+KCG I  ++E+F 
Subjt:  PYRDVVTWNSIIGGCVKNAWYGKAFKLFRQML-ISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFS

Query:  SVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSI
        SV  ND+S+WNAMI G A HGLA +A+ VFS ME E+V PD++TFLG+LT C+H G++EEG+ YF  M  R+SIQP+LEHYG MVDL  RAG ++EAY +
Subjt:  SVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSI

Query:  IMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEI
        I SMPIEPDVV WR+LLS  R Y+N EL E+AI N+S  KSGDYVLLSNIY S  +WE A+ VRE+M  +G+RK  GKSW+E  G I  FK+GD SH E 
Subjt:  IMSMPIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEI

Query:  DAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFE
         A+YKVL  L+++T+++G++  T+LV MD+SEEEKEENL++HSEKLALAY ILK+SPG +I I KN+R+C DCH WIK VSKLL RVI++RDRIRFH+FE
Subjt:  DAVYKVLGSLMKRTRAEGYMPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFE

Query:  AGMCSCGDCW
         G+CSC D W
Subjt:  AGMCSCGDCW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-10737.67Show/hide
Query:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVT
        T   +L+AC      +   + HA++ K GY N    + SL+++Y   G     H L D +     D V+ N +I+ ++K  +  +A  +F  M  ++ ++
Subjt:  TLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIRESKLAEKVFYIMPYRDVVT

Query:  WNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHNDIS
        W ++I G V+     +A +LF +M  S+++PD  + A+ L+ACA+LGAL   +W+H+ + + +I ++S+L   LID Y+KCG ++ A E+F ++    + 
Subjt:  WNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIFSSVPHNDIS

Query:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP
         W A+I G A HG   +A+  F  M++  + P+ +TF  +LTAC++ G++EEG+  F  M+  Y+++P +EHYG +VDL  RAG L+EA   I  MP++P
Subjt:  VWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEP

Query:  DVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVY
        + V W  LL  CRI++N    +E+ E+ IA +     G YV  +NI+    +W+ A   R +MK +GV K+ G S I L GT   F +GDRSHPEI+ + 
Subjt:  DVVTWRTLLSGCRIYRN----QELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVY

Query:  KVLGSLMKRTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGM
             + ++    GY+P  E + +D + ++E+E  +  HSEKLA+ Y ++KT PG  I I KNLR+C DCH+  KL+SK+  R IV+RDR RFH F  G 
Subjt:  KVLGSLMKRTRAEGYMPVTELVFMD-ISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGM

Query:  CSCGDCW
        CSCGD W
Subjt:  CSCGDCW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGCTTCCCTTGCTATTGAATTTGAGTGCCCTTTTCACCGGCCTATTAACTTTTTTCCTCATAAACTCTGTTTCAGATTTTCAAACCCTTTATCGTGTTCTTGAAGC
CTGCAGACTCTCCTTGGATTCCAAAACTGCTATTGAAACGCATGCGAGAGTTATTAAATTTGGATATGGAAACTACCCAACTCTCATCACCTCTCTAGTATCTACTTATC
AACATGCTGGTTACCTTAATCGTGTCCATCAACTTCTTGATCTACTCTGCTCGAAGCATCTCGATTTAGTTGCAATGAACTTACTTATTGAAAATTTTATGAAAATCAGG
GAAAGCAAACTTGCTGAAAAGGTATTTTATATAATGCCTTACCGTGATGTGGTAACATGGAACTCAATCATTGGAGGTTGTGTGAAGAATGCATGGTATGGCAAGGCATT
CAAACTCTTTAGACAGATGCTGATCTCAAATATTCAGCCGGACGGATTTACATTTGCTTCTATATTGAATGCATGTGCCAAGCTCGGAGCTCTAAGTAATACTCAGTGGG
TTCATGCTCTAATGACTCAGAAAAAAATTGAGCTTAATTCCATATTGATTTCTGCACTCATAGACGCGTACTCTAAGTGTGGTAGCATCCAAATTGCGAAGGAAATCTTT
TCTAGTGTCCCTCATAATGATATATCAGTTTGGAATGCGATGATCAAAGGGCTTGCAATTCACGGGCTTGCGATGGATGCATTATTGGTATTCTCGATGATGGAGCGTGA
GAATGTTCTCCCCGATGCTGTCACCTTTTTGGGTATTTTAACAGCATGCAACCATGGTGGTGTAATTGAAGAGGGTCGCAGGTATTTTGATTGGATGAAAAGCCGTTATT
CAATTCAGCCACAGCTTGAGCATTACGGAGTCATGGTTGATCTCTATAGCCGGGCTGGGTTTCTCGAAGAGGCCTATTCCATAATCATGTCAATGCCAATAGAGCCAGAT
GTTGTCACATGGAGGACGCTTCTCAGTGGTTGTAGAATTTACAGAAATCAAGAACTCGCAGAAGTTGCTATAGCGAACATGTCTCATCGTAAGAGTGGAGATTACGTGTT
ATTATCAAATATCTATTGTTCCCTCAATAGATGGGAGCATGCAGAAACAGTAAGAGAGATGATGAAAAACAAGGGAGTTCGTAAGATTTGTGGAAAAAGCTGGATTGAGT
TGGCAGGTACCATTCAAAACTTCAAGTCAGGTGATCGATCACATCCAGAAATCGATGCAGTATACAAAGTGCTGGGCAGTTTGATGAAGAGAACTCGGGCAGAGGGTTAT
ATGCCTGTCACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAAAAGGAAGAAAACCTATCGTTTCATAGCGAAAAGTTGGCATTGGCTTATGCCATCCTGAAAACTAG
TCCTGGGGTAAAAATCAATATATCAAAGAACCTACGGATCTGTGATGATTGTCATAGATGGATAAAACTAGTTTCAAAACTGCTCTGCAGAGTTATAGTAGTGAGGGATC
GGATCCGGTTTCATCAATTTGAAGCTGGCATGTGTTCCTGTGGGGATTGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGCTTCCCTTGCTATTGAATTTGAGTGCCCTTTTCACCGGCCTATTAACTTTTTTCCTCATAAACTCTGTTTCAGATTTTCAAACCCTTTATCGTGTTCTTGAAGC
CTGCAGACTCTCCTTGGATTCCAAAACTGCTATTGAAACGCATGCGAGAGTTATTAAATTTGGATATGGAAACTACCCAACTCTCATCACCTCTCTAGTATCTACTTATC
AACATGCTGGTTACCTTAATCGTGTCCATCAACTTCTTGATCTACTCTGCTCGAAGCATCTCGATTTAGTTGCAATGAACTTACTTATTGAAAATTTTATGAAAATCAGG
GAAAGCAAACTTGCTGAAAAGGTATTTTATATAATGCCTTACCGTGATGTGGTAACATGGAACTCAATCATTGGAGGTTGTGTGAAGAATGCATGGTATGGCAAGGCATT
CAAACTCTTTAGACAGATGCTGATCTCAAATATTCAGCCGGACGGATTTACATTTGCTTCTATATTGAATGCATGTGCCAAGCTCGGAGCTCTAAGTAATACTCAGTGGG
TTCATGCTCTAATGACTCAGAAAAAAATTGAGCTTAATTCCATATTGATTTCTGCACTCATAGACGCGTACTCTAAGTGTGGTAGCATCCAAATTGCGAAGGAAATCTTT
TCTAGTGTCCCTCATAATGATATATCAGTTTGGAATGCGATGATCAAAGGGCTTGCAATTCACGGGCTTGCGATGGATGCATTATTGGTATTCTCGATGATGGAGCGTGA
GAATGTTCTCCCCGATGCTGTCACCTTTTTGGGTATTTTAACAGCATGCAACCATGGTGGTGTAATTGAAGAGGGTCGCAGGTATTTTGATTGGATGAAAAGCCGTTATT
CAATTCAGCCACAGCTTGAGCATTACGGAGTCATGGTTGATCTCTATAGCCGGGCTGGGTTTCTCGAAGAGGCCTATTCCATAATCATGTCAATGCCAATAGAGCCAGAT
GTTGTCACATGGAGGACGCTTCTCAGTGGTTGTAGAATTTACAGAAATCAAGAACTCGCAGAAGTTGCTATAGCGAACATGTCTCATCGTAAGAGTGGAGATTACGTGTT
ATTATCAAATATCTATTGTTCCCTCAATAGATGGGAGCATGCAGAAACAGTAAGAGAGATGATGAAAAACAAGGGAGTTCGTAAGATTTGTGGAAAAAGCTGGATTGAGT
TGGCAGGTACCATTCAAAACTTCAAGTCAGGTGATCGATCACATCCAGAAATCGATGCAGTATACAAAGTGCTGGGCAGTTTGATGAAGAGAACTCGGGCAGAGGGTTAT
ATGCCTGTCACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAAAAGGAAGAAAACCTATCGTTTCATAGCGAAAAGTTGGCATTGGCTTATGCCATCCTGAAAACTAG
TCCTGGGGTAAAAATCAATATATCAAAGAACCTACGGATCTGTGATGATTGTCATAGATGGATAAAACTAGTTTCAAAACTGCTCTGCAGAGTTATAGTAGTGAGGGATC
GGATCCGGTTTCATCAATTTGAAGCTGGCATGTGTTCCTGTGGGGATTGTTGGTAG
Protein sequenceShow/hide protein sequence
MTLPLLLNLSALFTGLLTFFLINSVSDFQTLYRVLEACRLSLDSKTAIETHARVIKFGYGNYPTLITSLVSTYQHAGYLNRVHQLLDLLCSKHLDLVAMNLLIENFMKIR
ESKLAEKVFYIMPYRDVVTWNSIIGGCVKNAWYGKAFKLFRQMLISNIQPDGFTFASILNACAKLGALSNTQWVHALMTQKKIELNSILISALIDAYSKCGSIQIAKEIF
SSVPHNDISVWNAMIKGLAIHGLAMDALLVFSMMERENVLPDAVTFLGILTACNHGGVIEEGRRYFDWMKSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIMSMPIEPD
VVTWRTLLSGCRIYRNQELAEVAIANMSHRKSGDYVLLSNIYCSLNRWEHAETVREMMKNKGVRKICGKSWIELAGTIQNFKSGDRSHPEIDAVYKVLGSLMKRTRAEGY
MPVTELVFMDISEEEKEENLSFHSEKLALAYAILKTSPGVKINISKNLRICDDCHRWIKLVSKLLCRVIVVRDRIRFHQFEAGMCSCGDCW