; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G012060 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G012060
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCma_Chr09:7953904..7955626
RNA-Seq ExpressionCmaCh09G012060
SyntenyCmaCh09G012060
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592283.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.3e-28996.81Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MAS+P+QPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE DSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRG PLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRT+IGIARQVFDEMLERSVVSWTAM+SGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALS+CGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGV+PD VTFVGVLNACTHGGLVEKGYS+F+MMRQDYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVVRGM+IEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVR+VRKLLKEQNAYKIPGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

KAG7025121.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-28691.34Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MAS+P+QPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE DSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRG PLPNHFIYPHVLKSCPE+LESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRT+IGIARQVFDEMLERSVVSWTAM+SGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALS+CGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGV+PD                             DYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE--
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVR+VRKLLKEQNAYKIPGCSWIE  
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE--

Query:  --QEDISNSSGEKFEPREDRQALPSSSPLPLAPSSIDNRCKTE
          QEDI +SS EK EPREDRQALPSSSPLPLAPSSIDNRCKTE
Subjt:  --QEDISNSSGEKFEPREDRQALPSSSPLPLAPSSIDNRCKTE

XP_022930176.1 pentatricopeptide repeat-containing protein At1g33350 [Cucurbita moschata]1.2e-28896.81Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MAS+P+QPHLNQLVLSVLEKCSHLNHLKQLQGFLISLG SQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE DSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRG PLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRT+IGIARQVFDEMLERSVVSWTAM+SGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALS+CGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVV+GMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVR+VRKLLKEQNAYKIPGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

XP_022974048.1 pentatricopeptide repeat-containing protein At1g33350 [Cucurbita maxima]1.4e-29599.4Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

XP_023521314.1 pentatricopeptide repeat-containing protein At1g33350-like [Cucurbita pepo subsp. pepo]3.6e-28896.61Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MASVP+QPHLNQLVLSVLEKCSHLN LKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE DSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRG PLPNHFIYPHVLKSCPELLESNGTKMVH QVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAM+SGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFF EAIGLFKRMVSLALEGNKERETKPNK+TVASALS+CGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVV+GMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVR+VRKLLKEQNAYKIPGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

TrEMBL top hitse value%identityAlignment
A0A1S3CAA7 pentatricopeptide repeat-containing protein At1g333501.1e-25886.85Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        M+SV +QPHL QLV++ LEKCS+LNHLKQLQGFLIS GHSQTQF+AFKLVRFCNLTLTDLCY+R+IFD+L+SPNVYLYTAMITAYA+ PD KAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVR GA  PNHFIYPHVLKSCP++L SN TKMVH QVLKSGFG YPVVQTAIVD+YSRF + IG ARQ+FDEM+ERSVVSWTAMISGYARLG+ D+A+ L
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEG-NKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGK
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFK+MVSLALEG N +RE KPNK T+ASALS+CG+TGMLHLGKWIHGYVFKTY GQDSFISNALLDMYGK
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEG-NKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGK

Query:  CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLID
        CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLF ELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYF+MMR+DYDIEPQIEHFGCLID
Subjt:  CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLID

Query:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQ
        LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLN CKIHGR DLAEYSVKKLIEMDP+NGGYRIMLANIYAEL  WDEVRKVRKLLKE+NAYK PGCSWIE 
Subjt:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQ

Query:  ED
        ++
Subjt:  ED

A0A5A7T7Y7 Pentatricopeptide repeat-containing protein4.9e-25986.53Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        M+SV +QPHL QLV++ LEKCS+LNHLKQLQGFLIS GHSQTQF+AFKLVRFCNLTLTDLCY+R+IFD+L+SPNVYLYTAMITAYA+ PD KAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVR GA  PNHFIYPHVLKSCP++L SN TKMVH QVLKSGFG YPVVQTAIVD+YSRF + IG ARQ+FDEM+ERSVVSWTAMISGYARLG+ D+A+ L
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEG-NKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGK
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFK+MVSLALEG N +RE KPNK T+ASALS+CG+TGMLHLGKWIHGYVFKTY GQDSFISNALLDMYGK
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEG-NKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGK

Query:  CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLID
        CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLF ELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYF+MMR+DYDIEPQIEHFGCLID
Subjt:  CGNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLID

Query:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQ
        LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLN CKIHGR DLAEYSVKKLIEMDP+NGGYRIMLANIYAEL  WDEVRKVRKLLKE+NAYK PGCSWIE 
Subjt:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQ

Query:  EDISN
        ++  N
Subjt:  EDISN

A0A6J1DQX1 pentatricopeptide repeat-containing protein At1g333503.9e-26488.02Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MASVP+QPHLNQLVLSVLEKCSHLNHLKQ+QGFLISLGHSQTQF+AFKLVRFCNLTL +L Y+RFIFD L+SPNVYLYTAMITAYAS+PDSKAAF+LYR+
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRG P PNHFIYPHVLKSCPE+ ESN T+MVHAQ+LKSGFG YPVVQTAIVD+YS+F +DIGIARQ+FDEM+ERSVVSWTAMISGYARLG+ID+A+AL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERD+PAWNA+IAG AQNGFFCEAI LF+RMVSLA+E ++ERE KPNKITVASALS+CGHTGMLHLGKWIHGYVFK+ L QDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLK+A+RVFDMITLKSLTSWNSLINCLALHGHS SAIDLF++LV+C DGV+PD VTFVGVLNACTHGGLVEKGYSYF+MMRQDY IEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELE WDEVRKVRKLLKEQNAYK PGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

A0A6J1EQ76 pentatricopeptide repeat-containing protein At1g333506.0e-28996.81Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MAS+P+QPHLNQLVLSVLEKCSHLNHLKQLQGFLISLG SQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE DSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRG PLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRT+IGIARQVFDEMLERSVVSWTAM+SGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALS+CGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVV+GMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVR+VRKLLKEQNAYKIPGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

A0A6J1IF22 pentatricopeptide repeat-containing protein At1g333506.6e-29699.4Show/hide
Query:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
        MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN
Subjt:  MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRN

Query:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
        MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL
Subjt:  MVRRGAPLPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMAL

Query:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
        FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC
Subjt:  FESMPERDIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKC

Query:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
        GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL
Subjt:  GNLKVARRVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE +
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQE

Query:  D
        +
Subjt:  D

SwissProt top hitse value%identityAlignment
Q9C501 Pentatricopeptide repeat-containing protein At1g333501.1e-16255.49Show/hide
Query:  LNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE--PDSKAAFLLYRNMVRRGAP
        LNQ + +V+ K  HLNHLKQ+Q F+I  G S + F  FKL+RFC L L +L Y+RFIFD  S PN +LY A++TAY+S     + +AF  +R MV R  P
Subjt:  LNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE--PDSKAAFLLYRNMVRRGAP

Query:  LPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPER
         PNHFIYP VLKS P L  +  T +VH  + KSGF  Y VVQTA++ +Y+   + I +ARQ+FDEM ER+VVSWTAM+SGYAR GDI NA+ALFE MPER
Subjt:  LPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPER

Query:  DIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVAR
        D+P+WNA++A C QNG F EA+ LF+RM++       E   +PN++TV   LS+C  TG L L K IH + ++  L  D F+SN+L+D+YGKCGNL+ A 
Subjt:  DIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVAR

Query:  RVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCR-DGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGR
         VF M + KSLT+WNS+INC ALHG S  AI +F E+++   + ++PD +TF+G+LNACTHGGLV KG  YF +M   + IEP+IEH+GCLIDLLGRAGR
Subjt:  RVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCR-DGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGR

Query:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE
        F+EA+EV+  M ++ DE +WGSLLN CKIHG LDLAE +VK L+ ++P NGGY  M+AN+Y E+ NW+E R+ RK++K QNAYK PG S IE
Subjt:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE

Q9FIF7 Putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic8.0e-9736.89Show/hide
Query:  VLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFI
        ++SVL  C ++ H+  +   +I   H Q  F  F+L+R C+ TL  + Y+  +F ++S+PNVYLYTAMI  + S   S     LY  M+     LP++++
Subjt:  VLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFI

Query:  YPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWN
           VLK+C    +    + +HAQVLK GFG    V   +++ Y +   ++  A+++FDEM +R  V+ T MI+ Y+  G I  A+ LF+ +  +D   W 
Subjt:  YPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWN

Query:  ALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI
        A+I G  +N    +A+ LF+ M        +      N+ T    LS+C   G L LG+W+H +V    +   +F+ NAL++MY +CG++  ARRVF ++
Subjt:  ALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI

Query:  TLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEV
          K + S+N++I+ LA+HG S  AI+ F ++V    G +P+ VT V +LNAC+HGGL++ G   F  M++ +++EPQIEH+GC++DLLGR GR EEA   
Subjt:  TLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEV

Query:  VRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQED
        +  + IEPD ++ G+LL+ CKIHG ++L E   K+L E +  + G  ++L+N+YA    W E  ++R+ +++    K PGCS IE ++
Subjt:  VRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQED

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665203.0e-9635.38Show/hide
Query:  LSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFC-NLTLTD-LCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHF
        +S L++CS    LKQ+   ++  G  Q  +   K + FC + T +D L Y++ +FD    P+ +L+  MI  ++   + + + LLY+ M+   AP  N +
Subjt:  LSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFC-NLTLTD-LCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHF

Query:  IYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAW
         +P +LK+C  L     T  +HAQ+ K G+        +++++Y+    +  +A  +FD + E   VSW ++I GY + G +D A+ LF  M E++  +W
Subjt:  IYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAW

Query:  NALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDM
          +I+G  Q     EA+ LF  M        +  + +P+ +++A+ALS+C   G L  GKWIH Y+ KT +  DS +   L+DMY KCG ++ A  VF  
Subjt:  NALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDM

Query:  ITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAME
        I  KS+ +W +LI+  A HGH   AI  F+E+ +   G++P+ +TF  VL AC++ GLVE+G   F  M +DY+++P IEH+GC++DLLGRAG  +EA  
Subjt:  ITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAME

Query:  VVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKF
         ++ M ++P+ V+WG+LL  C+IH  ++L E   + LI +DP +GG  +  ANI+A  + WD+  + R+L+KEQ   K+PGCS I  E  ++   +G++ 
Subjt:  VVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKF

Query:  EPREDR
         P  ++
Subjt:  EPREDR

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085103.0e-9637.13Show/hide
Query:  LNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHVLKSCPE
        +N +KQL    +  G  +T+    +L     L + +L Y+R +FDH  +   +LY  +I AY        + +LY N++      P+H  +  +  +   
Subjt:  LNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHVLKSCPE

Query:  LLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIAGCAQNG
           +   +++H+Q  +SGF       T ++ AY++    +  AR+VFDEM +R V  W AMI+GY R GD+  AM LF+SMP +++ +W  +I+G +QNG
Subjt:  LLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIAGCAQNG

Query:  FFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI-TLKSLTSWN
         + EA+ +F  M        K++  KPN ITV S L +C + G L +G+ + GY  +     + ++ NA ++MY KCG + VA+R+F+ +   ++L SWN
Subjt:  FFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI-TLKSLTSWN

Query:  SLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPD
        S+I  LA HG    A+ LF +++  R+G +PDAVTFVG+L AC HGG+V KG   FK M + + I P++EH+GC+IDLLGR G+ +EA ++++ M ++PD
Subjt:  SLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPD

Query:  EVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSW
         VVWG+LL  C  HG +++AE + + L +++P N G  ++++NIYA  E WD V ++RKL+K++   K  G S+
Subjt:  EVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSW

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205401.0e-10438.43Show/hide
Query:  LEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHV
        L++    N  K++   +I  G SQ+ F   K+V FC+  + D+ Y+  +F+ +S+PNV+LY ++I AY           +Y+ ++R+   LP+ F +P +
Subjt:  LEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHV

Query:  LKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIA
         KSC  L      K VH  + K G   + V + A++D Y +F  D+  A +VFDEM ER V+SW +++SGYARLG +  A  LF  M ++ I +W A+I+
Subjt:  LKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIA

Query:  GCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMITLKS
        G    G + EA+  F+ M    +E        P++I++ S L SC   G L LGKWIH Y  +    + + + NAL++MY KCG +  A ++F  +  K 
Subjt:  GCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMITLKS

Query:  LTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM
        + SW+++I+  A HG++  AI+ F E+ + +  V+P+ +TF+G+L+AC+H G+ ++G  YF MMRQDY IEP+IEH+GCLID+L RAG+ E A+E+ + M
Subjt:  LTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM

Query:  NIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKFEP
         ++PD  +WGSLL+ C+  G LD+A  ++  L+E++PE+ G  ++LANIYA+L  W++V ++RK+++ +N  K PG S IE  +I     SG+  +P
Subjt:  NIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKFEP

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein7.8e-16455.49Show/hide
Query:  LNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE--PDSKAAFLLYRNMVRRGAP
        LNQ + +V+ K  HLNHLKQ+Q F+I  G S + F  FKL+RFC L L +L Y+RFIFD  S PN +LY A++TAY+S     + +AF  +R MV R  P
Subjt:  LNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASE--PDSKAAFLLYRNMVRRGAP

Query:  LPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPER
         PNHFIYP VLKS P L  +  T +VH  + KSGF  Y VVQTA++ +Y+   + I +ARQ+FDEM ER+VVSWTAM+SGYAR GDI NA+ALFE MPER
Subjt:  LPNHFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPER

Query:  DIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVAR
        D+P+WNA++A C QNG F EA+ LF+RM++       E   +PN++TV   LS+C  TG L L K IH + ++  L  D F+SN+L+D+YGKCGNL+ A 
Subjt:  DIPAWNALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVAR

Query:  RVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCR-DGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGR
         VF M + KSLT+WNS+INC ALHG S  AI +F E+++   + ++PD +TF+G+LNACTHGGLV KG  YF +M   + IEP+IEH+GCLIDLLGRAGR
Subjt:  RVFDMITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCR-DGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGR

Query:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE
        F+EA+EV+  M ++ DE +WGSLLN CKIHG LDLAE +VK L+ ++P NGGY  M+AN+Y E+ NW+E R+ RK++K QNAYK PG S IE
Subjt:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIE

AT2G20540.1 mitochondrial editing factor 217.4e-10638.43Show/hide
Query:  LEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHV
        L++    N  K++   +I  G SQ+ F   K+V FC+  + D+ Y+  +F+ +S+PNV+LY ++I AY           +Y+ ++R+   LP+ F +P +
Subjt:  LEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHV

Query:  LKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIA
         KSC  L      K VH  + K G   + V + A++D Y +F  D+  A +VFDEM ER V+SW +++SGYARLG +  A  LF  M ++ I +W A+I+
Subjt:  LKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIA

Query:  GCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMITLKS
        G    G + EA+  F+ M    +E        P++I++ S L SC   G L LGKWIH Y  +    + + + NAL++MY KCG +  A ++F  +  K 
Subjt:  GCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMITLKS

Query:  LTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM
        + SW+++I+  A HG++  AI+ F E+ + +  V+P+ +TF+G+L+AC+H G+ ++G  YF MMRQDY IEP+IEH+GCLID+L RAG+ E A+E+ + M
Subjt:  LTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM

Query:  NIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKFEP
         ++PD  +WGSLL+ C+  G LD+A  ++  L+E++PE+ G  ++LANIYA+L  W++V ++RK+++ +N  K PG S IE  +I     SG+  +P
Subjt:  NIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKFEP

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-9737.13Show/hide
Query:  LNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHVLKSCPE
        +N +KQL    +  G  +T+    +L     L + +L Y+R +FDH  +   +LY  +I AY        + +LY N++      P+H  +  +  +   
Subjt:  LNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFIYPHVLKSCPE

Query:  LLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIAGCAQNG
           +   +++H+Q  +SGF       T ++ AY++    +  AR+VFDEM +R V  W AMI+GY R GD+  AM LF+SMP +++ +W  +I+G +QNG
Subjt:  LLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIAGCAQNG

Query:  FFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI-TLKSLTSWN
         + EA+ +F  M        K++  KPN ITV S L +C + G L +G+ + GY  +     + ++ NA ++MY KCG + VA+R+F+ +   ++L SWN
Subjt:  FFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI-TLKSLTSWN

Query:  SLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPD
        S+I  LA HG    A+ LF +++  R+G +PDAVTFVG+L AC HGG+V KG   FK M + + I P++EH+GC+IDLLGR G+ +EA ++++ M ++PD
Subjt:  SLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPD

Query:  EVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSW
         VVWG+LL  C  HG +++AE + + L +++P N G  ++++NIYA  E WD V ++RKL+K++   K  G S+
Subjt:  EVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSW

AT5G59200.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-9836.89Show/hide
Query:  VLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFI
        ++SVL  C ++ H+  +   +I   H Q  F  F+L+R C+ TL  + Y+  +F ++S+PNVYLYTAMI  + S   S     LY  M+     LP++++
Subjt:  VLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHFI

Query:  YPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWN
           VLK+C    +    + +HAQVLK GFG    V   +++ Y +   ++  A+++FDEM +R  V+ T MI+ Y+  G I  A+ LF+ +  +D   W 
Subjt:  YPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWN

Query:  ALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI
        A+I G  +N    +A+ LF+ M        +      N+ T    LS+C   G L LG+W+H +V    +   +F+ NAL++MY +CG++  ARRVF ++
Subjt:  ALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMI

Query:  TLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEV
          K + S+N++I+ LA+HG S  AI+ F ++V    G +P+ VT V +LNAC+HGGL++ G   F  M++ +++EPQIEH+GC++DLLGR GR EEA   
Subjt:  TLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEV

Query:  VRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQED
        +  + IEPD ++ G+LL+ CKIHG ++L E   K+L E +  + G  ++L+N+YA    W E  ++R+ +++    K PGCS IE ++
Subjt:  VRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQED

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-9735.38Show/hide
Query:  LSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFC-NLTLTD-LCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHF
        +S L++CS    LKQ+   ++  G  Q  +   K + FC + T +D L Y++ +FD    P+ +L+  MI  ++   + + + LLY+ M+   AP  N +
Subjt:  LSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFC-NLTLTD-LCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPNHF

Query:  IYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAW
         +P +LK+C  L     T  +HAQ+ K G+        +++++Y+    +  +A  +FD + E   VSW ++I GY + G +D A+ LF  M E++  +W
Subjt:  IYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAW

Query:  NALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDM
          +I+G  Q     EA+ LF  M        +  + +P+ +++A+ALS+C   G L  GKWIH Y+ KT +  DS +   L+DMY KCG ++ A  VF  
Subjt:  NALIAGCAQNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDM

Query:  ITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAME
        I  KS+ +W +LI+  A HGH   AI  F+E+ +   G++P+ +TF  VL AC++ GLVE+G   F  M +DY+++P IEH+GC++DLLGRAG  +EA  
Subjt:  ITLKSLTSWNSLINCLALHGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAME

Query:  VVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKF
         ++ M ++P+ V+WG+LL  C+IH  ++L E   + LI +DP +GG  +  ANI+A  + WD+  + R+L+KEQ   K+PGCS I  E  ++   +G++ 
Subjt:  VVRGMNIEPDEVVWGSLLNGCKIHGRLDLAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNS--SGEKF

Query:  EPREDR
         P  ++
Subjt:  EPREDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTGTTCCAATGCAACCTCATTTGAATCAATTAGTTCTATCGGTGCTCGAAAAATGCAGCCATCTCAATCACCTCAAGCAGCTCCAAGGCTTTCTCATTTCTCT
CGGTCACTCACAAACCCAGTTCTACGCCTTCAAGCTCGTTCGCTTCTGTAACCTTACTCTTACTGACTTATGTTATTCTCGCTTCATTTTTGATCATCTATCTTCCCCGA
ATGTCTATCTCTATACTGCAATGATCACTGCTTATGCCTCGGAGCCCGATTCTAAAGCCGCGTTTCTCTTGTACCGTAACATGGTCCGCCGAGGAGCTCCTCTACCCAAC
CATTTTATTTATCCCCATGTGCTGAAGTCCTGCCCTGAGCTTTTGGAGTCCAATGGCACGAAAATGGTTCATGCCCAGGTTCTGAAATCTGGATTTGGTGGATACCCAGT
TGTCCAAACGGCCATTGTTGATGCCTATTCGAGATTCCGTACGGATATTGGAATTGCCCGACAGGTGTTCGACGAAATGCTTGAGAGAAGTGTAGTGTCTTGGACGGCTA
TGATTTCAGGGTATGCGAGGCTTGGGGACATTGATAATGCAATGGCGTTGTTTGAGAGTATGCCTGAGAGGGATATCCCTGCTTGGAATGCTCTTATTGCTGGATGTGCT
CAAAATGGATTCTTCTGTGAAGCAATTGGGCTGTTCAAAAGAATGGTTTCATTGGCTTTGGAGGGTAATAAGGAGCGTGAAACCAAGCCGAATAAGATCACAGTTGCATC
TGCACTATCCTCTTGTGGACATACTGGGATGCTTCATCTTGGTAAGTGGATTCATGGTTATGTTTTCAAAACTTATCTTGGTCAAGATTCATTTATCTCAAATGCTCTGT
TAGATATGTATGGGAAATGTGGCAATTTGAAAGTTGCTAGGAGAGTTTTCGATATGATTACTTTAAAAAGCTTGACATCATGGAATTCCTTGATAAATTGTCTTGCACTC
CATGGTCATAGTGGAAGTGCAATTGATTTGTTCTTAGAGTTAGTTCAATGTAGGGATGGCGTGCAGCCAGATGCGGTTACTTTTGTGGGTGTGTTGAATGCTTGTACTCA
TGGAGGATTAGTTGAAAAGGGTTACTCATACTTCAAAATGATGAGGCAGGATTACGACATCGAGCCTCAGATCGAACACTTTGGGTGCTTGATCGACCTTCTTGGTCGTG
CAGGGCGGTTCGAGGAAGCAATGGAAGTTGTGAGGGGAATGAATATTGAACCAGATGAAGTTGTATGGGGTTCTTTACTAAATGGATGCAAAATCCATGGCCGTCTAGAT
TTAGCTGAATACTCGGTTAAAAAGTTGATCGAGATGGATCCAGAAAATGGCGGTTATAGAATTATGCTAGCAAATATATACGCCGAGCTTGAAAACTGGGACGAGGTTCG
TAAGGTTCGGAAACTTTTGAAGGAGCAAAATGCTTACAAAATACCAGGTTGCAGTTGGATTGAGCAAGAGGATATCAGCAATTCATCAGGCGAGAAATTCGAACCACGAG
AGGATCGGCAAGCTCTACCGAGCTCTTCTCCCCTTCCATTGGCCCCGTCTTCCATTGACAACAGGTGCAAGACAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTGTTCCAATGCAACCTCATTTGAATCAATTAGTTCTATCGGTGCTCGAAAAATGCAGCCATCTCAATCACCTCAAGCAGCTCCAAGGCTTTCTCATTTCTCT
CGGTCACTCACAAACCCAGTTCTACGCCTTCAAGCTCGTTCGCTTCTGTAACCTTACTCTTACTGACTTATGTTATTCTCGCTTCATTTTTGATCATCTATCTTCCCCGA
ATGTCTATCTCTATACTGCAATGATCACTGCTTATGCCTCGGAGCCCGATTCTAAAGCCGCGTTTCTCTTGTACCGTAACATGGTCCGCCGAGGAGCTCCTCTACCCAAC
CATTTTATTTATCCCCATGTGCTGAAGTCCTGCCCTGAGCTTTTGGAGTCCAATGGCACGAAAATGGTTCATGCCCAGGTTCTGAAATCTGGATTTGGTGGATACCCAGT
TGTCCAAACGGCCATTGTTGATGCCTATTCGAGATTCCGTACGGATATTGGAATTGCCCGACAGGTGTTCGACGAAATGCTTGAGAGAAGTGTAGTGTCTTGGACGGCTA
TGATTTCAGGGTATGCGAGGCTTGGGGACATTGATAATGCAATGGCGTTGTTTGAGAGTATGCCTGAGAGGGATATCCCTGCTTGGAATGCTCTTATTGCTGGATGTGCT
CAAAATGGATTCTTCTGTGAAGCAATTGGGCTGTTCAAAAGAATGGTTTCATTGGCTTTGGAGGGTAATAAGGAGCGTGAAACCAAGCCGAATAAGATCACAGTTGCATC
TGCACTATCCTCTTGTGGACATACTGGGATGCTTCATCTTGGTAAGTGGATTCATGGTTATGTTTTCAAAACTTATCTTGGTCAAGATTCATTTATCTCAAATGCTCTGT
TAGATATGTATGGGAAATGTGGCAATTTGAAAGTTGCTAGGAGAGTTTTCGATATGATTACTTTAAAAAGCTTGACATCATGGAATTCCTTGATAAATTGTCTTGCACTC
CATGGTCATAGTGGAAGTGCAATTGATTTGTTCTTAGAGTTAGTTCAATGTAGGGATGGCGTGCAGCCAGATGCGGTTACTTTTGTGGGTGTGTTGAATGCTTGTACTCA
TGGAGGATTAGTTGAAAAGGGTTACTCATACTTCAAAATGATGAGGCAGGATTACGACATCGAGCCTCAGATCGAACACTTTGGGTGCTTGATCGACCTTCTTGGTCGTG
CAGGGCGGTTCGAGGAAGCAATGGAAGTTGTGAGGGGAATGAATATTGAACCAGATGAAGTTGTATGGGGTTCTTTACTAAATGGATGCAAAATCCATGGCCGTCTAGAT
TTAGCTGAATACTCGGTTAAAAAGTTGATCGAGATGGATCCAGAAAATGGCGGTTATAGAATTATGCTAGCAAATATATACGCCGAGCTTGAAAACTGGGACGAGGTTCG
TAAGGTTCGGAAACTTTTGAAGGAGCAAAATGCTTACAAAATACCAGGTTGCAGTTGGATTGAGCAAGAGGATATCAGCAATTCATCAGGCGAGAAATTCGAACCACGAG
AGGATCGGCAAGCTCTACCGAGCTCTTCTCCCCTTCCATTGGCCCCGTCTTCCATTGACAACAGGTGCAAGACAGAGTGA
Protein sequenceShow/hide protein sequence
MASVPMQPHLNQLVLSVLEKCSHLNHLKQLQGFLISLGHSQTQFYAFKLVRFCNLTLTDLCYSRFIFDHLSSPNVYLYTAMITAYASEPDSKAAFLLYRNMVRRGAPLPN
HFIYPHVLKSCPELLESNGTKMVHAQVLKSGFGGYPVVQTAIVDAYSRFRTDIGIARQVFDEMLERSVVSWTAMISGYARLGDIDNAMALFESMPERDIPAWNALIAGCA
QNGFFCEAIGLFKRMVSLALEGNKERETKPNKITVASALSSCGHTGMLHLGKWIHGYVFKTYLGQDSFISNALLDMYGKCGNLKVARRVFDMITLKSLTSWNSLINCLAL
HGHSGSAIDLFLELVQCRDGVQPDAVTFVGVLNACTHGGLVEKGYSYFKMMRQDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRLD
LAEYSVKKLIEMDPENGGYRIMLANIYAELENWDEVRKVRKLLKEQNAYKIPGCSWIEQEDISNSSGEKFEPREDRQALPSSSPLPLAPSSIDNRCKTE