; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013835 (gene) of Snake gourd v1 genome

Gene IDTan0013835
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG10:57021285..57023018
RNA-Seq ExpressionTan0013835
SyntenyTan0013835
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592283.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.2e-27587.52Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MAS+ IQPHLNQLVL+VLEKCSH NHLKQLQGFLISLGHSQTQF+AFKLVRFCN TLTDLCY+RFIFDHL+SPNVYLYTAMITAYAS+ D KAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRG P PNHFIYPHVLKSCPE+LESNGT++VHAQVLKSGFG YPVVQTAIVD+YSRF + IGIARQ+FDEM+ERSVVSWTAM+SGYARLG++D+A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFKRMVS+ALEG+KERE KPNKIT+ASALSACGHTGMLHLGKWIHGYVFK+YLGQDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLK+ARRVFDMIT+KSLTSWNSLINCLALHGHS SAIDLF+ELVQC DGVKPDGVTFVGVLNACTHGGLVEKGYS+F+MM +DYDIEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV-
        LGRAGRFEEAMEVVRGM+IEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELE WDEVR+VRKLLKEQNAYK PGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV-

Query:  ---------RRSNDKRIVKLYPALLSFHW
                 R+SN +RI KLY ALL FHW
Subjt:  ---------RRSNDKRIVKLYPALLSFHW

XP_022156007.1 pentatricopeptide repeat-containing protein At1g33350 [Momordica charantia]2.4e-27390.02Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MASV IQPHLNQLVL+VLEKCSH NHLKQ+QGFLISLGHSQTQFFAFKLVRFCN TL +L YARFIFD L SPNVYLYTAMITAYASQPD KAAF+LYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRGTPRPNHFIYPHVLKSCPEV ESN TQ+VHAQ+LKSGFGRYPVVQTAIVDSYS+FCS+IGIARQMFDEMIERSVVSWTAMISGYARLGN+D A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERDVPAWNA+IAG AQNGFFCEAIWLF+RMVS+A+E D+ERENKPNKIT+ASALSACGHTGMLHLGKWIHGYVFKS L QDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLKIA+RVFDMIT+KSLTSWNSLINCLALHGHSESAIDLFV+LV+CGDGVKPDGVTFVGVLNACTHGGLVEKGYSYF+MM +DY IEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR

Query:  RSNDKRIVKLY
           D ++ + Y
Subjt:  RSNDKRIVKLY

XP_022930176.1 pentatricopeptide repeat-containing protein At1g33350 [Cucurbita moschata]8.6e-27188.65Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MAS+ IQPHLNQLVL+VLEKCSH NHLKQLQGFLISLG SQTQF+AFKLVRFCN TLTDLCY+RFIFDHL+SPNVYLYTAMITAYAS+ D KAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRG P PNHFIYPHVLKSCPE+LESNGT++VHAQVLKSGFG YPVVQTAIVD+YSRF + IGIARQ+FDEM+ERSVVSWTAM+SGYARLG++D+A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFKRMVS+ALEG+KERE KPNKIT+ASALSACGHTGMLHLGKWIHGYVFK+YLGQDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLK+ARRVFDMIT+KSLTSWNSLINCLALHGHS SAIDLF+ELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMM +DYDIEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR
        LGRAGRFEEAMEVV+GMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELE WDEVR+VRKLLKEQNAYK PGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR

Query:  RSNDKRIVKLY
           D ++ + Y
Subjt:  RSNDKRIVKLY

XP_022974048.1 pentatricopeptide repeat-containing protein At1g33350 [Cucurbita maxima]2.3e-27188.85Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MASV +QPHLNQLVL+VLEKCSH NHLKQLQGFLISLGHSQTQF+AFKLVRFCN TLTDLCY+RFIFDHL+SPNVYLYTAMITAYAS+PD KAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRG P PNHFIYPHVLKSCPE+LESNGT++VHAQVLKSGFG YPVVQTAIVD+YSRF ++IGIARQ+FDEM+ERSVVSWTAMISGYARLG++D+A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFKRMVS+ALEG+KERE KPNKIT+ASALS+CGHTGMLHLGKWIHGYVFK+YLGQDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLK+ARRVFDMIT+KSLTSWNSLINCLALHGHS SAIDLF+ELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYFKMM +DYDIEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELE WDEVRKVRKLLKEQNAYK PGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR

Query:  RSNDKRIVKLY
           D  + + Y
Subjt:  RSNDKRIVKLY

XP_038889416.1 pentatricopeptide repeat-containing protein At1g33350-like [Benincasa hispida]2.3e-27991.8Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        M+SV IQPHLNQLVLA LEKCS  NHLKQLQGFLISLGHS+TQFFAFKLVRFCN TL DLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVR G PRPNHFIYPHVLKSCPEVLESNGT++VH QVLKSGFG+YPVVQTAIVDSYSRFCS++G ARQMFDEM+ERSVVSWTAMISGYARLGNVDSAI L
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEG-DKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGK
        FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVS+ALEG + ERENKPNKIT+ASALSACGHTGMLHLGKWIHGYVFK+Y GQDSFISNALLDMYGK
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEG-DKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGK

Query:  CGNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLID
        CGNLK+ARRVFDMI++KSLTSWNSLINCLALHGHS SAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYF+MM RDYDIEPQIEHFGCLID
Subjt:  CGNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLID

Query:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
        LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
Subjt:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Query:  RRSNDKRIVKLY
            D +I + Y
Subjt:  RRSNDKRIVKLY

TrEMBL top hitse value%identityAlignment
A0A0A0KUN2 Uncharacterized protein3.7e-26786.02Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        M+SV I PHLNQL +A LEKCS+ NHLKQLQGFLIS GHSQTQFFAFKLVRFCN TL DLCYAR+IFD+LTSPNV+LYTAMITAYAS PDPKAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRG  RPN+FIYPHVL+SCP+VL SN T++VH QVLKSGFG YPVVQTAIVDSYSRF S+IG ARQMFDEM+ER+VVSWTAMISGYARLGN DSAI L
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEG-DKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGK
        FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMV +ALEG + +RENKPNK TL SALSACGHTGMLHLGKWIHGYVFK+Y GQDSFISNALLDMYGK
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEG-DKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGK

Query:  CGNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLID
        CGNLK+ARRVFDMIT+K+LTSWNSLINCLALHGHS SAIDLF EL+ CGDGVKP+ VTFVGVLNACTHGGLVEKGYSYF+MM RDYDIEPQIEHFGCLID
Subjt:  CGNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLID

Query:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
        LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLN CKIHGR DLAEYSVKKLIEMDP+NGGYRIMLANIYAE  KWDEVRKVR+LLKE+NAYKTPGCSWIE+
Subjt:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Query:  RRSNDKRIVKLYPALLSFHWQS
        R+SN +RI+KLY   LSF   S
Subjt:  RRSNDKRIVKLYPALLSFHWQS

A0A5A7T7Y7 Pentatricopeptide repeat-containing protein8.2e-26784.38Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        M+SV IQPHL QLV+A LEKCS+ NHLKQLQGFLIS GHSQTQFFAFKLVRFCN TLTDLCYAR+IFD+LTSPNVYLYTAMITAYA+ PDPKAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVR G  RPNHFIYPHVLKSCP+VL SN T++VH QVLKSGFGRYPVVQTAIVDSYSRF S IG ARQMFDEM+ERSVVSWTAMISGYARLGN DSAI L
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEG-DKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGK
        FESMPERDVPAWNALIAGCAQNGFFCEAIWLFK+MVS+ALEG + +RENKPNK TLASALSACG+TGMLHLGKWIHGYVFK+Y GQDSFISNALLDMYGK
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEG-DKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGK

Query:  CGNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLID
        CGNLK+ARRVFDMIT+KSLTSWNSLINCLALHGHS SAIDLF ELVQCGDGVKPD VTFVGVLNACTHGGLVEKGYSYF+MM RDYDIEPQIEHFGCLID
Subjt:  CGNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLID

Query:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
        LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLN CKIHGR DLAEYSVKKLIEMDP+NGGYRIMLANIYAEL KWDEVRKVRKLLKE+NAYKTPGCSWIEV
Subjt:  LLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Query:  ------------------------------RRSNDKRIVKLYPA
                                      R+SN KRI+KLY A
Subjt:  ------------------------------RRSNDKRIVKLYPA

A0A6J1DQX1 pentatricopeptide repeat-containing protein At1g333501.2e-27390.02Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MASV IQPHLNQLVL+VLEKCSH NHLKQ+QGFLISLGHSQTQFFAFKLVRFCN TL +L YARFIFD L SPNVYLYTAMITAYASQPD KAAF+LYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRGTPRPNHFIYPHVLKSCPEV ESN TQ+VHAQ+LKSGFGRYPVVQTAIVDSYS+FCS+IGIARQMFDEMIERSVVSWTAMISGYARLGN+D A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERDVPAWNA+IAG AQNGFFCEAIWLF+RMVS+A+E D+ERENKPNKIT+ASALSACGHTGMLHLGKWIHGYVFKS L QDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLKIA+RVFDMIT+KSLTSWNSLINCLALHGHSESAIDLFV+LV+CGDGVKPDGVTFVGVLNACTHGGLVEKGYSYF+MM +DY IEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR

Query:  RSNDKRIVKLY
           D ++ + Y
Subjt:  RSNDKRIVKLY

A0A6J1EQ76 pentatricopeptide repeat-containing protein At1g333504.2e-27188.65Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MAS+ IQPHLNQLVL+VLEKCSH NHLKQLQGFLISLG SQTQF+AFKLVRFCN TLTDLCY+RFIFDHL+SPNVYLYTAMITAYAS+ D KAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRG P PNHFIYPHVLKSCPE+LESNGT++VHAQVLKSGFG YPVVQTAIVD+YSRF + IGIARQ+FDEM+ERSVVSWTAM+SGYARLG++D+A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFKRMVS+ALEG+KERE KPNKIT+ASALSACGHTGMLHLGKWIHGYVFK+YLGQDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLK+ARRVFDMIT+KSLTSWNSLINCLALHGHS SAIDLF+ELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMM +DYDIEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR
        LGRAGRFEEAMEVV+GMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELE WDEVR+VRKLLKEQNAYK PGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR

Query:  RSNDKRIVKLY
           D ++ + Y
Subjt:  RSNDKRIVKLY

A0A6J1IF22 pentatricopeptide repeat-containing protein At1g333501.1e-27188.85Show/hide
Query:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT
        MASV +QPHLNQLVL+VLEKCSH NHLKQLQGFLISLGHSQTQF+AFKLVRFCN TLTDLCY+RFIFDHL+SPNVYLYTAMITAYAS+PD KAAFLLYR 
Subjt:  MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRT

Query:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL
        MVRRG P PNHFIYPHVLKSCPE+LESNGT++VHAQVLKSGFG YPVVQTAIVD+YSRF ++IGIARQ+FDEM+ERSVVSWTAMISGYARLG++D+A+AL
Subjt:  MVRRGTPRPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIAL

Query:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC
        FESMPERD+PAWNALIAGCAQNGFFCEAI LFKRMVS+ALEG+KERE KPNKIT+ASALS+CGHTGMLHLGKWIHGYVFK+YLGQDSFISNALLDMYGKC
Subjt:  FESMPERDVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKC

Query:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL
        GNLK+ARRVFDMIT+KSLTSWNSLINCLALHGHS SAIDLF+ELVQC DGV+PD VTFVGVLNACTHGGLVEKGYSYFKMM +DYDIEPQIEHFGCLIDL
Subjt:  GNLKIARRVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDL

Query:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR
        LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGR DLAEYSVKKLIEMDPENGGYRIMLANIYAELE WDEVRKVRKLLKEQNAYK PGCSWIEV 
Subjt:  LGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVR

Query:  RSNDKRIVKLY
           D  + + Y
Subjt:  RSNDKRIVKLY

SwissProt top hitse value%identityAlignment
Q9C501 Pentatricopeptide repeat-containing protein At1g333501.2e-16155.17Show/hide
Query:  LNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQ--PDPKAAFLLYRTMVRRGTP
        LNQ + AV+ K  H NHLKQ+Q F+I  G S + F  FKL+RFC   L +L YARFIFD  + PN +LY A++TAY+S       +AF  +R MV R  P
Subjt:  LNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQ--PDPKAAFLLYRTMVRRGTP

Query:  RPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPER
        RPNHFIYP VLKS P +  +  T +VH  + KSGF  Y VVQTA++ SY+   S+I +ARQ+FDEM ER+VVSWTAM+SGYAR G++ +A+ALFE MPER
Subjt:  RPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPER

Query:  DVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIAR
        DVP+WNA++A C QNG F EA+ LF+RM++       E   +PN++T+   LSAC  TG L L K IH + ++  L  D F+SN+L+D+YGKCGNL+ A 
Subjt:  DVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIAR

Query:  RVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCG-DGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGR
         VF M + KSLT+WNS+INC ALHG SE AI +F E+++   + +KPD +TF+G+LNACTHGGLV KG  YF +M   + IEP+IEH+GCLIDLLGRAGR
Subjt:  RVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCG-DGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGR

Query:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
        F+EA+EV+  M ++ DE +WGSLLN CKIHG  DLAE +VK L+ ++P NGGY  M+AN+Y E+  W+E R+ RK++K QNAYK PG S IE+
Subjt:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Q9FIF7 Putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic3.2e-9537.58Show/hide
Query:  VLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFI
        +++VL  C +  H+  +   +I   H Q  F  F+L+R C+ TL  + YA  +F ++++PNVYLYTAMI  + S         LY  M+      P++++
Subjt:  VLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFI

Query:  YPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWN
           VLK+C    +    + +HAQVLK GFG    V   +++ Y +    +  A++MFDEM +R  V+ T MI+ Y+  G +  A+ LF+ +  +D   W 
Subjt:  YPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWN

Query:  ALIAGCAQNGFFCEAIWLFKRMVSMALEGDKEREN-KPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM
        A+I G  +N    +A+ LF+ M         + EN   N+ T    LSAC   G L LG+W+H +V    +   +F+ NAL++MY +CG++  ARRVF +
Subjt:  ALIAGCAQNGFFCEAIWLFKRMVSMALEGDKEREN-KPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM

Query:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME
        +  K + S+N++I+ LA+HG S  AI+ F ++V    G +P+ VT V +LNAC+HGGL++ G   F  M R +++EPQIEH+GC++DLLGR GR EEA  
Subjt:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME

Query:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
         +  + IEPD ++ G+LL+ CKIHG  +L E   K+L E +  + G  ++L+N+YA   KW E  ++R+ +++    K PGCS IEV
Subjt:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665209.1e-9837.37Show/hide
Query:  LAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFC-NHTLTD-LCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHF
        ++ L++CS    LKQ+   ++  G  Q  +   K + FC + T +D L YA+ +FD    P+ +L+  MI  ++   +P+ + LLY+ M+    P  N +
Subjt:  LAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFC-NHTLTD-LCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHF

Query:  IYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAW
         +P +LK+C  +     T  +HAQ+ K G+        ++++SY+    N  +A  +FD + E   VSW ++I GY + G +D A+ LF  M E++  +W
Subjt:  IYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAW

Query:  NALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM
          +I+G  Q     EA+ LF  M +  +E        P+ ++LA+ALSAC   G L  GKWIH Y+ K+ +  DS +   L+DMY KCG ++ A  VF  
Subjt:  NALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM

Query:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME
        I  KS+ +W +LI+  A HGH   AI  F+E+ +   G+KP+ +TF  VL AC++ GLVE+G   F  M RDY+++P IEH+GC++DLLGRAG  +EA  
Subjt:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME

Query:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
         ++ M ++P+ V+WG+LL  C+IH   +L E   + LI +DP +GG  +  ANI+A  +KWD+  + R+L+KEQ   K PGCS I +
Subjt:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085103.2e-9536.79Show/hide
Query:  NHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHVLKSCPEV
        N +KQL    +  G  +T+    +L+      + +L YAR +FDH  +   +LY  +I AY     P  + +LY  +   G  RP+H  +  +  +    
Subjt:  NHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHVLKSCPEV

Query:  LESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIAGCAQNGF
          +   +++H+Q  +SGF       T ++ +Y++    +  AR++FDEM +R V  W AMI+GY R G++ +A+ LF+SMP ++V +W  +I+G +QNG 
Subjt:  LESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIAGCAQNGF

Query:  FCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMI-TVKSLTSWNS
        + EA+ +F  M       +K++  KPN IT+ S L AC + G L +G+ + GY  ++    + ++ NA ++MY KCG + +A+R+F+ +   ++L SWNS
Subjt:  FCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMI-TVKSLTSWNS

Query:  LINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDE
        +I  LA HG  + A+ LF ++++  +G KPD VTFVG+L AC HGG+V KG   FK M   + I P++EH+GC+IDLLGR G+ +EA ++++ M ++PD 
Subjt:  LINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDE

Query:  VVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSW
        VVWG+LL  C  HG  ++AE + + L +++P N G  ++++NIYA  EKWD V ++RKL+K++   K  G S+
Subjt:  VVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSW

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205402.3e-10137.97Show/hide
Query:  LEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHV
        L++    N  K++   +I  G SQ+ F   K+V FC+  + D+ YA  +F+ +++PNV+LY ++I AY           +Y+ ++R+    P+ F +P +
Subjt:  LEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHV

Query:  LKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIA
         KSC  +      + VH  + K G   + V + A++D Y +F  ++  A ++FDEM ER V+SW +++SGYARLG +  A  LF  M ++ + +W A+I+
Subjt:  LKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIA

Query:  GCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMITVKS
        G    G + EA+  F+ M    +E        P++I+L S L +C   G L LGKWIH Y  +    + + + NAL++MY KCG +  A ++F  +  K 
Subjt:  GCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMITVKS

Query:  LTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM
        + SW+++I+  A HG++  AI+ F E+ +    VKP+G+TF+G+L+AC+H G+ ++G  YF MM +DY IEP+IEH+GCLID+L RAG+ E A+E+ + M
Subjt:  LTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM

Query:  NIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
         ++PD  +WGSLL+ C+  G  D+A  ++  L+E++PE+ G  ++LANIYA+L KW++V ++RK+++ +N  KTPG S IEV
Subjt:  NIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-16355.17Show/hide
Query:  LNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQ--PDPKAAFLLYRTMVRRGTP
        LNQ + AV+ K  H NHLKQ+Q F+I  G S + F  FKL+RFC   L +L YARFIFD  + PN +LY A++TAY+S       +AF  +R MV R  P
Subjt:  LNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQ--PDPKAAFLLYRTMVRRGTP

Query:  RPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPER
        RPNHFIYP VLKS P +  +  T +VH  + KSGF  Y VVQTA++ SY+   S+I +ARQ+FDEM ER+VVSWTAM+SGYAR G++ +A+ALFE MPER
Subjt:  RPNHFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPER

Query:  DVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIAR
        DVP+WNA++A C QNG F EA+ LF+RM++       E   +PN++T+   LSAC  TG L L K IH + ++  L  D F+SN+L+D+YGKCGNL+ A 
Subjt:  DVPAWNALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIAR

Query:  RVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCG-DGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGR
         VF M + KSLT+WNS+INC ALHG SE AI +F E+++   + +KPD +TF+G+LNACTHGGLV KG  YF +M   + IEP+IEH+GCLIDLLGRAGR
Subjt:  RVFDMITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCG-DGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGR

Query:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
        F+EA+EV+  M ++ DE +WGSLLN CKIHG  DLAE +VK L+ ++P NGGY  M+AN+Y E+  W+E R+ RK++K QNAYK PG S IE+
Subjt:  FEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

AT2G20540.1 mitochondrial editing factor 211.6e-10237.97Show/hide
Query:  LEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHV
        L++    N  K++   +I  G SQ+ F   K+V FC+  + D+ YA  +F+ +++PNV+LY ++I AY           +Y+ ++R+    P+ F +P +
Subjt:  LEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHV

Query:  LKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIA
         KSC  +      + VH  + K G   + V + A++D Y +F  ++  A ++FDEM ER V+SW +++SGYARLG +  A  LF  M ++ + +W A+I+
Subjt:  LKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIA

Query:  GCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMITVKS
        G    G + EA+  F+ M    +E        P++I+L S L +C   G L LGKWIH Y  +    + + + NAL++MY KCG +  A ++F  +  K 
Subjt:  GCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMITVKS

Query:  LTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM
        + SW+++I+  A HG++  AI+ F E+ +    VKP+G+TF+G+L+AC+H G+ ++G  YF MM +DY IEP+IEH+GCLID+L RAG+ E A+E+ + M
Subjt:  LTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGM

Query:  NIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
         ++PD  +WGSLL+ C+  G  D+A  ++  L+E++PE+ G  ++LANIYA+L KW++V ++RK+++ +N  KTPG S IEV
Subjt:  NIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-9636.79Show/hide
Query:  NHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHVLKSCPEV
        N +KQL    +  G  +T+    +L+      + +L YAR +FDH  +   +LY  +I AY     P  + +LY  +   G  RP+H  +  +  +    
Subjt:  NHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFIYPHVLKSCPEV

Query:  LESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIAGCAQNGF
          +   +++H+Q  +SGF       T ++ +Y++    +  AR++FDEM +R V  W AMI+GY R G++ +A+ LF+SMP ++V +W  +I+G +QNG 
Subjt:  LESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIAGCAQNGF

Query:  FCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMI-TVKSLTSWNS
        + EA+ +F  M       +K++  KPN IT+ S L AC + G L +G+ + GY  ++    + ++ NA ++MY KCG + +A+R+F+ +   ++L SWNS
Subjt:  FCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMI-TVKSLTSWNS

Query:  LINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDE
        +I  LA HG  + A+ LF ++++  +G KPD VTFVG+L AC HGG+V KG   FK M   + I P++EH+GC+IDLLGR G+ +EA ++++ M ++PD 
Subjt:  LINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDE

Query:  VVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSW
        VVWG+LL  C  HG  ++AE + + L +++P N G  ++++NIYA  EKWD V ++RKL+K++   K  G S+
Subjt:  VVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSW

AT5G59200.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-9637.58Show/hide
Query:  VLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFI
        +++VL  C +  H+  +   +I   H Q  F  F+L+R C+ TL  + YA  +F ++++PNVYLYTAMI  + S         LY  M+      P++++
Subjt:  VLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHFI

Query:  YPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWN
           VLK+C    +    + +HAQVLK GFG    V   +++ Y +    +  A++MFDEM +R  V+ T MI+ Y+  G +  A+ LF+ +  +D   W 
Subjt:  YPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWN

Query:  ALIAGCAQNGFFCEAIWLFKRMVSMALEGDKEREN-KPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM
        A+I G  +N    +A+ LF+ M         + EN   N+ T    LSAC   G L LG+W+H +V    +   +F+ NAL++MY +CG++  ARRVF +
Subjt:  ALIAGCAQNGFFCEAIWLFKRMVSMALEGDKEREN-KPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM

Query:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME
        +  K + S+N++I+ LA+HG S  AI+ F ++V    G +P+ VT V +LNAC+HGGL++ G   F  M R +++EPQIEH+GC++DLLGR GR EEA  
Subjt:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME

Query:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
         +  + IEPD ++ G+LL+ CKIHG  +L E   K+L E +  + G  ++L+N+YA   KW E  ++R+ +++    K PGCS IEV
Subjt:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-9937.37Show/hide
Query:  LAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFC-NHTLTD-LCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHF
        ++ L++CS    LKQ+   ++  G  Q  +   K + FC + T +D L YA+ +FD    P+ +L+  MI  ++   +P+ + LLY+ M+    P  N +
Subjt:  LAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFC-NHTLTD-LCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPNHF

Query:  IYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAW
         +P +LK+C  +     T  +HAQ+ K G+        ++++SY+    N  +A  +FD + E   VSW ++I GY + G +D A+ LF  M E++  +W
Subjt:  IYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAW

Query:  NALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM
          +I+G  Q     EA+ LF  M +  +E        P+ ++LA+ALSAC   G L  GKWIH Y+ K+ +  DS +   L+DMY KCG ++ A  VF  
Subjt:  NALIAGCAQNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDM

Query:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME
        I  KS+ +W +LI+  A HGH   AI  F+E+ +   G+KP+ +TF  VL AC++ GLVE+G   F  M RDY+++P IEH+GC++DLLGRAG  +EA  
Subjt:  ITVKSLTSWNSLINCLALHGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAME

Query:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV
         ++ M ++P+ V+WG+LL  C+IH   +L E   + LI +DP +GG  +  ANI+A  +KWD+  + R+L+KEQ   K PGCS I +
Subjt:  VVRGMNIEPDEVVWGSLLNGCKIHGRPDLAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGGTTCAAATCCAACCTCATTTGAACCAATTAGTTCTAGCAGTGCTTGAAAAATGCAGTCATCACAATCACCTCAAGCAACTCCAAGGCTTTCTCATTTCACT
TGGTCACTCACAAACACAGTTCTTCGCCTTCAAGCTCGTCCGCTTCTGTAACCATACTCTTACTGACTTATGTTACGCTCGGTTCATTTTTGATCATTTAACTTCCCCAA
ATGTCTATCTATATACTGCAATGATCACAGCTTATGCCTCTCAGCCCGATCCTAAAGCCGCATTTCTTTTATATCGTACCATGGTCCGCCGAGGAACTCCTCGACCCAAC
CATTTTATCTATCCTCATGTTCTAAAATCTTGCCCTGAGGTTTTGGAGTCCAATGGCACGCAAGTGGTTCATGCCCAGGTGTTGAAATCTGGATTTGGTCGATACCCAGT
TGTCCAAACGGCCATTGTTGATTCCTATTCGAGATTCTGTTCGAATATTGGAATTGCAAGACAGATGTTCGATGAAATGATTGAGAGAAGTGTAGTGTCTTGGACGGCTA
TGATTTCGGGGTATGCGAGGCTTGGGAACGTTGATAGTGCAATAGCGTTGTTTGAAAGTATGCCGGAGAGAGATGTGCCTGCCTGGAATGCTCTTATTGCTGGATGTGCT
CAAAATGGATTCTTCTGTGAAGCAATTTGGCTGTTCAAAAGAATGGTGTCAATGGCGCTGGAGGGTGATAAGGAGCGTGAAAACAAGCCGAATAAGATCACACTTGCTTC
TGCACTTTCAGCTTGTGGTCATACTGGGATGCTTCATCTTGGCAAGTGGATTCATGGTTATGTTTTCAAAAGTTATCTTGGTCAGGATTCATTTATCTCAAATGCTCTAC
TAGATATGTATGGAAAATGTGGCAATTTGAAAATTGCAAGGAGGGTTTTCGACATGATTACTGTAAAAAGCTTGACATCATGGAATTCCTTGATAAATTGTCTTGCACTC
CATGGCCATAGTGAAAGTGCAATTGATTTATTTGTAGAGTTGGTTCAATGTGGGGACGGCGTGAAGCCAGATGGGGTTACTTTTGTTGGTGTGTTGAATGCTTGTACTCA
TGGAGGATTAGTTGAAAAAGGTTACTCCTACTTCAAAATGATGGGGCGGGATTATGACATTGAGCCTCAGATTGAACACTTTGGATGCTTGATAGACCTTCTTGGCCGTG
CAGGGCGGTTTGAAGAAGCGATGGAAGTTGTGAGGGGAATGAATATCGAACCAGATGAGGTTGTATGGGGTTCTTTACTAAATGGATGCAAAATCCATGGCCGTCCTGAT
TTAGCAGAATATTCGGTGAAAAAGTTGATCGAGATGGATCCAGAAAACGGTGGTTATAGAATTATGCTAGCAAACATATATGCTGAGCTTGAAAAGTGGGACGAAGTTCG
CAAGGTTCGGAAACTTTTGAAGGAGCAAAATGCTTATAAAACACCAGGTTGCAGTTGGATTGAGGTGAGAAGGTCGAACGACAAGAGGATAGTCAAGCTCTATCCTGCTC
TTCTCTCCTTCCATTGGCAGTCTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCGGTTCAAATCCAACCTCATTTGAACCAATTAGTTCTAGCAGTGCTTGAAAAATGCAGTCATCACAATCACCTCAAGCAACTCCAAGGCTTTCTCATTTCACT
TGGTCACTCACAAACACAGTTCTTCGCCTTCAAGCTCGTCCGCTTCTGTAACCATACTCTTACTGACTTATGTTACGCTCGGTTCATTTTTGATCATTTAACTTCCCCAA
ATGTCTATCTATATACTGCAATGATCACAGCTTATGCCTCTCAGCCCGATCCTAAAGCCGCATTTCTTTTATATCGTACCATGGTCCGCCGAGGAACTCCTCGACCCAAC
CATTTTATCTATCCTCATGTTCTAAAATCTTGCCCTGAGGTTTTGGAGTCCAATGGCACGCAAGTGGTTCATGCCCAGGTGTTGAAATCTGGATTTGGTCGATACCCAGT
TGTCCAAACGGCCATTGTTGATTCCTATTCGAGATTCTGTTCGAATATTGGAATTGCAAGACAGATGTTCGATGAAATGATTGAGAGAAGTGTAGTGTCTTGGACGGCTA
TGATTTCGGGGTATGCGAGGCTTGGGAACGTTGATAGTGCAATAGCGTTGTTTGAAAGTATGCCGGAGAGAGATGTGCCTGCCTGGAATGCTCTTATTGCTGGATGTGCT
CAAAATGGATTCTTCTGTGAAGCAATTTGGCTGTTCAAAAGAATGGTGTCAATGGCGCTGGAGGGTGATAAGGAGCGTGAAAACAAGCCGAATAAGATCACACTTGCTTC
TGCACTTTCAGCTTGTGGTCATACTGGGATGCTTCATCTTGGCAAGTGGATTCATGGTTATGTTTTCAAAAGTTATCTTGGTCAGGATTCATTTATCTCAAATGCTCTAC
TAGATATGTATGGAAAATGTGGCAATTTGAAAATTGCAAGGAGGGTTTTCGACATGATTACTGTAAAAAGCTTGACATCATGGAATTCCTTGATAAATTGTCTTGCACTC
CATGGCCATAGTGAAAGTGCAATTGATTTATTTGTAGAGTTGGTTCAATGTGGGGACGGCGTGAAGCCAGATGGGGTTACTTTTGTTGGTGTGTTGAATGCTTGTACTCA
TGGAGGATTAGTTGAAAAAGGTTACTCCTACTTCAAAATGATGGGGCGGGATTATGACATTGAGCCTCAGATTGAACACTTTGGATGCTTGATAGACCTTCTTGGCCGTG
CAGGGCGGTTTGAAGAAGCGATGGAAGTTGTGAGGGGAATGAATATCGAACCAGATGAGGTTGTATGGGGTTCTTTACTAAATGGATGCAAAATCCATGGCCGTCCTGAT
TTAGCAGAATATTCGGTGAAAAAGTTGATCGAGATGGATCCAGAAAACGGTGGTTATAGAATTATGCTAGCAAACATATATGCTGAGCTTGAAAAGTGGGACGAAGTTCG
CAAGGTTCGGAAACTTTTGAAGGAGCAAAATGCTTATAAAACACCAGGTTGCAGTTGGATTGAGGTGAGAAGGTCGAACGACAAGAGGATAGTCAAGCTCTATCCTGCTC
TTCTCTCCTTCCATTGGCAGTCTTGTTGA
Protein sequenceShow/hide protein sequence
MASVQIQPHLNQLVLAVLEKCSHHNHLKQLQGFLISLGHSQTQFFAFKLVRFCNHTLTDLCYARFIFDHLTSPNVYLYTAMITAYASQPDPKAAFLLYRTMVRRGTPRPN
HFIYPHVLKSCPEVLESNGTQVVHAQVLKSGFGRYPVVQTAIVDSYSRFCSNIGIARQMFDEMIERSVVSWTAMISGYARLGNVDSAIALFESMPERDVPAWNALIAGCA
QNGFFCEAIWLFKRMVSMALEGDKERENKPNKITLASALSACGHTGMLHLGKWIHGYVFKSYLGQDSFISNALLDMYGKCGNLKIARRVFDMITVKSLTSWNSLINCLAL
HGHSESAIDLFVELVQCGDGVKPDGVTFVGVLNACTHGGLVEKGYSYFKMMGRDYDIEPQIEHFGCLIDLLGRAGRFEEAMEVVRGMNIEPDEVVWGSLLNGCKIHGRPD
LAEYSVKKLIEMDPENGGYRIMLANIYAELEKWDEVRKVRKLLKEQNAYKTPGCSWIEVRRSNDKRIVKLYPALLSFHWQSC