; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023072 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023072
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00000729:2502527..2505328
RNA-Seq ExpressionSgr023072
SyntenySgr023072
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598662.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]6.8e-22867.4Show/hide
Query:  MATLKDGFLSSNNASPG-LPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDV-------------------VELNARRV
        MATL DGFLSSNN SP  LP SSKLN D +PSFRFSRNSMNV CRMH T  SA NR + +FAP+AK PD +D                    V+LNARRV
Subjt:  MATLKDGFLSSNNASPG-LPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDV-------------------VELNARRV

Query:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL
        DS  GN L KF  K A CVD D K+FDE+PER +PAYTALIRAYCRSEKWNELFAA  SMV+EGILPDKYLVPTILKACS RQ VKTGKM+HGY  R RL
Subjt:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL

Query:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK-
        VSDIFIGNALMDFYGNCGDLRFSINVF+SMSEKDVVSWTALV+AYMEEGLLDEA+EAFHSMQSSGLKPDLISWNALVSGFA +G+I TAL+YLE  +++ 
Subjt:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK-

Query:  ------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAK
                                                        ACAGLR LGLG A+HAYALKCELC NIYVEGS+V+MYSKC QDDYAE++FAK
Subjt:  ------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAK

Query:  AEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR
        AEKKNITLWNEIIA YVNQG+TSQALERFRSMQHHGL+PDVVTYNTLLAG+AKNGQKVEAY LL+EMLQ DL PNVVSLNVLVSGFQQ GLSYEAL+LF+
Subjt:  AEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR

Query:  TMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMK-------------------------------I
        TML T CL++KVIT PIRPN VTITA LAACA LNL HKGKEIHGYMLRNGFED+H V    +  + K                               +
Subjt:  TMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMK-------------------------------I

Query:  MQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADL
        MQPK+A+ELFC+MLVEGIKPSS TFSIL  AL  R DL
Subjt:  MQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADL

KAG7029606.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-22767.24Show/hide
Query:  MATLKDGFLSSNNASPG-LPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDV-------------------VELNARRV
        MATL DGFLSSNN SP  LP SSKLN D +PSFRFSRNSMNV CRMH T  SA NR + +FAP+AK PD +D                    V+LNARRV
Subjt:  MATLKDGFLSSNNASPG-LPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDV-------------------VELNARRV

Query:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL
        DS  GN L KF  K A CVD D K+FDE+PER +PAYTALIRAYCRSEKWNELFAA  SMV+EGILPDKYLVPTILKACS RQ VKTGKM+HGY  R RL
Subjt:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL

Query:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK-
        VSDIFIGNALMDFYGNCGDLRFSINVF+SMSEKDVVSWTALV+AYMEEGLLDEA+EAFHSMQSSGLKPDLISWNALVSGFA +G+I TAL+YLE  +++ 
Subjt:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK-

Query:  ------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAK
                                                        ACAGLR LGLG A+HAY LKCELC NIYVEGS+V+MYSKC QDDYAE++FAK
Subjt:  ------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAK

Query:  AEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR
        AEKKNITLWNEIIA YVNQG+TSQALERFRSMQHHGL+PDVVTYNTLLAG+AKNGQKVEAY LL+EMLQ DL PNVVSLNVLVSGFQQ GLSYEAL+LF+
Subjt:  AEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR

Query:  TMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMK-------------------------------I
        TML T CL++KVIT PIRPN VTITA LAACA LNL HKGKEIHGYMLRNGFED+H V    +  + K                               +
Subjt:  TMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMK-------------------------------I

Query:  MQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADL
        MQPK+A+ELFC+MLVEGIKPSS TFSIL  AL  R DL
Subjt:  MQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADL

XP_008445371.1 PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cucumis melo]3.5e-22465.52Show/hide
Query:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPD-----------------RSDVVELNARRVDSS
        MAT  DGF+SSNNASP LP   K +FD +P+  FSRNSMNV CRMHF    A+NR   QF+PIA   D                  S VV+LNA RVD+ 
Subjt:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPD-----------------RSDVVELNARRVDSS

Query:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD
        FG KL KFY KD KCVDGD K+FDEIPERT+P Y ALIRAYCRSEKWNELFAA RSMVDEGILPDKYLVPT+LKACS RQ+VKTGKMVHGY  R R+VSD
Subjt:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD

Query:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----
        I IGNALMDFYGNC DL  SINVF+SMSEKDVVSWTALV+AY+EEGLL+EA++ FHSMQSSGLKPDLISWNALVSGFA YGE +TAL YLE  +++    
Subjt:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----

Query:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK
                                                     ACAGLR+LGLG A+HAYALKCELC NIYVEGS+VDMYSKC QDD+AE+VFAKAEK
Subjt:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK

Query:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML
        KN+TLWNEIIA YVNQGK SQALERFRSMQHHGLKPDVVTYNTLLAG+AKNG+KVEAY+LLS+ML+ +L PNV+SLNVLVSGFQ  GLSYEAL+L +TML
Subjt:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML

Query:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIMQP
        CTG LLNK I  P+ P+TVTITAALAACA LNL HKGKEIHGYMLRN FE+NHF                  ++V S  ++             ++IMQ 
Subjt:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIMQP

Query:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
        +VA+ELFC+MLVEGIKPSS TFSILL AL ERADLKV+
Subjt:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

XP_022131620.1 pentatricopeptide repeat-containing protein At1g19720-like [Momordica charantia]1.5e-24370.35Show/hide
Query:  MATLKDGFLSSNNASPGL--PPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSD---------------------------
        MATLKD FLS NNASP L  P SSKLNFD HPS  FSRNSMN++CRMHFT  SA N  RGQF P AKS DR+D                           
Subjt:  MATLKDGFLSSNNASPGL--PPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSD---------------------------

Query:  -VVELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKM
         VV+ N  RVDS FGNKLPKF A+D KCVD DCKLFDEIPERT+PAY ALIRAYCRS+KWNELFAA RSMVDEGI PDKYLVPTILKACSGRQLVKTGKM
Subjt:  -VVELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKM

Query:  VHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL
        VHG+V R   VSDIF+GNALM+FYGNCGDLR SI VF+SMSEKDVVSWTALV+AYMEEGLLDEA+E FH+MQSSGLKPDLISWNALVSGFA YGEID AL
Subjt:  VHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL

Query:  QYLEQCKKK-------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQ
        QYLE+ ++K                                                 ACAGLRD+GLG AIHAYALK ELC N+YVEGS+VDMYSKC Q
Subjt:  QYLEQCKKK-------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQ

Query:  DDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYG
        D  AEKVFA+AEKKNITLWNEIIAAYVNQGK SQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQ DLTPNVVSLNVLVSGFQQ+G
Subjt:  DDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYG

Query:  LSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLS----------
        LSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSH+GKEIHGYMLRNGF DNHF                  +RV            
Subjt:  LSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLS----------

Query:  ---LTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
           +  HMK  QPKVAIELFCEMLVEGIKPSSVT SIL  AL    DLKV+
Subjt:  ---LTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

XP_038884429.1 pentatricopeptide repeat-containing protein At1g19720-like [Benincasa hispida]2.9e-23468.03Show/hide
Query:  TLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDR-------------------SDVVELNARRVDSS
        T  DGF+SS+NASP LP S K NFD  PSFR SRNSM V CRMHFT  SA +R +GQF+PIAK  DR                   + VV+LNA RVD+ 
Subjt:  TLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDR-------------------SDVVELNARRVDSS

Query:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD
        FG KL  FYAKD  CVD D KLFDEIPERT+ AY+ALIRAYCRSEKWNELFAA RSMVDEGILP KYLVPTILKACS RQ+VKTGKMVHGY  R RLVSD
Subjt:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD

Query:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----
        IFIGNAL+D YGNCGDLRFSINVF+SMSEKDVVSWTALV+AY+EEGLLDE +E FHSMQSSGLKPDLISWNALVSGFA YGE +TAL YLE  +++    
Subjt:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----

Query:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK
                                                     ACAGLRDLGLG AIHAYALKCELC NIYVEGS+VDMYSKC QDDYAE+VFAKAEK
Subjt:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK

Query:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML
        KNITLWNEIIA YVNQ KTSQALE FRS+QHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQ DL PNVVSLNVLVSGFQQ GLSYEAL+LF+TML
Subjt:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML

Query:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRH-------------------------------MKIMQP
        C GCL NK+IT PIRP+TVTITAAL ACA LNL HKGKEIHGYM RN FEDNHF+    +  +                               M+IMQP
Subjt:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRH-------------------------------MKIMQP

Query:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
        K+A+ELFC+MLVEG+KPSSVTFSILL AL E+ADLK +
Subjt:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

TrEMBL top hitse value%identityAlignment
A0A0A0KFW8 Uncharacterized protein9.6e-22064.69Show/hide
Query:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDR-------------------SDVVELNARRVD
        MAT   GF SSNNAS  LP   K +FD +P+  FSRNSMNV CRMHF   SA NR   QF+PIA   DR                   + VV+LN  RVD
Subjt:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDR-------------------SDVVELNARRVD

Query:  SSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLV
        + FG KL KFY KD KCVD D K+FDEIPERT+PAY ALIRAYCRSEKWNELFAA RSMVDEGILPDKYLVPTILKACS RQ+VKTGKM HGY  R R+V
Subjt:  SSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLV

Query:  SDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK--
        SDI I NALMDFYGNCGDL  SINVF+SMSEKDVVSWTALV+AY+EEGLL+EA+E FHSMQSSGLKPDLISWNALVSGFA YGE +TAL YLE  +++  
Subjt:  SDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK--

Query:  -----------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKA
                                                       ACAGLRDLGLG A+HAYALKCELC NIYVEGS+VDMYSKC QDD AE++FAKA
Subjt:  -----------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKA

Query:  EKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRT
        EKKNITLWNEIIA Y+NQGK S ALE FRSMQHHGLKPDVVTYNTLLAG+AKNGQKVEAY+LLS+MLQ +L PNV+SLNVLVSGFQQ GL+YEAL+L +T
Subjt:  EKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRT

Query:  MLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIM
        MLCTG LLNK I  P+ PNTVT+TAALAACA LNL HKGKEIHGYMLRN F +N+F                  ++V S  ++             ++ M
Subjt:  MLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIM

Query:  QPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
        Q K+A+ELFC+MLVEGIKPSS TFSILL AL ERADLKV+
Subjt:  QPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

A0A1S3BDB0 pentatricopeptide repeat-containing protein At1g19720-like1.7e-22465.52Show/hide
Query:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPD-----------------RSDVVELNARRVDSS
        MAT  DGF+SSNNASP LP   K +FD +P+  FSRNSMNV CRMHF    A+NR   QF+PIA   D                  S VV+LNA RVD+ 
Subjt:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPD-----------------RSDVVELNARRVDSS

Query:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD
        FG KL KFY KD KCVDGD K+FDEIPERT+P Y ALIRAYCRSEKWNELFAA RSMVDEGILPDKYLVPT+LKACS RQ+VKTGKMVHGY  R R+VSD
Subjt:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD

Query:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----
        I IGNALMDFYGNC DL  SINVF+SMSEKDVVSWTALV+AY+EEGLL+EA++ FHSMQSSGLKPDLISWNALVSGFA YGE +TAL YLE  +++    
Subjt:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----

Query:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK
                                                     ACAGLR+LGLG A+HAYALKCELC NIYVEGS+VDMYSKC QDD+AE+VFAKAEK
Subjt:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK

Query:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML
        KN+TLWNEIIA YVNQGK SQALERFRSMQHHGLKPDVVTYNTLLAG+AKNG+KVEAY+LLS+ML+ +L PNV+SLNVLVSGFQ  GLSYEAL+L +TML
Subjt:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML

Query:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIMQP
        CTG LLNK I  P+ P+TVTITAALAACA LNL HKGKEIHGYMLRN FE+NHF                  ++V S  ++             ++IMQ 
Subjt:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIMQP

Query:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
        +VA+ELFC+MLVEGIKPSS TFSILL AL ERADLKV+
Subjt:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

A0A5A7VGH4 Pentatricopeptide repeat-containing protein2.2e-22465.52Show/hide
Query:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPD-----------------RSDVVELNARRVDSS
        MAT  DGF+SSNNASP LP   K +FD +P+  FSRNSMNV CRMHF    A+NR   QF+PIA   D                  S VV+LNA RVD+ 
Subjt:  MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPD-----------------RSDVVELNARRVDSS

Query:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD
        FG KL KFY KD KCVDGD K+FDEIPER +P Y ALIRAYCRSEKWNELFAA RSMVDEGILPDKYLVPT+LKACS RQ+VKTGKMVHGY  R R+VSD
Subjt:  FGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD

Query:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----
        I IGNALMDFYGNC DL  SINVF+SMSEKDVVSWTALV+AY+EEGLL+EA++ FHSMQSSGLKPDLISWNALVSGFA YGE +TAL YLE  +++    
Subjt:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----

Query:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK
                                                     ACAGLR+LGLG A+HAYALKCELC NIYVEGS+VDMYSKC QDD+AE+VFAKAEK
Subjt:  ---------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEK

Query:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML
        KN+TLWNEIIA YVNQGK SQALERFRSMQHHGLKPDVVTYNTLLAG+AKNG+KVEAY+LLS+ML+ +L PNV+SLNVLVSGFQ  GLSYEAL+L +TML
Subjt:  KNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTML

Query:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIMQP
        CTG LLNKVI  P+ P+TVTITAALAACA LNL HKGKEIHGYMLRN FE+NHF                  ++V S  ++             ++IMQ 
Subjt:  CTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLSLTRH-------------MKIMQP

Query:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
        +VA+ELFC+MLVEGIKPSS TFSILL AL ERADLKV+
Subjt:  KVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

A0A6J1BQ73 pentatricopeptide repeat-containing protein At1g19720-like7.3e-24470.35Show/hide
Query:  MATLKDGFLSSNNASPGL--PPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSD---------------------------
        MATLKD FLS NNASP L  P SSKLNFD HPS  FSRNSMN++CRMHFT  SA N  RGQF P AKS DR+D                           
Subjt:  MATLKDGFLSSNNASPGL--PPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSD---------------------------

Query:  -VVELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKM
         VV+ N  RVDS FGNKLPKF A+D KCVD DCKLFDEIPERT+PAY ALIRAYCRS+KWNELFAA RSMVDEGI PDKYLVPTILKACSGRQLVKTGKM
Subjt:  -VVELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKM

Query:  VHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL
        VHG+V R   VSDIF+GNALM+FYGNCGDLR SI VF+SMSEKDVVSWTALV+AYMEEGLLDEA+E FH+MQSSGLKPDLISWNALVSGFA YGEID AL
Subjt:  VHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL

Query:  QYLEQCKKK-------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQ
        QYLE+ ++K                                                 ACAGLRD+GLG AIHAYALK ELC N+YVEGS+VDMYSKC Q
Subjt:  QYLEQCKKK-------------------------------------------------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQ

Query:  DDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYG
        D  AEKVFA+AEKKNITLWNEIIAAYVNQGK SQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQ DLTPNVVSLNVLVSGFQQ+G
Subjt:  DDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYG

Query:  LSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLS----------
        LSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSH+GKEIHGYMLRNGF DNHF                  +RV            
Subjt:  LSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHF------------------VRVLS----------

Query:  ---LTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK
           +  HMK  QPKVAIELFCEMLVEGIKPSSVT SIL  AL    DLKV+
Subjt:  ---LTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVK

A0A6J1K3Z4 pentatricopeptide repeat-containing protein At1g19720-like1.6e-21166.5Show/hide
Query:  MNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDV-------------------VELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTA
        MNV CRMH T  SA NR + +FAP+AK PD +D                    V+LNARRVDS  GNKL K  AK A CVD D K+FDE+PER +PAYTA
Subjt:  MNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDV-------------------VELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTA

Query:  LIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWT
        LIRAYCRSEKWNELFAA  SMV+EGILPDKYLVPTILKACS  Q VKTGKM+HGY  R RLVSDIFIGNALMDFYGNCGDLRFSINVF+SMSEKDVVSWT
Subjt:  LIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWT

Query:  ALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----------------------------------------
        ALV+AYMEEGLLDEA+EAFHSMQSSGLKPDLISWNALVSGFA +G+I TAL+YLE  +++                                        
Subjt:  ALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKK----------------------------------------

Query:  ---------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKP
                 ACAGLR LGLG A+HAYALKCELC NIYVEGS+V+MYSKC QDDYAE++FAKAEKKNITLWNEIIA YVNQG+TSQALERFRSMQHHGL+P
Subjt:  ---------ACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKP

Query:  DVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPN-TVTITAALAACADLNLSH
        DVVTYNTLLAG+AKNGQKVEAY LL+EMLQ DL PNVVSLN LVSGFQQ GLSYEAL+LF+TML T CL++KVIT PIRPN  +TITAALAACA LNL H
Subjt:  DVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPN-TVTITAALAACADLNLSH

Query:  KGKEIHGYMLRNGFEDNHFVRVLSLTRHMK-------------------------------IMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADL
        KGKEIHGYMLRNGFEDNH V    +  + K                               +MQPK+A+ELFC+MLVEGIKPSS +FSILL AL  R DL
Subjt:  KGKEIHGYMLRNGFEDNHFVRVLSLTRHMK-------------------------------IMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADL

SwissProt top hitse value%identityAlignment
Q9FM64 Pentatricopeptide repeat-containing protein At5g55740, chloroplastic2.5e-5532.04Show/hide
Query:  KC--VDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFY
        KC  +D   K+FDEIP+R   A+ AL+  Y ++ K  E       M  +G+ P +  V T L A +    V+ GK  H     N +  D  +G +L++FY
Subjt:  KC--VDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFY

Query:  GNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLRDLGLGSAIH
           G + ++  VF+ M EKDVV+W  +++ Y+++GL+++A+     M+   LK D ++   L+S                     A A   +L LG  + 
Subjt:  GNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLRDLGLGSAIH

Query:  AYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKL
         Y ++    ++I +  +++DMY+KC     A+KVF    +K++ LWN ++AAY   G + +AL  F  MQ  G+ P+V+T+N ++    +NGQ  EA  +
Subjt:  AYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKL

Query:  LSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRN
          +M  + + PN++S   +++G  Q G S EA+   R M  +G          +RPN  +IT AL+ACA L   H G+ IHGY++RN
Subjt:  LSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRN

Q9FXH1 Pentatricopeptide repeat-containing protein At1g197201.5e-6531.5Show/hide
Query:  KLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFI
        KL   YAK   C+    K+FD + ER +  ++A+I AY R  +W E+    R M+ +G+LPD +L P IL+ C+    V+ GK++H  V +  + S + +
Subjt:  KLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFI

Query:  GNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL----------------
         N+++  Y  CG+L F+   F  M E+DV++W +++ AY + G  +EAVE    M+  G+ P L++WN L+ G+   G+ D A+                
Subjt:  GNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL----------------

Query:  ----------------QYLEQCKK-----------------KACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNI
                        Q L+  +K                  AC+ L+ +  GS +H+ A+K     ++ V  S+VDMYSKC + + A KVF   + K++
Subjt:  ----------------QYLEQCKK-----------------KACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNI

Query:  TLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQT-DLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCT
          WN +I  Y   G   +A E F  MQ   L+P+++T+NT+++G+ KNG + EA  L   M +   +  N  + N++++G+ Q G   EAL+LFR M  +
Subjt:  TLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQT-DLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCT

Query:  GCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVR
          +          PN+VTI + L ACA+L  +   +EIHG +LR   +  H V+
Subjt:  GCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVR

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic8.8e-4526.61Show/hide
Query:  GNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMV-DEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD
        GN     + +    VD    +F ++ ER + ++  L+  Y +   ++E       M+   G+ PD Y  P +L+ C G   +  GK VH +V R     D
Subjt:  GNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMV-DEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD

Query:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAG
        I + NAL+  Y  CGD++ +  +F+ M  +D++SW A+++ Y E G+  E +E F +M+   + PDL++  +++S                     AC  
Subjt:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAG

Query:  LRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHA
        L D  LG  IHAY +      +I V  S+  MY        AEK+F++ E+K+I  W  +I+ Y       +A++ +R M    +KPD +T   +L+  A
Subjt:  LRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHA

Query:  KNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR----------TMLCTGCLLN----------KVITLPIRPNTVTITAALAACA
          G      +L    ++  L   V+  N L++ + +     +AL +F           T +  G  LN          + + + ++PN +T+TAALAACA
Subjt:  KNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR----------TMLCTGCLLN----------KVITLPIRPNTVTITAALAACA

Query:  DLNLSHKGKEIHGYMLRN--GFED-------NHFVRV---------------------LSLTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILL
         +     GKEIH ++LR   G +D       + +VR                      + LT + +  Q  + +ELF  M+   ++P  +TF  LL
Subjt:  DLNLSHKGKEIHGYMLRN--GFED-------NHFVRV---------------------LSLTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILL

Q9SV26 Pentatricopeptide repeat-containing protein At4g01030, mitochondrial2.0e-5731.92Show/hide
Query:  KLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFS
        KLFDE+P+R   A+  ++    RS  W +     R M   G       +  +L+ CS ++    G+ +HGYV R  L S++ + N+L+  Y   G L  S
Subjt:  KLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFS

Query:  INVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLR-----------------D
          VF SM ++++ SW +++++Y + G +D+A+     M+  GLKPD+++WN+L+SG+A  G    A+  L   K+   AGL+                  
Subjt:  INVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLR-----------------D

Query:  LGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIA--AYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAK
        L LG AIH Y L+ +L  ++YVE +++DMY K     YA  VF   + KNI  WN +++  +Y    K ++AL     M+  G+KPD +T+N+L +G+A 
Subjt:  LGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIA--AYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAK

Query:  NGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFE
         G+  +A  ++ +M +  + PNVVS   + SG  + G    ALK+F  M   G          + PN  T++  L     L+L H GKE+HG+ LR    
Subjt:  NGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFE

Query:  DNHFVRVLSLTRHMKIMQPKVAIELF
         + +V    +  + K    + AIE+F
Subjt:  DNHFVRVLSLTRHMKIMQPKVAIELF

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395301.1e-4225.97Show/hide
Query:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL
        D   G  L  FY KD   +D    +FD +PE++   +T +I    +  +          ++++ ++PD Y++ T+L ACS    ++ GK +H ++ R  L
Subjt:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL

Query:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKA
          D  + N L+D Y  CG +  +  +F  M  K+++SWT L++ Y +  L  EA+E F SM   GLKPD+ + +++++                     +
Subjt:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKA

Query:  CAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQG---KTSQALERFRSMQHHGLKPDVVTYNT
        CA L  LG G+ +HAY +K  L  + YV  S++DMY+KC     A KVF      ++ L+N +I  Y   G   +  +AL  FR M+   ++P ++T+ +
Subjt:  CAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQG---KTSQALERFRSMQHHGLKPDVVTYNT

Query:  LL-----------------------------AGHAKNGQKVEAYKLLSEMLQTD--LTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITL
        LL                             AG A        Y L    L  D     ++V  N + +G+ Q   + EAL LF  +  +          
Subjt:  LL-----------------------------AGHAKNGQKVEAYKLLSEMLQTD--LTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITL

Query:  PIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMKIMQP-------------------------------KVAIELFCEMLV
          RP+  T    + A  +L     G+E H  +L+ G E N ++    L  + K   P                               K A+++  +M+ 
Subjt:  PIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMKIMQP-------------------------------KVAIELFCEMLV

Query:  EGIKPSSVTFSILLSA
        EGI+P+ +TF  +LSA
Subjt:  EGIKPSSVTFSILLSA

Arabidopsis top hitse value%identityAlignment
AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.3e-4626.61Show/hide
Query:  GNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMV-DEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD
        GN     + +    VD    +F ++ ER + ++  L+  Y +   ++E       M+   G+ PD Y  P +L+ C G   +  GK VH +V R     D
Subjt:  GNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMV-DEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSD

Query:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAG
        I + NAL+  Y  CGD++ +  +F+ M  +D++SW A+++ Y E G+  E +E F +M+   + PDL++  +++S                     AC  
Subjt:  IFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAG

Query:  LRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHA
        L D  LG  IHAY +      +I V  S+  MY        AEK+F++ E+K+I  W  +I+ Y       +A++ +R M    +KPD +T   +L+  A
Subjt:  LRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHA

Query:  KNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR----------TMLCTGCLLN----------KVITLPIRPNTVTITAALAACA
          G      +L    ++  L   V+  N L++ + +     +AL +F           T +  G  LN          + + + ++PN +T+TAALAACA
Subjt:  KNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFR----------TMLCTGCLLN----------KVITLPIRPNTVTITAALAACA

Query:  DLNLSHKGKEIHGYMLRN--GFED-------NHFVRV---------------------LSLTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILL
         +     GKEIH ++LR   G +D       + +VR                      + LT + +  Q  + +ELF  M+   ++P  +TF  LL
Subjt:  DLNLSHKGKEIHGYMLRN--GFED-------NHFVRV---------------------LSLTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILL

AT1G19720.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-6631.5Show/hide
Query:  KLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFI
        KL   YAK   C+    K+FD + ER +  ++A+I AY R  +W E+    R M+ +G+LPD +L P IL+ C+    V+ GK++H  V +  + S + +
Subjt:  KLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFI

Query:  GNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL----------------
         N+++  Y  CG+L F+   F  M E+DV++W +++ AY + G  +EAVE    M+  G+ P L++WN L+ G+   G+ D A+                
Subjt:  GNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTAL----------------

Query:  ----------------QYLEQCKK-----------------KACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNI
                        Q L+  +K                  AC+ L+ +  GS +H+ A+K     ++ V  S+VDMYSKC + + A KVF   + K++
Subjt:  ----------------QYLEQCKK-----------------KACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNI

Query:  TLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQT-DLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCT
          WN +I  Y   G   +A E F  MQ   L+P+++T+NT+++G+ KNG + EA  L   M +   +  N  + N++++G+ Q G   EAL+LFR M  +
Subjt:  TLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQT-DLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCT

Query:  GCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVR
          +          PN+VTI + L ACA+L  +   +EIHG +LR   +  H V+
Subjt:  GCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVR

AT4G01030.1 pentatricopeptide (PPR) repeat-containing protein1.4e-5831.92Show/hide
Query:  KLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFS
        KLFDE+P+R   A+  ++    RS  W +     R M   G       +  +L+ CS ++    G+ +HGYV R  L S++ + N+L+  Y   G L  S
Subjt:  KLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFS

Query:  INVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLR-----------------D
          VF SM ++++ SW +++++Y + G +D+A+     M+  GLKPD+++WN+L+SG+A  G    A+  L   K+   AGL+                  
Subjt:  INVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLR-----------------D

Query:  LGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIA--AYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAK
        L LG AIH Y L+ +L  ++YVE +++DMY K     YA  VF   + KNI  WN +++  +Y    K ++AL     M+  G+KPD +T+N+L +G+A 
Subjt:  LGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIA--AYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAK

Query:  NGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFE
         G+  +A  ++ +M +  + PNVVS   + SG  + G    ALK+F  M   G          + PN  T++  L     L+L H GKE+HG+ LR    
Subjt:  NGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFE

Query:  DNHFVRVLSLTRHMKIMQPKVAIELF
         + +V    +  + K    + AIE+F
Subjt:  DNHFVRVLSLTRHMKIMQPKVAIELF

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.7e-4425.97Show/hide
Query:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL
        D   G  L  FY KD   +D    +FD +PE++   +T +I    +  +          ++++ ++PD Y++ T+L ACS    ++ GK +H ++ R  L
Subjt:  DSSFGNKLPKFYAKDAKCVDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRL

Query:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKA
          D  + N L+D Y  CG +  +  +F  M  K+++SWT L++ Y +  L  EA+E F SM   GLKPD+ + +++++                     +
Subjt:  VSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKA

Query:  CAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQG---KTSQALERFRSMQHHGLKPDVVTYNT
        CA L  LG G+ +HAY +K  L  + YV  S++DMY+KC     A KVF      ++ L+N +I  Y   G   +  +AL  FR M+   ++P ++T+ +
Subjt:  CAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQG---KTSQALERFRSMQHHGLKPDVVTYNT

Query:  LL-----------------------------AGHAKNGQKVEAYKLLSEMLQTD--LTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITL
        LL                             AG A        Y L    L  D     ++V  N + +G+ Q   + EAL LF  +  +          
Subjt:  LL-----------------------------AGHAKNGQKVEAYKLLSEMLQTD--LTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITL

Query:  PIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMKIMQP-------------------------------KVAIELFCEMLV
          RP+  T    + A  +L     G+E H  +L+ G E N ++    L  + K   P                               K A+++  +M+ 
Subjt:  PIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMKIMQP-------------------------------KVAIELFCEMLV

Query:  EGIKPSSVTFSILLSA
        EGI+P+ +TF  +LSA
Subjt:  EGIKPSSVTFSILLSA

AT5G55740.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-5632.04Show/hide
Query:  KC--VDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFY
        KC  +D   K+FDEIP+R   A+ AL+  Y ++ K  E       M  +G+ P +  V T L A +    V+ GK  H     N +  D  +G +L++FY
Subjt:  KC--VDGDCKLFDEIPERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFY

Query:  GNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLRDLGLGSAIH
           G + ++  VF+ M EKDVV+W  +++ Y+++GL+++A+     M+   LK D ++   L+S                     A A   +L LG  + 
Subjt:  GNCGDLRFSINVFESMSEKDVVSWTALVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLRDLGLGSAIH

Query:  AYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKL
         Y ++    ++I +  +++DMY+KC     A+KVF    +K++ LWN ++AAY   G + +AL  F  MQ  G+ P+V+T+N ++    +NGQ  EA  +
Subjt:  AYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFAKAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKL

Query:  LSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRN
          +M  + + PN++S   +++G  Q G S EA+   R M  +G          +RPN  +IT AL+ACA L   H G+ IHGY++RN
Subjt:  LSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLLNKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTCTAAAAGATGGGTTTCTTTCGTCAAACAATGCGTCTCCTGGCCTTCCACCGTCTTCCAAGTTAAACTTCGACCCTCATCCCAGTTTTAGATTTTCTCGAAA
TTCTATGAATGTACATTGTAGGATGCATTTCACCACGCCATCGGCCCAGAATAGACACCGGGGTCAATTTGCTCCAATTGCGAAAAGTCCTGACCGTAGTGATGTTGTTG
AATTAAATGCTCGTCGAGTTGATAGTTCGTTTGGAAACAAGCTACCGAAGTTTTATGCCAAGGATGCGAAGTGCGTGGATGGCGACTGTAAGCTGTTCGATGAAATTCCC
GAGAGAACAGTGCCAGCCTATACAGCTTTGATAAGGGCGTATTGTCGGTCAGAGAAGTGGAATGAACTCTTTGCGGCATTAAGATCGATGGTTGATGAGGGCATACTACC
TGATAAATACCTCGTACCTACGATTCTGAAAGCATGTTCCGGAAGACAATTGGTGAAGACGGGTAAAATGGTTCATGGGTATGTTTTTAGGAATAGGCTGGTCTCTGATA
TTTTTATTGGGAATGCTCTTATGGACTTCTACGGTAATTGTGGGGATTTGAGATTTTCGATCAATGTTTTTGAATCGATGAGTGAAAAAGATGTGGTTTCGTGGACTGCG
CTTGTTACAGCTTACATGGAAGAAGGTCTTTTGGATGAGGCGGTGGAAGCTTTTCACTCCATGCAGTCAAGTGGGTTGAAGCCTGATTTGATATCTTGGAATGCACTGGT
CTCAGGGTTTGCTCACTATGGAGAGATTGACACTGCTCTCCAATACTTGGAACAATGCAAGAAAAAGGCTTGTGCAGGGTTGAGAGATCTAGGCTTAGGCAGCGCTATTC
ATGCATATGCTCTCAAGTGCGAGCTGTGTGCGAACATTTATGTTGAAGGATCAATAGTTGATATGTATTCGAAATGCAGACAAGATGATTATGCTGAAAAAGTTTTTGCC
AAAGCAGAGAAGAAAAACATTACATTGTGGAACGAAATTATTGCAGCTTACGTGAATCAGGGAAAAACTAGCCAGGCATTAGAACGTTTTAGATCAATGCAGCATCATGG
ACTAAAACCTGATGTTGTAACCTACAATACACTGCTGGCTGGACATGCAAAAAATGGGCAAAAAGTTGAAGCATATAAGTTGCTATCTGAGATGTTACAAACAGATTTGA
CACCCAATGTTGTATCTTTAAATGTTTTAGTATCTGGATTTCAACAATATGGGCTTAGTTATGAAGCTCTAAAATTATTCCGAACCATGCTATGTACTGGTTGCCTTCTT
AATAAGGTGATTACTTTGCCAATTAGACCAAATACTGTCACCATAACTGCTGCTCTGGCTGCTTGTGCTGACTTGAATTTATCACACAAGGGGAAGGAAATCCATGGATA
TATGTTGAGGAATGGTTTTGAAGACAACCACTTCGTTCGAGTGCTCTCATTGACACGTCATATGAAAATTATGCAGCCCAAAGTGGCAATTGAACTCTTCTGTGAAATGC
TAGTAGAAGGCATAAAACCAAGTTCAGTCACCTTTTCGATACTTCTCTCTGCCTTAGGTGAAAGGGCAGATTTGAAAGTGAAAGACAACTACATTCCTATATCATCAAAA
TTCAGGGGTAACTTTCTTACAGGATGGAATGATGTTAAAAAAAAATCACCCAGTGCTCAGGTGCATCCTATTGGGGGTCCAGGACAAGTTAAGGTTCTCACTCCAATGGA
GGCTTCAAAATATACATCCCAGCCCTTCAGCCTTAAGAGGACCATAGCAACCAATACTTCTCATCAAAGTTTCCTATCTCAGGACTATATGCAAGATGAAAATTCATGCA
ATACTTACCCCCTAAGAGGAGTACTGACAGTAGGTGGTGGGTTTTGGGATGGTATGACTGGCTCTCTGAGGTGTAGCAACCAATACTTCCCATCAGATTTTCCTACTTCG
GGACTATATACAAATGCAAATCCATGCAAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCACTCTAAAAGATGGGTTTCTTTCGTCAAACAATGCGTCTCCTGGCCTTCCACCGTCTTCCAAGTTAAACTTCGACCCTCATCCCAGTTTTAGATTTTCTCGAAA
TTCTATGAATGTACATTGTAGGATGCATTTCACCACGCCATCGGCCCAGAATAGACACCGGGGTCAATTTGCTCCAATTGCGAAAAGTCCTGACCGTAGTGATGTTGTTG
AATTAAATGCTCGTCGAGTTGATAGTTCGTTTGGAAACAAGCTACCGAAGTTTTATGCCAAGGATGCGAAGTGCGTGGATGGCGACTGTAAGCTGTTCGATGAAATTCCC
GAGAGAACAGTGCCAGCCTATACAGCTTTGATAAGGGCGTATTGTCGGTCAGAGAAGTGGAATGAACTCTTTGCGGCATTAAGATCGATGGTTGATGAGGGCATACTACC
TGATAAATACCTCGTACCTACGATTCTGAAAGCATGTTCCGGAAGACAATTGGTGAAGACGGGTAAAATGGTTCATGGGTATGTTTTTAGGAATAGGCTGGTCTCTGATA
TTTTTATTGGGAATGCTCTTATGGACTTCTACGGTAATTGTGGGGATTTGAGATTTTCGATCAATGTTTTTGAATCGATGAGTGAAAAAGATGTGGTTTCGTGGACTGCG
CTTGTTACAGCTTACATGGAAGAAGGTCTTTTGGATGAGGCGGTGGAAGCTTTTCACTCCATGCAGTCAAGTGGGTTGAAGCCTGATTTGATATCTTGGAATGCACTGGT
CTCAGGGTTTGCTCACTATGGAGAGATTGACACTGCTCTCCAATACTTGGAACAATGCAAGAAAAAGGCTTGTGCAGGGTTGAGAGATCTAGGCTTAGGCAGCGCTATTC
ATGCATATGCTCTCAAGTGCGAGCTGTGTGCGAACATTTATGTTGAAGGATCAATAGTTGATATGTATTCGAAATGCAGACAAGATGATTATGCTGAAAAAGTTTTTGCC
AAAGCAGAGAAGAAAAACATTACATTGTGGAACGAAATTATTGCAGCTTACGTGAATCAGGGAAAAACTAGCCAGGCATTAGAACGTTTTAGATCAATGCAGCATCATGG
ACTAAAACCTGATGTTGTAACCTACAATACACTGCTGGCTGGACATGCAAAAAATGGGCAAAAAGTTGAAGCATATAAGTTGCTATCTGAGATGTTACAAACAGATTTGA
CACCCAATGTTGTATCTTTAAATGTTTTAGTATCTGGATTTCAACAATATGGGCTTAGTTATGAAGCTCTAAAATTATTCCGAACCATGCTATGTACTGGTTGCCTTCTT
AATAAGGTGATTACTTTGCCAATTAGACCAAATACTGTCACCATAACTGCTGCTCTGGCTGCTTGTGCTGACTTGAATTTATCACACAAGGGGAAGGAAATCCATGGATA
TATGTTGAGGAATGGTTTTGAAGACAACCACTTCGTTCGAGTGCTCTCATTGACACGTCATATGAAAATTATGCAGCCCAAAGTGGCAATTGAACTCTTCTGTGAAATGC
TAGTAGAAGGCATAAAACCAAGTTCAGTCACCTTTTCGATACTTCTCTCTGCCTTAGGTGAAAGGGCAGATTTGAAAGTGAAAGACAACTACATTCCTATATCATCAAAA
TTCAGGGGTAACTTTCTTACAGGATGGAATGATGTTAAAAAAAAATCACCCAGTGCTCAGGTGCATCCTATTGGGGGTCCAGGACAAGTTAAGGTTCTCACTCCAATGGA
GGCTTCAAAATATACATCCCAGCCCTTCAGCCTTAAGAGGACCATAGCAACCAATACTTCTCATCAAAGTTTCCTATCTCAGGACTATATGCAAGATGAAAATTCATGCA
ATACTTACCCCCTAAGAGGAGTACTGACAGTAGGTGGTGGGTTTTGGGATGGTATGACTGGCTCTCTGAGGTGTAGCAACCAATACTTCCCATCAGATTTTCCTACTTCG
GGACTATATACAAATGCAAATCCATGCAAAGCTTAG
Protein sequenceShow/hide protein sequence
MATLKDGFLSSNNASPGLPPSSKLNFDPHPSFRFSRNSMNVHCRMHFTTPSAQNRHRGQFAPIAKSPDRSDVVELNARRVDSSFGNKLPKFYAKDAKCVDGDCKLFDEIP
ERTVPAYTALIRAYCRSEKWNELFAALRSMVDEGILPDKYLVPTILKACSGRQLVKTGKMVHGYVFRNRLVSDIFIGNALMDFYGNCGDLRFSINVFESMSEKDVVSWTA
LVTAYMEEGLLDEAVEAFHSMQSSGLKPDLISWNALVSGFAHYGEIDTALQYLEQCKKKACAGLRDLGLGSAIHAYALKCELCANIYVEGSIVDMYSKCRQDDYAEKVFA
KAEKKNITLWNEIIAAYVNQGKTSQALERFRSMQHHGLKPDVVTYNTLLAGHAKNGQKVEAYKLLSEMLQTDLTPNVVSLNVLVSGFQQYGLSYEALKLFRTMLCTGCLL
NKVITLPIRPNTVTITAALAACADLNLSHKGKEIHGYMLRNGFEDNHFVRVLSLTRHMKIMQPKVAIELFCEMLVEGIKPSSVTFSILLSALGERADLKVKDNYIPISSK
FRGNFLTGWNDVKKKSPSAQVHPIGGPGQVKVLTPMEASKYTSQPFSLKRTIATNTSHQSFLSQDYMQDENSCNTYPLRGVLTVGGGFWDGMTGSLRCSNQYFPSDFPTS
GLYTNANPCKA