; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030196 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030196
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153574:1219906..1221215
RNA-Seq ExpressionSgr030196
SyntenySgr030196
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583710.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.1e-13362.09Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR     +   +      SMTYASVLSACANIYD QWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++ DYFD+MPERNVISWNSML AYFQNGFWEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQIVSQA ++
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        G GSDVSVANSA+TLYSRCGKIE+A  +FDSIQEKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHITYVAILSGCSHSGLVKE KHYFNSMT+
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGISAT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

XP_022142381.1 pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia]1.1e-13969.43Show/hide
Query:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------------------------
        S+TYASVLSACAN+YDLQWGKHLHARIVR EPFLDVLVGNGLVDMYAKCG +EASR+                                           
Subjt:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------------------------

Query:  --------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-----------------IYSLWRC-----------RKSTDYFDQMPERNV
                       ENIS+GEQLHGFTVKTGMDSSVPVGNATVT+   C                 I S W              K+ DYFD+MPERNV
Subjt:  --------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-----------------IYSLWRC-----------RKSTDYFDQMPERNV

Query:  ISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKN
        ISWNSMLGA FQNGFWEEGLKLYILMLRQE RPDWITFATTIS+CSEL  LKLGTQIVSQA+K+GLGSDVSVANSA+TLYSRCGKIEEAQNIFDSIQEKN
Subjt:  ISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKN

Query:  LISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG
        LISWNSIMGGYAQNGQGRKVIEVFQNML+VGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGIS TSEHFACMVDLF   G
Subjt:  LISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG

XP_022927554.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata]6.7e-13462.32Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++ DYFD+MPERNVISWNSML AYFQNG+WEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQIVSQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSIQEKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHITYVAILSGCSHSGLVKE KHYFNSMT+
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGISAT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

XP_023520697.1 pentatricopeptide repeat-containing protein At2g13600-like isoform X1 [Cucurbita pepo subsp. pepo]4.3e-13361.61Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++ DYFD+MPERNV+SWNSML AYFQNGFWEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQI+SQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSI+EKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHIT+VAILSGCSHSGLVKE KHYFNSMT+
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGISAT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

XP_023520702.1 pentatricopeptide repeat-containing protein At2g13600-like isoform X2 [Cucurbita pepo subsp. pepo]4.3e-13361.61Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++ DYFD+MPERNV+SWNSML AYFQNGFWEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQI+SQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSI+EKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHIT+VAILSGCSHSGLVKE KHYFNSMT+
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGISAT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

TrEMBL top hitse value%identityAlignment
A0A6J1CM06 pentatricopeptide repeat-containing protein At2g13600-like5.2e-14069.43Show/hide
Query:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------------------------
        S+TYASVLSACAN+YDLQWGKHLHARIVR EPFLDVLVGNGLVDMYAKCG +EASR+                                           
Subjt:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------------------------

Query:  --------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-----------------IYSLWRC-----------RKSTDYFDQMPERNV
                       ENIS+GEQLHGFTVKTGMDSSVPVGNATVT+   C                 I S W              K+ DYFD+MPERNV
Subjt:  --------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-----------------IYSLWRC-----------RKSTDYFDQMPERNV

Query:  ISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKN
        ISWNSMLGA FQNGFWEEGLKLYILMLRQE RPDWITFATTIS+CSEL  LKLGTQIVSQA+K+GLGSDVSVANSA+TLYSRCGKIEEAQNIFDSIQEKN
Subjt:  ISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKN

Query:  LISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG
        LISWNSIMGGYAQNGQGRKVIEVFQNML+VGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGIS TSEHFACMVDLF   G
Subjt:  LISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG

A0A6J1EIC0 pentatricopeptide repeat-containing protein At2g13600-like3.2e-13462.32Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++ DYFD+MPERNVISWNSML AYFQNG+WEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQIVSQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSIQEKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHITYVAILSGCSHSGLVKE KHYFNSMT+
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGISAT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

A0A6J1I7S2 pentatricopeptide repeat-containing protein At2g13600-like isoform X12.6e-13161.14Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARI+RIEP LDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++  YFD+MPERNV+SWNSML AYFQNGFWEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQIVSQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSIQEKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHITYVAILSGCSHSGLVKE KHYFNSM++
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGI AT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

A0A6J1IC44 pentatricopeptide repeat-containing protein At2g13600-like isoform X22.6e-13161.14Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARI+RIEP LDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++  YFD+MPERNV+SWNSML AYFQNGFWEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQIVSQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSIQEKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHITYVAILSGCSHSGLVKE KHYFNSM++
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGI AT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

A0A6J1IDK4 pentatricopeptide repeat-containing protein At2g13600-like isoform X32.6e-13161.14Show/hide
Query:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------
        N I S  +Q  LH      TR  S  +   +      SMTYASVLSACANIYD QWGKHLHARI+RIEP LDVLVGNGLVDMYAKCG IEAS++      
Subjt:  NVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK------

Query:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------
                                                            ENIS+GEQLHGF VKTGMDSS+PVGNAT+T+   C             
Subjt:  ---------------------------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC-------------

Query:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS
                   I S  R     ++  YFD+MPERNV+SWNSML AYFQNGFWEEGLKLYI MLRQE RPDW+TF TTISSCSEL + KLGTQIVSQA + 
Subjt:  -----------IYSLWR---CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKS

Query:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
        GLGSDVSVANSA+TLYSRCGKIE+A  +FDSIQEKNLISWNSIMGGYAQNGQGRKVIE+FQNML+VGCKPDHITYVAILSGCSHSGLVKE KHYFNSM++
Subjt:  GLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE

Query:  DFGISATSEHFACMVDLFVELG
        DFGI AT EHFACMVDLF   G
Subjt:  DFGISATSEHFACMVDLFVELG

SwissProt top hitse value%identityAlignment
O49287 Putative pentatricopeptide repeat-containing protein At1g77010, mitochondrial9.0e-4933Show/hide
Query:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGC-IEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWR
        S T A+V++AC  +  L+ GK +H    +     D++V + L+DMY+KCG  +EA +    +   + +        ++S + V  +   IDD        
Subjt:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGC-IEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWR

Query:  CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCG
           +   F+++  +++ISWNSM   + QNG   E L+ +  M + +   D ++ ++ IS+C+ +  L+LG Q+ ++A   GL SD  V++S + LY +CG
Subjt:  CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCG

Query:  KIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVE
         +E  + +FD++ + + + WNS++ GYA NGQG + I++F+ M + G +P  IT++ +L+ C++ GLV+EG+  F SM  D G     EHF+CMVDL   
Subjt:  KIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVE

Query:  LGY
         GY
Subjt:  LGY

Q9FRI5 Pentatricopeptide repeat-containing protein At1g253604.5e-4834.09Show/hide
Query:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC
        S +     TY SV+ ACA    LQ GK +HA ++R E F      N LV +Y KCG  + +R   E +   + +    + +G  SS  +G A +      
Subjt:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC

Query:  IYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVT
                     F +M E+N++SW  M+    +NGF EEGLKL+  M R+ F P    F+  I SC+ LG    G Q  +Q  K G  S +S  N+ +T
Subjt:  IYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVT

Query:  LYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACM
        +Y++CG +EEA+ +F ++   + +SWN+++    Q+G G + ++V++ ML  G +PD IT + +L+ CSH+GLV +G+ YF+SM   + I   ++H+A +
Subjt:  LYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACM

Query:  VDLFVELG
        +DL    G
Subjt:  VDLFVELG

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.4e-4935.81Show/hide
Query:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK--RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDH
        S LS    T ASVLSACAN+  L  GK +H+ IV     +  +V N L+ MY++CG +E +R+   +  +   ++ GFT    +D  + +G+        
Subjt:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK--RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDH

Query:  CIYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAV
                 ++ + F  + +R+V++W +M+  Y Q+G + E + L+  M+    RP+  T A  +S  S L  L  G QI   A KSG    VSV+N+ +
Subjt:  CIYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAV

Query:  TLYSRCGKIEEAQNIFDSIQ-EKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFA
        T+Y++ G I  A   FD I+ E++ +SW S++   AQ+G   + +E+F+ ML+ G +PDHITYV + S C+H+GLV +G+ YF+ M +   I  T  H+A
Subjt:  TLYSRCGKIEEAQNIFDSIQ-EKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFA

Query:  CMVDLFVELG
        CMVDLF   G
Subjt:  CMVDLFVELG

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.1e-5736.71Show/hide
Query:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFL-DVLVGNGLVDMYAKCGCIEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC
        SR+    +T ASV+SACA++  ++ G+ +H R+V+ +    D+++ N  VDMYAKC  I+ +R                   +  S+P+ N         
Subjt:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFL-DVLVGNGLVDMYAKCGCIEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC

Query:  IYSLWRCRKSTD-YFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEK------SGLGSDVS
         Y++    K+    F +M ERNV+SWN+++  Y QNG  EE L L+ L+ R+   P   +FA  + +C++L  L LG Q      K      SG   D+ 
Subjt:  IYSLWRCRKSTD-YFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEK------SGLGSDVS

Query:  VANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISAT
        V NS + +Y +CG +EE   +F  + E++ +SWN+++ G+AQNG G + +E+F+ ML  G KPDHIT + +LS C H+G V+EG+HYF+SMT DFG++  
Subjt:  VANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISAT

Query:  SEHFACMVDLFVELGY
         +H+ CMVDL    G+
Subjt:  SEHFACMVDLFVELGY

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395301.3e-4728.5Show/hide
Query:  KASVRLLRCVSRLSAKSMTYA--SVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------
        K ++ L   +S+   K   YA  S+L++CA+++ L +G  +HA  ++     D  V N L+DMYAKC C+  +RK                         
Subjt:  KASVRLLRCVSRLSAKSMTYA--SVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------

Query:  -----------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWRCRKSTDYFDQMPERNVISWNSML
                                             ++ + +Q+HG   K G++  +  G+A + +  +C Y L   + S   FD+M  ++++ WNSM 
Subjt:  -----------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWRCRKSTDYFDQMPERNVISWNSML

Query:  GAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSI
          Y Q    EE L L++ +     RPD  TFA  +++   L  ++LG +   Q  K GL  +  + N+ + +Y++CG  E+A   FDS   ++++ WNS+
Subjt:  GAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSI

Query:  MGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG
        +  YA +G+G+K +++ + M+  G +P++IT+V +LS CSH+GLV++G   F  M   FGI   +EH+ CMV L    G
Subjt:  MGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG

Arabidopsis top hitse value%identityAlignment
AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-4934.09Show/hide
Query:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC
        S +     TY SV+ ACA    LQ GK +HA ++R E F      N LV +Y KCG  + +R   E +   + +    + +G  SS  +G A +      
Subjt:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC

Query:  IYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVT
                     F +M E+N++SW  M+    +NGF EEGLKL+  M R+ F P    F+  I SC+ LG    G Q  +Q  K G  S +S  N+ +T
Subjt:  IYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVT

Query:  LYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACM
        +Y++CG +EEA+ +F ++   + +SWN+++    Q+G G + ++V++ ML  G +PD IT + +L+ CSH+GLV +G+ YF+SM   + I   ++H+A +
Subjt:  LYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACM

Query:  VDLFVELG
        +DL    G
Subjt:  VDLFVELG

AT1G77010.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-5033Show/hide
Query:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGC-IEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWR
        S T A+V++AC  +  L+ GK +H    +     D++V + L+DMY+KCG  +EA +    +   + +        ++S + V  +   IDD        
Subjt:  SMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGC-IEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWR

Query:  CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCG
           +   F+++  +++ISWNSM   + QNG   E L+ +  M + +   D ++ ++ IS+C+ +  L+LG Q+ ++A   GL SD  V++S + LY +CG
Subjt:  CRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCG

Query:  KIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVE
         +E  + +FD++ + + + WNS++ GYA NGQG + I++F+ M + G +P  IT++ +L+ C++ GLV+EG+  F SM  D G     EHF+CMVDL   
Subjt:  KIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVE

Query:  LGY
         GY
Subjt:  LGY

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein7.6e-5936.71Show/hide
Query:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFL-DVLVGNGLVDMYAKCGCIEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC
        SR+    +T ASV+SACA++  ++ G+ +H R+V+ +    D+++ N  VDMYAKC  I+ +R                   +  S+P+ N         
Subjt:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFL-DVLVGNGLVDMYAKCGCIEASRKRENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHC

Query:  IYSLWRCRKSTD-YFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEK------SGLGSDVS
         Y++    K+    F +M ERNV+SWN+++  Y QNG  EE L L+ L+ R+   P   +FA  + +C++L  L LG Q      K      SG   D+ 
Subjt:  IYSLWRCRKSTD-YFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEK------SGLGSDVS

Query:  VANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISAT
        V NS + +Y +CG +EE   +F  + E++ +SWN+++ G+AQNG G + +E+F+ ML  G KPDHIT + +LS C H+G V+EG+HYF+SMT DFG++  
Subjt:  VANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISAT

Query:  SEHFACMVDLFVELGY
         +H+ CMVDL    G+
Subjt:  SEHFACMVDLFVELGY

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein9.9e-5135.81Show/hide
Query:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK--RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDH
        S LS    T ASVLSACAN+  L  GK +H+ IV     +  +V N L+ MY++CG +E +R+   +  +   ++ GFT    +D  + +G+        
Subjt:  SRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK--RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDH

Query:  CIYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAV
                 ++ + F  + +R+V++W +M+  Y Q+G + E + L+  M+    RP+  T A  +S  S L  L  G QI   A KSG    VSV+N+ +
Subjt:  CIYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAV

Query:  TLYSRCGKIEEAQNIFDSIQ-EKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFA
        T+Y++ G I  A   FD I+ E++ +SW S++   AQ+G   + +E+F+ ML+ G +PDHITYV + S C+H+GLV +G+ YF+ M +   I  T  H+A
Subjt:  TLYSRCGKIEEAQNIFDSIQ-EKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFA

Query:  CMVDLFVELG
        CMVDLF   G
Subjt:  CMVDLFVELG

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-4928.5Show/hide
Query:  KASVRLLRCVSRLSAKSMTYA--SVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------
        K ++ L   +S+   K   YA  S+L++CA+++ L +G  +HA  ++     D  V N L+DMYAKC C+  +RK                         
Subjt:  KASVRLLRCVSRLSAKSMTYA--SVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRK-------------------------

Query:  -----------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWRCRKSTDYFDQMPERNVISWNSML
                                             ++ + +Q+HG   K G++  +  G+A + +  +C Y L   + S   FD+M  ++++ WNSM 
Subjt:  -----------------------------------RENISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWRCRKSTDYFDQMPERNVISWNSML

Query:  GAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSI
          Y Q    EE L L++ +     RPD  TFA  +++   L  ++LG +   Q  K GL  +  + N+ + +Y++CG  E+A   FDS   ++++ WNS+
Subjt:  GAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLGTQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSI

Query:  MGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG
        +  YA +G+G+K +++ + M+  G +P++IT+V +LS CSH+GLV++G   F  M   FGI   +EH+ CMV L    G
Subjt:  MGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTEDFGISATSEHFACMVDLFVELG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGGGACGAGCTCTCGACATTTTCATACAAATGCCTGAACGTGATTCAGTCTCTTGGAACACAATCATTTCTGCATTTTCTCAACATGGTCTGCACGCGCAAAGC
CTCAGTACGTTTGTTGAGATGTGTTTCAAGATTGTCAGCCAAATCAATGACATATGCAAGTGTTCTTAGTGCATGTGCCAATATCTATGATCTTCAATGGGGTAAACATT
TGCATGCCCGAATCGTCCGCATCGAACCCTTTCTTGATGTTTTGGTGGGCAATGGGCTGGTCGATATGTATGCAAAATGTGGATGCATTGAAGCTTCAAGAAAGAGAGAA
AATATTTCAGTTGGGGAGCAGCTACATGGGTTTACGGTGAAGACTGGGATGGATTCATCTGTGCCTGTAGGTAATGCTACCGTGACAATCGATGATCACTGCATTTACTC
ACTGTGGCGATGTAGAAAAAGCACAGATTATTTTGACCAAATGCCAGAGCGTAATGTCATTAGTTGGAATTCAATGTTGGGAGCGTATTTTCAAAATGGTTTTTGGGAAG
AAGGTCTAAAATTGTACATTCTTATGCTTAGACAAGAGTTCAGGCCTGATTGGATCACCTTTGCTACCACAATCAGTTCTTGTTCTGAGTTAGGAATGTTAAAACTTGGA
ACACAAATAGTATCCCAAGCAGAAAAATCAGGGCTTGGCTCTGATGTTTCAGTTGCTAATAGTGCAGTTACCTTGTACTCTAGATGTGGTAAAATTGAAGAAGCACAGAA
CATCTTTGACTCAATACAGGAGAAAAACTTGATCTCTTGGAACTCAATAATGGGAGGATATGCTCAAAATGGACAAGGCAGAAAGGTGATTGAAGTTTTTCAGAACATGT
TGATAGTTGGTTGCAAACCTGATCATATAACCTATGTAGCAATTCTCTCAGGTTGCAGCCATTCAGGGCTTGTAAAAGAAGGAAAGCATTACTTCAACTCCATGACTGAA
GATTTTGGCATCTCTGCAACTTCCGAGCATTTTGCGTGTATGGTAGATCTGTTCGTCGAGCTGGGTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATGGGACGAGCTCTCGACATTTTCATACAAATGCCTGAACGTGATTCAGTCTCTTGGAACACAATCATTTCTGCATTTTCTCAACATGGTCTGCACGCGCAAAGC
CTCAGTACGTTTGTTGAGATGTGTTTCAAGATTGTCAGCCAAATCAATGACATATGCAAGTGTTCTTAGTGCATGTGCCAATATCTATGATCTTCAATGGGGTAAACATT
TGCATGCCCGAATCGTCCGCATCGAACCCTTTCTTGATGTTTTGGTGGGCAATGGGCTGGTCGATATGTATGCAAAATGTGGATGCATTGAAGCTTCAAGAAAGAGAGAA
AATATTTCAGTTGGGGAGCAGCTACATGGGTTTACGGTGAAGACTGGGATGGATTCATCTGTGCCTGTAGGTAATGCTACCGTGACAATCGATGATCACTGCATTTACTC
ACTGTGGCGATGTAGAAAAAGCACAGATTATTTTGACCAAATGCCAGAGCGTAATGTCATTAGTTGGAATTCAATGTTGGGAGCGTATTTTCAAAATGGTTTTTGGGAAG
AAGGTCTAAAATTGTACATTCTTATGCTTAGACAAGAGTTCAGGCCTGATTGGATCACCTTTGCTACCACAATCAGTTCTTGTTCTGAGTTAGGAATGTTAAAACTTGGA
ACACAAATAGTATCCCAAGCAGAAAAATCAGGGCTTGGCTCTGATGTTTCAGTTGCTAATAGTGCAGTTACCTTGTACTCTAGATGTGGTAAAATTGAAGAAGCACAGAA
CATCTTTGACTCAATACAGGAGAAAAACTTGATCTCTTGGAACTCAATAATGGGAGGATATGCTCAAAATGGACAAGGCAGAAAGGTGATTGAAGTTTTTCAGAACATGT
TGATAGTTGGTTGCAAACCTGATCATATAACCTATGTAGCAATTCTCTCAGGTTGCAGCCATTCAGGGCTTGTAAAAGAAGGAAAGCATTACTTCAACTCCATGACTGAA
GATTTTGGCATCTCTGCAACTTCCGAGCATTTTGCGTGTATGGTAGATCTGTTCGTCGAGCTGGGTTACTAA
Protein sequenceShow/hide protein sequence
MGWDELSTFSYKCLNVIQSLGTQSFLHFLNMVCTRKASVRLLRCVSRLSAKSMTYASVLSACANIYDLQWGKHLHARIVRIEPFLDVLVGNGLVDMYAKCGCIEASRKRE
NISVGEQLHGFTVKTGMDSSVPVGNATVTIDDHCIYSLWRCRKSTDYFDQMPERNVISWNSMLGAYFQNGFWEEGLKLYILMLRQEFRPDWITFATTISSCSELGMLKLG
TQIVSQAEKSGLGSDVSVANSAVTLYSRCGKIEEAQNIFDSIQEKNLISWNSIMGGYAQNGQGRKVIEVFQNMLIVGCKPDHITYVAILSGCSHSGLVKEGKHYFNSMTE
DFGISATSEHFACMVDLFVELGY