; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0329 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0329
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationMC10:2655198..2656641
RNA-Seq ExpressionMC10g0329
SyntenyMC10g0329
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008454429.1 PREDICTED: uncharacterized protein LOC103494838 [Cucumis melo]1.86e-26383.41Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSHS-LAACFSLNKSQMEKLLSKRKD
        MDPCPF+R+LVGNLALKFPVAAKPSFSGVHPS+SPCFCKIKL DFPTQFVT+PL+V+G+ S           S+QSHS LAACFSLNKSQ+EKL+ KRKD
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSHS-LAACFSLNKSQMEKLLSKRKD

Query:  PAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         +VKIEVYTGR GPA C  DV GSSAKLLGR+ VP+TGS L+ETKP VFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVMSSTVEG
        WRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSG      SGDFGYL+ Y Y+GFVMS+ VEG
Subjt:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVMSSTVEG

Query:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        MKKK R+PEVEV VQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_022922983.1 uncharacterized protein LOC111430804 [Cucurbita moschata]4.24e-26083.48Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNLALKFP+AAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C     GSSAKLLGR+VVP+TGSSL+ETKP VFQNGWT I   +KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSG      DFGYL+ Y Y+GFVMS+ VEGMKKK R+ EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_023516135.1 uncharacterized protein LOC111780086 [Cucurbita pepo subsp. pepo]1.92e-26083.07Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGD--------NSSAQSHS-LAACFSLNKSQMEKLLSKRKDPAV
        MDPC F+RVLVGNLA+KFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPL+V+G+        +SS+QSHS LA+CFSLNKSQ+EKL+SKRKD +V
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGD--------NSSAQSHS-LAACFSLNKSQMEKLLSKRKDPAV

Query:  KIEVYTGRRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQ
        KIEV+TG R PA+C    + SSAKLLGR+VVP+T SSLAETKP +FQNGWT IGE K+G SSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQG+V Q
Subjt:  KIEVYTGRRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQ

Query:  PVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRP
        PVF+CKFGFRNERDWDRSRSSISE +STSKSWLPKIRS+KDQSA ERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRP
Subjt:  PVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRP

Query:  WGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGS------GSGDFGYLSPYSYRGFVMSSTVEGMKK
        WGRLEAWRESGGSDS+GYRFELLP  SAAA LATSTISS  GGKFTID+T SASP  SP+GS D  S      GSGDFGYLS Y Y+GFVMS+TVEGMKK
Subjt:  WGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGS------GSGDFGYLSPYSYRGFVMSSTVEGMKK

Query:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        + R+PEVEVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_023553155.1 uncharacterized protein LOC111810646 [Cucurbita pepo subsp. pepo]2.99e-26083.94Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C     GSSAKLLGR+VVP+TGSSL+ETKP VFQNGWT I    KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSG      DFGYL+ Y Y+GFVMS+ VEGMKKK RK EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_038906212.1 uncharacterized protein LOC120092083 [Benincasa hispida]2.66e-26683.96Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS--------------SAQSHS-LAACFSLNKSQMEKLLSK
        MDPCPF+R+LVGNLALKFPVAAKPSFSGV+PSSSPCFCKIKL DFPTQFVTVPLVV+G+ S              S+QSHS LAACFSLNKSQ+EKL+ K
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS--------------SAQSHS-LAACFSLNKSQMEKLLSK

Query:  RKDPAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQV
        RKDP+VKIEVYTGR GPA C  DV GSSAKLLGR++VP+TGSSL+ETKP VFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQV
Subjt:  RKDPAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQV

Query:  QGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPV
        QG+V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPV
Subjt:  QGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPV

Query:  DGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSST
        DGSWRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSGSG      DFGYL+ Y Y+GFVMS+ 
Subjt:  DGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSST

Query:  VEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        VEGMKKK R+PEVEVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  VEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

TrEMBL top hitse value%identityAlignment
A0A0A0KY09 Uncharacterized protein2.17e-26181.66Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSA-----------------QSHS-LAACFSLNKSQMEKL
        MDPCPF+R+LVGNLALKFPVAA+PSFS VHPS+SPC+CKIKL DFPTQFVT+PL+V+G+ S A                 QSHS ++A FSLNKSQ+EKL
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSA-----------------QSHS-LAACFSLNKSQMEKL

Query:  LSKRKDPAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQV
        + KRKDP+VKIEVYTGR GPA+C  DV GSSAKLLGR+ VP+TGS L+ETKP VFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQV
Subjt:  LSKRKDPAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQV

Query:  FQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
        FQVQG+V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
Subjt:  FQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL

Query:  RPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVM
        RPVDGSWRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTIDMT SASPA SP+GS D GSG      SGDFGYL+ Y Y+GFVM
Subjt:  RPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVM

Query:  SSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        S+ VEGMKKK R+PEVEVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  SSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A1S3BYK8 uncharacterized protein LOC1034948389.02e-26483.41Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSHS-LAACFSLNKSQMEKLLSKRKD
        MDPCPF+R+LVGNLALKFPVAAKPSFSGVHPS+SPCFCKIKL DFPTQFVT+PL+V+G+ S           S+QSHS LAACFSLNKSQ+EKL+ KRKD
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSHS-LAACFSLNKSQMEKLLSKRKD

Query:  PAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         +VKIEVYTGR GPA C  DV GSSAKLLGR+ VP+TGS L+ETKP VFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVMSSTVEG
        WRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSG      SGDFGYL+ Y Y+GFVMS+ VEG
Subjt:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVMSSTVEG

Query:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        MKKK R+PEVEV VQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A5D3E122 Formin-like protein 189.02e-26483.41Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSHS-LAACFSLNKSQMEKLLSKRKD
        MDPCPF+R+LVGNLALKFPVAAKPSFSGVHPS+SPCFCKIKL DFPTQFVT+PL+V+G+ S           S+QSHS LAACFSLNKSQ+EKL+ KRKD
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSHS-LAACFSLNKSQMEKLLSKRKD

Query:  PAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         +VKIEVYTGR GPA C  DV GSSAKLLGR+ VP+TGS L+ETKP VFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PAVKIEVYTGRRGPANC--DV-GSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVMSSTVEG
        WRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSG      SGDFGYL+ Y Y+GFVMS+ VEG
Subjt:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSG------SGDFGYLSPYSYRGFVMSSTVEG

Query:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        MKKK R+PEVEV VQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A6J1E8C1 uncharacterized protein LOC1114308042.05e-26083.48Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNLALKFP+AAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C     GSSAKLLGR+VVP+TGSSL+ETKP VFQNGWT I   +KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSG      DFGYL+ Y Y+GFVMS+ VEGMKKK R+ EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSG------DFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A6J1FT46 uncharacterized protein LOC1114478991.54e-25983.07Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSA--------QSHSL-AACFSLNKSQMEKLLSKRKDPAV
        MDPC F+RVLVGNLA+KFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPL+V+G+ S A        QSHS  AA FSLNKSQ+EKL+SKRKD +V
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSA--------QSHSL-AACFSLNKSQMEKLLSKRKDPAV

Query:  KIEVYTGRRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQ
        KIEV+TG R PA+C    + SSAKLLGR+VVP+TGSSLAETKP +FQNGWT IGE K+G SSAQLHLTVRAEPDPRFVFRF GEPECSPQVFQVQG+V Q
Subjt:  KIEVYTGRRGPANCD---VGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQ

Query:  PVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRP
        PVF+CKFGFRNERDWDRSRSSISEP+STSKSWLPKIRS+KDQSA ERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRP
Subjt:  PVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRP

Query:  WGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGS------GSGDFGYLSPYSYRGFVMSSTVEGMKK
        WGRLEAWRESGGSDS+GYRFELLP  SAAA LATSTISS  GGKFTID+T SASP  SP+GS D  S      GSGDFGYLS Y Y+GFVMS+TVEGMKK
Subjt:  WGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGS------GSGDFGYLSPYSYRGFVMSSTVEGMKK

Query:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        + R+PEVEVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.1e-13155.91Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK---RKDPAVKIEVYT
        MDPCPF+R+ +GNLALK P+AAK + S VHPSSSPCFCKIKLK+FP Q   +P +        +  +LAA F L+ S +++L S+      P +KI +YT
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK---RKDPAVKIEVYT

Query:  GRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGE-SKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFG
        GR G A C V  S +LL +V VPL  S   ++KP VF NGW  +G+ + K  SSAQ HL V+AEPDPRFVF+FDGEPECSPQV Q+QGN+RQPVF+CKF 
Subjt:  GRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGE-SKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFG

Query:  FRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAWR
         R+  D  +   S+    S S+SWL    S++++  KERKGWSIT+HDLSGSPVA AS+VTPFV SPG+ RVSRSNPG+WLILRP D +WRPWGRLEAWR
Subjt:  FRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAWR

Query:  ESGG-SDSVGYRFELLP--ATSAAATLATSTISSKTGGKFTIDMTSS--------------------------ASPANSPHGSCDFGSGSGDFGY-LSPY
        E GG +D +GYRFEL+P  ++ A   LA STISS  GGKF+I++ SS                          ASPANSP G      GSGD+GY L P+
Subjt:  ESGG-SDSVGYRFELLP--ATSAAATLATSTISSKTGGKFTIDMTSS--------------------------ASPANSPHGSCDFGSGSGDFGY-LSPY

Query:  S-YRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL
        + Y+GFVMS++VEG + K  KP VEV+VQHV+C EDAA +VAL+AA+DLS+DACRLF+Q++RKEL
Subjt:  S-YRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL

AT1G50040.1 Protein of unknown function (DUF1005)1.1e-11552.35Show/hide
Query:  MDPCPFVRVLVGNLALKFP----------VAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS------SAQSHSLAACFSLNKSQMEKLLS
        MDPC FVR++VGNLA++FP           ++ PS S V  SS  C+CKIK K FP Q V+VP+++  ++       S    ++AACFSL+KSQ+E  L 
Subjt:  MDPCPFVRVLVGNLALKFP----------VAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS------SAQSHSLAACFSLNKSQMEKLLS

Query:  KRKDPAVKIEVYTGRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIG----ESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVF
        K K   + +EVY+ R         S  KL+GR  V L     AE+K  +  NGW  +G     +KK  S  +LH++VR EPD RFVF+FDGEPECSPQVF
Subjt:  KRKDPAVKIEVYTGRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIG----ESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVF

Query:  QVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILR
        QVQGN +Q VF+CKFGFRN  D + S S            L  + S K+Q +KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS+RVSRS+PGAWLILR
Subjt:  QVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILR

Query:  PVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTID---MTSSASPANSPHGSCDFG-----------SGSGD---FGYL
        P   +W+PW RL+AWRE G SD +GYRFEL     A A  A+S+IS+K GG F ID    T++ +  +S  GS D             SGSG    F   
Subjt:  PVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTID---MTSSASPANSPHGSCDFG-----------SGSGD---FGYL

Query:  SPYSYRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
              GFVMS+ V+G++K+  KP+VEV V+HVTCTEDAA  VALAAAVDLS+DACRLFSQKLR ELR
Subjt:  SPYSYRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

AT3G19680.1 Protein of unknown function (DUF1005)3.6e-12753.18Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAK-------PSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGD-------NSSAQSHSLAACFSLNKSQMEKLLSKR
        MDPC FVR++VGNLA++FP ++        PS SG++P++  C+CKI+ K+FP + V+VP++   +       +SS    ++AACFSL+K+Q+E  L K 
Subjt:  MDPCPFVRVLVGNLALKFPVAAK-------PSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGD-------NSSAQSHSLAACFSLNKSQMEKLLSKR

Query:  KDPAVKIEVYTGRRGPANCDVGSSA----------KLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESK---KGYSSAQLHLTVRAEPDPRFVFRFDGEP
        K   + +E Y+  RG ++ D G S           KLLGR  V L   S AETK ++  NGW  +   K   K  S  +LH++VR EPDPRFVF+FDGEP
Subjt:  KDPAVKIEVYTGRRGPANCDVGSSA----------KLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESK---KGYSSAQLHLTVRAEPDPRFVFRFDGEP

Query:  ECSPQVFQVQGNVRQPVFSCKFGFRNERDWDRS---RSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSR
        ECSPQVFQVQGN +Q VF+CKFG RN    DR+    SS+    S+++S +  + S+K+Q +KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS+RV+R
Subjt:  ECSPQVFQVQGNVRQPVFSCKFGFRNERDWDRS---RSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSR

Query:  SNPGAWLILRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDM-----TSSASPANSPHGSCDFGSGS--------
        S+PGAWLILRP   +W+PWGRLEAWRE+G SD++GYRFEL     A A  A+S+IS K GG F ID+     T++++P  SP GS D GSGS        
Subjt:  SNPGAWLILRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDM-----TSSASPANSPHGSCDFGSGS--------

Query:  -------GDFGYLSPY------SYRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
                DFGYL P         RGFVMS+TVEG+ K+  KPEVEV V HVTCTEDAA  VALAAAVDLS+DACRLFS KLRKELR
Subjt:  -------GDFGYLSPY------SYRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

AT4G29310.1 Protein of unknown function (DUF1005)8.0e-10351.02Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSG--VHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSS-AQSHSLAACFSLNKSQMEKLLSKRKDPAVKIEVYT
        MDPCPFVR+ + +LAL+ P  A     G  VHPSS+PC+CK+++K FP+Q   +PL    D SS  +S + A  F L+   + ++  K+   ++++ VY 
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSG--VHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSS-AQSHSLAACFSLNKSQMEKLLSKRKDPAVKIEVYT

Query:  GRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFGF
        GR G   C V +S KLLG+V V +   + A ++   F NGW ++G       SA+LHL V AEPDPRFVF+F GEPECSP V+Q+Q N++QPVFSCKF  
Subjt:  GRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFGF

Query:  RNERDWDRSRSSISEPNSTSKSWLPKIRSD---KDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRPWGR
         ++R+  RSRS  S    +S+ W+ +  S    + + A+ERKGW ITIHDLSGSPVAAASM+TPFV SPGS RVSRSNPGAWLILRP      SW+PWGR
Subjt:  RNERDWDRSRSSISEPNSTSKSWLPKIRSD---KDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRPWGR

Query:  LEAWRESGGSDSVGYRFELL--PATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGDFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV
        LEAWRE G  D +GY+FEL+   +TS    +A  T+S+K GGKF+ID                  SG G+   +S    +GFVM S+VEG + K  KP V
Subjt:  LEAWRESGGSDSVGYRFELL--PATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGDFGYLSPYSYRGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL
         V  QHVTC  DAA+FVAL+AAVDLSVDAC+LFS+KLRKEL
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL

AT5G17640.1 Protein of unknown function (DUF1005)1.1e-8042.09Show/hide
Query:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPS---SSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK----RKDPAVKI
        MDP  F+R+ VG+LAL+ P     S S  +     SS C C+IKL+ FP Q  ++PL+ + D ++   HS++  F L +S +  LL+          ++I
Subjt:  MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPS---SSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK----RKDPAVKI

Query:  EVYTGRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSC
         V+TG++   NC VG   + +G   + + G    E KP +  NGW  IG++K+   +A+LHL V+ +PDPR+VF+F+     SPQ+ Q++G+V+QP+FSC
Subjt:  EVYTGRRGPANCDVGSSAKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSC

Query:  KFGFRNERDWDRSRSSISEPNSTSKSWLPK-IRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRPW
        KF          SR  +S+ +  +  W      ++ +   +ERKGW + IHDLSGS VAAA + TPFVPS G   V++SNPGAWL++RP      SW+PW
Subjt:  KFGFRNERDWDRSRSSISEPNSTSKSWLPK-IRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRPW

Query:  GRLEAWRESGGSDSVGYRFELLPATSAAATLATS--TISSKTGGKFTID-----MTSSASPANSPHGSCDFGSGSGDFGYLSPYSYRGFVMSSTVEGMKK
        G+LEAWRE G  DSV  RF LL        +  S   IS++ GG+F ID     +T +A+P  SP  S DF SG G        S  GFVMSS V+G + 
Subjt:  GRLEAWRESGGSDSVGYRFELLPATSAAATLATS--TISSKTGGKFTID-----MTSSASPANSPHGSCDFGSGSGDFGYLSPYSYRGFVMSSTVEGMKK

Query:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        K  KP V++A++HVTC EDAA+F+ALAAAVDLS+ AC+ F +  R+  R
Subjt:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTGCCCTTTCGTCCGGGTTCTCGTCGGAAACTTGGCTCTCAAGTTTCCGGTCGCTGCGAAACCGTCCTTTTCCGGCGTGCATCCGTCGAGTTCTCCGTGCTT
CTGCAAAATCAAACTCAAGGATTTTCCGACGCAGTTCGTCACCGTTCCTCTCGTCGTCAACGGCGATAATTCTTCTGCTCAATCTCACTCACTCGCCGCCTGCTTCAGCC
TCAACAAATCTCAGATGGAGAAGCTTCTCTCGAAGCGGAAGGATCCGGCCGTGAAAATCGAAGTCTACACCGGCCGCCGTGGTCCGGCCAATTGCGACGTCGGAAGCTCC
GCCAAGTTGCTTGGCCGAGTCGTCGTGCCGTTGACCGGCTCGAGCCTCGCCGAAACCAAGCCGTGGGTGTTCCAGAACGGCTGGACCAGAATCGGCGAGAGCAAGAAAGG
CTACTCGTCGGCGCAATTGCACTTGACGGTTCGCGCAGAGCCGGATCCGAGATTCGTGTTTCGATTCGACGGCGAGCCGGAGTGTAGCCCGCAGGTTTTTCAGGTGCAAG
GAAACGTGAGGCAGCCGGTTTTCTCTTGCAAATTCGGTTTCAGAAACGAGCGCGATTGGGACCGATCAAGGTCATCAATTTCTGAACCTAATAGCACGTCAAAGAGTTGG
TTACCGAAGATTCGATCCGATAAGGACCAATCCGCGAAAGAACGAAAAGGATGGTCCATTACGATCCACGACCTCTCCGGATCGCCGGTCGCCGCCGCGTCGATGGTGAC
GCCGTTCGTTCCGTCGCCGGGATCGCATCGTGTAAGCCGCTCAAATCCCGGCGCGTGGCTCATCCTCCGGCCGGTCGACGGCAGCTGGAGGCCGTGGGGCCGCCTCGAGG
CCTGGCGTGAGAGCGGCGGCTCCGATTCAGTCGGCTACCGGTTCGAGCTCCTTCCGGCGACCTCCGCCGCCGCGACACTCGCGACCTCCACCATCAGCTCGAAGACCGGC
GGGAAGTTCACGATCGACATGACCTCGAGCGCGTCGCCGGCGAACAGCCCCCACGGGAGCTGCGACTTCGGGTCCGGATCCGGAGACTTCGGGTACTTGTCGCCGTATTC
GTACAGGGGATTCGTGATGTCGTCGACGGTGGAGGGGATGAAGAAAAAGGGGAGGAAGCCGGAGGTGGAGGTGGCGGTGCAGCACGTGACTTGCACGGAGGACGCGGCGG
TGTTCGTTGCGTTGGCGGCGGCGGTGGACCTGAGCGTCGACGCCTGCAGGTTGTTCTCTCAAAAGCTAAGGAAGGAGCTGAGG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGTGCCCTTTCGTCCGGGTTCTCGTCGGAAACTTGGCTCTCAAGTTTCCGGTCGCTGCGAAACCGTCCTTTTCCGGCGTGCATCCGTCGAGTTCTCCGTGCTT
CTGCAAAATCAAACTCAAGGATTTTCCGACGCAGTTCGTCACCGTTCCTCTCGTCGTCAACGGCGATAATTCTTCTGCTCAATCTCACTCACTCGCCGCCTGCTTCAGCC
TCAACAAATCTCAGATGGAGAAGCTTCTCTCGAAGCGGAAGGATCCGGCCGTGAAAATCGAAGTCTACACCGGCCGCCGTGGTCCGGCCAATTGCGACGTCGGAAGCTCC
GCCAAGTTGCTTGGCCGAGTCGTCGTGCCGTTGACCGGCTCGAGCCTCGCCGAAACCAAGCCGTGGGTGTTCCAGAACGGCTGGACCAGAATCGGCGAGAGCAAGAAAGG
CTACTCGTCGGCGCAATTGCACTTGACGGTTCGCGCAGAGCCGGATCCGAGATTCGTGTTTCGATTCGACGGCGAGCCGGAGTGTAGCCCGCAGGTTTTTCAGGTGCAAG
GAAACGTGAGGCAGCCGGTTTTCTCTTGCAAATTCGGTTTCAGAAACGAGCGCGATTGGGACCGATCAAGGTCATCAATTTCTGAACCTAATAGCACGTCAAAGAGTTGG
TTACCGAAGATTCGATCCGATAAGGACCAATCCGCGAAAGAACGAAAAGGATGGTCCATTACGATCCACGACCTCTCCGGATCGCCGGTCGCCGCCGCGTCGATGGTGAC
GCCGTTCGTTCCGTCGCCGGGATCGCATCGTGTAAGCCGCTCAAATCCCGGCGCGTGGCTCATCCTCCGGCCGGTCGACGGCAGCTGGAGGCCGTGGGGCCGCCTCGAGG
CCTGGCGTGAGAGCGGCGGCTCCGATTCAGTCGGCTACCGGTTCGAGCTCCTTCCGGCGACCTCCGCCGCCGCGACACTCGCGACCTCCACCATCAGCTCGAAGACCGGC
GGGAAGTTCACGATCGACATGACCTCGAGCGCGTCGCCGGCGAACAGCCCCCACGGGAGCTGCGACTTCGGGTCCGGATCCGGAGACTTCGGGTACTTGTCGCCGTATTC
GTACAGGGGATTCGTGATGTCGTCGACGGTGGAGGGGATGAAGAAAAAGGGGAGGAAGCCGGAGGTGGAGGTGGCGGTGCAGCACGTGACTTGCACGGAGGACGCGGCGG
TGTTCGTTGCGTTGGCGGCGGCGGTGGACCTGAGCGTCGACGCCTGCAGGTTGTTCTCTCAAAAGCTAAGGAAGGAGCTGAGG
Protein sequenceShow/hide protein sequence
MDPCPFVRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSKRKDPAVKIEVYTGRRGPANCDVGSS
AKLLGRVVVPLTGSSLAETKPWVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSW
LPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTG
GKFTIDMTSSASPANSPHGSCDFGSGSGDFGYLSPYSYRGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR