; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012371 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012371
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationscaffold63:255663..257131
RNA-Seq ExpressionMS012371
SyntenyMS012371
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014924.1 hypothetical protein SDJN02_22555, partial [Cucurbita argyrosperma subsp. argyrosperma]5.8e-21285.07Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNL LKFP+AAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C  D+ GSSAKLLGR+VVP+TGSSL+ETKPCVFQNGWT I   +KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSGSRPGSGDFGYL+ Y YKGFVMS+ VEGMKKK R+ EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAV+LS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_008454429.1 PREDICTED: uncharacterized protein LOC103494838 [Cucumis melo]4.3e-21584.73Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSH-SLAACFSLNKSQMEKLLSKRKD
        MDPCPF+R+LVGNL LKFPVAAKPSFSGVHPS+SPCFCKIKL DFPTQFVT+PL+V+G+ S           S+QSH SLAACFSLNKSQ+EKL+ KRKD
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSH-SLAACFSLNKSQMEKLLSKRKD

Query:  PAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         +VKIEVYTGR GPA C  DV GSSAKLLGR+ VP+TGS L+ETKPCVFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEG
        WRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSG+GSRPGSGDFGYL+ Y YKGFVMS+ VEG
Subjt:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEG

Query:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        MKKK R+PEVEV VQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_022922983.1 uncharacterized protein LOC111430804 [Cucurbita moschata]1.5e-21285.29Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNL LKFP+AAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C  D+ GSSAKLLGR+VVP+TGSSL+ETKPCVFQNGWT I   +KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSGSRPGSGDFGYL+ Y YKGFVMS+ VEGMKKK R+ EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_023553155.1 uncharacterized protein LOC111810646 [Cucurbita pepo subsp. pepo]1.2e-21285.75Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNL LKFPVAAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C  D+ GSSAKLLGR+VVP+TGSSL+ETKPCVFQNGWT I    KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSGSRPGSGDFGYL+ Y YKGFVMS+ VEGMKKK RK EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

XP_038906212.1 uncharacterized protein LOC120092083 [Benincasa hispida]1.2e-21785.49Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS--------------SAQSH-SLAACFSLNKSQMEKLLSK
        MDPCPF+R+LVGNL LKFPVAAKPSFSGV+PSSSPCFCKIKL DFPTQFVTVPLVV+G+ S              S+QSH SLAACFSLNKSQ+EKL+ K
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS--------------SAQSH-SLAACFSLNKSQMEKLLSK

Query:  RKDPAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQV
        RKDP+VKIEVYTGR GPA C  DV GSSAKLLGR++VP+TGSSL+ETKPCVFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQV
Subjt:  RKDPAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQV

Query:  QGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPV
        QG+V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPV
Subjt:  QGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPV

Query:  DGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSST
        DGSWRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSGSGSRPGSGDFGYL+ Y YKGFVMS+ 
Subjt:  DGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSST

Query:  VEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        VEGMKKK R+PEVEVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  VEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

TrEMBL top hitse value%identityAlignment
A0A0A0KY09 Uncharacterized protein1.1e-21382.97Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------------SAQSH-SLAACFSLNKSQMEKL
        MDPCPF+R+LVGNL LKFPVAA+PSFS VHPS+SPC+CKIKL DFPTQFVT+PL+V+G+ S                 S QSH S++A FSLNKSQ+EKL
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------------SAQSH-SLAACFSLNKSQMEKL

Query:  LSKRKDPAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQV
        + KRKDP+VKIEVYTGR GPA+C  DV GSSAKLLGR+ VP+TGS L+ETKPCVFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQV
Subjt:  LSKRKDPAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQV

Query:  FQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
        FQVQG+V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
Subjt:  FQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL

Query:  RPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVM
        RPVDGSWRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTIDMT SASPA SP+GS D GSG+GSRPGSGDFGYL+ Y YKGFVM
Subjt:  RPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVM

Query:  SSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        S+ VEGMKKK R+PEVEVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  SSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A1S3BYK8 uncharacterized protein LOC1034948382.1e-21584.73Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSH-SLAACFSLNKSQMEKLLSKRKD
        MDPCPF+R+LVGNL LKFPVAAKPSFSGVHPS+SPCFCKIKL DFPTQFVT+PL+V+G+ S           S+QSH SLAACFSLNKSQ+EKL+ KRKD
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSH-SLAACFSLNKSQMEKLLSKRKD

Query:  PAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         +VKIEVYTGR GPA C  DV GSSAKLLGR+ VP+TGS L+ETKPCVFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEG
        WRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSG+GSRPGSGDFGYL+ Y YKGFVMS+ VEG
Subjt:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEG

Query:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        MKKK R+PEVEV VQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A5D3E122 Formin-like protein 182.1e-21584.73Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSH-SLAACFSLNKSQMEKLLSKRKD
        MDPCPF+R+LVGNL LKFPVAAKPSFSGVHPS+SPCFCKIKL DFPTQFVT+PL+V+G+ S           S+QSH SLAACFSLNKSQ+EKL+ KRKD
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS-----------SAQSH-SLAACFSLNKSQMEKLLSKRKD

Query:  PAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN
         +VKIEVYTGR GPA C  DV GSSAKLLGR+ VP+TGS L+ETKPCVFQNGWT IGE KKGYSSAQLHLTVR+EPDPRFVFRFDGEPECSPQVFQVQG+
Subjt:  PAVKIEVYTGRRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGN

Query:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
        V+QPVF+CKFGFRNERDWDRSRSSI+E +STSKSWLPKIRS++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Subjt:  VRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS

Query:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEG
        WRPWGRLEAWRESGGSDS+GYRFELLPATSAAATLA STISS +GG+FTIDMT SASPA SP+GS D GSG+GSRPGSGDFGYL+ Y YKGFVMS+ VEG
Subjt:  WRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEG

Query:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        MKKK R+PEVEV VQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  MKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A6J1E8C1 uncharacterized protein LOC1114308047.4e-21385.29Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNL LKFP+AAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S   S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C  D+ GSSAKLLGR+VVP+TGSSL+ETKPCVFQNGWT I   +KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+GS D GSGSGSRPGSGDFGYL+ Y YKGFVMS+ VEGMKKK R+ EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

A0A6J1JBG6 uncharacterized protein LOC1114829431.4e-21185.29Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG
        MDPCPF+RVLVGNL LKFPVAAKPSFSGVHPSSSPCFCKIKL DFPTQF TVPLVV+G+ S A S S  LAACFSLNKSQ+EKL+SKRKD  VKIEVYTG
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHS--LAACFSLNKSQMEKLLSKRKDPAVKIEVYTG

Query:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        R GP  C  D+ GSSAKLLGR+VVP+TGSSL+ETKPCVF NGWT I    KGYSSAQLHLTVRAE DPRFVFRFDGEPECSPQVFQVQG+V+QPVF+CKF
Subjt:  RRGPANC--DVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
        GFRNERDWDRSRSSI+E +STS+SWLPKI S++DQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW PWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV
        RESGGSDS+GYRFELLPATSAAATLA STISS +GGKFTID T SASP  SP+ S D GSGSGSRPGSGDFGYL+ Y YKGFVMS+ VEGMKKK R+ EV
Subjt:  RESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEV

Query:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        EVAVQHVTCTEDAAVFVALAAAVDLS+DACRLFSQKLRKELR
Subjt:  EVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)4.9e-13256.62Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK---RKDPAVKIEVYT
        MDPCPF+R+ +GNL LK P+AAK + S VHPSSSPCFCKIKLK+FP Q   +P +        +  +LAA F L+ S +++L S+      P +KI +YT
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK---RKDPAVKIEVYT

Query:  GRRGPANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGE-SKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF
        GR G A C V   S +LL +V VPL  S   ++KPCVF NGW  +G+ + K  SSAQ HL V+AEPDPRFVF+FDGEPECSPQV Q+QGN+RQPVF+CKF
Subjt:  GRRGPANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGE-SKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKF

Query:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW
          R+  D  +   S+    S S+SWL    S++++  KERKGWSIT+HDLSGSPVA AS+VTPFV SPG+ RVSRSNPG+WLILRP D +WRPWGRLEAW
Subjt:  GFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAW

Query:  RESGG-SDSVGYRFELLP--ATSAAATLATSTISSKTGGKFTIDMTSSASPAN---------SPHGSCDFGSGSGSRP------GSGDFGY-LSPYS-YK
        RE GG +D +GYRFEL+P  ++ A   LA STISS  GGKF+I++ SS S ++         S  G    GSG G+ P      GSGD+GY L P++ YK
Subjt:  RESGG-SDSVGYRFELLP--ATSAAATLATSTISSKTGGKFTIDMTSSASPAN---------SPHGSCDFGSGSGSRP------GSGDFGY-LSPYS-YK

Query:  GFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL
        GFVMS++VEG + K  KP VEV+VQHV+C EDAA +VAL+AA+DLS+DACRLF+Q++RKEL
Subjt:  GFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL

AT1G50040.1 Protein of unknown function (DUF1005)1.5e-11752.98Show/hide
Query:  MDPCPFVRVLVGNLVLKFP----------VAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS------SAQSHSLAACFSLNKSQMEKLLS
        MDPC FVR++VGNL ++FP           ++ PS S V  SS  C+CKIK K FP Q V+VP+++  ++       S    ++AACFSL+KSQ+E  L 
Subjt:  MDPCPFVRVLVGNLVLKFP----------VAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNS------SAQSHSLAACFSLNKSQMEKLLS

Query:  KRKDPAVKIEVYTGRRGPANCDVVGSSA-KLLGRVVVPLTGSSLAETKPCVFQNGWTRIG----ESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQ
        K K   + +EVY+  R  A+C  V +S  KL+GR  V L     AE+K C+  NGW  +G     +KK  S  +LH++VR EPD RFVF+FDGEPECSPQ
Subjt:  KRKDPAVKIEVYTGRRGPANCDVVGSSA-KLLGRVVVPLTGSSLAETKPCVFQNGWTRIG----ESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQ

Query:  VFQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLI
        VFQVQGN +Q VF+CKFGFRN  D + S S            L  + S K+Q +KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS+RVSRS+PGAWLI
Subjt:  VFQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLI

Query:  LRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTID---MTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYK
        LRP   +W+PW RL+AWRE G SD +GYRFEL     A A  A+S+IS+K GG F ID    T++ +  +S  GS D  S S  R    D G  S + + 
Subjt:  LRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTID---MTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYK

Query:  --------GFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
                GFVMS+ V+G++K+  KP+VEV V+HVTCTEDAA  VALAAAVDLS+DACRLFSQKLR ELR
Subjt:  --------GFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

AT3G19680.1 Protein of unknown function (DUF1005)8.6e-12954.02Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAK-------PSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGD-------NSSAQSHSLAACFSLNKSQMEKLLSKR
        MDPC FVR++VGNL ++FP ++        PS SG++P++  C+CKI+ K+FP + V+VP++   +       +SS    ++AACFSL+K+Q+E  L K 
Subjt:  MDPCPFVRVLVGNLVLKFPVAAK-------PSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGD-------NSSAQSHSLAACFSLNKSQMEKLLSKR

Query:  KDPAVKIEVYT-----GRRG--PANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESK---KGYSSAQLHLTVRAEPDPRFVFRFDGEPEC
        K   + +E Y+     G  G   A+C +  +  KLLGR  V L   S AETK  +  NGW  +   K   K  S  +LH++VR EPDPRFVF+FDGEPEC
Subjt:  KDPAVKIEVYT-----GRRG--PANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESK---KGYSSAQLHLTVRAEPDPRFVFRFDGEPEC

Query:  SPQVFQVQGNVRQPVFSCKFGFRNERDWDRS---RSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSN
        SPQVFQVQGN +Q VF+CKFG RN    DR+    SS+    S+++S +  + S+K+Q +KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS+RV+RS+
Subjt:  SPQVFQVQGNVRQPVFSCKFGFRNERDWDRS---RSSISEPNSTSKSWLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSN

Query:  PGAWLILRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDM-----TSSASPANSPHGSCDFGSGS------GSRP
        PGAWLILRP   +W+PWGRLEAWRE+G SD++GYRFEL     A A  A+S+IS K GG F ID+     T++++P  SP GS D GSGS       SRP
Subjt:  PGAWLILRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKTGGKFTIDM-----TSSASPANSPHGSCDFGSGS------GSRP

Query:  GSG---DFGYLSPY------SYKGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
        GSG   DFGYL P         +GFVMS+TVEG+ K+  KPEVEV V HVTCTEDAA  VALAAAVDLS+DACRLFS KLRKELR
Subjt:  GSG---DFGYLSPY------SYKGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR

AT4G29310.1 Protein of unknown function (DUF1005)4.0e-10250.67Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSG--VHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSS-AQSHSLAACFSLNKSQMEKLLSKRKDPAVKIEVYT
        MDPCPFVR+ + +L L+ P  A     G  VHPSS+PC+CK+++K FP+Q   +PL    D SS  +S + A  F L+   + ++  K+   ++++ VY 
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSG--VHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSS-AQSHSLAACFSLNKSQMEKLLSKRKDPAVKIEVYT

Query:  GRRGPANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFG
        GR G   C V  +S KLLG+V V +   + A ++   F NGW ++G       SA+LHL V AEPDPRFVF+F GEPECSP V+Q+Q N++QPVFSCKF 
Subjt:  GRRGPANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFG

Query:  FRNERDWDRSRSSISEPNSTSKSWLPKIRSD---KDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRPWG
          ++R+  RSRS  S    +S+ W+ +  S    + + A+ERKGW ITIHDLSGSPVAAASM+TPFV SPGS RVSRSNPGAWLILRP      SW+PWG
Subjt:  FRNERDWDRSRSSISEPNSTSKSWLPKIRSD---KDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRPWG

Query:  RLEAWRESGGSDSVGYRFELL--PATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKK
        RLEAWRE G  D +GY+FEL+   +TS    +A  T+S+K GGKF+ID                  SG G  P        SP   KGFVM S+VEG + 
Subjt:  RLEAWRESGGSDSVGYRFELL--PATSAAATLATSTISSKTGGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKK

Query:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL
        K  KP V V  QHVTC  DAA+FVAL+AAVDLSVDAC+LFS+KLRKEL
Subjt:  KGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKEL

AT5G17640.1 Protein of unknown function (DUF1005)4.1e-7841.23Show/hide
Query:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPS---SSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK----RKDPAVKI
        MDP  F+R+ VG+L L+ P     S S  +     SS C C+IKL+ FP Q  ++PL+ + D ++   HS++  F L +S +  LL+          ++I
Subjt:  MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPS---SSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSK----RKDPAVKI

Query:  EVYTGRRGPANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFS
         V+TG++   NC  VG   + +G   + + G    E KP +  NGW  IG++K+   +A+LHL V+ +PDPR+VF+F+     SPQ+ Q++G+V+QP+FS
Subjt:  EVYTGRRGPANCDVVGSSAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFS

Query:  CKFGFRNERDWDRSRSSISEPNSTSKSWLPK-IRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRP
        CKF          SR  +S+ +  +  W      ++ +   +ERKGW + IHDLSGS VAAA + TPFVPS G   V++SNPGAWL++RP      SW+P
Subjt:  CKFGFRNERDWDRSRSSISEPNSTSKSWLPK-IRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSWRP

Query:  WGRLEAWRESGGSDSVGYRFELLPATSAAATLATS--TISSKTGGKFTID-----MTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSS
        WG+LEAWRE G  DSV  RF LL        +  S   IS++ GG+F ID     +T +A+P  SP  S DF SG G     G           GFVMSS
Subjt:  WGRLEAWRESGGSDSVGYRFELLPATSAAATLATS--TISSKTGGKFTID-----MTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSS

Query:  TVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR
         V+G + K  KP V++A++HVTC EDAA+F+ALAAAVDLS+ AC+ F +  R+  R
Subjt:  TVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTGCCCTTTCGTCCGGGTTCTCGTCGGAAACTTGGTTCTCAAGTTTCCGGTCGCTGCGAAACCGTCCTTTTCCGGCGTGCATCCGTCGAGTTCTCCGTGCTT
CTGCAAAATCAAACTCAAGGATTTTCCGACGCAGTTCGTCACCGTTCCTCTCGTCGTCAACGGCGATAATTCTTCTGCTCAATCTCACTCACTCGCCGCCTGCTTCAGCC
TCAACAAATCTCAGATGGAGAAGCTTCTCTCGAAGCGGAAGGATCCGGCCGTGAAAATCGAAGTCTACACCGGCCGCCGTGGTCCGGCCAATTGCGACGTCGTCGGTAGC
TCCGCCAAGTTGCTTGGCCGAGTCGTCGTGCCGTTGACCGGCTCGAGCCTCGCCGAAACCAAGCCGTGCGTGTTCCAGAACGGCTGGACCAGAATCGGCGAGAGCAAGAA
AGGCTACTCGTCGGCGCAATTGCACTTGACGGTTCGCGCAGAGCCGGATCCGAGATTCGTGTTTCGATTCGACGGCGAGCCGGAGTGTAGCCCGCAGGTTTTTCAGGTGC
AAGGAAATGTGAGGCAGCCGGTTTTCTCTTGCAAATTCGGTTTCAGAAACGAGCGCGATTGGGACCGATCAAGGTCATCAATTTCTGAACCTAATAGCACGTCGAAGAGT
TGGTTACCGAAGATTCGATCCGATAAGGACCAATCCGCGAAAGAACGAAAAGGATGGTCCATTACGATCCATGACCTCTCCGGATCGCCGGTCGCCGCCGCGTCGATGGT
GACGCCGTTCGTTCCGTCGCCGGGATCGCATCGTGTAAGCCGCTCAAATCCCGGCGCGTGGCTCATCCTCCGGCCGGTCGACGGCAGCTGGAGGCCGTGGGGCCGCCTCG
AGGCCTGGCGTGAGAGCGGCGGCTCCGATTCAGTCGGCTACCGGTTCGAGCTCCTCCCGGCGACCTCCGCCGCCGCGACACTCGCGACCTCCACCATCAGCTCGAAGACC
GGCGGGAAGTTCACGATCGACATGACCTCGAGCGCGTCGCCGGCGAACAGCCCCCACGGGAGCTGCGACTTCGGGTCCGGATCGGGGTCAAGGCCCGGATCCGGAGACTT
CGGGTACTTGTCGCCGTATTCGTACAAGGGATTCGTGATGTCGTCGACGGTGGAGGGGATGAAGAAGAAGGGGAGGAAGCCGGAGGTGGAGGTGGCGGTGCAGCACGTGA
CTTGCACGGAGGACGCGGCGGTGTTCGTGGCGTTGGCGGCGGCGGTGGACCTGAGCGTCGACGCCTGCAGGTTGTTCTCTCAAAAGCTAAGGAAGGAGCTGAGGCCA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGTGCCCTTTCGTCCGGGTTCTCGTCGGAAACTTGGTTCTCAAGTTTCCGGTCGCTGCGAAACCGTCCTTTTCCGGCGTGCATCCGTCGAGTTCTCCGTGCTT
CTGCAAAATCAAACTCAAGGATTTTCCGACGCAGTTCGTCACCGTTCCTCTCGTCGTCAACGGCGATAATTCTTCTGCTCAATCTCACTCACTCGCCGCCTGCTTCAGCC
TCAACAAATCTCAGATGGAGAAGCTTCTCTCGAAGCGGAAGGATCCGGCCGTGAAAATCGAAGTCTACACCGGCCGCCGTGGTCCGGCCAATTGCGACGTCGTCGGTAGC
TCCGCCAAGTTGCTTGGCCGAGTCGTCGTGCCGTTGACCGGCTCGAGCCTCGCCGAAACCAAGCCGTGCGTGTTCCAGAACGGCTGGACCAGAATCGGCGAGAGCAAGAA
AGGCTACTCGTCGGCGCAATTGCACTTGACGGTTCGCGCAGAGCCGGATCCGAGATTCGTGTTTCGATTCGACGGCGAGCCGGAGTGTAGCCCGCAGGTTTTTCAGGTGC
AAGGAAATGTGAGGCAGCCGGTTTTCTCTTGCAAATTCGGTTTCAGAAACGAGCGCGATTGGGACCGATCAAGGTCATCAATTTCTGAACCTAATAGCACGTCGAAGAGT
TGGTTACCGAAGATTCGATCCGATAAGGACCAATCCGCGAAAGAACGAAAAGGATGGTCCATTACGATCCATGACCTCTCCGGATCGCCGGTCGCCGCCGCGTCGATGGT
GACGCCGTTCGTTCCGTCGCCGGGATCGCATCGTGTAAGCCGCTCAAATCCCGGCGCGTGGCTCATCCTCCGGCCGGTCGACGGCAGCTGGAGGCCGTGGGGCCGCCTCG
AGGCCTGGCGTGAGAGCGGCGGCTCCGATTCAGTCGGCTACCGGTTCGAGCTCCTCCCGGCGACCTCCGCCGCCGCGACACTCGCGACCTCCACCATCAGCTCGAAGACC
GGCGGGAAGTTCACGATCGACATGACCTCGAGCGCGTCGCCGGCGAACAGCCCCCACGGGAGCTGCGACTTCGGGTCCGGATCGGGGTCAAGGCCCGGATCCGGAGACTT
CGGGTACTTGTCGCCGTATTCGTACAAGGGATTCGTGATGTCGTCGACGGTGGAGGGGATGAAGAAGAAGGGGAGGAAGCCGGAGGTGGAGGTGGCGGTGCAGCACGTGA
CTTGCACGGAGGACGCGGCGGTGTTCGTGGCGTTGGCGGCGGCGGTGGACCTGAGCGTCGACGCCTGCAGGTTGTTCTCTCAAAAGCTAAGGAAGGAGCTGAGGCCA
Protein sequenceShow/hide protein sequence
MDPCPFVRVLVGNLVLKFPVAAKPSFSGVHPSSSPCFCKIKLKDFPTQFVTVPLVVNGDNSSAQSHSLAACFSLNKSQMEKLLSKRKDPAVKIEVYTGRRGPANCDVVGS
SAKLLGRVVVPLTGSSLAETKPCVFQNGWTRIGESKKGYSSAQLHLTVRAEPDPRFVFRFDGEPECSPQVFQVQGNVRQPVFSCKFGFRNERDWDRSRSSISEPNSTSKS
WLPKIRSDKDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWRPWGRLEAWRESGGSDSVGYRFELLPATSAAATLATSTISSKT
GGKFTIDMTSSASPANSPHGSCDFGSGSGSRPGSGDFGYLSPYSYKGFVMSSTVEGMKKKGRKPEVEVAVQHVTCTEDAAVFVALAAAVDLSVDACRLFSQKLRKELRP