; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020224 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020224
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionglycine-rich cell wall structural protein 1.8-like
Genome locationtig00153449:1335585..1336507
RNA-Seq ExpressionSgr020224
SyntenySgr020224
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448673.1 PREDICTED: glycine-rich cell wall structural protein 2-like [Cucumis melo]7.6e-2752.2Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH----------YPGVGQNPNYRDESYDA-----------YGGGPGRGHGVG---GSALVGSGYGSGG
        MAI+K  S GFLLLV LGLASAAR+LL YD P H          +P V     Y  + +D            YGGG G G+G G   GS+L GSGYGSGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH----------YPGVGQNPNYRDESYDA-----------YGGGPGRGHGVG---GSALVGSGYGSGG

Query:  GGGSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSV
        GG  GSGYGG G+HGVGYGSGGG GYG GVGS LGGSGYGSGGGGGSG GYG + GHG GYGSGGGGG G G G    G GYGSGGGGG G         
Subjt:  GGGSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSV

Query:  MLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVG-GHGGGGGIGSGG-----GYGSGGGVHEGGY
                                S V    V+ G+G             G G GGGAGGGYGG +G G G GGGGG G GG     GYGSGGG      
Subjt:  MLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVG-GHGGGGGIGSGG-----GYGSGGGVHEGGY

Query:  ENSKGSGENGGYDGGYAP
            GSGE GGYDGGYAP
Subjt:  ENSKGSGENGGYDGGYAP

XP_011650342.1 glycine-rich cell wall structural protein 2 [Cucumis sativus]1.9e-2551.85Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRD------ESYDA-YGGGPGRGHGVG---GSALVGSGYGSGGGG
        MAI+K  S GFLLLV LGLASAAR+LLSYD P H             P VG   + RD      + +DA YGGG G G+G G   GS+L GSGYGSGGGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRD------ESYDA-YGGGPGRGHGVG---GSALVGSGYGSGGGG

Query:  GSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDM
          GSGYGG G+H VGYGSGGG GYG GVGS LGGSGYGSGGGGGSG GYG + G G GYGSGGGGG        +G GYGS  GG GYGSG GGG GV  
Subjt:  GSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDM

Query:  EVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGG-----GYGSGGG
                                             V+ G+G             G G GGGAG GYGG +G  GG GGGGG G GG     GYGSGGG
Subjt:  EVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGG-----GYGSGGG

Query:  VHEGGYENSKGSGENGGYDGGYAP
                  GSGE GGYDGGYAP
Subjt:  VHEGGYENSKGSGENGGYDGGYAP

XP_022145469.1 glycine-rich cell wall structural protein 1.8-like isoform X1 [Momordica charantia]1.3e-3154.3Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGRGDHGVGYGSGGG
        MAI+KAFSL FLLL+G GLASAARTLL ++P  + P   +    RD    AYGGG G G+G G GS+L GSGYGSGGGGG GSGYGG GDHGVGYGSGGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGRGDHGVGYGSGGG

Query:  SGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAE
         GYGGG+GSGLGG+GYGSGGGGGSG GYG   G G GYGSGGGGG        +G+GYGS  GG GYGSGGGGG G                        
Subjt:  SGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAE

Query:  AVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKG-------SGENGGYDG
                         E G+G             G G GGG+GGGYGG +G  G G GGG G GGGYG+ GG H GGY    G       SGE GGYDG
Subjt:  AVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKG-------SGENGGYDG

Query:  GY
        GY
Subjt:  GY

XP_022145473.1 glycine-rich cell wall structural protein 2-like [Momordica charantia]1.4e-2866.29Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGR
        MA  +  SLGFLLLVGLGLASAAR LL+YDPP H             P VG +P + D  Y AYGGG G G+G G GSAL GSGYGSGGGGG GSGYGG 
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGR

Query:  GDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGG
         DHG GYGSGGG GYGGG+GSGLGG+GYGSGGGGGSG GYG V+G G GYGSG GGG GS       G GYGS G GG
Subjt:  GDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGG

XP_038905926.1 glycine-rich cell wall structural protein 1.8-like [Benincasa hispida]3.4e-2753.7Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLS------------YDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGR
        MAI +  S GFLLLVGLGLASA R LLS            YD PVH P VG     RD     YGGG G G+G G GS+L GSGYGSGGGG  GSGY G 
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLS------------YDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGR

Query:  GDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLV
        GDHGVGYGSGGG GYG GVGS LGGSGYGSGGGGGSG GYG + GHG GYGSGGGGG        +G GYGS  GG GYGSGGG G G            
Subjt:  GDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLV

Query:  MEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGGG-YGSGGGVHEGGYENSKGSG
                                    VE G+G             G G GGG+G GYGG +G  GG GGGGG G GGG +GSGGG   G      GSG
Subjt:  MEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGGG-YGSGGGVHEGGYENSKGSG

Query:  ENGGYDGGYAP
        E GGYDGGYAP
Subjt:  ENGGYDGGYAP

TrEMBL top hitse value%identityAlignment
A0A0A0L1U7 Uncharacterized protein9.1e-2651.85Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRD------ESYDA-YGGGPGRGHGVG---GSALVGSGYGSGGGG
        MAI+K  S GFLLLV LGLASAAR+LLSYD P H             P VG   + RD      + +DA YGGG G G+G G   GS+L GSGYGSGGGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRD------ESYDA-YGGGPGRGHGVG---GSALVGSGYGSGGGG

Query:  GSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDM
          GSGYGG G+H VGYGSGGG GYG GVGS LGGSGYGSGGGGGSG GYG + G G GYGSGGGGG        +G GYGS  GG GYGSG GGG GV  
Subjt:  GSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDM

Query:  EVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGG-----GYGSGGG
                                             V+ G+G             G G GGGAG GYGG +G  GG GGGGG G GG     GYGSGGG
Subjt:  EVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGG-----GYGSGGG

Query:  VHEGGYENSKGSGENGGYDGGYAP
                  GSGE GGYDGGYAP
Subjt:  VHEGGYENSKGSGENGGYDGGYAP

A0A1S3BJM7 glycine-rich cell wall structural protein 2-like3.7e-2752.2Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH----------YPGVGQNPNYRDESYDA-----------YGGGPGRGHGVG---GSALVGSGYGSGG
        MAI+K  S GFLLLV LGLASAAR+LL YD P H          +P V     Y  + +D            YGGG G G+G G   GS+L GSGYGSGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH----------YPGVGQNPNYRDESYDA-----------YGGGPGRGHGVG---GSALVGSGYGSGG

Query:  GGGSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSV
        GG  GSGYGG G+HGVGYGSGGG GYG GVGS LGGSGYGSGGGGGSG GYG + GHG GYGSGGGGG G G G    G GYGSGGGGG G         
Subjt:  GGGSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSV

Query:  MLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVG-GHGGGGGIGSGG-----GYGSGGGVHEGGY
                                S V    V+ G+G             G G GGGAGGGYGG +G G G GGGGG G GG     GYGSGGG      
Subjt:  MLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVG-GHGGGGGIGSGG-----GYGSGGGVHEGGY

Query:  ENSKGSGENGGYDGGYAP
            GSGE GGYDGGYAP
Subjt:  ENSKGSGENGGYDGGYAP

A0A6J1CW11 glycine-rich cell wall structural protein 2-like6.7e-2966.29Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGR
        MA  +  SLGFLLLVGLGLASAAR LL+YDPP H             P VG +P + D  Y AYGGG G G+G G GSAL GSGYGSGGGGG GSGYGG 
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGR

Query:  GDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGG
         DHG GYGSGGG GYGGG+GSGLGG+GYGSGGGGGSG GYG V+G G GYGSG GGG GS       G GYGS G GG
Subjt:  GDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGG

A0A6J1CWP3 glycine-rich cell wall structural protein 1.8-like isoform X16.5e-3254.3Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGRGDHGVGYGSGGG
        MAI+KAFSL FLLL+G GLASAARTLL ++P  + P   +    RD    AYGGG G G+G G GS+L GSGYGSGGGGG GSGYGG GDHGVGYGSGGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVG-GSALVGSGYGSGGGGGSGSGYGGRGDHGVGYGSGGG

Query:  SGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAE
         GYGGG+GSGLGG+GYGSGGGGGSG GYG   G G GYGSGGGGG        +G+GYGS  GG GYGSGGGGG G                        
Subjt:  SGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAE

Query:  AVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKG-------SGENGGYDG
                         E G+G             G G GGG+GGGYGG +G  G G GGG G GGGYG+ GG H GGY    G       SGE GGYDG
Subjt:  AVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKG-------SGENGGYDG

Query:  GY
        GY
Subjt:  GY

Q9ZWM2 Glycine-rich protein-2 (Fragment)2.0e-2551.54Show/hide
Query:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRD------ESYDA-YGGGPGRGHGVG---GSALVGSGYGSGGGG
        MAI+K  S GFLLLV LGLASAAR+LLSYD P H             P VG   + RD      + +DA YGGG G G+G G   GS+L GSGYGSGGGG
Subjt:  MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVH------------YPGVGQNPNYRD------ESYDA-YGGGPGRGHGVG---GSALVGSGYGSGGGG

Query:  GSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDM
          GSGYGG G+H VGYGSGGG GYG GVGS LGGSGYGSGGGGG+G GYG + G G GYGSGGGGG        +G GYGS  GG GYGSG GGG GV  
Subjt:  GSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGLGGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGG--------YGSGYGSSLGGSGYGSGGGGGRGVDM

Query:  EVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGG-----GYGSGGG
                                             V+ G+G             G G GGGAG GYGG +G  GG GGGGG G GG     GYGSGGG
Subjt:  EVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGV-GGHGGGGGIGSGG-----GYGSGGG

Query:  VHEGGYENSKGSGENGGYDGGYAP
                  GSGE GGYDGGYAP
Subjt:  VHEGGYENSKGSGENGGYDGGYAP

SwissProt top hitse value%identityAlignment
Q9SIH2 Glycine-rich protein DOT14.2e-0443.11Show/hide
Query:  LVGLGLASAARTLL-SYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVGGSALVGSG--------YGSGGGGGSGSGYGGRGDHGVGYGSGGGSGYGG
        L+GLGL SA R LL S +        G N           GGGPG G G GG +  G G         G GGGGG G G GG G  G G G GGGSG GG
Subjt:  LVGLGLASAARTLL-SYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVGGSALVGSG--------YGSGGGGGSGSGYGGRGDHGVGYGSGGGSGYGG

Query:  GVGSGLG-GSGYGSGGGGGSGVGYGGVEG-HGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAEAVVDMAQEESM
        G G G G   G+G GGGGG+G G GG  G HG GYG G G G G GYG   G  G+G GGGGG G                                   
Subjt:  GVGSGLG-GSGYGSGGGGGSGVGYGGVEG-HGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAEAVVDMAQEESM

Query:  VLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKGSGENGGYDGG
                  G    + E      GYG GGGAG GYGGG G GGHGGGGG G G G G GGG   GGY  + G G  GG  GG
Subjt:  VLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKGSGENGGYDGG

Arabidopsis top hitse value%identityAlignment
AT2G36120.1 Glycine-rich protein family3.0e-0543.11Show/hide
Query:  LVGLGLASAARTLL-SYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVGGSALVGSG--------YGSGGGGGSGSGYGGRGDHGVGYGSGGGSGYGG
        L+GLGL SA R LL S +        G N           GGGPG G G GG +  G G         G GGGGG G G GG G  G G G GGGSG GG
Subjt:  LVGLGLASAARTLL-SYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVGGSALVGSG--------YGSGGGGGSGSGYGGRGDHGVGYGSGGGSGYGG

Query:  GVGSGLG-GSGYGSGGGGGSGVGYGGVEG-HGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAEAVVDMAQEESM
        G G G G   G+G GGGGG+G G GG  G HG GYG G G G G GYG   G  G+G GGGGG G                                   
Subjt:  GVGSGLG-GSGYGSGGGGGSGVGYGGVEG-HGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAEAVVDMAQEESM

Query:  VLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKGSGENGGYDGG
                  G    + E      GYG GGGAG GYGGG G GGHGGGGG G G G G GGG   GGY  + G G  GG  GG
Subjt:  VLAMEVEEGLGLAAAAMEVPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKGSGENGGYDGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATCGCTAAAGCTTTCTCCCTTGGTTTTCTTCTGTTGGTGGGTTTAGGCTTAGCTTCGGCTGCCAGAACCCTTCTCAGCTATGATCCTCCTGTGCATTACCCGGG
TGTAGGGCAAAATCCTAACTATCGTGATGAATCCTATGATGCATATGGTGGTGGCCCTGGTAGAGGACATGGAGTTGGAGGCTCTGCTCTGGTAGGTTCGGGATATGGAA
GTGGTGGAGGTGGAGGAAGTGGTTCTGGATATGGAGGTCGAGGAGACCATGGGGTTGGATATGGTAGTGGTGGAGGTAGTGGATATGGAGGAGGGGTTGGCTCTGGCCTC
GGAGGTTCTGGATACGGAAGCGGTGGCGGCGGAGGAAGTGGTGTAGGTTATGGAGGTGTAGAAGGACATGGAGATGGCTACGGTAGCGGTGGTGGTGGAGGATATGGGAG
TGGGTATGGCTCTTCCCTTGGAGGTTCAGGATACGGAAGTGGAGGAGGTGGAGGAAGAGGAGTGGATATGGAGGTGGAGCAGAGCGTGATGTTGGTTATGGAAGTGGTGG
AGGTGGAGGATATGGAAGCGGAGGCGGTGGTGGATATGGCCCAGGAGGAGAGCATGGTGTTGGCTATGGAAGTGGAGGAGGGGCTGGGTCTGGCAGCGGCGGCTATGGAG
GTTCCCGTGGATATGGACGGATATGGGAATGGTGGAGGAGCCGGCGGTGGCTATGGAGGGGGAGAGGGAGTAGGAGGTCATGGTGGTGGCGGTGGAATTGGCTCCGGTGG
TGGTTACGGCAGCGGTGGAGGCGTGCATGAAGGAGGGTATGAAAATAGCAAAGGAAGTGGAGAGAATGGAGGATATGATGGTGGATATGCACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTATCGCTAAAGCTTTCTCCCTTGGTTTTCTTCTGTTGGTGGGTTTAGGCTTAGCTTCGGCTGCCAGAACCCTTCTCAGCTATGATCCTCCTGTGCATTACCCGGG
TGTAGGGCAAAATCCTAACTATCGTGATGAATCCTATGATGCATATGGTGGTGGCCCTGGTAGAGGACATGGAGTTGGAGGCTCTGCTCTGGTAGGTTCGGGATATGGAA
GTGGTGGAGGTGGAGGAAGTGGTTCTGGATATGGAGGTCGAGGAGACCATGGGGTTGGATATGGTAGTGGTGGAGGTAGTGGATATGGAGGAGGGGTTGGCTCTGGCCTC
GGAGGTTCTGGATACGGAAGCGGTGGCGGCGGAGGAAGTGGTGTAGGTTATGGAGGTGTAGAAGGACATGGAGATGGCTACGGTAGCGGTGGTGGTGGAGGATATGGGAG
TGGGTATGGCTCTTCCCTTGGAGGTTCAGGATACGGAAGTGGAGGAGGTGGAGGAAGAGGAGTGGATATGGAGGTGGAGCAGAGCGTGATGTTGGTTATGGAAGTGGTGG
AGGTGGAGGATATGGAAGCGGAGGCGGTGGTGGATATGGCCCAGGAGGAGAGCATGGTGTTGGCTATGGAAGTGGAGGAGGGGCTGGGTCTGGCAGCGGCGGCTATGGAG
GTTCCCGTGGATATGGACGGATATGGGAATGGTGGAGGAGCCGGCGGTGGCTATGGAGGGGGAGAGGGAGTAGGAGGTCATGGTGGTGGCGGTGGAATTGGCTCCGGTGG
TGGTTACGGCAGCGGTGGAGGCGTGCATGAAGGAGGGTATGAAAATAGCAAAGGAAGTGGAGAGAATGGAGGATATGATGGTGGATATGCACCTTAG
Protein sequenceShow/hide protein sequence
MAIAKAFSLGFLLLVGLGLASAARTLLSYDPPVHYPGVGQNPNYRDESYDAYGGGPGRGHGVGGSALVGSGYGSGGGGGSGSGYGGRGDHGVGYGSGGGSGYGGGVGSGL
GGSGYGSGGGGGSGVGYGGVEGHGDGYGSGGGGGYGSGYGSSLGGSGYGSGGGGGRGVDMEVEQSVMLVMEVVEVEDMEAEAVVDMAQEESMVLAMEVEEGLGLAAAAME
VPVDMDGYGNGGGAGGGYGGGEGVGGHGGGGGIGSGGGYGSGGGVHEGGYENSKGSGENGGYDGGYAP