; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030184 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030184
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLEA_2 domain-containing protein
Genome locationtig00153574:1120992..1121714
RNA-Seq ExpressionSgr030184
SyntenySgr030184
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463664.1 PREDICTED: uncharacterized protein LOC103501757 [Cucumis melo]9.1e-5853.1Show/hide
Query:  PPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNN
        PPHP+ Q + HC                    +SPFLR FAAGM L++ +A ++Y++++L F P+LPA R+DSL L+NFS AAA    +  W+VGFS+NN
Subjt:  PPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNN

Query:  PNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLA
        PNKKL IS+ N++SSIYYK  I++QARI  F +R RN T L+ PF+A S  D SVLNDINGDLARG INFTV VLG   F+  +W+WRG   +V+CSDL+
Subjt:  PNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLA

Query:  VGISPPLSYSGGYGELVGGSRQCQVR
        VG S P S +G  G+LVGGS+QCQ++
Subjt:  VGISPPLSYSGGYGELVGGSRQCQVR

XP_022943707.1 uncharacterized protein At1g08160-like [Cucurbita moschata]1.6e-3845.19Show/hide
Query:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS
        + A P G   P + QPQ   +    PP   P  Y NP +         FLRA  AG+I++ I+  I+  + +L+ RP LP FRVDS  ++NFS  AA QS
Subjt:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS

Query:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTAL-ITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR
        LSALW VGFSV NPNKK+TISY  +E +++YK   LSQ R+  F   KR QTA+  T    N+  + S +N+IN D  RG + F V V  RV F  G WR
Subjt:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTAL-ITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR

Query:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
         R  L RVLC DL+VG+S   S S   G+L+G  R C+V
Subjt:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

XP_022957733.1 NDR1/HIN1-like protein 2 [Cucurbita moschata]8.0e-3844.81Show/hide
Query:  FTANPGGGRQPPHPQP--QPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAP
        + A P G   P + QP  Q YP+    PP   P  Y NP +   +     FLRA  AG+I+I I+  ++  + +L+ RP LP FRVDS  ++NFS  AA 
Subjt:  FTANPGGGRQPPHPQP--QPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAP

Query:  QSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGV
        QSLSA W VGFSV NPNKK++ISY  ++S+++YK +ILS+ R+  F   KR  T +   F + N+  DAS +N+IN D  RG + F V +  RV F  G 
Subjt:  QSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGV

Query:  WRWRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
        WR R  L RVLC DL+VG+S   S S   G+L+G SR C+V
Subjt:  WRWRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

XP_022995131.1 NDR1/HIN1-like protein 2 [Cucurbita maxima]1.6e-3844.77Show/hide
Query:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS
        + A P G   P + QP    +    PP   P  Y NP +         FLRA  AG+I+I I+  ++  + +L+ RP LP FRVDS  ++NFS  AA QS
Subjt:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS

Query:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR
        LSA W VGFSV NPNKK++ISY  ++S+++YK +ILS+ R+  F   KR  T +   F + N+  DAS +NDIN D  RG + F V +  RV F  G WR
Subjt:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR

Query:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
         R  L RVLC DL+VG+S  LS S   G+L+G SR C+V
Subjt:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

XP_023512211.1 uncharacterized protein At1g08160-like [Cucurbita pepo subsp. pepo]4.7e-3845.64Show/hide
Query:  FTANPGGGRQPPHPQP--QPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAP
        + A P G   P + QP  Q YP+    PP   P  Y NP +         FLRA  AG+I++ I+  I+  + +L+ RP LP FRVDS  ++NFS  AA 
Subjt:  FTANPGGGRQPPHPQP--QPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAP

Query:  QSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTAL-ITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGV
        QSLSALW VGFSV NPNKK+TISY  +E +++YK   LSQ R+  F   KR QTA+  T    N+  + S +N+IN D  RG + F V V  RV F  G 
Subjt:  QSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTAL-ITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGV

Query:  WRWRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
        WR R  L RVLC DL+VG+S   S S   G+L+G  R C+V
Subjt:  WRWRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

TrEMBL top hitse value%identityAlignment
A0A0A0LT76 LEA_2 domain-containing protein1.6e-5251.71Show/hide
Query:  ANPGGGR-QPPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSAL
        AN   GR  PPHP+ Q +PHC                     SPF+R FAAG+ L++ +  I+Y ++YLIFRP+L A R+DSL   NFS  AA  S    
Subjt:  ANPGGGR-QPPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSAL

Query:  WVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNL
        WVVGFS+NNPNKKL IS+ N+ESSIYYK  I++QAR   F +  RN T L++PF+A+   D SVLNDI+GDL RG I+FTV VLG    E GVWR  G  
Subjt:  WVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNL

Query:  CRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
         RV+CSDL+V  S P   SG  G+LVGGSRQC +
Subjt:  CRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

A0A1S3CJS5 uncharacterized protein LOC1035017574.4e-5853.1Show/hide
Query:  PPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNN
        PPHP+ Q + HC                    +SPFLR FAAGM L++ +A ++Y++++L F P+LPA R+DSL L+NFS AAA    +  W+VGFS+NN
Subjt:  PPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNN

Query:  PNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLA
        PNKKL IS+ N++SSIYYK  I++QARI  F +R RN T L+ PF+A S  D SVLNDINGDLARG INFTV VLG   F+  +W+WRG   +V+CSDL+
Subjt:  PNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLA

Query:  VGISPPLSYSGGYGELVGGSRQCQVR
        VG S P S +G  G+LVGGS+QCQ++
Subjt:  VGISPPLSYSGGYGELVGGSRQCQVR

A0A5D3E5N0 Protein YLS9 isoform X24.4e-5853.1Show/hide
Query:  PPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNN
        PPHP+ Q + HC                    +SPFLR FAAGM L++ +A ++Y++++L F P+LPA R+DSL L+NFS AAA    +  W+VGFS+NN
Subjt:  PPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNN

Query:  PNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLA
        PNKKL IS+ N++SSIYYK  I++QARI  F +R RN T L+ PF+A S  D SVLNDINGDLARG INFTV VLG   F+  +W+WRG   +V+CSDL+
Subjt:  PNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLA

Query:  VGISPPLSYSGGYGELVGGSRQCQVR
        VG S P S +G  G+LVGGS+QCQ++
Subjt:  VGISPPLSYSGGYGELVGGSRQCQVR

A0A6J1FTS7 uncharacterized protein At1g08160-like7.8e-3945.19Show/hide
Query:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS
        + A P G   P + QPQ   +    PP   P  Y NP +         FLRA  AG+I++ I+  I+  + +L+ RP LP FRVDS  ++NFS  AA QS
Subjt:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS

Query:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTAL-ITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR
        LSALW VGFSV NPNKK+TISY  +E +++YK   LSQ R+  F   KR QTA+  T    N+  + S +N+IN D  RG + F V V  RV F  G WR
Subjt:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTAL-ITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR

Query:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
         R  L RVLC DL+VG+S   S S   G+L+G  R C+V
Subjt:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

A0A6J1K154 NDR1/HIN1-like protein 27.8e-3944.77Show/hide
Query:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS
        + A P G   P + QP    +    PP   P  Y NP +         FLRA  AG+I+I I+  ++  + +L+ RP LP FRVDS  ++NFS  AA QS
Subjt:  FTANPGGGRQPPHPQPQPYPHCLQQPP---PPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQS

Query:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR
        LSA W VGFSV NPNKK++ISY  ++S+++YK +ILS+ R+  F   KR  T +   F + N+  DAS +NDIN D  RG + F V +  RV F  G WR
Subjt:  LSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWR

Query:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV
         R  L RVLC DL+VG+S  LS S   G+L+G SR C+V
Subjt:  WRGNLCRVLCSDLAVGISPPLSYSGGYGELVGGSRQCQV

SwissProt top hitse value%identityAlignment
Q9SJ52 NDR1/HIN1-like protein 107.6e-0726.4Show/hide
Query:  LRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKR
        L  F   +I ++++  +   + +LI RP    F V    L+ F   +    L     +   V NPNK++ + Y  IE+  YY+ +  S   +  F    +
Subjt:  LRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKR

Query:  NQTALITPFVANSPA--DASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPPLSYSGG
        N T L   F   +    +A     +N +   G  N  +    RVRF+ G  ++R    +V C DL +    PLS S G
Subjt:  NQTALITPFVANSPA--DASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPPLSYSGG

Arabidopsis top hitse value%identityAlignment
AT1G54540.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.8e-0427.27Show/hide
Query:  ETFTANPGGGRQPPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSL
        +T T  P  G+    P  +P        PPP  P+ +    N+    F    +  +I ++ +AI V AV Y +F P LP++ V+SL ++N  +     SL
Subjt:  ETFTANPGGGRQPPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSL

Query:  SALWVVGFSVNNPNKKLTISY---GNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENG
        SA + V  +  NPN+K+ I Y   G+I   ++Y    L +  I  F    RN T L       +    +VL  +      G +   + V   V  + G
Subjt:  SALWVVGFSVNNPNKKLTISY---GNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENG

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.4e-0826.4Show/hide
Query:  LRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKR
        L  F   +I ++++  +   + +LI RP    F V    L+ F   +    L     +   V NPNK++ + Y  IE+  YY+ +  S   +  F    +
Subjt:  LRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKR

Query:  NQTALITPFVANSPA--DASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPPLSYSGG
        N T L   F   +    +A     +N +   G  N  +    RVRF+ G  ++R    +V C DL +    PLS S G
Subjt:  NQTALITPFVANSPA--DASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPPLSYSGG

AT3G52460.1 hydroxyproline-rich glycoprotein family protein7.5e-1025.55Show/hide
Query:  GGRQPPHPQP-------QPYPHCLQQPPPPF----YPNPSNP-RTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAA
        G  QPP P P       Q YP+  Q PP  +    YP   NP      +S F+R    G+I++V++  I   + +L+ RP +P F V++  +SNF+V   
Subjt:  GGRQPPHPQP-------QPYPHCLQQPPPPF----YPNPSNP-RTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAA

Query:  PQSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRI-----LSQARIHHFNVRKRNQTALITPFVANSPADAS----VLNDINGDLARGFINFTVTVL
            SA W    ++ N N KL   +  I+  +Y++  +     L+ A      V  +    +     A           V++++  +   G + F++ + 
Subjt:  PQSLSALWVVGFSVNNPNKKLTISYGNIESSIYYKTRI-----LSQARIHHFNVRKRNQTALITPFVANSPADAS----VLNDINGDLARGFINFTVTVL

Query:  GRVRFENGVWRWRGNLCRVLCSDLAVG
          V F+   W  R +  +V C  L VG
Subjt:  GRVRFENGVWRWRGNLCRVLCSDLAVG

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.4e-0923.68Show/hide
Query:  PFYPNPSNPRTNLLTSPFLRAFAAGMIL-IVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNNPNKKLTISYGNIESSIY
        P   +P+ P    L  P L  +   +IL ++ +A + + + +L  +P    + V++  + NF++      +SA +      +NPN ++++ Y ++E  + 
Subjt:  PFYPNPSNPRTNLLTSPFLRAFAAGMIL-IVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGFSVNNPNKKLTISYGNIESSIY

Query:  YKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPP
        +K + L+   +  F+  + N   +    +A N     S   D+    + G I F V V  RVRF+ G+W+      ++ CS + V +S P
Subjt:  YKTRILSQARIHHFNVRKRNQTALITPFVA-NSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGACCTTCACTGCCAACCCCGGCGGCGGCCGGCAACCGCCTCATCCTCAACCCCAGCCCTATCCACACTGCCTTCAACAGCCGCCGCCGCCTTTTTACCCTAA
CCCCTCCAATCCACGTACTAATCTCCTCACCTCTCCTTTCCTCCGAGCTTTCGCCGCCGGCATGATTCTAATCGTCATCGTCGCCATCATCGTGTACGCCGTCGAATACC
TCATCTTTCGCCCCGTCCTCCCCGCATTCCGAGTCGACTCGCTTCATCTGTCCAACTTCTCCGTCGCCGCAGCCCCCCAGTCTCTCTCCGCCCTGTGGGTCGTCGGATTT
TCCGTCAACAACCCGAACAAGAAGCTGACGATCTCCTACGGCAACATCGAGTCGTCGATTTACTACAAAACGAGGATCCTCTCTCAGGCCCGGATTCACCATTTCAACGT
CCGGAAAAGGAACCAGACGGCGTTGATCACCCCCTTCGTCGCCAACTCGCCCGCTGACGCGTCGGTGTTGAACGACATTAATGGAGACTTGGCGCGCGGATTCATCAACT
TCACTGTGACGGTTCTCGGCCGTGTCAGGTTCGAGAACGGCGTGTGGAGGTGGAGAGGAAACTTGTGTAGGGTTTTATGCAGCGACCTGGCCGTCGGAATCTCGCCGCCG
TTGAGTTACAGCGGCGGGTACGGCGAGCTGGTGGGCGGCTCGAGGCAATGCCAAGTTCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGACCTTCACTGCCAACCCCGGCGGCGGCCGGCAACCGCCTCATCCTCAACCCCAGCCCTATCCACACTGCCTTCAACAGCCGCCGCCGCCTTTTTACCCTAA
CCCCTCCAATCCACGTACTAATCTCCTCACCTCTCCTTTCCTCCGAGCTTTCGCCGCCGGCATGATTCTAATCGTCATCGTCGCCATCATCGTGTACGCCGTCGAATACC
TCATCTTTCGCCCCGTCCTCCCCGCATTCCGAGTCGACTCGCTTCATCTGTCCAACTTCTCCGTCGCCGCAGCCCCCCAGTCTCTCTCCGCCCTGTGGGTCGTCGGATTT
TCCGTCAACAACCCGAACAAGAAGCTGACGATCTCCTACGGCAACATCGAGTCGTCGATTTACTACAAAACGAGGATCCTCTCTCAGGCCCGGATTCACCATTTCAACGT
CCGGAAAAGGAACCAGACGGCGTTGATCACCCCCTTCGTCGCCAACTCGCCCGCTGACGCGTCGGTGTTGAACGACATTAATGGAGACTTGGCGCGCGGATTCATCAACT
TCACTGTGACGGTTCTCGGCCGTGTCAGGTTCGAGAACGGCGTGTGGAGGTGGAGAGGAAACTTGTGTAGGGTTTTATGCAGCGACCTGGCCGTCGGAATCTCGCCGCCG
TTGAGTTACAGCGGCGGGTACGGCGAGCTGGTGGGCGGCTCGAGGCAATGCCAAGTTCGGTGA
Protein sequenceShow/hide protein sequence
MAETFTANPGGGRQPPHPQPQPYPHCLQQPPPPFYPNPSNPRTNLLTSPFLRAFAAGMILIVIVAIIVYAVEYLIFRPVLPAFRVDSLHLSNFSVAAAPQSLSALWVVGF
SVNNPNKKLTISYGNIESSIYYKTRILSQARIHHFNVRKRNQTALITPFVANSPADASVLNDINGDLARGFINFTVTVLGRVRFENGVWRWRGNLCRVLCSDLAVGISPP
LSYSGGYGELVGGSRQCQVR