; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014333 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014333
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationtig00000289:547132..548115
RNA-Seq ExpressionSgr014333
SyntenySgr014333
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032834.1 hypothetical protein SDJN02_06884, partial [Cucurbita argyrosperma subsp. argyrosperma]5.0e-7563.82Show/hide
Query:  TTNKPFHFKYPLSLSHSNFFLHIFPENLNKKKKKKRTKAMGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAV-DEDSQAESGVCSPTLWG
        T NK FHF   L    SN F   FP             AMGD   K PR PI ++              ED I+FRG  SAV D+DSQ+ESGVCSPTLWG
Subjt:  TTNKPFHFKYPLSLSHSNFFLHIFPENLNKKKKKKRTKAMGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAV-DEDSQAESGVCSPTLWG

Query:  SNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSRS
        S+S  SP QF+R RNR+LSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHH   L +QE ++VE+ DS+SETSF RDP KKR+E RALVTRSRS
Subjt:  SNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSRS

Query:  VDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRVSPKPPVGDGSAKGVDRDWWRKRSS-VAGESEGSVSGGSMSGSSNSTSSDRSNSR
        V+SGGFYLKMFFP+  G ++TKKK N R+DS LNGSSRVSPKPP        VDRDWWRKRSS  +GE+ GSVSG S S +SNSTSS+RSNSR
Subjt:  VDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRVSPKPPVGDGSAKGVDRDWWRKRSS-VAGESEGSVSGGSMSGSSNSTSSDRSNSR

TYK23785.1 uncharacterized protein E5676_scaffold1607G001120 [Cucumis melo var. makuwa]1.7e-8372.41Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM
        MGD  TK P KPI    P    YYK+ E + ED I+FRGWDS  A+D+DSQ+ESGV SPTLW SNS T+P QF+R RNRSLSPTSRTQAIARGQQELMEM
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM

Query:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN
        VRNMPESSYELSLKDLVEHH+ N +RQ++  V    RDDSSSETSFRRDPSK R E RALVTRSRSVDSGGFYLKMFFPL FG ++ KKK N R DSGL+
Subjt:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN

Query:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        GSSRVSPKPP        VD+DWWRKRSSV+ GE++GS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

XP_004144253.1 uncharacterized protein LOC101219576 [Cucumis sativus]9.4e-8270.88Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM
        MGD  TKLP KP          YYK+ E + ED I+FRGWDS  A+D+DSQ+ESGV SPTLW SNS T+P   +R RNRSLSPTSRTQAIARGQQELMEM
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM

Query:  VRNMPESSYELSLKDLVEHHVGNLERQEE---STVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN
        VRNMPESSYELSLKDLVEHH+ N +RQ++   +++ RDDSSSETSFRRDPSK R E RALVTRSRSVDSGGFYLKMFFPL FG ++ KKK N R DSGL+
Subjt:  VRNMPESSYELSLKDLVEHHVGNLERQEE---STVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN

Query:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        GSSRVSPKPP        VD+DWWRKRSSV+ GE++GS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

XP_008441584.1 PREDICTED: uncharacterized protein LOC103485667 [Cucumis melo]1.7e-8372.41Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM
        MGD  TK P KPI    P    YYK+ E + ED I+FRGWDS  A+D+DSQ+ESGV SPTLW SNS T+P QF+R RNRSLSPTSRTQAIARGQQELMEM
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM

Query:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN
        VRNMPESSYELSLKDLVEHH+ N +RQ++  V    RDDSSSETSFRRDPSK R E RALVTRSRSVDSGGFYLKMFFPL FG ++ KKK N R DSGL+
Subjt:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN

Query:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        GSSRVSPKPP        VD+DWWRKRSSV+ GE++GS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

XP_038885358.1 uncharacterized protein LOC120075766 [Benincasa hispida]4.2e-8270.23Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMV
        MGD RTK PRK   + D     YYK+ +  ED I+FRGWDS  A+D+DSQ+ESGV SPTLWGSNS TSP QF+R RNRSLSPTSR QAIARGQQELMEMV
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMV

Query:  RNMPESSYELSLKDLVEHHVGNLERQEE-----STVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGL
        RNMPESSYELSLKDLVEHH+ N +RQ++     +++ RDDSSSETSFRRD SK R+E R LVTRSRSVDSGGFYLKMF PL FG ++ KKK N R DSGL
Subjt:  RNMPESSYELSLKDLVEHHVGNLERQEE-----STVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGL

Query:  NGSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        +G SRVSPKPP        V++DWWRKRS+VA GE+EGS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  NGSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

TrEMBL top hitse value%identityAlignment
A0A0A0KCW4 Uncharacterized protein4.6e-8270.88Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM
        MGD  TKLP KP          YYK+ E + ED I+FRGWDS  A+D+DSQ+ESGV SPTLW SNS T+P   +R RNRSLSPTSRTQAIARGQQELMEM
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM

Query:  VRNMPESSYELSLKDLVEHHVGNLERQEE---STVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN
        VRNMPESSYELSLKDLVEHH+ N +RQ++   +++ RDDSSSETSFRRDPSK R E RALVTRSRSVDSGGFYLKMFFPL FG ++ KKK N R DSGL+
Subjt:  VRNMPESSYELSLKDLVEHHVGNLERQEE---STVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN

Query:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        GSSRVSPKPP        VD+DWWRKRSSV+ GE++GS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

A0A1S3B3A7 uncharacterized protein LOC1034856678.3e-8472.41Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM
        MGD  TK P KPI    P    YYK+ E + ED I+FRGWDS  A+D+DSQ+ESGV SPTLW SNS T+P QF+R RNRSLSPTSRTQAIARGQQELMEM
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM

Query:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN
        VRNMPESSYELSLKDLVEHH+ N +RQ++  V    RDDSSSETSFRRDPSK R E RALVTRSRSVDSGGFYLKMFFPL FG ++ KKK N R DSGL+
Subjt:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN

Query:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        GSSRVSPKPP        VD+DWWRKRSSV+ GE++GS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

A0A5D3DJL8 Uncharacterized protein8.3e-8472.41Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM
        MGD  TK P KPI    P    YYK+ E + ED I+FRGWDS  A+D+DSQ+ESGV SPTLW SNS T+P QF+R RNRSLSPTSRTQAIARGQQELMEM
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARE-SPEDGIVFRGWDS--AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEM

Query:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN
        VRNMPESSYELSLKDLVEHH+ N +RQ++  V    RDDSSSETSFRRDPSK R E RALVTRSRSVDSGGFYLKMFFPL FG ++ KKK N R DSGL+
Subjt:  VRNMPESSYELSLKDLVEHHVGNLERQEESTV---ERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLN

Query:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR
        GSSRVSPKPP        VD+DWWRKRSSV+ GE++GS+SGGSM  SGSSNSTSS+RSNSR
Subjt:  GSSRVSPKPPVGDGSAKGVDRDWWRKRSSVA-GESEGSVSGGSM--SGSSNSTSSDRSNSR

A0A6J1HEJ4 uncharacterized protein LOC1114621656.0e-7468.11Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAV-DEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVR
        MGD   K PR PI ++              ED I+FRG DSAV D+DSQ+ESGVCSPTLWGS+S  +P QF+R RNR+LSPTSRTQAIARGQQELMEMVR
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAV-DEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVR

Query:  NMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRV
        NMPESSYELSLKDLVEHH   L +QE +++E+ DS+SETSF RDP KKR+E RALVTRSRSV+SGGFYLKMFFP+  G ++TKKK N R+DS LNGSSRV
Subjt:  NMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRV

Query:  SPKPPVGDGSAKGVDRDWWRKRSS-VAGESEGSVSGGSMSGSSNSTSSDRSNSR
        SPKPP        VDRDWWRKRSS  +GE+ GSVSG S S +SNSTSS+RSNSR
Subjt:  SPKPPVGDGSAKGVDRDWWRKRSS-VAGESEGSVSGGSMSGSSNSTSSDRSNSR

A0A6J1JND4 uncharacterized protein LOC1114868787.8e-7468.5Show/hide
Query:  MGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAV-DEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVR
        MGD   K PR PI ++              ED I+FRG DSAV D+DSQ+ESGVCSPTLWGS+S  SP QF+R RNR+LSPTSRTQAIARGQQELMEMVR
Subjt:  MGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAV-DEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVR

Query:  NMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRV
        NMPESSYELSLKDLVEHH   L +QE ++VE+ DS+SETSF RDP KKR+E RALVTRSRSV+SGGFYLKMFFP+  G ++TKKK N R+DS LNG SRV
Subjt:  NMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRV

Query:  SPKPPVGDGSAKGVDRDWWRKRSS-VAGESEGSVSGGSMSGSSNSTSSDRSNSR
        SPKPP        VDRDWWRKRSS  +GE+ GSVSG S S +SNSTSS+RSNSR
Subjt:  SPKPPVGDGSAKGVDRDWWRKRSS-VAGESEGSVSGGSMSGSSNSTSSDRSNSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21708.3e-2839.82Show/hide
Query:  AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNR-SLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSF
        ++  D  ++SGVCSPTLW ++ P SP  F+R  +  SLSP S+ QAIARGQ+ELMEMV  MPES YELSLKDLVE  V N E + +   E    ++  S 
Subjt:  AVDEDSQAESGVCSPTLWGSNSPTSPGQFYRLRNR-SLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSF

Query:  RRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPM--ATKKKRNSRNDSGLNGSSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSVSGGSMS
         +   K +++ R    RS   ++ GF LK+ F +S G M   TKKK+  + D     + +VSP+P + + + K  D++WW + S  + +  GS    S +
Subjt:  RRDPSKKRTENRALVTRSRSVDSGGFYLKMFFPLSFGPM--ATKKKRNSRNDSGLNGSSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSVSGGSMS

Query:  GSSNSTSSDRSNSRYHFSFLQCHRVL
         S  S SS R      FSFL   +++
Subjt:  GSSNSTSSDRSNSRYHFSFLQCHRVL

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)3.0e-2539.55Show/hide
Query:  DSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPS
        D  ++SGVCSP LW ++ P SP   +    ++LSP ++ Q IARGQ+ELM+MV  MPES YELSLKDLVE +       EE  V  +    E   R+   
Subjt:  DSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPS

Query:  KKRTENRALVTRSRSVDSGGFYLKMFFPLSFG--PMATKKKRNSRNDSGLNG--SSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSV----SGGSM
        K +++      R+  V++ GF LK+ FP+S G      KKK N  +DS +    S   SP+P + D S K  D+DWW+   S +  S+  V    SG S 
Subjt:  KKRTENRALVTRSRSVDSGGFYLKMFFPLSFG--PMATKKKRNSRNDSGLNG--SSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSV----SGGSM

Query:  S--GSSNSTSSDRSNSRYHF
        S  GSS+ ++SDRS +   +
Subjt:  S--GSSNSTSSDRSNSRYHF

AT1G76980.2 FUNCTIONS IN: molecular_function unknown3.0e-2540.76Show/hide
Query:  DSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPS
        D  ++SGVCSP LW ++ P SP   +    ++LSP ++ Q IARGQ+ELM+MV  MPES YELSLKDLVE +       EE  V  +    E   R+   
Subjt:  DSQAESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPS

Query:  KKRTENRALVTRSRSVDSGGFYLKMFFPLSFG--PMATKKKRNSRNDSGLNG--SSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSVSG-GSMSGS
        K +++      R+  V++ GF LK+ FP+S G      KKK N  +DS +    S   SP+P + D S K  D+DWW+   S +  S+  VS   S S  
Subjt:  KKRTENRALVTRSRSVDSGGFYLKMFFPLSFG--PMATKKKRNSRNDSGLNG--SSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSVSG-GSMSGS

Query:  SNSTSSDRSNS
        S+  SS RSNS
Subjt:  SNSTSSDRSNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATTATCTCCTCTCCCCCACCTCTCCCCCTATACCCACCCACCCACCGCTGGCCTCCTCCCACCACAAACAAACCATTTCATTTCAAGTATCCTCTCTCTCTCTC
TCACTCAAATTTCTTCCTCCATATTTTCCCAGAAAATCTAAACAAAAAAAAAAAAAAAAAAAGAACCAAAGCCATGGGTGATCCTAGGACGAAACTACCAAGAAAACCTA
TCCGTAGCTCAGATCCTGTCGCACATGGTTACTATAAAGCTCGTGAATCCCCCGAAGATGGAATCGTTTTCAGGGGCTGGGACAGCGCGGTGGATGAAGACTCTCAAGCT
GAATCGGGAGTCTGTTCTCCCACGCTCTGGGGTTCCAATTCTCCCACCAGCCCCGGCCAATTTTACCGCCTGCGTAATCGCAGCCTCTCCCCAACTTCCCGGACCCAAGC
CATAGCCAGAGGCCAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAGTCCTCCTACGAGCTTTCTTTGAAAGATCTCGTCGAGCACCACGTCGGTAATCTCGAGC
GGCAGGAGGAGAGCACCGTCGAGAGAGACGATTCCAGCTCTGAAACTTCCTTCCGAAGGGACCCCAGCAAGAAAAGGACTGAAAACAGAGCGCTCGTCACCAGAAGTAGA
AGCGTCGACAGCGGCGGTTTTTACCTCAAAATGTTCTTCCCACTGTCTTTCGGGCCGATGGCGACCAAAAAGAAGAGGAACAGCAGAAACGACTCTGGGTTGAATGGGAG
TTCAAGAGTGTCTCCTAAGCCGCCGGTGGGGGATGGATCTGCAAAGGGCGTAGACAGAGACTGGTGGAGGAAGAGATCCTCGGTGGCCGGCGAGAGCGAGGGCAGCGTTT
CCGGCGGAAGCATGAGCGGTAGTAGCAATAGCACCAGCAGCGATAGAAGCAACAGCAGGTACCATTTTTCATTCCTTCAATGCCATCGAGTTCTTGAATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAATTATCTCCTCTCCCCCACCTCTCCCCCTATACCCACCCACCCACCGCTGGCCTCCTCCCACCACAAACAAACCATTTCATTTCAAGTATCCTCTCTCTCTCTC
TCACTCAAATTTCTTCCTCCATATTTTCCCAGAAAATCTAAACAAAAAAAAAAAAAAAAAAAGAACCAAAGCCATGGGTGATCCTAGGACGAAACTACCAAGAAAACCTA
TCCGTAGCTCAGATCCTGTCGCACATGGTTACTATAAAGCTCGTGAATCCCCCGAAGATGGAATCGTTTTCAGGGGCTGGGACAGCGCGGTGGATGAAGACTCTCAAGCT
GAATCGGGAGTCTGTTCTCCCACGCTCTGGGGTTCCAATTCTCCCACCAGCCCCGGCCAATTTTACCGCCTGCGTAATCGCAGCCTCTCCCCAACTTCCCGGACCCAAGC
CATAGCCAGAGGCCAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAGTCCTCCTACGAGCTTTCTTTGAAAGATCTCGTCGAGCACCACGTCGGTAATCTCGAGC
GGCAGGAGGAGAGCACCGTCGAGAGAGACGATTCCAGCTCTGAAACTTCCTTCCGAAGGGACCCCAGCAAGAAAAGGACTGAAAACAGAGCGCTCGTCACCAGAAGTAGA
AGCGTCGACAGCGGCGGTTTTTACCTCAAAATGTTCTTCCCACTGTCTTTCGGGCCGATGGCGACCAAAAAGAAGAGGAACAGCAGAAACGACTCTGGGTTGAATGGGAG
TTCAAGAGTGTCTCCTAAGCCGCCGGTGGGGGATGGATCTGCAAAGGGCGTAGACAGAGACTGGTGGAGGAAGAGATCCTCGGTGGCCGGCGAGAGCGAGGGCAGCGTTT
CCGGCGGAAGCATGAGCGGTAGTAGCAATAGCACCAGCAGCGATAGAAGCAACAGCAGGTACCATTTTTCATTCCTTCAATGCCATCGAGTTCTTGAATCTTGA
Protein sequenceShow/hide protein sequence
MTIISSPPPLPLYPPTHRWPPPTTNKPFHFKYPLSLSHSNFFLHIFPENLNKKKKKKRTKAMGDPRTKLPRKPIRSSDPVAHGYYKARESPEDGIVFRGWDSAVDEDSQA
ESGVCSPTLWGSNSPTSPGQFYRLRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHVGNLERQEESTVERDDSSSETSFRRDPSKKRTENRALVTRSR
SVDSGGFYLKMFFPLSFGPMATKKKRNSRNDSGLNGSSRVSPKPPVGDGSAKGVDRDWWRKRSSVAGESEGSVSGGSMSGSSNSTSSDRSNSRYHFSFLQCHRVLES