; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012136 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012136
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153231:128170..128856
RNA-Seq ExpressionSgr012136
SyntenySgr012136
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-8876.57Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC
        MEIKHKGKVHPSPSSS PSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARS++SSA TST DSRKKSA KASI GGN       HKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE
        YTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKP+KN G+GKRR+R+ R QAT+K    V+   P  DECVV+P SPER+ AA        GS VK+ GE
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE

Query:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL
        SGP +EV GDHQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]3.3e-8976.99Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC
        MEIKHKGKVHPSPSSS PSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARS++SSA TST DSRKKSA KASINGGN       HKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE
        YTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKP+KN G+GKRR+R+ R QAT+K    V+   P  DECVV+P SPER+ AA        GS VK+ GE
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE

Query:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL
        SGP +EV GDHQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]1.1e-8977.41Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC
        MEIKHKGKVHPSPSSS PSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARS++SSA TST DSRKKSA KASINGGN       HKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE
        YTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKP+KN G+GKRR+R+GR QAT+K    V+  PP A ECVV+P SPER+ A         GS VK+ GE
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE

Query:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL
        SGP +EV GDHQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL

XP_023528171.1 uncharacterized protein LOC111791162 [Cucurbita pepo subsp. pepo]1.2e-8876.45Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC
        MEIKHKGKVHPSPSSS PSSSSSVFKLLPAAILALV+VLSLDEREVLAYMIARS++SSA TST DSRKKSA KASI+GGN       HKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE
        YTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKP+KN G+GKRR+R+GR QAT+K    V+  PP ADECVV+P SPER+ AA        GS VK+ GE
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE

Query:  SGPVEEVTGDHQKG----VLPDVLGFFNSR---LWSLWSPNL
        SGP +EV GDHQKG    VLPDVLGFFNSR   LWSLWSPNL
Subjt:  SGPVEEVTGDHQKG----VLPDVLGFFNSR---LWSLWSPNL

XP_038904588.1 uncharacterized protein LOC120090946 [Benincasa hispida]5.6e-8171.14Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN------------HKTPTFSC
        MEIKHKGK+HPSPSSS PSSSSSVFKLLP AILALVS+LSLD+REVLAYMIARS++SSAFTST+ SRKKS  KASINGGN            HKTP FSC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN------------HKTPTFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRV
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT GEKP+KN GRGKRR+R+GR Q T K    ++  PP ADECV +P   ER            GS V
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRV

Query:  KDDGESGPVEEV--TGDHQKG----VLPDVLGFFNSRLWSLWSPNL
        K+  ESGPVEEV   GDH+KG    VLPDVLGF NSRLWSLWSPNL
Subjt:  KDDGESGPVEEV--TGDHQKG----VLPDVLGFFNSRLWSLWSPNL

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein6.6e-7265.74Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTS-TQDSRKKSAGKASINGGN---------------HKTP
        MEIKHK K+HPSP    PSSSSSVFKLLPAAILAL S+LSLD+REVLAYMIARS++SSAFTS T+ SRKKS  K  IN GN               HKTP
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTS-TQDSRKKSAGKASINGGN---------------HKTP

Query:  TFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEA-DECVVLPSSPEREVAAVDGHGGG
         FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT GEKP+KN GRGKRR+R+GR  +T+ ++  V   P    DECV +P SP  E          
Subjt:  TFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEA-DECVVLPSSPEREVAAVDGHGGG

Query:  GGSRVKDDGESGPVEEVT--GDHQKG----VLPDVLGFFNSRLWSLWSPNL
         GS VK+  ESGPV E    G+HQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  GGSRVKDDGESGPVEEVT--GDHQKG----VLPDVLGFFNSRLWSLWSPNL

A0A5A7TRT9 Uncharacterized protein1.1e-6964.03Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSA-FTSTQDSRKKSAGKASINGGN--------------HKTPT
        MEIKHK K+HPSP    PSSSSSVFKLLPAAILAL S+LSLD+REVLAYMIARS++SSA  TST+ SRKKS  K SIN GN              HKTP 
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSA-FTSTQDSRKKSAGKASINGGN--------------HKTPT

Query:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQA----TEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHG
        FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT GEKP+KN GRGKRR+R+GR  +      K    +      ADECV +  SP  E        
Subjt:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQA----TEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHG

Query:  GGGGSRVKDDGESGPVEEVT--GDHQKG----VLPDVLGFFNSRLWSLWSPNL
           GS VK+  E+GPV E    G+HQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  GGGGSRVKDDGESGPVEEVT--GDHQKG----VLPDVLGFFNSRLWSLWSPNL

A0A6J1CVA9 uncharacterized protein LOC1110146985.1e-7266.26Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASING-------GNHKTPTFSCDCFYC
        MEIK KGKVHPSP   SPSSSSSVFKLLPAAILAL+SVLSL+EREVLAYMIARS++SSAF ST+DSRKKSA KASING       GNHKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASING-------GNHKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADEC-VVLPSSPEREVAAVDGHGGGGGSRVKDDG
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKP+ NA R KRR R+ R+QA +K    V+  P E D+C  VLP SPER+        GG GS   +DG
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADEC-VVLPSSPEREVAAVDGHGGGGGSRVKDDG

Query:  ESGPVEEVT----GDHQKG----VLPDVLGFFNSRLWSLWSPN
        ESG  E+V+     DHQKG    V+PDVL +F SR  SLW+PN
Subjt:  ESGPVEEVT----GDHQKG----VLPDVLGFFNSRLWSLWSPN

A0A6J1GV32 uncharacterized protein LOC1114577371.6e-8976.99Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC
        MEIKHKGKVHPSPSSS PSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARS++SSA TST DSRKKSA KASINGGN       HKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE
        YTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKP+KN G+GKRR+R+ R QAT+K    V+   P  DECVV+P SPER+ AA        GS VK+ GE
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE

Query:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL
        SGP +EV GDHQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL

A0A6J1IQ09 uncharacterized protein LOC1114794675.4e-9077.41Show/hide
Query:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC
        MEIKHKGKVHPSPSSS PSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARS++SSA TST DSRKKSA KASINGGN       HKTP FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGN-------HKTPTFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE
        YTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKP+KN G+GKRR+R+GR QAT+K    V+  PP A ECVV+P SPER+ A         GS VK+ GE
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGE

Query:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL
        SGP +EV GDHQKG    VLPDVLGFFNSRLWSLWSPNL
Subjt:  SGPVEEVTGDHQKG----VLPDVLGFFNSRLWSLWSPNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein6.0e-2535.34Show/hide
Query:  EIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYTAYWCRW
        ++  KG VHPSP      S+  +  LLP AI +L +VLS ++REVLAY+I+ +  S     T    K  A K ++   NH +P F CDCF CYT+YW RW
Subjt:  EIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYTAYWCRW

Query:  DSSPNRELIHQAIEAFEDHLTTGEKPRKN-AGRGKRRERLGRHQ--------ATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDG
        DSSP+R+LIH+ I+AFED L   +  +KN  G+  RR+R G+          +T+      RL     + C    SS   +    DG G  GG    +  
Subjt:  DSSPNRELIHQAIEAFEDHLTTGEKPRKN-AGRGKRRERLGRHQ--------ATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDG

Query:  ESGPVEEVTGDHQKGVLPDVLGFFNSRLWSLW
         +G   E   + +KG +   + F   +++ +W
Subjt:  ESGPVEEVTGDHQKGVLPDVLGFFNSRLWSLW

AT1G24270.1 unknown protein1.3e-2747.4Show/hide
Query:  MEIKHKGKVHPSP-----SSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYT
        M++  KGKVHPSP     SSS+   S SVFKLL +AIL LVSVLS ++ EVLAY+I RS+ ++   S +  R            +HK P   C CF CYT
Subjt:  MEIKHKGKVHPSP-----SSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYT

Query:  AYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQ
        +YW +WDSS NRELI+Q IEAFEDHLT  E    +  + K ++R  + + +E+Q
Subjt:  AYWCRWDSSPNRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQ

AT1G62422.1 unknown protein3.3e-2334.25Show/hide
Query:  KGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYTAYWCRWDSSP
        KG VHPSP  +   +      LLP AIL+LV+ LS+++REVLAY+I+ S  S+  +  + +++           NH +P F CDCF CYT+YW RWD+SP
Subjt:  KGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYTAYWCRWDSSP

Query:  NRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGESGPVEEVTGDHQ
         R+LIH+ I+A+ED L   +K +       RR+R G+         T RL                 E+ +      GG S  + DG  G  E    + +
Subjt:  NRELIHQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGESGPVEEVTGDHQ

Query:  KGVLPDVLGFFNSRLWSLW
        KG +  V+ F   R   +W
Subjt:  KGVLPDVLGFFNSRLWSLW

AT5G13090.1 unknown protein1.7e-3542.44Show/hide
Query:  MEIKHKGKVHPSP-----SSSSPSSSS----------SVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPT
        M++K KGKV+PSP     SSSS SSS+          SV KLLPA IL LVSVLS +EREVLAY+I R    S       S+ K+  K++ +  NHK P 
Subjt:  MEIKHKGKVHPSP-----SSSSPSSSS----------SVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPT

Query:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEK---PRKNAGRGKRRERLGRH--QATEKQSATVRLRPPEADECVVLP------------S
        F C+CF CYT YW RWDSSPNRELIH+ IEAFE+H   GE+    R  + RGK++E+ GR    +  K +  V     +  + VV P            S
Subjt:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTTGEK---PRKNAGRGKRRERLGRH--QATEKQSATVRLRPPEADECVVLP------------S

Query:  SPEREVAAVDGHGGGGGS-RVKDDGES---------GPVEEVTGDH--QKGVLPDVLGFFNSRLWSLWSPN
        SP R   A    G       V++DG+               VTG     + VLPDVLG F+S  W LW+PN
Subjt:  SPEREVAAVDGHGGGGGS-RVKDDGES---------GPVEEVTGDH--QKGVLPDVLGFFNSRLWSLWSPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCACAAAGGCAAAGTACACCCGTCGCCATCATCCTCCTCGCCGTCGTCTTCCTCCTCCGTCTTCAAGCTACTTCCGGCGGCTATTCTGGCTCTGGTCTC
CGTCCTCTCTCTAGACGAACGGGAAGTCTTGGCTTACATGATCGCCAGGTCCATGAAATCCTCCGCATTCACTTCTACACAAGATTCCAGGAAGAAATCGGCCGGAAAAG
CTTCCATCAATGGCGGAAACCACAAAACTCCGACGTTCAGTTGCGATTGCTTCTACTGCTACACTGCCTACTGGTGCCGCTGGGACTCCTCCCCCAACCGCGAGCTCATC
CACCAAGCCATTGAAGCCTTCGAGGATCACTTGACGACCGGCGAGAAGCCGAGGAAGAACGCCGGCAGAGGCAAGAGACGAGAGAGACTCGGCCGTCATCAAGCTACCGA
GAAGCAGTCCGCAACCGTTCGACTTCGGCCGCCGGAGGCCGATGAGTGCGTCGTACTCCCGTCGTCGCCTGAGCGTGAAGTGGCGGCAGTGGATGGCCACGGCGGCGGCG
GAGGAAGTAGAGTGAAGGACGATGGCGAGAGCGGTCCGGTGGAGGAGGTGACCGGCGACCATCAGAAGGGTGTACTGCCGGACGTATTGGGGTTTTTCAATTCCCGTTTG
TGGAGTCTGTGGAGTCCGAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTAAGCACAAAGGCAAAGTACACCCGTCGCCATCATCCTCCTCGCCGTCGTCTTCCTCCTCCGTCTTCAAGCTACTTCCGGCGGCTATTCTGGCTCTGGTCTC
CGTCCTCTCTCTAGACGAACGGGAAGTCTTGGCTTACATGATCGCCAGGTCCATGAAATCCTCCGCATTCACTTCTACACAAGATTCCAGGAAGAAATCGGCCGGAAAAG
CTTCCATCAATGGCGGAAACCACAAAACTCCGACGTTCAGTTGCGATTGCTTCTACTGCTACACTGCCTACTGGTGCCGCTGGGACTCCTCCCCCAACCGCGAGCTCATC
CACCAAGCCATTGAAGCCTTCGAGGATCACTTGACGACCGGCGAGAAGCCGAGGAAGAACGCCGGCAGAGGCAAGAGACGAGAGAGACTCGGCCGTCATCAAGCTACCGA
GAAGCAGTCCGCAACCGTTCGACTTCGGCCGCCGGAGGCCGATGAGTGCGTCGTACTCCCGTCGTCGCCTGAGCGTGAAGTGGCGGCAGTGGATGGCCACGGCGGCGGCG
GAGGAAGTAGAGTGAAGGACGATGGCGAGAGCGGTCCGGTGGAGGAGGTGACCGGCGACCATCAGAAGGGTGTACTGCCGGACGTATTGGGGTTTTTCAATTCCCGTTTG
TGGAGTCTGTGGAGTCCGAATCTTTAA
Protein sequenceShow/hide protein sequence
MEIKHKGKVHPSPSSSSPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSMKSSAFTSTQDSRKKSAGKASINGGNHKTPTFSCDCFYCYTAYWCRWDSSPNRELI
HQAIEAFEDHLTTGEKPRKNAGRGKRRERLGRHQATEKQSATVRLRPPEADECVVLPSSPEREVAAVDGHGGGGGSRVKDDGESGPVEEVTGDHQKGVLPDVLGFFNSRL
WSLWSPNL