; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019777 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019777
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein SOGA3-like
Genome locationtig00153414:194343..197007
RNA-Seq ExpressionSgr019777
SyntenySgr019777
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058663.1 protein SOGA3-like [Cucumis melo var. makuwa]5.5e-14287.46Show/hide
Query:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK
        NDMDHTQILMN NFVVWT    PP LPPVLP ENRGMF Q SQRSLST  T+  GR FRKGKRPDMRRKD   T YK SNWNELQY NRLKSRRFYPKKK
Subjt:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK

Query:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDV
        SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED+D    EDEVG SGDSDV
Subjt:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDV

Query:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE
        EGHLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D  I+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR HL DNNKEETAASENAF+
Subjt:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE

Query:  NGG
        NGG
Subjt:  NGG

KAE8646411.1 hypothetical protein Csa_016850 [Cucumis sativus]7.2e-14287.13Show/hide
Query:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK
        NDMDHTQILMN NFVVWT    PP LPPVLP ENRG FLQ SQRSLST  T+  GR FRKGKRPDMRRKD    +YK SNWNELQY NRLKSRRFYPKKK
Subjt:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK

Query:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDV
        SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED+D    EDEVG SGDSDV
Subjt:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDV

Query:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE
        EGHLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D  I+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR HL+DNNKEETAASENAF+
Subjt:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE

Query:  NGG
        NGG
Subjt:  NGG

XP_004136108.1 uncharacterized protein LOC101208199 [Cucumis sativus]1.8e-14087.04Show/hide
Query:  MDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSN
        MDHTQILMN NFVVWT    PP LPPVLP ENRG FLQ SQRSLST  T+  GR FRKGKRPDMRRKD    +YK SNWNELQY NRLKSRRFYPKKKSN
Subjt:  MDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSN

Query:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDVEG
        YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED+D    EDEVG SGDSDVEG
Subjt:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDVEG

Query:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG
        HLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D  I+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR HL+DNNKEETAASENAF+NG
Subjt:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG

Query:  G
        G
Subjt:  G

XP_022144844.1 uncharacterized protein LOC111014426 [Momordica charantia]7.9e-14992.36Show/hide
Query:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK
        NDMD  QILMN+NFVVWT QP     PP+LP ENRGMFLQGSQRSLS+A T NQGRGFRKGKRPD+RRKDS+ T YKPSNWNELQYPNRLKSRRFYPKKK
Subjt:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK

Query:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED-DDEDEVGGSGDSDVEGH
        SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED D+EDEVGGSGDSDVEGH
Subjt:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED-DDEDEVGGSGDSDVEGH

Query:  LEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVD-NNKEETAASENAFENG
        LEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDS IAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLH VD NNKEETAASENAFENG
Subjt:  LEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVD-NNKEETAASENAFENG

Query:  G
        G
Subjt:  G

XP_038899694.1 uncharacterized protein LOC120086952 [Benincasa hispida]8.5e-14387.79Show/hide
Query:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLP-AENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKK
        NDMDHTQIL+N NFVVWT    PP LPPVLP  ENR MFLQ SQRSLST  T+N GRGFRKGKRPDMRRKD   TAYKPSNWNELQY NRLKSRRFYPKK
Subjt:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLP-AENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKK

Query:  KSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED---DDEDEVGGSGDSDV
        KSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED   ++EDEVG SGDSDV
Subjt:  KSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED---DDEDEVGGSGDSDV

Query:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE
        EGHLEVERRLDHDLSRFEMICPTSG EEQSTLLESRVDD+D  I+QLEEENLTLKERVFFMERELEDLR R+QCLETEGWRL L+DNNKEETAASENAF+
Subjt:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE

Query:  NGG
        NGG
Subjt:  NGG

TrEMBL top hitse value%identityAlignment
A0A0A0K9V7 Uncharacterized protein8.6e-14187.04Show/hide
Query:  MDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSN
        MDHTQILMN NFVVWT    PP LPPVLP ENRG FLQ SQRSLST  T+  GR FRKGKRPDMRRKD    +YK SNWNELQY NRLKSRRFYPKKKSN
Subjt:  MDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSN

Query:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDVEG
        YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED+D    EDEVG SGDSDVEG
Subjt:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDVEG

Query:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG
        HLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D  I+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR HL+DNNKEETAASENAF+NG
Subjt:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG

Query:  G
        G
Subjt:  G

A0A5D3CGI4 Protein SOGA3-like2.7e-14287.46Show/hide
Query:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK
        NDMDHTQILMN NFVVWT    PP LPPVLP ENRGMF Q SQRSLST  T+  GR FRKGKRPDMRRKD   T YK SNWNELQY NRLKSRRFYPKKK
Subjt:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK

Query:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDV
        SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED+D    EDEVG SGDSDV
Subjt:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDD----EDEVGGSGDSDV

Query:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE
        EGHLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D  I+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR HL DNNKEETAASENAF+
Subjt:  EGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFE

Query:  NGG
        NGG
Subjt:  NGG

A0A6J1CTG2 uncharacterized protein LOC1110144263.8e-14992.36Show/hide
Query:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK
        NDMD  QILMN+NFVVWT QP     PP+LP ENRGMFLQGSQRSLS+A T NQGRGFRKGKRPD+RRKDS+ T YKPSNWNELQYPNRLKSRRFYPKKK
Subjt:  NDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKK

Query:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED-DDEDEVGGSGDSDVEGH
        SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED D+EDEVGGSGDSDVEGH
Subjt:  SNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED-DDEDEVGGSGDSDVEGH

Query:  LEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVD-NNKEETAASENAFENG
        LEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDS IAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLH VD NNKEETAASENAFENG
Subjt:  LEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVD-NNKEETAASENAFENG

Query:  G
        G
Subjt:  G

A0A6J1H988 uncharacterized protein LOC1114612445.2e-12282.35Show/hide
Query:  MNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFA
        MNQN VVW     PP LPPVLP ENRGMFL GSQR LSTA T+NQGR  RKG+  DM+RK SI +AYKPSNWNELQY NRLKSRRF+PKKK NYRFPPFA
Subjt:  MNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFA

Query:  PRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDDEDEVGGSGDSDVEGHLEVERRLDHD
        PRNTTSFLIRAKRSGGIASLVSP PVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS    ED++E E GGSGDSDVEGHLEVERRLDHD
Subjt:  PRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDDEDEVGGSGDSDVEGHLEVERRLDHD

Query:  LSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG
        LSRFEMICP SGGEEQS++LESRVDDQDS IAQL+EENLTLKERVFFMERELE+LRRRVQ LETEG  +   +NNK+ETAASENA +NG
Subjt:  LSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG

A0A6J1KRB8 uncharacterized protein LOC1114979718.0e-12382.7Show/hide
Query:  MNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFA
        MNQN VVW     PP LPPVLP ENRGMFL GSQR LSTA T+NQGR  RKG+  DMRRK SI +AYKPSNWNELQY NRLKSRRF+PKKK NYRFPPFA
Subjt:  MNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFA

Query:  PRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDDEDEVGGSGDSDVEGHLEVERRLDHD
        PRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS    ED++E E GGSGDSDVEGHLEVERRLDHD
Subjt:  PRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDDEDEVGGSGDSDVEGHLEVERRLDHD

Query:  LSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG
        LSRFEMICP SGGEEQS++LESRVDDQDS IAQL+EENLTLKERVFFMERELE+LRRRVQ LETEG  +   +NNK+ETA SENA +NG
Subjt:  LSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G19900.1 PRLI-interacting factor, putative3.0e-7767.1Show/hide
Query:  YKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLR
        YKP   NELQ  NRLK+R+FYPKKK   R+ P+APRNTTSF+IRAK+SGGIA LVSP PVTPAVLPTP+FSP REVL DMAKEEWGVDGYGSMKGLIRLR
Subjt:  YKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLR

Query:  SS----KDYEDDDEDEVGGSGDSDVEGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCL
        +     + YE+DDEDE GGS +SDVE H+EVERRLDHDLSRFEMI P  GG E + +LE+RVDDQDS IAQLEEENLTLKER+F MEREL D+RRR+Q L
Subjt:  SS----KDYEDDDEDEVGGSGDSDVEGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCL

Query:  ETEGWRLHLVDNNKEETAASENAFENGGQPA
        E         +    E  +  +  + GG  A
Subjt:  ETEGWRLHLVDNNKEETAASENAFENGGQPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATGATATGGATCATACTCAGATCTTGATGAATCAAAACTTTGTCGTTTGGACGTCTCAGCCTCCGCCTCCGCCTCTGCCGCCAGTTCTACCGGCGGAGAACCG
AGGTATGTTTCTTCAAGGCTCTCAAAGGTCACTGTCCACTGCTCTTACTGCCAATCAAGGTCGAGGTTTCAGGAAGGGGAAAAGGCCTGACATGAGGCGAAAAGACTCGA
TTCCAACAGCGTATAAACCCTCGAATTGGAACGAATTGCAGTACCCGAACCGCTTGAAATCCAGGCGTTTTTATCCCAAGAAGAAGTCCAACTACCGTTTTCCTCCCTTC
GCGCCCCGTAATACCACTTCCTTCTTAATTCGCGCCAAAAGATCCGGTGGTATTGCCTCGCTGGTGTCGCCGTATCCGGTCACCCCTGCGGTGTTGCCGACTCCCATATT
TTCGCCTCTGAGGGAAGTTCTGGTTGATATGGCTAAAGAGGAGTGGGGTGTAGACGGCTATGGCTCGATGAAGGGTCTGATTAGGCTTCGATCATCAAAGGACTACGAAG
ACGACGACGAAGATGAAGTTGGCGGATCCGGCGATAGTGATGTGGAAGGGCATTTGGAGGTGGAGAGACGGCTGGACCATGATTTAAGTAGGTTTGAAATGATCTGCCCA
ACTTCTGGTGGTGAAGAGCAGAGTACCCTTTTGGAGAGTAGAGTTGACGATCAAGATTCTCAGATAGCGCAGCTTGAGGAGGAGAATTTGACATTGAAGGAAAGGGTATT
TTTCATGGAGAGAGAGTTGGAAGACCTAAGAAGGAGGGTTCAGTGCCTGGAGACTGAAGGTTGGCGCTTGCATCTGGTGGACAATAACAAAGAGGAAACTGCAGCGTCCG
AGAATGCATTTGAGAATGGTGGCCAGCCTGCATCTTGGAGAACTTTTCGAGGTTCACTAAGAAGGTTCTTGGAATCTGTTCTCTCCATTTCACTTGATTTTGATATCAGA
GAAATCTTGAAGCTTGATGACATAACATTGACGGCAGCAGCAGCCTTAGAAGGAAAGGTTTGTGAAAGCAGTTTCTCCGGCATTGTGTCTGCTCTTTTGGCTGCCGGAGC
CTTGGGATGTGCCGGCCGGCGTCAGATGATGGCCGTTCCCCCCGTGTTCTTTAGAAAGCTGCCTCATTTTTCTGAAGGGGCCCATGGGGATGCAATACAAGGCTTCGCAC
GTGCAATTGAGAGTACAGAAAAAGAAAGAGGACTAGAGGAGGCAGTGTTTGGAAAGGGGGTAATTTGCAACATTAAAAGAGGCGGATTGCTCGTACAGATTTTCCTGGAA
GCCAAACAACGCCTACCATCTGAAGGTAAAAAGCATGTTTATTTATCGGTTGGGGGAGCAGTGGGGCCCACCGAGCATCAATTATATGATGGGAGAGACGATGATCGCAA
TGATGGAAAAAGTGTTACGTCTAATTCTCATGATTGCACCGGACATACGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATGATATGGATCATACTCAGATCTTGATGAATCAAAACTTTGTCGTTTGGACGTCTCAGCCTCCGCCTCCGCCTCTGCCGCCAGTTCTACCGGCGGAGAACCG
AGGTATGTTTCTTCAAGGCTCTCAAAGGTCACTGTCCACTGCTCTTACTGCCAATCAAGGTCGAGGTTTCAGGAAGGGGAAAAGGCCTGACATGAGGCGAAAAGACTCGA
TTCCAACAGCGTATAAACCCTCGAATTGGAACGAATTGCAGTACCCGAACCGCTTGAAATCCAGGCGTTTTTATCCCAAGAAGAAGTCCAACTACCGTTTTCCTCCCTTC
GCGCCCCGTAATACCACTTCCTTCTTAATTCGCGCCAAAAGATCCGGTGGTATTGCCTCGCTGGTGTCGCCGTATCCGGTCACCCCTGCGGTGTTGCCGACTCCCATATT
TTCGCCTCTGAGGGAAGTTCTGGTTGATATGGCTAAAGAGGAGTGGGGTGTAGACGGCTATGGCTCGATGAAGGGTCTGATTAGGCTTCGATCATCAAAGGACTACGAAG
ACGACGACGAAGATGAAGTTGGCGGATCCGGCGATAGTGATGTGGAAGGGCATTTGGAGGTGGAGAGACGGCTGGACCATGATTTAAGTAGGTTTGAAATGATCTGCCCA
ACTTCTGGTGGTGAAGAGCAGAGTACCCTTTTGGAGAGTAGAGTTGACGATCAAGATTCTCAGATAGCGCAGCTTGAGGAGGAGAATTTGACATTGAAGGAAAGGGTATT
TTTCATGGAGAGAGAGTTGGAAGACCTAAGAAGGAGGGTTCAGTGCCTGGAGACTGAAGGTTGGCGCTTGCATCTGGTGGACAATAACAAAGAGGAAACTGCAGCGTCCG
AGAATGCATTTGAGAATGGTGGCCAGCCTGCATCTTGGAGAACTTTTCGAGGTTCACTAAGAAGGTTCTTGGAATCTGTTCTCTCCATTTCACTTGATTTTGATATCAGA
GAAATCTTGAAGCTTGATGACATAACATTGACGGCAGCAGCAGCCTTAGAAGGAAAGGTTTGTGAAAGCAGTTTCTCCGGCATTGTGTCTGCTCTTTTGGCTGCCGGAGC
CTTGGGATGTGCCGGCCGGCGTCAGATGATGGCCGTTCCCCCCGTGTTCTTTAGAAAGCTGCCTCATTTTTCTGAAGGGGCCCATGGGGATGCAATACAAGGCTTCGCAC
GTGCAATTGAGAGTACAGAAAAAGAAAGAGGACTAGAGGAGGCAGTGTTTGGAAAGGGGGTAATTTGCAACATTAAAAGAGGCGGATTGCTCGTACAGATTTTCCTGGAA
GCCAAACAACGCCTACCATCTGAAGGTAAAAAGCATGTTTATTTATCGGTTGGGGGAGCAGTGGGGCCCACCGAGCATCAATTATATGATGGGAGAGACGATGATCGCAA
TGATGGAAAAAGTGTTACGTCTAATTCTCATGATTGCACCGGACATACGATATGA
Protein sequenceShow/hide protein sequence
MKNDMDHTQILMNQNFVVWTSQPPPPPLPPVLPAENRGMFLQGSQRSLSTALTANQGRGFRKGKRPDMRRKDSIPTAYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPF
APRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDDDEDEVGGSGDSDVEGHLEVERRLDHDLSRFEMICP
TSGGEEQSTLLESRVDDQDSQIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHLVDNNKEETAASENAFENGGQPASWRTFRGSLRRFLESVLSISLDFDIR
EILKLDDITLTAAAALEGKVCESSFSGIVSALLAAGALGCAGRRQMMAVPPVFFRKLPHFSEGAHGDAIQGFARAIESTEKERGLEEAVFGKGVICNIKRGGLLVQIFLE
AKQRLPSEGKKHVYLSVGGAVGPTEHQLYDGRDDDRNDGKSVTSNSHDCTGHTI