; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g04900 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g04900
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLiprin-alpha-2 like
Genome locationchr9:3770186..3771913
RNA-Seq ExpressionMoc09g04900
SyntenyMoc09g04900
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058663.1 protein SOGA3-like [Cucumis melo var. makuwa]2.7e-14188Show/hide
Query:  MNDMDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSN
        MNDMD  QILMN NFVVWTQP   PP+LPLENRGMF Q SQRSLS+ PT   GR FRKGKRPD+RRKD  ATPYK SNWNELQY NRLKSRRFYPKKKSN
Subjt:  MNDMDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSN

Query:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEG
        YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYEDED   EEDEVG SGDSDVEG
Subjt:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEG

Query:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        HLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D HI+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR H  D NNKEETAASENAF+N
Subjt:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

KAE8646411.1 hypothetical protein Csa_016850 [Cucumis sativus]2.0e-13983.81Show/hide
Query:  MNDMDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSN
        MNDMD  QILMN NFVVWTQP   PP+LPLENRG FLQ SQRSLS+  T   GR FRKGKRPD+RRKD  A  YK SNWNELQY NRLKSRRFYPKKKSN
Subjt:  MNDMDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSN

Query:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEG
        YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYEDED   EEDEVG SGDSDVEG
Subjt:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEG

Query:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        HLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D HI+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR H +D NNKEETAASENAF+N
Subjt:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

Query:  VHEVLDCRAHCFQNI
            L C      +I
Subjt:  VHEVLDCRAHCFQNI

XP_004136108.1 uncharacterized protein LOC101208199 [Cucumis sativus]1.8e-13783.65Show/hide
Query:  MDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF
        MD  QILMN NFVVWTQP   PP+LPLENRG FLQ SQRSLS+  T   GR FRKGKRPD+RRKD  A  YK SNWNELQY NRLKSRRFYPKKKSNYRF
Subjt:  MDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF

Query:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEGHLE
        PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYEDED   EEDEVG SGDSDVEGHLE
Subjt:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEGHLE

Query:  VERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFENVHE
        VERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D HI+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR H +D NNKEETAASENAF+N   
Subjt:  VERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFENVHE

Query:  VLDCRAHCFQNI
         L C      +I
Subjt:  VLDCRAHCFQNI

XP_022144844.1 uncharacterized protein LOC111014426 [Momordica charantia]1.3e-164100Show/hide
Query:  MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF
        MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF
Subjt:  MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF

Query:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVER
        PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVER
Subjt:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVER

Query:  RLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        RLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
Subjt:  RLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

XP_038899694.1 uncharacterized protein LOC120086952 [Benincasa hispida]1.4e-14087.67Show/hide
Query:  MNDMDRAQILMNRNFVVWTQP---PPLLP-LENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKS
        MNDMD  QIL+N NFVVWTQP   PP+LP LENR MFLQ SQRSLS+ PT N GRGFRKGKRPD+RRKD  AT YKPSNWNELQY NRLKSRRFYPKKKS
Subjt:  MNDMDRAQILMNRNFVVWTQP---PPLLP-LENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKS

Query:  NYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED--EDEEDEVGGSGDSDVEG
        NYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYED  E+EEDEVG SGDSDVEG
Subjt:  NYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYED--EDEEDEVGGSGDSDVEG

Query:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        HLEVERRLDHDLSRFEMICPTSG EEQSTLLESRVDD+D HI+QLEEENLTLKERVFFMERELEDLR R+QCLETEGWRL  +D NNKEETAASENAF+N
Subjt:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

TrEMBL top hitse value%identityAlignment
A0A0A0K9V7 Uncharacterized protein8.8e-13883.65Show/hide
Query:  MDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF
        MD  QILMN NFVVWTQP   PP+LPLENRG FLQ SQRSLS+  T   GR FRKGKRPD+RRKD  A  YK SNWNELQY NRLKSRRFYPKKKSNYRF
Subjt:  MDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF

Query:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEGHLE
        PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYEDED   EEDEVG SGDSDVEGHLE
Subjt:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEGHLE

Query:  VERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFENVHE
        VERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D HI+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR H +D NNKEETAASENAF+N   
Subjt:  VERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFENVHE

Query:  VLDCRAHCFQNI
         L C      +I
Subjt:  VLDCRAHCFQNI

A0A5D3CGI4 Protein SOGA3-like1.3e-14188Show/hide
Query:  MNDMDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSN
        MNDMD  QILMN NFVVWTQP   PP+LPLENRGMF Q SQRSLS+ PT   GR FRKGKRPD+RRKD  ATPYK SNWNELQY NRLKSRRFYPKKKSN
Subjt:  MNDMDRAQILMNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSN

Query:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEG
        YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS+KDYEDED   EEDEVG SGDSDVEG
Subjt:  YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDED---EEDEVGGSGDSDVEG

Query:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        HLEVERRLDHDLSRFEMICPTS GEEQSTLLE+RVDD+D HI+QLEEENLTLKERVFFMERELEDLRRRVQCLETEGWR H  D NNKEETAASENAF+N
Subjt:  HLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

A0A6J1CTG2 uncharacterized protein LOC1110144266.5e-165100Show/hide
Query:  MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF
        MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF
Subjt:  MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRF

Query:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVER
        PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVER
Subjt:  PPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVER

Query:  RLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        RLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
Subjt:  RLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

A0A6J1H988 uncharacterized protein LOC1114612442.3e-12281.53Show/hide
Query:  MNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRN
        MN+N VVW  P   PP+LPLENRGMFL GSQR LS+APT NQGR  RKG+  D++RK S+++ YKPSNWNELQY NRLKSRRF+PKKK NYRFPPFAPRN
Subjt:  MNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRN

Query:  TTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLS
        TTSFLIRAKRSGGIASLVSP PVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS +D    +EE E GGSGDSDVEGHLEVERRLDHDLS
Subjt:  TTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLS

Query:  RFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        RFEMICP SGGEEQS++LESRVDDQDSHIAQL+EENLTLKERVFFMERELE+LRRRVQ LETEG  +   DNNNK+ETAASENA +N
Subjt:  RFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

A0A6J1KRB8 uncharacterized protein LOC1114979713.6e-12381.88Show/hide
Query:  MNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRN
        MN+N VVW  P   PP+LPLENRGMFL GSQR LS+APT NQGR  RKG+  D+RRK S+++ YKPSNWNELQY NRLKSRRF+PKKK NYRFPPFAPRN
Subjt:  MNRNFVVWTQP---PPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRN

Query:  TTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLS
        TTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWG+DGYGSMKGLIRLRS +D    +EE E GGSGDSDVEGHLEVERRLDHDLS
Subjt:  TTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLS

Query:  RFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN
        RFEMICP SGGEEQS++LESRVDDQDSHIAQL+EENLTLKERVFFMERELE+LRRRVQ LETEG  +   DNNNK+ETA SENA +N
Subjt:  RFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G19900.1 PRLI-interacting factor, putative1.1e-7668.75Show/hide
Query:  YKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLR
        YKP   NELQ  NRLK+R+FYPKKK   R+ P+APRNTTSF+IRAK+SGGIA LVSP PVTPAVLPTP+FSP REVL DMAKEEWGVDGYGSMKGLIRLR
Subjt:  YKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLR

Query:  SS----KDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQC
        +     + YE++DE++  GGS +SDVE H+EVERRLDHDLSRFEMI P  GG E + +LE+RVDDQDSHIAQLEEENLTLKER+F MEREL D+RRR+Q 
Subjt:  SS----KDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLSRFEMICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQC

Query:  LETEGWRLHPVDNNNKEETAASEN
        LE    R   V  +  EE   +E+
Subjt:  LETEGWRLHPVDNNNKEETAASEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATATGGATCGTGCTCAGATCTTGATGAATCGAAACTTCGTCGTTTGGACTCAACCTCCTCCACTTCTGCCGCTTGAGAATCGAGGTATGTTTCTTCAA
GGCTCCCAAAGATCACTGTCCTCAGCTCCTACTGTTAATCAAGGGCGAGGTTTCAGGAAGGGGAAAAGGCCCGACGTGAGGCGAAAAGACTCCGTTGCAACGCCG
TATAAACCCTCGAATTGGAACGAATTGCAGTACCCGAACCGCTTGAAATCCAGGCGGTTCTATCCGAAGAAGAAGTCCAACTACCGATTCCCTCCGTTCGCGCCC
CGTAATACCACTTCCTTCTTAATTCGTGCCAAAAGATCCGGCGGTATTGCCTCGCTTGTATCGCCGTATCCGGTAACCCCTGCCGTGTTACCGACTCCAATATTT
TCGCCTCTCAGGGAAGTTCTTGTCGATATGGCCAAAGAGGAGTGGGGTGTTGACGGCTATGGCTCGATGAAGGGTCTGATTAGGCTTCGGTCCTCAAAAGATTAC
GAGGACGAAGACGAAGAAGATGAAGTTGGGGGATCCGGTGATAGCGATGTGGAAGGACATTTGGAGGTGGAGAGACGGCTGGACCATGATTTAAGTAGGTTTGAA
ATGATCTGCCCAACTTCTGGTGGTGAAGAACAGAGCACCCTTTTGGAGAGTAGAGTGGATGATCAAGATTCTCACATAGCGCAGCTTGAGGAGGAGAATTTGACA
TTGAAGGAAAGGGTTTTCTTCATGGAGAGAGAATTGGAAGACCTGAGAAGAAGGGTCCAGTGCTTGGAAACTGAAGGATGGCGCTTGCATCCGGTGGACAATAAC
AACAAAGAGGAGACTGCAGCGTCCGAGAATGCCTTTGAGAATGTTCATGAAGTGTTAGATTGCAGGGCCCATTGTTTTCAGAACATCGATGGAGCAGCAGCATTA
GAAGAAAAGATTTGTAAAAGAAGTTTTCCTGGTGAGCCTTCACTCCAGGAAATTTTCGCATTTTGGGCCATACAAGCTTGTCTGCCGGAGCCTGGAATATGCCGC
TCCGGCCAGCACCGTCGTTCTCCAATGTTCTTCAGAAAGCTTCCTCAGTTTTCTGAAGCCATGGGCATTGAAGCTGAAGTTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATATGGATCGTGCTCAGATCTTGATGAATCGAAACTTCGTCGTTTGGACTCAACCTCCTCCACTTCTGCCGCTTGAGAATCGAGGTATGTTTCTTCAA
GGCTCCCAAAGATCACTGTCCTCAGCTCCTACTGTTAATCAAGGGCGAGGTTTCAGGAAGGGGAAAAGGCCCGACGTGAGGCGAAAAGACTCCGTTGCAACGCCG
TATAAACCCTCGAATTGGAACGAATTGCAGTACCCGAACCGCTTGAAATCCAGGCGGTTCTATCCGAAGAAGAAGTCCAACTACCGATTCCCTCCGTTCGCGCCC
CGTAATACCACTTCCTTCTTAATTCGTGCCAAAAGATCCGGCGGTATTGCCTCGCTTGTATCGCCGTATCCGGTAACCCCTGCCGTGTTACCGACTCCAATATTT
TCGCCTCTCAGGGAAGTTCTTGTCGATATGGCCAAAGAGGAGTGGGGTGTTGACGGCTATGGCTCGATGAAGGGTCTGATTAGGCTTCGGTCCTCAAAAGATTAC
GAGGACGAAGACGAAGAAGATGAAGTTGGGGGATCCGGTGATAGCGATGTGGAAGGACATTTGGAGGTGGAGAGACGGCTGGACCATGATTTAAGTAGGTTTGAA
ATGATCTGCCCAACTTCTGGTGGTGAAGAACAGAGCACCCTTTTGGAGAGTAGAGTGGATGATCAAGATTCTCACATAGCGCAGCTTGAGGAGGAGAATTTGACA
TTGAAGGAAAGGGTTTTCTTCATGGAGAGAGAATTGGAAGACCTGAGAAGAAGGGTCCAGTGCTTGGAAACTGAAGGATGGCGCTTGCATCCGGTGGACAATAAC
AACAAAGAGGAGACTGCAGCGTCCGAGAATGCCTTTGAGAATGTTCATGAAGTGTTAGATTGCAGGGCCCATTGTTTTCAGAACATCGATGGAGCAGCAGCATTA
GAAGAAAAGATTTGTAAAAGAAGTTTTCCTGGTGAGCCTTCACTCCAGGAAATTTTCGCATTTTGGGCCATACAAGCTTGTCTGCCGGAGCCTGGAATATGCCGC
TCCGGCCAGCACCGTCGTTCTCCAATGTTCTTCAGAAAGCTTCCTCAGTTTTCTGAAGCCATGGGCATTGAAGCTGAAGTTCATTGA
Protein sequenceShow/hide protein sequence
MNDMDRAQILMNRNFVVWTQPPPLLPLENRGMFLQGSQRSLSSAPTVNQGRGFRKGKRPDVRRKDSVATPYKPSNWNELQYPNRLKSRRFYPKKKSNYRFPPFAP
RNTTSFLIRAKRSGGIASLVSPYPVTPAVLPTPIFSPLREVLVDMAKEEWGVDGYGSMKGLIRLRSSKDYEDEDEEDEVGGSGDSDVEGHLEVERRLDHDLSRFE
MICPTSGGEEQSTLLESRVDDQDSHIAQLEEENLTLKERVFFMERELEDLRRRVQCLETEGWRLHPVDNNNKEETAASENAFENVHEVLDCRAHCFQNIDGAAAL
EEKICKRSFPGEPSLQEIFAFWAIQACLPEPGICRSGQHRRSPMFFRKLPQFSEAMGIEAEVH