; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0577 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0577
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionMucin-16 like
Genome locationMC07:13784844..13785722
RNA-Seq ExpressionMC07g0577
SyntenyMC07g0577
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461888.1 PREDICTED: uncharacterized protein LOC103500380 [Cucumis melo]1.22e-13267.22Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QN    AKIPL FFG P RSAI + D  +LSFTF+TFFRSGPSFNLSYRPN +++PF+LALKAGIGLFGSPID+PM F AEFNLP N+PPRFFLHFRP+L
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQS I   N PYRN IS++DDD +ALS+GKS+D G ET      DL LGN       IDCS+V  R+DDV S++EI ARS FKV D  AVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        RW+MRFP S+  + FTA+  LSKMPYL LGKIKIE      GE+ES+EA GAGE+S ++K L+DLW ESRWLK NVEQL+SEIGEQKA P TPPVE+RK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  RG
        +G
Subjt:  RG

XP_011652835.1 uncharacterized protein LOC105435123 [Cucumis sativus]9.93e-13265.56Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QN   RAKIPLNFFG P RS++ + D  +LSFTF+TFFRSGPSFNLSYRPN +++PF+LA+KAGIGLFGSPID+PM F AEFNLP N+PPRFFLHFRP+L
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQS+I N  LPYRN IS++DDD++A ++GKSV  G ET      DL LGN       ID S+   R+DDVLS++EI ARS FKV D  AVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGVG----EQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        RW+M FP  ++ ++FTA+  LSKMPYL LGKIKIE       E+ES++AAGAGE+S L+K L+DLW ESRWLK N+EQL+SEIGEQKAAP+TPPVE+RK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGVG----EQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  RG
        +G
Subjt:  RG

XP_022152590.1 uncharacterized protein LOC111020281 [Momordica charantia]1.72e-20598.63Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSI+PFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKFRWTMRF
        GDFTLRRSVQSHIPNLNLPYRNAISRMDDD+AALSKGKSVDDGAETSGEADSDLTLGNGIDCSQV TRIDDVLSSVEICA SRFKVRDRAAVKFRWTMRF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKFRWTMRF

Query:  PVSVRTEDFTARTLLSKMPYLTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA
        PVSVRTEDFTARTLLSKMPYLTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA
Subjt:  PVSVRTEDFTARTLLSKMPYLTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA

XP_023526002.1 uncharacterized protein LOC111789609 [Cucurbita pepo subsp. pepo]1.61e-12765.12Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        +NP  RAKIPLNFFGFP RSAI++A+  +LSFTF++FFRSGP+FN SYRPNDS+ PFTLA+KAGIGL+GS ID+PM FTAEFNLP N+PPRFFLHFRPRL
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGI------DCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQSH  + NLP       +DDD+AA+S GKSVD G E+SGE  +DL LGN I       CS +  R  D+LS+ EI ARS FKV++ A+VKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGI------DCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIE----GVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        +W MRFP+S++ E+FTA+ +LSK+PYL L KIKIE       E+ES+EA  AGE S L+K LDDL +ESRW+K N+EQLRSEIG+Q AAPA PPVESRK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIE----GVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  R
        R
Subjt:  R

XP_038904343.1 uncharacterized protein LOC120090696 [Benincasa hispida]5.26e-14170.1Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QNPI RAKIPLNFFGFP RSAI +A+  +LSFTF+TFFRSGPSFNLSY PN S++PF+LA+KAGIGLFGSPID+PM  TAEFNLP N+PPRFFLHF+P+L
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRS+QSHI N NLPYRN+IS++D  +AALS+GKSVD G ETS   D  L LGN       IDCS+V  R+DDVLS+VEI ARS FK+RD AAVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        RW+MRFP S++ EDF A+   SKMPYL LGKIKIE       E+ES+EA GAGE+ AL+K LDDLW+ESRWLK N+E+L+SEIG+QKAAPATPPVE+RK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  R
        +
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A0A0LD41 Uncharacterized protein4.81e-13265.56Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QN   RAKIPLNFFG P RS++ + D  +LSFTF+TFFRSGPSFNLSYRPN +++PF+LA+KAGIGLFGSPID+PM F AEFNLP N+PPRFFLHFRP+L
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQS+I N  LPYRN IS++DDD++A ++GKSV  G ET      DL LGN       ID S+   R+DDVLS++EI ARS FKV D  AVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGVG----EQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        RW+M FP  ++ ++FTA+  LSKMPYL LGKIKIE       E+ES++AAGAGE+S L+K L+DLW ESRWLK N+EQL+SEIGEQKAAP+TPPVE+RK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGVG----EQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  RG
        +G
Subjt:  RG

A0A1S3CFN0 uncharacterized protein LOC1035003805.89e-13367.22Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QN    AKIPL FFG P RSAI + D  +LSFTF+TFFRSGPSFNLSYRPN +++PF+LALKAGIGLFGSPID+PM F AEFNLP N+PPRFFLHFRP+L
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQS I   N PYRN IS++DDD +ALS+GKS+D G ET      DL LGN       IDCS+V  R+DDV S++EI ARS FKV D  AVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        RW+MRFP S+  + FTA+  LSKMPYL LGKIKIE      GE+ES+EA GAGE+S ++K L+DLW ESRWLK NVEQL+SEIGEQKA P TPPVE+RK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  RG
        +G
Subjt:  RG

A0A5D3CGK3 Uncharacterized protein5.89e-13367.22Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QN    AKIPL FFG P RSAI + D  +LSFTF+TFFRSGPSFNLSYRPN +++PF+LALKAGIGLFGSPID+PM F AEFNLP N+PPRFFLHFRP+L
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQS I   N PYRN IS++DDD +ALS+GKS+D G ET      DL LGN       IDCS+V  R+DDV S++EI ARS FKV D  AVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR
        RW+MRFP S+  + FTA+  LSKMPYL LGKIKIE      GE+ES+EA GAGE+S ++K L+DLW ESRWLK NVEQL+SEIGEQKA P TPPVE+RK+
Subjt:  RWTMRFPVSVRTEDFTARTLLSKMPYLTLGKIKIEGV----GEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKR

Query:  RG
        +G
Subjt:  RG

A0A6J1DED2 uncharacterized protein LOC1110202818.35e-20698.63Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSI+PFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKFRWTMRF
        GDFTLRRSVQSHIPNLNLPYRNAISRMDDD+AALSKGKSVDDGAETSGEADSDLTLGNGIDCSQV TRIDDVLSSVEICA SRFKVRDRAAVKFRWTMRF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKFRWTMRF

Query:  PVSVRTEDFTARTLLSKMPYLTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA
        PVSVRTEDFTARTLLSKMPYLTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA
Subjt:  PVSVRTEDFTARTLLSKMPYLTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA

A0A6J1F7Z3 uncharacterized protein LOC1114416923.80e-12665.56Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        +NPI RAKIPLNFFGFP RSAI++A+  +LSFTF++FFRSGP+FN SYRPNDS+ PFTLA+KAGIGL GS ID+PM FTAEFNLP N+PPRFFLHFRPRL
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF
        GDFTLRRSVQSH  + NLP       +DDD+AA+S GKSVD G E+SGE  +DL LGN       I CS +  R  D+LS+ EI ARS FKV++ AAVKF
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNG------IDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKF

Query:  RWTMRFPVSVRTEDFTART-LLSKMPYLTLGKIKIE----GVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRK
        +W MRFP+S++ E+FTA+  LLSK+PYL L KIKIE       E+ES+EA  AGE S L+K LDDL +ESRW+K N+EQLRSEIG+Q AAPA PPVESRK
Subjt:  RWTMRFPVSVRTEDFTART-LLSKMPYLTLGKIKIE----GVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRK

Query:  RR
        ++
Subjt:  RR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57990.1 unknown protein8.5e-4035.88Show/hide
Query:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL
        Q P+ RAK+PL+  G P +S I   +  +LS   STFF SGPS  ++YRPNDS +PF+L +K G G FGSPI + M  +AEFNL     P F LHF+P+ 
Subjt:  QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRL

Query:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQV--STRIDDV---LSSVEICARSRFKVRDRAAVKFR
        GDF++++S  S     +   RN I  M+  ++       V D    +G        G G     V  ST   D+   LS VE+ AR+   VR RA + FR
Subjt:  GDFTLRRSVQSHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQV--STRIDDV---LSSVEICARSRFKVRDRAAVKFR

Query:  WTMRFPVSVRTE-DFTARTLLSKMPYLTLGKIKIEGVGEQES---------DEAAGAGEYSA---LRKQLDDLWSESRWLKMNVEQLRSEIGEQKA-APA
        W +R P  +R + D TA   L + P+L + KI IE V   ++          + +G  +++    + + +++L +E++ LK  VE LR  I   +  +PA
Subjt:  WTMRFPVSVRTE-DFTARTLLSKMPYLTLGKIKIEGVGEQES---------DEAAGAGEYSA---LRKQLDDLWSESRWLKMNVEQLRSEIGEQKA-APA

Query:  T
        T
Subjt:  T


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAAAACCCTATTCTGAGAGCCAAAATCCCTCTCAATTTCTTCGGCTTCCCCCTTCGTTCCGCCATTGAAGTTGCAGACGTCCACGATCTCTCCTTCACCTTCAGCACTTT
CTTCAGATCCGGCCCTTCCTTCAACCTCTCTTACCGCCCCAATGATTCCATCGACCCCTTCACTCTCGCCCTCAAGGCCGGAATCGGACTGTTCGGCTCGCCGATCGACG
CTCCCATGGCTTTCACGGCGGAATTCAACCTTCCACGCAACGAACCCCCTCGGTTCTTCCTCCACTTCCGCCCTCGACTCGGCGATTTCACTCTCCGGAGATCCGTCCAG
TCGCACATCCCGAATTTGAATTTGCCCTATCGGAACGCGATTTCGCGAATGGATGATGATATTGCTGCTCTGAGTAAGGGAAAGTCCGTCGACGATGGCGCCGAAACTTC
TGGAGAAGCCGATTCTGACCTGACGCTCGGAAATGGGATCGACTGCTCGCAGGTCTCGACTCGAATCGACGATGTGCTCTCGAGTGTGGAGATTTGTGCGAGATCGAGGT
TCAAGGTGAGGGACCGGGCGGCGGTGAAGTTCCGATGGACCATGAGATTTCCGGTTAGCGTTAGAACGGAAGATTTTACTGCCAGGACTCTGCTCTCCAAAATGCCGTAT
CTAACATTGGGAAAAATCAAAATCGAAGGCGTCGGCGAACAAGAAAGCGATGAGGCAGCCGGCGCCGGCGAGTATTCAGCTTTGAGGAAGCAATTGGACGATTTATGGAG
CGAAAGCCGGTGGTTGAAGATGAACGTAGAGCAACTTCGGTCGGAAATCGGTGAGCAGAAAGCTGCACCGGCGACGCCGCCGGTTGAATCTCGGAAGAGAAGGGGAGCT
mRNA sequenceShow/hide mRNA sequence
CAAAACCCTATTCTGAGAGCCAAAATCCCTCTCAATTTCTTCGGCTTCCCCCTTCGTTCCGCCATTGAAGTTGCAGACGTCCACGATCTCTCCTTCACCTTCAGCACTTT
CTTCAGATCCGGCCCTTCCTTCAACCTCTCTTACCGCCCCAATGATTCCATCGACCCCTTCACTCTCGCCCTCAAGGCCGGAATCGGACTGTTCGGCTCGCCGATCGACG
CTCCCATGGCTTTCACGGCGGAATTCAACCTTCCACGCAACGAACCCCCTCGGTTCTTCCTCCACTTCCGCCCTCGACTCGGCGATTTCACTCTCCGGAGATCCGTCCAG
TCGCACATCCCGAATTTGAATTTGCCCTATCGGAACGCGATTTCGCGAATGGATGATGATATTGCTGCTCTGAGTAAGGGAAAGTCCGTCGACGATGGCGCCGAAACTTC
TGGAGAAGCCGATTCTGACCTGACGCTCGGAAATGGGATCGACTGCTCGCAGGTCTCGACTCGAATCGACGATGTGCTCTCGAGTGTGGAGATTTGTGCGAGATCGAGGT
TCAAGGTGAGGGACCGGGCGGCGGTGAAGTTCCGATGGACCATGAGATTTCCGGTTAGCGTTAGAACGGAAGATTTTACTGCCAGGACTCTGCTCTCCAAAATGCCGTAT
CTAACATTGGGAAAAATCAAAATCGAAGGCGTCGGCGAACAAGAAAGCGATGAGGCAGCCGGCGCCGGCGAGTATTCAGCTTTGAGGAAGCAATTGGACGATTTATGGAG
CGAAAGCCGGTGGTTGAAGATGAACGTAGAGCAACTTCGGTCGGAAATCGGTGAGCAGAAAGCTGCACCGGCGACGCCGCCGGTTGAATCTCGGAAGAGAAGGGGAGCT
Protein sequenceShow/hide protein sequence
QNPILRAKIPLNFFGFPLRSAIEVADVHDLSFTFSTFFRSGPSFNLSYRPNDSIDPFTLALKAGIGLFGSPIDAPMAFTAEFNLPRNEPPRFFLHFRPRLGDFTLRRSVQ
SHIPNLNLPYRNAISRMDDDIAALSKGKSVDDGAETSGEADSDLTLGNGIDCSQVSTRIDDVLSSVEICARSRFKVRDRAAVKFRWTMRFPVSVRTEDFTARTLLSKMPY
LTLGKIKIEGVGEQESDEAAGAGEYSALRKQLDDLWSESRWLKMNVEQLRSEIGEQKAAPATPPVESRKRRGA