; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0587 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0587
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC05:4497557..4504284
RNA-Seq ExpressionMC05g0587
SyntenyMC05g0587
Gene Ontology termsGO:0000124 - SAGA complex (cellular component)
InterPro domainsIPR037804 - SAGA-associated factor 73


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438795.1 PREDICTED: uncharacterized protein LOC103483792 isoform X2 [Cucumis melo]2.14e-19681.67Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MV SVGNGRMAVMTRL+AAGSFSRTI GK R+  EVG QK ASEFI RELRDADEANLIDEEDMHVFGLKPM DPLNLVCCN CKKPVKASQYIIH ELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCP--DSASLMHP
        RSL   QGTIMDLDGGMGHRK+SRKEKKKLL +DANIS VEKEGSEST AD+SAAPA PINNQFEM+KLTKRNSTCN +PILD GTG C   D+AS +HP
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCP--DSASLMHP

Query:  STKRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDIIG
        STKRSKLITGEGLLLAS LEPSS KTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC        KENI  FH++S+EE SQEQT+D+IG
Subjt:  STKRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDIIG

Query:  KKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK
         KMD++SL SAWK DHNLA    +FSSGKCLPAG ASNKFV GSSVAWPQIA VELTQKK
Subjt:  KKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK

XP_022138020.1 uncharacterized protein LOC111009285 isoform X1 [Momordica charantia]2.64e-24498.01Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSRTIT      +EVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
        RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST

Query:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMS
        KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMS
Subjt:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMS

Query:  AWKPDHNLAQCLDIFSSGKCLPAGSASNKFVGSSVAWPQIAAVELTQKKLST
        AWKPDHNLAQCLDIFSSGKCLPAGSASNKFVGSSVAWPQIAAVELTQKKLST
Subjt:  AWKPDHNLAQCLDIFSSGKCLPAGSASNKFVGSSVAWPQIAAVELTQKKLST

XP_022138026.1 uncharacterized protein LOC111009285 isoform X2 [Momordica charantia]5.71e-19897.6Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSRTIT      +EVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
        RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST

Query:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
        KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
Subjt:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK

XP_022138027.1 uncharacterized protein LOC111009285 isoform X3 [Momordica charantia]2.53e-19897.6Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSRTIT      +EVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
        RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST

Query:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
        KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
Subjt:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK

XP_038900665.1 uncharacterized protein LOC120087804 isoform X2 [Benincasa hispida]1.09e-19782.47Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSR+I GK R+  EVGHQK ASEFI RELRDADEANLIDEEDMHVFGLKPM DPLNLVCCN CKKPVKASQYIIH ELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPD----SASLM
        RSL   QGTIMDLDGGMGHRK+SRKEKKK+LP DANISAVEKEGSESTYA+YS APAFPINNQFEMVKLTKRNSTC  + ILD  TGVC D    +ASL+
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPD----SASLM

Query:  HPSTKRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDI
        HPSTKRSKLITGEGLLLAS LEPSS KTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC        KENI  FHN+SQEESSQEQT+DI
Subjt:  HPSTKRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDI

Query:  IGKKMDSRS---LMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK
        IG KMDS+    L SAWK DHNL     IFSSGKCLPA  ASNKFV GSSVAWPQIA VELTQKK
Subjt:  IGKKMDSRS---LMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK

TrEMBL top hitse value%identityAlignment
A0A1S3AWW9 uncharacterized protein LOC103483792 isoform X14.16e-19481.16Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MV SVGNGRMAVMTRL+AAGSFSRTI GK R+  EVG QK ASEFI RELRDADEANLIDEEDMHVFGLKPM DPLNLVCCN CKKPVKASQYIIH ELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCP--DSASLMHP
        RSL   QGTIMDLDGGMGHRK+SRKEKKKLL +DANIS VEKEGSEST AD+SAAPA PINNQFEM+KLTKRNSTCN +PILD GTG C   D+AS +HP
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCP--DSASLMHP

Query:  STKRSKLITGEGLLLASALEPSSTKTKIR-NVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDII
        STKRSKLITGEGLLLAS LEPSS KTKIR +VP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC        KENI  FH++S+EE SQEQT+D+I
Subjt:  STKRSKLITGEGLLLASALEPSSTKTKIR-NVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDII

Query:  GKKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK
        G KMD++SL SAWK DHNLA    +FSSGKCLPAG ASNKFV GSSVAWPQIA VELTQKK
Subjt:  GKKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK

A0A1S3AX95 uncharacterized protein LOC103483792 isoform X21.04e-19681.67Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MV SVGNGRMAVMTRL+AAGSFSRTI GK R+  EVG QK ASEFI RELRDADEANLIDEEDMHVFGLKPM DPLNLVCCN CKKPVKASQYIIH ELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCP--DSASLMHP
        RSL   QGTIMDLDGGMGHRK+SRKEKKKLL +DANIS VEKEGSEST AD+SAAPA PINNQFEM+KLTKRNSTCN +PILD GTG C   D+AS +HP
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCP--DSASLMHP

Query:  STKRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDIIG
        STKRSKLITGEGLLLAS LEPSS KTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC        KENI  FH++S+EE SQEQT+D+IG
Subjt:  STKRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEIC--------KENINQFHNSSQEESSQEQTNDIIG

Query:  KKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK
         KMD++SL SAWK DHNLA    +FSSGKCLPAG ASNKFV GSSVAWPQIA VELTQKK
Subjt:  KKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKFV-GSSVAWPQIAAVELTQKK

A0A6J1C8H1 uncharacterized protein LOC111009285 isoform X22.76e-19897.6Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSRTIT      +EVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
        RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST

Query:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
        KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
Subjt:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK

A0A6J1C9W0 uncharacterized protein LOC111009285 isoform X11.28e-24498.01Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSRTIT      +EVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
        RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST

Query:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMS
        KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMS
Subjt:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMS

Query:  AWKPDHNLAQCLDIFSSGKCLPAGSASNKFVGSSVAWPQIAAVELTQKKLST
        AWKPDHNLAQCLDIFSSGKCLPAGSASNKFVGSSVAWPQIAAVELTQKKLST
Subjt:  AWKPDHNLAQCLDIFSSGKCLPAGSASNKFVGSSVAWPQIAAVELTQKKLST

A0A6J1C9W4 uncharacterized protein LOC111009285 isoform X31.23e-19897.6Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
        MVCSVGNGRMAVMTRLLAAGSFSRTIT      +EVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELC

Query:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
        RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST
Subjt:  RSLSSEQGTIMDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPST

Query:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
        KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK
Subjt:  KRSKLITGEGLLLASALEPSSTKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATGCTCAGTTGGAAATGGGAGAATGGCGGTGATGACTAGGCTTCTGGCTGCTGGGAGTTTCTCGCGAACTATTACAGGTAAACTGAGAGATATTAAGGAAGTTGG
TCATCAGAAGTTAGCTTCTGAATTTATCTACAGAGAACTTCGTGATGCAGATGAAGCAAATTTAATTGATGAAGAAGACATGCACGTGTTTGGTTTGAAGCCTATGGCTG
ATCCCCTGAATTTGGTATGTTGCAATAACTGTAAGAAGCCAGTAAAGGCCAGCCAATATATCATTCATGAAGAACTTTGCAGGTCATTAAGTTCTGAACAAGGAACTATT
ATGGATCTCGATGGTGGGATGGGCCATAGAAAAAACTCAAGGAAGGAGAAGAAAAAGTTACTGCCTGCTGATGCTAATATATCAGCTGTGGAGAAAGAAGGATCTGAATC
AACATATGCAGATTATTCTGCTGCCCCAGCATTTCCAATTAATAACCAATTTGAAATGGTCAAGTTGACAAAAAGAAATTCAACTTGTAATGCGTCACCTATACTGGATG
GTGGTACAGGAGTCTGTCCTGATTCAGCTAGTTTGATGCATCCTTCCACCAAGCGGTCCAAATTGATAACTGGTGAAGGGCTGTTACTGGCATCTGCTTTAGAACCATCG
TCAACTAAAACAAAAATTAGAAATGTTCCGTTACCCCTTGCAAGTAAAATATATTACTCTCAAAGAAATAATCGTTTGCGCTCGGCACTTGGTTATCTTTACTGGGAGGC
TGTTGCATCTAGCAAGGAAATTTGTAAGGAAAATATAAATCAATTTCACAATTCTTCTCAGGAGGAGTCATCTCAAGAACAAACTAATGACATTATTGGAAAGAAGATGG
ATAGTCGGTCCTTAATGTCTGCATGGAAACCTGACCATAATCTGGCTCAATGTTTGGACATATTCTCATCTGGGAAATGTTTGCCTGCTGGTAGTGCCTCAAATAAGTTT
GTTGGCAGCAGTGTTGCATGGCCACAAATTGCCGCAGTCGAATTGACACAAAAAAAATTATCTACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTATGCTCAGTTGGAAATGGGAGAATGGCGGTGATGACTAGGCTTCTGGCTGCTGGGAGTTTCTCGCGAACTATTACAGGTAAACTGAGAGATATTAAGGAAGTTGG
TCATCAGAAGTTAGCTTCTGAATTTATCTACAGAGAACTTCGTGATGCAGATGAAGCAAATTTAATTGATGAAGAAGACATGCACGTGTTTGGTTTGAAGCCTATGGCTG
ATCCCCTGAATTTGGTATGTTGCAATAACTGTAAGAAGCCAGTAAAGGCCAGCCAATATATCATTCATGAAGAACTTTGCAGGTCATTAAGTTCTGAACAAGGAACTATT
ATGGATCTCGATGGTGGGATGGGCCATAGAAAAAACTCAAGGAAGGAGAAGAAAAAGTTACTGCCTGCTGATGCTAATATATCAGCTGTGGAGAAAGAAGGATCTGAATC
AACATATGCAGATTATTCTGCTGCCCCAGCATTTCCAATTAATAACCAATTTGAAATGGTCAAGTTGACAAAAAGAAATTCAACTTGTAATGCGTCACCTATACTGGATG
GTGGTACAGGAGTCTGTCCTGATTCAGCTAGTTTGATGCATCCTTCCACCAAGCGGTCCAAATTGATAACTGGTGAAGGGCTGTTACTGGCATCTGCTTTAGAACCATCG
TCAACTAAAACAAAAATTAGAAATGTTCCGTTACCCCTTGCAAGTAAAATATATTACTCTCAAAGAAATAATCGTTTGCGCTCGGCACTTGGTTATCTTTACTGGGAGGC
TGTTGCATCTAGCAAGGAAATTTGTAAGGAAAATATAAATCAATTTCACAATTCTTCTCAGGAGGAGTCATCTCAAGAACAAACTAATGACATTATTGGAAAGAAGATGG
ATAGTCGGTCCTTAATGTCTGCATGGAAACCTGACCATAATCTGGCTCAATGTTTGGACATATTCTCATCTGGGAAATGTTTGCCTGCTGGTAGTGCCTCAAATAAGTTT
GTTGGCAGCAGTGTTGCATGGCCACAAATTGCCGCAGTCGAATTGACACAAAAAAAATTATCTACCTAGACGGTTTGACCAAACTATTTTGAAGGTAATACAGGAAAACC
ACTGGAGACTAGGCAACGGCCAAGCAGAAATGTTCCTGTTGTATAGGAGTCTTGGTGGCACTTTTTTTTGTAGGCACAGTCGTATATGTAGTTAACCCTGAACTTGAGGT
TAGGACCAAGAATCTGTATTTTGATATTGGCAATGAAAATCATTTAAAATTGTCCATTCTATCACATTGTACTTAGTTTCTTCTGTTCACTGCAAAGCGTAAATATTTCC
TTCAGCCACTACATATATGAAGTCATTGAGTTGTTCTGTTTGCTCATATATTAGCTGACAGTTGTCAGCTGAAATAATTAG
Protein sequenceShow/hide protein sequence
MVCSVGNGRMAVMTRLLAAGSFSRTITGKLRDIKEVGHQKLASEFIYRELRDADEANLIDEEDMHVFGLKPMADPLNLVCCNNCKKPVKASQYIIHEELCRSLSSEQGTI
MDLDGGMGHRKNSRKEKKKLLPADANISAVEKEGSESTYADYSAAPAFPINNQFEMVKLTKRNSTCNASPILDGGTGVCPDSASLMHPSTKRSKLITGEGLLLASALEPS
STKTKIRNVPLPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICKENINQFHNSSQEESSQEQTNDIIGKKMDSRSLMSAWKPDHNLAQCLDIFSSGKCLPAGSASNKF
VGSSVAWPQIAAVELTQKKLST