; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0736 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0736
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF1685 domain-containing protein
Genome locationMC03:14164691..14165410
RNA-Seq ExpressionMC03g0736
SyntenyMC03g0736
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057058.1 DUF1685 domain-containing protein [Cucumis melo var. makuwa]2.47e-9266.93Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MD EQ+LNLFDS WFERG+FN+    SN + + Q+    +S P    ++PR+ TRSISEDLSSKLSFMS+SNSPDS+LFSPKLQTIFSSKDIA  ESPE 
Subjt:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEH------EEEEEGEE----GEISRP
        +RK    E+E  P+   + R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL KKE       EEEEE EE    GEISRP
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEH------EEEEEGEE----GEISRP

Query:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        YLSEAWEAME+E E     KKP +M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

KAG6583944.1 hypothetical protein SDJN03_19876, partial [Cucurbita argyrosperma subsp. sororia]5.44e-9168.44Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+LNLFDSLWFER IFN+   P  P NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSKDIA  E
Subjt:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAW
         PE +R+ +  +   + R     R  GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEE G  G ISRPYLSEAW
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAW

Query:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
         AME+E ELKK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

XP_022140169.1 uncharacterized protein LOC111010901 [Momordica charantia]9.61e-165100Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
Subjt:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAME
        NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAME
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAME

Query:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
Subjt:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

XP_023001638.1 uncharacterized protein LOC111495710 [Cucurbita maxima]9.77e-9166.67Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+LNLFDS WFER IFN+   P  P NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSKDIA  E
Subjt:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEE-----GEISRPY
         PE +R+ +        +++ + R  GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEEE EE     G ISRPY
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEE-----GEISRPY

Query:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        LSEAW AME+E E+KK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

XP_038895996.1 uncharacterized protein LOC120084174 [Benincasa hispida]1.08e-9669.35Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES
        MD EQ+LNLFDS WFE  IFN+   P  P NP+ ENQ+    NS P  P ++PR+ TRSISEDLSSKLSFMSNSNSPDS+LFSPKLQTIFSSKDIA  ES
Subjt:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES

Query:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEE----GEISRPYLS
        PEN+RK     +E  P+  ++ + RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K E E+ EE EE    GEISRPYLS
Subjt:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEE----GEISRPYLS

Query:  EAWEAMEKENE-LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        EAWEAME+E E LK P  M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  EAWEAMEKENE-LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0LV26 Uncharacterized protein2.52e-8863.71Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPE---VLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES
        MD + +LNLFDS WF+R + N    PSNP+     +P    P P P+   ++PR+ TRSISEDLSSKLSFMSNSNSPDS+L SPKLQTIFSSKDIA  ES
Subjt:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPE---VLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES

Query:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEH-------EEEEEGEE-----G
        PE + K    E+E  P+   + R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K+E        EEEEE EE     G
Subjt:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEH-------EEEEEGEE-----G

Query:  EISRPYLSEAWEAM----EKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        EISRPYLSEAWEA+    EKE  LK+P +M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  EISRPYLSEAWEAM----EKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A5A7UPL7 DUF1685 domain-containing protein1.19e-9266.93Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MD EQ+LNLFDS WFERG+FN+    SN + + Q+    +S P    ++PR+ TRSISEDLSSKLSFMS+SNSPDS+LFSPKLQTIFSSKDIA  ESPE 
Subjt:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEH------EEEEEGEE----GEISRP
        +RK    E+E  P+   + R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL KKE       EEEEE EE    GEISRP
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEH------EEEEEGEE----GEISRP

Query:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        YLSEAWEAME+E E     KKP +M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A6J1CEZ9 uncharacterized protein LOC1110109014.65e-165100Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
Subjt:  MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAME
        NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAME
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAME

Query:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
Subjt:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A6J1EH41 uncharacterized protein LOC1114340588.70e-9066.8Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+L+LFDSLWFER IFN+   P  P NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSK+IA  E
Subjt:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAW
         PE +R+ +        +++ + R  GR  R  ES+SLSELEFEE+KGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEE G  G ISRPYLSEAW
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAW

Query:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
         AME+E ELKK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A6J1KR35 uncharacterized protein LOC1114957104.73e-9166.67Show/hide
Query:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+LNLFDS WFER IFN+   P  P NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSKDIA  E
Subjt:  MDVEQILNLFDSLWFERGIFNQ---PLLPSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEE-----GEISRPY
         PE +R+ +        +++ + R  GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEEE EE     G ISRPY
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEE-----GEISRPY

Query:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        LSEAW AME+E E+KK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)2.1e-0431.09Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALG-NE
        SKSL++ + E+L+G +DLGF FS  D+   L + +P L          L  K+++  E     +   P L  A             P+  W   + G N 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALG-NE

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  IDMKDNLKWWAHTVASTVR

AT1G05870.2 Protein of unknown function (DUF1685)2.1e-0431.09Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALG-NE
        SKSL++ + E+L+G +DLGF FS  D+   L + +P L          L  K+++  E     +   P L  A             P+  W   + G N 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALG-NE

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  IDMKDNLKWWAHTVASTVR

AT2G31560.1 Protein of unknown function (DUF1685)1.7e-0633.61Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-
        +KSL++ + EELKG +DLGF FS  D+   L + +P L          L  K+    +  EE + S P  + A             P+  W   + G++ 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA TVA TVR
Subjt:  IDMKDNLKWWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)1.7e-0633.61Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-
        +KSL++ + EELKG +DLGF FS  D+   L + +P L          L  K+    +  EE + S P  + A             P+  W   + G++ 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA TVA TVR
Subjt:  IDMKDNLKWWAHTVASTVR

AT2G42760.1 unknown protein7.7e-3141.33Show/hide
Query:  EQILNLFDSLWFERGIFNQPLLPSNP------------ERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSN-----SPDSIL----FSPK
        E++L LF+  W ER IF +     N             E   +E+ LKN P     ++ R  +       SSK S  S+S+     SP S+L       K
Subjt:  EQILNLFDSLWFERGIFNQPLLPSNP------------ERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSN-----SPDSIL----FSPK

Query:  LQTIFSSKDIARTESPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEED-KDSSLASIIPGLNRLKKKE----HE
        LQTI S K++      E  R+ +  E E E RK+ K +   R R+    KS+S+LE+EELKGFMDLGFVFSE+D KDS L SI+PGL RL KK+     E
Subjt:  LQTIFSSKDIARTESPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEED-KDSSLASIIPGLNRLKKKE----HE

Query:  EEEEGEEGEI-----SRPYLSEAWE-AMEKENELKKPPLMEWSF--PALGNEIDMKDNLKWWAHTVASTVR
        EEEE EE +I     +RPYLSEAW+    ++ + +  P ++W    PA  +E+D+KDNL+ WAH VAST+R
Subjt:  EEEEGEEGEI-----SRPYLSEAWE-AMEKENELKKPPLMEWSF--PALGNEIDMKDNLKWWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTGAGCAAATTCTCAATCTCTTCGATTCCCTCTGGTTCGAGCGTGGAATCTTCAACCAACCCCTCCTGCCGTCAAACCCAGAACGTGAAAATCAAGAAAAGCC
ATTAAAAAACTCGCCGCCGCCGTCGCCGGAAGTTTTGCCACGGATTCACACGAGGTCCATAAGCGAAGATCTGAGCTCCAAATTGAGCTTTATGTCCAATTCCAATTCAC
CCGATTCGATCCTCTTCTCTCCAAAGCTTCAGACGATTTTCTCCAGCAAAGACATCGCCAGAACTGAGTCGCCGGAGAACAACCGGAAGGAAATTCGGCCGGAACTTGAA
ACAGAGCCGAGAAAGAGAAACAAGGGAAGAGGGAGAGGGAGAAGAAGACGGTGGCCGGAAAGTAAGAGCCTTTCGGAGCTGGAATTTGAGGAGCTAAAAGGGTTCATGGA
TTTGGGATTTGTTTTCTCGGAGGAAGATAAAGATTCGAGCTTGGCGTCGATTATTCCTGGTTTGAACAGGTTGAAGAAAAAGGAACATGAAGAAGAAGAAGAAGGAGAAG
AGGGGGAAATTTCGAGGCCTTATCTTTCTGAAGCTTGGGAGGCGATGGAGAAAGAGAATGAATTGAAGAAGCCGCCATTGATGGAGTGGAGCTTTCCTGCTTTGGGGAAT
GAAATTGATATGAAAGACAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTTGAGCAAATTCTCAATCTCTTCGATTCCCTCTGGTTCGAGCGTGGAATCTTCAACCAACCCCTCCTGCCGTCAAACCCAGAACGTGAAAATCAAGAAAAGCC
ATTAAAAAACTCGCCGCCGCCGTCGCCGGAAGTTTTGCCACGGATTCACACGAGGTCCATAAGCGAAGATCTGAGCTCCAAATTGAGCTTTATGTCCAATTCCAATTCAC
CCGATTCGATCCTCTTCTCTCCAAAGCTTCAGACGATTTTCTCCAGCAAAGACATCGCCAGAACTGAGTCGCCGGAGAACAACCGGAAGGAAATTCGGCCGGAACTTGAA
ACAGAGCCGAGAAAGAGAAACAAGGGAAGAGGGAGAGGGAGAAGAAGACGGTGGCCGGAAAGTAAGAGCCTTTCGGAGCTGGAATTTGAGGAGCTAAAAGGGTTCATGGA
TTTGGGATTTGTTTTCTCGGAGGAAGATAAAGATTCGAGCTTGGCGTCGATTATTCCTGGTTTGAACAGGTTGAAGAAAAAGGAACATGAAGAAGAAGAAGAAGGAGAAG
AGGGGGAAATTTCGAGGCCTTATCTTTCTGAAGCTTGGGAGGCGATGGAGAAAGAGAATGAATTGAAGAAGCCGCCATTGATGGAGTGGAGCTTTCCTGCTTTGGGGAAT
GAAATTGATATGAAAGACAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGA
Protein sequenceShow/hide protein sequence
MDVEQILNLFDSLWFERGIFNQPLLPSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPENNRKEIRPELE
TEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEEGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGN
EIDMKDNLKWWAHTVASTVR