; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013541 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013541
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF1685 domain-containing protein
Genome locationscaffold402:2314115..2314834
RNA-Seq ExpressionMS013541
SyntenyMS013541
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057058.1 DUF1685 domain-containing protein [Cucumis melo var. makuwa]1.2e-7366.93Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MD EQ+LNLFDSFWFERG+FN+    SN + + Q+    +S P    ++PR+ TRSISEDLSSKLSFMS+SNSPDS+LFSPKLQTIFSSKDIA  ESPE 
Subjt:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKK------KEHEEEEGGEE----GEISRP
        +RK    E+E  P+   + R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K      KE EEEE  EE    GEISRP
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKK------KEHEEEEGGEE----GEISRP

Query:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        YLSEAWEAME+E E     KKP +M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

KAG6583944.1 hypothetical protein SDJN03_19876, partial [Cucurbita argyrosperma subsp. sororia]1.2e-7067.21Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+LNLFDS WFER IFN+   P    NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSKDIA  E
Subjt:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAW
         PE +R+ +        +++   R  GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEE G  G ISRPYLSEAW
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAW

Query:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
         AME+E ELKK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

XP_022140169.1 uncharacterized protein LOC111010901 [Momordica charantia]2.1e-12698.75Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MDVEQILNLFDS WFERGIFNQPLL SNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
Subjt:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAME
        NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEE GEEGEISRPYLSEAWEAME
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAME

Query:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
Subjt:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

XP_023001638.1 uncharacterized protein LOC111495710 [Cucurbita maxima]4.2e-7166.27Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+LNLFDSFWFER IFN+   P    NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSKDIA  E
Subjt:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEE-----GEISRPY
         PE +R+ +        +++ + R  GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEE  EE     G ISRPY
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEE-----GEISRPY

Query:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        LSEAW AME+E E+KK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

XP_038895996.1 uncharacterized protein LOC120084174 [Benincasa hispida]7.4e-7668.95Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES
        MD EQ+LNLFDSFWFE  IFN+   P    NP+ ENQ+    NS P  P ++PR+ TRSISEDLSSKLSFMSNSNSPDS+LFSPKLQTIFSSKDIA  ES
Subjt:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES

Query:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEE----GEISRPYLS
        PEN+RK     +E  P+  ++ + RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K E E+ E  EE    GEISRPYLS
Subjt:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEE----GEISRPYLS

Query:  EAWEAM-EKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        EAWEAM E+E ELK P  M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  EAWEAM-EKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0LV26 Uncharacterized protein1.5e-6963.32Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPE---VLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES
        MD + +LNLFDSFWF+R + N     SNP+     +P    P P P+   ++PR+ TRSISEDLSSKLSFMSNSNSPDS+L SPKLQTIFSSKDIA  ES
Subjt:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPE---VLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTES

Query:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKK--------EHEEEEGGEE----G
        PE + K    E+E  P+   + R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K+        E EEEE  EE    G
Subjt:  PENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKK--------EHEEEEGGEE----G

Query:  EISRPYLSEAWEAM----EKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        EISRPYLSEAWEA+    EKE  LK+P +M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  EISRPYLSEAWEAM----EKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A5A7UPL7 DUF1685 domain-containing protein5.7e-7466.93Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MD EQ+LNLFDSFWFERG+FN+    SN + + Q+    +S P    ++PR+ TRSISEDLSSKLSFMS+SNSPDS+LFSPKLQTIFSSKDIA  ESPE 
Subjt:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKK------KEHEEEEGGEE----GEISRP
        +RK    E+E  P+   + R RGRR R  ES+SLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K      KE EEEE  EE    GEISRP
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKK------KEHEEEEGGEE----GEISRP

Query:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        YLSEAWEAME+E E     KKP +M+W FP+  N+IDMKDNLKWWAH VASTVR
Subjt:  YLSEAWEAMEKENE----LKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A6J1CEZ9 uncharacterized protein LOC1110109019.9e-12798.75Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
        MDVEQILNLFDS WFERGIFNQPLL SNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN
Subjt:  MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPEN

Query:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAME
        NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEE GEEGEISRPYLSEAWEAME
Subjt:  NRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAME

Query:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
Subjt:  KENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A6J1EH41 uncharacterized protein LOC1114340586.5e-7065.98Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+L+LFDS WFER IFN+   P    NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSK+IA  E
Subjt:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAW
         PE +R+ +        +++ + R  GR  R  ES+SLSELEFEE+KGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEE G  G ISRPYLSEAW
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAW

Query:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
         AME+E ELKK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  EAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

A0A6J1KR35 uncharacterized protein LOC1114957102.0e-7166.27Show/hide
Query:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE
        MDVEQ+LNLFDSFWFER IFN+   P    NP  ENQ++ PLKNSPP  P V PRI  RSISEDLSSKL+FMS+S+SPDS+LFSPKLQTI SSKDIA  E
Subjt:  MDVEQILNLFDSFWFERGIFNQ---PLLSSNPERENQEK-PLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTE

Query:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEE-----GEISRPY
         PE +R+ +        +++ + R  GR  R  ES+SLSELEFEELKGFMDLGFVFSE DK SSLA I+PGLNRL K++ EEEE  EE     G ISRPY
Subjt:  SPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEE-----GEISRPY

Query:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR
        LSEAW AME+E E+KK  +M+W  PA  NEIDMKDNLKWWAH VASTVR
Subjt:  LSEAWEAMEKENELKKPPLMEWSFPALGNEIDMKDNLKWWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.6e-0431.09Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALG-NE
        SKSL++ + E+L+G +DLGF FS  D+   L + +P L          L  K+++  E     +   P L  A             P+  W   + G N 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALG-NE

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  IDMKDNLKWWAHTVASTVR

AT2G31560.1 Protein of unknown function (DUF1685)7.8e-0733.61Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-
        +KSL++ + EELKG +DLGF FS  D+   L + +P L          L  K+    +  EE + S P  + A             P+  W   + G++ 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA TVA TVR
Subjt:  IDMKDNLKWWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)7.8e-0733.61Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-
        +KSL++ + EELKG +DLGF FS  D+   L + +P L          L  K+    +  EE + S P  + A             P+  W   + G++ 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNR--------LKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGNE-

Query:  IDMKDNLKWWAHTVASTVR
         D+K  LK+WA TVA TVR
Subjt:  IDMKDNLKWWAHTVASTVR

AT2G42760.1 unknown protein1.0e-3040.96Show/hide
Query:  EQILNLFDSFWFERGIFNQPLLSSNP------------ERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSN-----SPDSIL----FSPK
        E++L LF+  W ER IF +   + N             E   +E+ LKN P     ++ R  +       SSK S  S+S+     SP S+L       K
Subjt:  EQILNLFDSFWFERGIFNQPLLSSNP------------ERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSN-----SPDSIL----FSPK

Query:  LQTIFSSKDIARTESPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEED-KDSSLASIIPGLNRLKKKE----HE
        LQTI S K++      E  R+ +  E E E RK+ K +   R R+    KS+S+LE+EELKGFMDLGFVFSE+D KDS L SI+PGL RL KK+     E
Subjt:  LQTIFSSKDIARTESPENNRKEIRPELETEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEED-KDSSLASIIPGLNRLKKKE----HE

Query:  EEEGGEEGEI-----SRPYLSEAWE-AMEKENELKKPPLMEWSF--PALGNEIDMKDNLKWWAHTVASTVR
        EEE  EE +I     +RPYLSEAW+    ++ + +  P ++W    PA  +E+D+KDNL+ WAH VAST+R
Subjt:  EEEGGEEGEI-----SRPYLSEAWE-AMEKENELKKPPLMEWSF--PALGNEIDMKDNLKWWAHTVASTVR

AT2G43340.1 Protein of unknown function (DUF1685)9.5e-0532.46Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENE-LKKP--PLMEWSFPALG-NEIDMKD
        +KSL++ + EELKG +DLGF F+ E+    L + +P L        +         I + +   +  + EK++  L  P  P+  W   + G N  D+K 
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENE-LKKP--PLMEWSFPALG-NEIDMKD

Query:  NLKWWAHTVASTVR
         LK+WA  VA TVR
Subjt:  NLKWWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTGAGCAAATTCTCAATCTCTTCGATTCCTTCTGGTTCGAGCGTGGAATCTTCAACCAACCCCTCCTGTCGTCAAACCCAGAACGTGAAAATCAAGAAAAGCC
ATTAAAAAACTCGCCGCCGCCGTCGCCGGAAGTTTTGCCACGGATTCACACGAGGTCCATAAGCGAAGATCTGAGCTCCAAATTGAGCTTTATGTCCAATTCCAATTCAC
CCGATTCGATCCTCTTCTCTCCGAAGCTTCAGACGATTTTCTCCAGCAAAGACATCGCCAGAACTGAGTCGCCGGAGAACAACCGGAAGGAAATTCGGCCGGAACTTGAA
ACAGAGCCGAGAAAGAGAAACAAGGGAAGAGGGAGAGGGAGAAGAAGACGGTGGCCGGAAAGTAAGAGCCTTTCGGAGCTGGAATTTGAGGAGCTAAAAGGGTTCATGGA
TTTGGGATTTGTTTTCTCGGAGGAAGATAAAGATTCGAGCTTGGCGTCGATTATTCCTGGTTTGAACAGGTTGAAGAAAAAGGAACATGAAGAAGAAGAAGGAGGAGAAG
AGGGGGAAATTTCGAGGCCTTATCTTTCTGAAGCTTGGGAGGCGATGGAGAAAGAGAATGAATTGAAGAAGCCGCCATTGATGGAGTGGAGCTTTCCTGCTTTGGGGAAT
GAAATTGATATGAAAGACAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTTGAGCAAATTCTCAATCTCTTCGATTCCTTCTGGTTCGAGCGTGGAATCTTCAACCAACCCCTCCTGTCGTCAAACCCAGAACGTGAAAATCAAGAAAAGCC
ATTAAAAAACTCGCCGCCGCCGTCGCCGGAAGTTTTGCCACGGATTCACACGAGGTCCATAAGCGAAGATCTGAGCTCCAAATTGAGCTTTATGTCCAATTCCAATTCAC
CCGATTCGATCCTCTTCTCTCCGAAGCTTCAGACGATTTTCTCCAGCAAAGACATCGCCAGAACTGAGTCGCCGGAGAACAACCGGAAGGAAATTCGGCCGGAACTTGAA
ACAGAGCCGAGAAAGAGAAACAAGGGAAGAGGGAGAGGGAGAAGAAGACGGTGGCCGGAAAGTAAGAGCCTTTCGGAGCTGGAATTTGAGGAGCTAAAAGGGTTCATGGA
TTTGGGATTTGTTTTCTCGGAGGAAGATAAAGATTCGAGCTTGGCGTCGATTATTCCTGGTTTGAACAGGTTGAAGAAAAAGGAACATGAAGAAGAAGAAGGAGGAGAAG
AGGGGGAAATTTCGAGGCCTTATCTTTCTGAAGCTTGGGAGGCGATGGAGAAAGAGAATGAATTGAAGAAGCCGCCATTGATGGAGTGGAGCTTTCCTGCTTTGGGGAAT
GAAATTGATATGAAAGACAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGA
Protein sequenceShow/hide protein sequence
MDVEQILNLFDSFWFERGIFNQPLLSSNPERENQEKPLKNSPPPSPEVLPRIHTRSISEDLSSKLSFMSNSNSPDSILFSPKLQTIFSSKDIARTESPENNRKEIRPELE
TEPRKRNKGRGRGRRRRWPESKSLSELEFEELKGFMDLGFVFSEEDKDSSLASIIPGLNRLKKKEHEEEEGGEEGEISRPYLSEAWEAMEKENELKKPPLMEWSFPALGN
EIDMKDNLKWWAHTVASTVR